WorldWideScience

Sample records for psychometric test scores

  1. Psychometric Properties of Raw and Scale Scores on Mixed-Format Tests

    Science.gov (United States)

    Kolen, Michael J.; Lee, Won-Chan

    2011-01-01

    This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…

  2. Testing Students with Special Educational Needs in Large-Scale Assessments - Psychometric Properties of Test Scores and Associations with Test Taking Behavior.

    Science.gov (United States)

    Pohl, Steffi; Südkamp, Anna; Hardt, Katinka; Carstensen, Claus H; Weinert, Sabine

    2016-01-01

    Assessing competencies of students with special educational needs in learning (SEN-L) poses a challenge for large-scale assessments (LSAs). For students with SEN-L, the available competence tests may fail to yield test scores of high psychometric quality, which are-at the same time-measurement invariant to test scores of general education students. We investigated whether we can identify a subgroup of students with SEN-L, for which measurement invariant competence measures of adequate psychometric quality may be obtained with tests available in LSAs. We furthermore investigated whether differences in test-taking behavior may explain dissatisfying psychometric properties and measurement non-invariance of test scores within LSAs. We relied on person fit indices and mixture distribution models to identify students with SEN-L for whom test scores with satisfactory psychometric properties and measurement invariance may be obtained. We also captured differences in test-taking behavior related to guessing and missing responses. As a result we identified a subgroup of students with SEN-L for whom competence scores of adequate psychometric quality that are measurement invariant to those of general education students were obtained. Concerning test taking behavior, there was a small number of students who unsystematically picked response options. Removing these students from the sample slightly improved item fit. Furthermore, two different patterns of missing responses were identified that explain to some extent problems in the assessments of students with SEN-L.

  3. ETS Psychometric Contributions: Focus on Test Scores. Research Report. ETS RR-13-15. ETS R&D Scientific and Policy Contributions Series. ETS SPC-13-03

    Science.gov (United States)

    Moses, Tim

    2013-01-01

    The purpose of this report is to review ETS psychometric contributions that focus on test scores. Two major sections review contributions based on assessing test scores' measurement characteristics and other contributions about using test scores as predictors in correlational and regression relationships. An additional section reviews additional…

  4. Development of Scoring Procedures for the Performance Based Measurement (PBM) Test: Psychometric and Criterion Validity Investigation

    Science.gov (United States)

    2011-11-29

    39 Vertical Tracking Test ( VTT ): Scoring Strategies and...Validities ................................................. 43 Item-level CTT and IRT Analyses and Results for the VTT ...44 IRT Calibration of the 9 VTT Items

  5. Psychometrics of Mayer-Salovey-Caruso Emotional Intelligence Test (MSCEIT) scores.

    Science.gov (United States)

    Brannick, Michael T; Wahi, Monika M; Goldin, Steven B

    2011-08-01

    A sample of 183 medical students completed the Mayer-Salovey-Caruso Emotional Intelligence Test (MSCEIT V2.0). Scores on the test were examined for evidence of reliability and factorial validity. Although Cronbach's alpha for the total scores was adequate (.79), many of the scales had low internal consistency (scale alphas ranged from .34 to .77; median = .48). Previous factor analyses of the MSCEIT are critiqued and the rationale for the current analysis is presented. Both confirmatory and exploratory factor analyses of the MSCEIT item parcels are reported. Pictures and faces items formed separate factors rather than loading on a Perception factor. Emotional Management appeared as a factor, but items from Blends and Facilitation failed to load consistently on any factor, rendering factors for Emotional Understanding and Emotional Facilitation problematic.

  6. Psychometric properties of the Mayo Elbow Performance Score.

    Science.gov (United States)

    Celik, Derya

    2015-06-01

    To translate and culturally adapt the Mayo Elbow Performance Score (MEPS), a widely used instrument for evaluating disability associated with elbow injuries, into Turkish (MEPS-T) and to determine psychometric properties of the translated version. The MEPS was translated into Turkish using published methodological guidelines. The measurement properties of the MEPS-T (construct validity and floor and ceiling effects) were tested in 91 patients with elbow pathology. The reproducibility of the MEPS-T was tested in 59 patients over 7-14 days. The responsiveness of the MEPS-T was tested in a subgroup of 46 patients diagnosed with lateral epicondylitis and who received conservative treatment for 6 weeks. The interclass correlation coefficient (ICC) was used to estimate the test-retest reliability. The construct validity was analyzed with the disabilities of the arm, shoulder and hand (DASH), Visual Analog Scale (VAS) and the Short Form 36 (SF-36). Effect size (ES) was used to assess the responsiveness. The distribution of floor and ceiling effects was determined. The MEPS-T showed very good test-retest reliability (ICC 0.89). The correlation coefficients between the MEPS-T and DASH and VAS were -0.61 and -0.53, respectively (p elbow disease.

  7. Physical capability scale: psychometric testing.

    Science.gov (United States)

    Resnick, Barbara; Boltz, Marie; Galik, Elizabeth; Wells, Chris

    2013-02-01

    The purpose of this study was to describe the psychometric testing of the Basic Physical Capability Scale. The study was a secondary data analysis of combined data sets from three studies. Study participants included 93 older adults, recruited from 2 acute-care settings and 110 older adults living in long-term care facilities. Rasch analysis was used for the testing of the measurement model. There was some support for construct validity based on the fit of the items to the scale across both samples. In addition, there was support for hypothesis testing as physical function was significantly associated with physical capability. There was evidence for internal consistency (Alpha coefficients of .77-.83) and interrater reliability based on an intraclass correlation of .81. This study provided preliminary support for the reliability and validity of the Basic Physical Capability Scale, and guidance for scale revisions and continued use.

  8. Psychometrics behind Computerized Adaptive Testing.

    Science.gov (United States)

    Chang, Hua-Hua

    2015-03-01

    The paper provides a survey of 18 years' progress that my colleagues, students (both former and current) and I made in a prominent research area in Psychometrics-Computerized Adaptive Testing (CAT). We start with a historical review of the establishment of a large sample foundation for CAT. It is worth noting that the asymptotic results were derived under the framework of Martingale Theory, a very theoretical perspective of Probability Theory, which may seem unrelated to educational and psychological testing. In addition, we address a number of issues that emerged from large scale implementation and show that how theoretical works can be helpful to solve the problems. Finally, we propose that CAT technology can be very useful to support individualized instruction on a mass scale. We show that even paper and pencil based tests can be made adaptive to support classroom teaching.

  9. Test Scoring [book review].

    Science.gov (United States)

    Meijer, Rob R.

    2003-01-01

    This book discusses how to obtain test scores and, in particular, how to obtain test scores from tests that consist of a combination of multiple choice and open-ended questions. The strength of the book is that scoring solutions are presented for a diversity of real world scoring problems. (SLD)

  10. Confessions, scapegoats and flying pigs: Psychometric testing and the law

    OpenAIRE

    Callie Theron

    2007-01-01

    The use of psychometric tests in personnel selection has been regarded with an extraordinary degree of suspicion and scepticism. This is especially true when selection occurs in respect of a diverse applicant group. Concern is expressed about the seemingly uncritical embracing of specific tenets related to the use of psychometric tests in personnel selection in the absence of any systematic coherent psychometric argument to justify these beliefs. The absence of such a supporting psychometric ...

  11. Psychometric testing and Human Resource Management

    Directory of Open Access Journals (Sweden)

    R. P. van der Merwe

    2002-09-01

    Full Text Available This is a cumulative report on the findings of various exploratory research that were done with regard to the practice of psychometric testing in the Eastern Cape. Recent and ongoing developments in the South African labour legislation, and especially the implications of the Employment Equity Act, highlight once again the importance of the validation of all instruments to be used for human assessment and selection purposes. Information was gathered to establish which psychometric tests are used, and for what purposes, in industry today. Biographical information on each organisation is supplied, including the number of employees. The role of psychometric testing in the selection procedure is discussed. The different tests used, as well as the test users, are also indicated. The findings of other, related research, as well as comments, recommendations and shortcomings, are discussed. Opsomming Hierdie is ‘n kumulatiewe verslag wat die resultate verstrek van verskeie verkennende ondersoeke wat gedoen is na die aanwending van psigometriese toetsing in die Oos-Kaap. Onlangse en voortdurende ontwikkelinge in die Suid-Afrikaanse arbeidswetgewing, en veral die implikasies van die Wet op Gelyke Indiensneming, beklemtoon weer eens die belangrikheid van die validering van enige instrumente wat gebruik word vir evaluerings- en keuringsdoeleindes van individue. Inligting is ingewin om te bepaal watter psigometriese toetse, sowel as vir watter doel, vandag in die bedryf gebruik word. Biografiese inligting oor die onderskeie organisasies, insluitende hul aantal werknemers, word verstrek. Die rol van psigometriese toetsing in die keuringsproses word bespreek. Die verskillende toetse wat deur die organisasies gebruik word, sowel as die toetsge-bruikers, word ook aangedui. Die bevindinge van ander, relevante navorsing, sowel as opmerkings, aanbevelings en tekortkominge word bespreek.

  12. Do Test Scores Buy Happiness?

    Science.gov (United States)

    McCluskey, Neal

    2017-01-01

    Since at least the enactment of No Child Left Behind in 2002, standardized test scores have served as the primary measures of public school effectiveness. Yet, such scores fail to measure the ultimate goal of education: maximizing happiness. This exploratory analysis assesses nation level associations between test scores and happiness, controlling…

  13. Development and psychometric validation of the verbal affective memory test

    DEFF Research Database (Denmark)

    Jensen, Christian Gaden; Hjordt, Liv V; Stenbæk, Dea S

    2015-01-01

    -taboo words. Second, we studied the test's psychometric properties in healthy adults. Finally, we investigated whether individuals diagnosed with Seasonal Affective Disorder (SAD) differed from healthy controls on seasonal changes in affective recall. Recall rates were internally consistent and reliable...... psychometric properties. VAMT-24 seems especially sensitive to measuring positive verbal recall bias, perhaps due to the application of common, non-taboo words. Based on the psychometric and clinical results, we recommend VAMT-24 for international translations and studies of affective memory....

  14. The AMC Linear Disability Score project in a population requiring residential care: psychometric properties

    Directory of Open Access Journals (Sweden)

    Vermeulen Marinus

    2004-08-01

    Full Text Available Abstract Background Currently there is a lot of interest in the flexible framework offered by item banks for measuring patient relevant outcomes, including functional status. However, there are few item banks, which have been developed to quantify functional status, as expressed by the ability to perform activities of daily life. Method This paper examines the psychometric properties of the AMC Linear Disability Score (ALDS project item bank using an item response theory model and full information factor analysis. Data were collected from 555 respondents on a total of 160 items. Results Following the analysis, 79 items remained in the item bank. The remaining 81 items were excluded because of: difficulties in presentation (1 item; low levels of variation in response pattern (28 items; significant differences in measurement characteristics for males and females or for respondents under or over 85 years old (26 items; or lack of model fit to the data at item level (26 items. Conclusions It is conceivable that the item bank will have different measurement characteristics for other patient or demographic populations. However, these results indicate that the ALDS item bank has sound psychometric properties for respondents in residential care settings and could form a stable base for measuring functional status in a range of situations, including the implementation of computerised adaptive testing of functional status.

  15. The AMC Linear Disability Score project in a population requiring residential care: psychometric properties

    Science.gov (United States)

    Holman, Rebecca; Lindeboom, Robert; Vermeulen, Marinus; de Haan, Rob J

    2004-01-01

    Background Currently there is a lot of interest in the flexible framework offered by item banks for measuring patient relevant outcomes, including functional status. However, there are few item banks, which have been developed to quantify functional status, as expressed by the ability to perform activities of daily life. Method This paper examines the psychometric properties of the AMC Linear Disability Score (ALDS) project item bank using an item response theory model and full information factor analysis. Data were collected from 555 respondents on a total of 160 items. Results Following the analysis, 79 items remained in the item bank. The remaining 81 items were excluded because of: difficulties in presentation (1 item); low levels of variation in response pattern (28 items); significant differences in measurement characteristics for males and females or for respondents under or over 85 years old (26 items); or lack of model fit to the data at item level (26 items). Conclusions It is conceivable that the item bank will have different measurement characteristics for other patient or demographic populations. However, these results indicate that the ALDS item bank has sound psychometric properties for respondents in residential care settings and could form a stable base for measuring functional status in a range of situations, including the implementation of computerised adaptive testing of functional status. PMID:15291958

  16. Beyond the Test Scores.

    Science.gov (United States)

    Thibodeau, Janice J.

    1985-01-01

    A diagnostic-prescriptive scheme is illustrated using subtests of the Slingerland Screening Tests for Identifying Children with Specific Language Disability and the Detroit Tests of Learning Aptitude. The scheme is intended to focus on the child's learning style by examining the task and the strategies employed. (CL)

  17. Psychometric Functions for Shortened Administrations of a Speech Recognition Approach Using Tri-Word Presentations and Phonemic Scoring

    Science.gov (United States)

    Gelfand, Stanley A.; Gelfand, Jessica T.

    2012-01-01

    Method: Complete psychometric functions for phoneme and word recognition scores at 8 signal-to-noise ratios from -15 dB to 20 dB were generated for the first 10, 20, and 25, as well as all 50, three-word presentations of the Tri-Word or Computer Assisted Speech Recognition Assessment (CASRA) Test (Gelfand, 1998) based on the results of 12…

  18. Psychometric Characteristics of the Gesell School Readiness Screening Test.

    Science.gov (United States)

    Lichtenstein, Robert

    The Gesell School Readiness Screening Test (GSRST) is widely used to identify "developmentally immature" children for placement in extra-year, transition programs in spite of a problematic absence of psychometric evidence and research support. In this study of psychometric characteristics of the GSRST, teacher ratings of classroom performance and…

  19. A Psychometric Evaluation of the Mayer-Salovey-Caruso Emotional Intelligence Test Version 2.0

    Science.gov (United States)

    Palmer, B.R.; Gignac, G.; Manocha, R.; Stough, C.

    2005-01-01

    and discussed.There has been some debate recently over the scoring, reliability and factor structure of ability measures of emotional intelligence (EI). This study examined these three psychometric properties with the most recent ability test of EI, the Mayer-Salovey-Caruso Emotional Intelligence Test (MSCEIT V2.0; Mayer, Salovey, & Caruso,…

  20. Confessions, scapegoats and flying pigs: Psychometric testing and the law

    Directory of Open Access Journals (Sweden)

    Callie Theron

    2007-01-01

    Full Text Available The use of psychometric tests in personnel selection has been regarded with an extraordinary degree of suspicion and scepticism. This is especially true when selection occurs in respect of a diverse applicant group. Concern is expressed about the seemingly uncritical embracing of specific tenets related to the use of psychometric tests in personnel selection in the absence of any systematic coherent psychometric argument to justify these beliefs. The absence of such a supporting psychometric rationale seems unfortunate in as far as it probably would inhibit the independent critical evaluation of the psychometric merits of these generally accepted beliefs. Specific beliefs related to selection fairness, measurement bias and adverse impact are critically examined.

  1. Psychometric Properties of Scores on Three Black Racial Identity Scales

    Science.gov (United States)

    Simmons, Crystal; Worrell, Frank C.; Berry, Jane M.

    2008-01-01

    In this study, we examined the internal consistency and the structural validity of scores on the African Self-Consciousness Scale (ASCS), the Multidimensional Inventory of Black Identity (MIBI), and the Cross Racial Identity Scale (CRIS). Participants consisted of 225 African American college students--75 attending predominantly White institutions…

  2. Psychometric Properties and Responsiveness to Change of 15- and 28-Item Versions of the SCORE: A Family Assessment Questionnaire.

    Science.gov (United States)

    Hamilton, Elena; Carr, Alan; Cahill, Paul; Cassells, Ciara; Hartnett, Dan

    2015-09-01

    The SCORE (Systemic Clinical Outcome and Routine Evaluation) is a 40-item questionnaire for completion by family members 12 years and older to assess outcome in systemic therapy. This study aimed to investigate psychometric properties of two short versions of the SCORE and their responsiveness to therapeutic change. Data were collected at 19 centers from 701 families at baseline and from 433 of these 3-5 months later. Results confirmed the three-factor structure (strengths, difficulties, and communication) of the 15- and 28-item versions of the SCORE. Both instruments had good internal consistency and test-retest reliability. They also showed construct and criterion validity, correlating with measures of parent, child, and family adjustment, and discriminating between clinical and nonclinical cases. Total and factor scales of the SCORE-15 and -28 were responsive to change over 3-5 months of therapy. The SCORE-15 and SCORE-28 are brief psychometrically robust family assessment instruments which may be used to evaluate systemic therapy. © 2015 Family Process Institute.

  3. Raven's Test Performance of Sub-Saharan Africans: Average Performance, Psychometric Properties, and the Flynn Effect

    Science.gov (United States)

    Wicherts, Jelte M.; Dolan, Conor V.; Carlson, Jerry S.; van der Maas, Han L. J.

    2010-01-01

    This paper presents a systematic review of published data on the performance of sub-Saharan Africans on Raven's Progressive Matrices. The specific goals were to estimate the average level of performance, to study the Flynn Effect in African samples, and to examine the psychometric meaning of Raven's test scores as measures of general intelligence.…

  4. Psychometric testing of the Italian and French versions of the Care Dependency Scale.

    Science.gov (United States)

    Zürcher, Simeon Joel; Vangelooven, Christa; Borter, Natalie; Schnyder, Daniel; Hahn, Sabine

    2016-12-01

    The aim of this study was to test psychometrically the Italian and French versions of the Care Dependency Scale. The Care Dependency Scale assesses changes in patients' level of care dependency including important functional and mental dimensions. Evaluation of the psychometric properties of the Italian version is still ongoing. The French version has to date not been validated. Nationwide cross-sectional point prevalence study. Data were extracted from the national, annual prevalence survey of hospital-acquired pressure ulcers and inpatient falls in Swiss acute care hospitals in 2011. A total of 799 Italian and 1068 French-speaking patients were included in the analysis. For the evaluation, the psychometric properties were tested for each language both separately and conjointly. The scales revealed high internal consistency. Factor analysis presented a one-factor solution for both versions separately as well as combined. Comparison of internal structure revealed an excellent degree of equivalence between the versions. Highly significant Spearman correlations between the Care Dependency Scale and the Braden Scale sum scores indicated satisfactory criterion validity. Both the Italian and the French versions of the Care Dependency Scale showed satisfactory psychometric properties and a high level of equivalence. Further psychometric testing, using modern test theory approaches, is required. However, the scale is recommended as a valid instrument for further use in Italian and French. © 2016 John Wiley & Sons Ltd.

  5. The 'caring experience': Testing the psychometric properties of the Caring Efficacy Scale.

    Science.gov (United States)

    Reid, Carol; Courtney, Mary; Anderson, Debra; Hurst, Cameron

    2015-12-01

    The purpose of the study was to undertake rigorous psychometric testing of the Caring Efficacy Scale in a sample of Registered Nurses. A cross-sectional survey of 2000 registered nurses was undertaken. The Caring Efficacy Scale was utilized to inform the psychometric properties of the selected items of the Caring Efficacy Scale. Cronbach's Alpha identified reliability of the data. Exploratory Factor Analysis and Confirmatory Factor Analysis were undertaken to validate the factors. Confirmatory factor analysis confirmed the development of two factors; Confidence to Care and Doubts and Concerns. The Caring Efficacy Scale has undergone rigorous psychometric testing, affording evidence of internal consistency and goodness-of-fit indices within satisfactory ranges. The Caring Efficacy Scale is valid for use in an Australian population of registered nurses. The scale can be used as a subscale or total score reflective of self-efficacy in nursing. This scale may assist nursing educators to predict levels of caring efficacy.

  6. Psychometric Testing of the Gordon Facial Muscle Weakness Assessment Tool

    Science.gov (United States)

    Gordon, Shirley C.; Blum, Cynthia Ann; Parcells, Dax Andrew

    2010-01-01

    School nurses may be the first health professionals to assess the onset of facial paralysis/muscle weakness in school-age children. The purpose of this study was to test the psychometric properties of the Gordon Facial Muscle Weakness Assessment Tool (GFMWT) developed by Gordon. Data were collected in two phases. In Phase 1, 4 content experts…

  7. Measurement of empathy among Japanese medical students: psychometrics and score differences by gender and level of medical education.

    Science.gov (United States)

    Kataoka, Hitomi U; Koide, Norio; Ochi, Koji; Hojat, Mohammadreza; Gonnella, Joseph S

    2009-09-01

    To examine psychometric properties of a Japanese translation of the Jefferson Scale of Physician Empathy (JSPE), and to study differences in empathy scores between men and women, and students in different years of medical school. The student version of the JSPE was translated into Japanese using back-translation procedures and administered to 400 Japanese students from all six years at the Okayama University Medical School. Item-total score correlations were calculated. Factor analysis was used to examine the underlying components of the Japanese version of the JSPE. Cronbach coefficient alpha was calculated to assess the internal consistency aspect of reliability of the instrument. Finally, empathy scores for men and women were compared using t test, and score differences by year of medical school were examined using analysis of variance. Factor analysis confirmed the three components of "perspective taking," "compassionate care," and "ability to stand in patient's shoes," which had emerged in American and Mexican medical students. Item-total score correlations were all positive and statistically significant. Cronbach coefficient alpha was .80. Women outscored men, and empathy scores increased as students progressed through medical school in this cross-sectional study. Findings provide support for the construct validity and reliability of the Japanese translated version of the JSPE for medical students. Cultural characteristics and educational differences in Japanese medical schools that influence empathic behaviors are described, and implications for cross-cultural study of empathy are discussed.

  8. Psychometric testing of the Duke Activity Status Index in patients with heart failure.

    Science.gov (United States)

    Fan, Xiuzhen; Lee, Kyoung Suk; Frazier, Susan K; Lennie, Terry A; Moser, Debra K

    2015-06-01

    Patients with heart failure (HF) experience difficulty performing activities of daily living. As impaired functional status is adversely associated with outcomes, it is important to accurately evaluate patient functional status. The Duke Activity Status Index (DASI) is a 12-item, self-administered questionnaire to measure functional status. However, its psychometric properties have not been determined in patients with HF. The purpose of this study was to examine the psychometric properties of the DASI in patients with HF. We used data from 297 patients with a diagnosis of HF (age 61±11 years, 31% female) for psychometric testing. Internal consistency reliability of the DASI was high (Cronbach's alpha=0.86). Criterion-related validity was supported by significantly different DASI scores for each New York Heart Association classification. Construct validity was supported by significant correlation of DASI scores with health-related quality of life (r = -0.64), depressive symptoms (r = -0.44), and N-terminal B-type natriuretic peptide (r = -0.14). Our data support the psychometric properties of the DASI as a measure of functional status in patients with HF. This instrument can be used to evaluate functional status and enhance health care provider understanding of functional status related to daily living from the patient perspective. © The European Society of Cardiology 2014.

  9. Spinal appearance questionnaire: factor analysis, scoring, reliability, and validity testing.

    Science.gov (United States)

    Carreon, Leah Y; Sanders, James O; Polly, David W; Sucato, Daniel J; Parent, Stefan; Roy-Beaudry, Marjolaine; Hopkins, Jeffrey; McClung, Anna; Bratcher, Kelly R; Diamond, Beverly E

    2011-08-15

    Cross sectional. This study presents the factor analysis of the Spinal Appearance Questionnaire (SAQ) and its psychometric properties. Although the SAQ has been administered to a large sample of patients with adolescent idiopathic scoliosis (AIS) treated surgically, its psychometric properties have not been fully evaluated. This study presents the factor analysis and scoring of the SAQ and evaluates its psychometric properties. The SAQ and the Scoliosis Research Society-22 (SRS-22) were administered to AIS patients who were being observed, braced or scheduled for surgery. Standard demographic data and radiographic measures including Lenke type and curve magnitude were also collected. Of the 1802 patients, 83% were female; with a mean age of 14.8 years and mean initial Cobb angle of 55.8° (range, 0°-123°). From the 32 items of the SAQ, 15 loaded on two factors with consistent and significant correlations across all Lenke types. There is an Appearance (items 1-10) and an Expectations factor (items 12-15). Responses are summed giving a range of 5 to 50 for the Appearance domain and 5 to 20 for the Expectations domain. The Cronbach's α was 0.88 for both domains and Total score with a test-retest reliability of 0.81 for Appearance and 0.91 for Expectations. Correlations with major curve magnitude were higher for the SAQ Appearance and SAQ Total scores compared to correlations between the SRS Appearance and SRS Total scores. The SAQ and SRS-22 Scores were statistically significantly different in patients who were scheduled for surgery compared to those who were observed or braced. The SAQ is a valid measure of self-image in patients with AIS with greater correlation to curve magnitude than SRS Appearance and Total score. It also discriminates between patients who require surgery from those who do not.

  10. The psychometric testing of the Nursing Teamwork Survey in Iceland.

    Science.gov (United States)

    Bragadóttir, Helga; Kalisch, Beatrice J; Smáradóttir, Sigríður Bríet; Jónsdóttir, Heiður Hrund

    2016-06-01

    The purpose of this study was to test the psychometric properties of the Nursing Teamwork Survey-Icelandic (NTS-Icelandic), which was translated from US English to Icelandic. The Nursing Teamwork Survey, with 33 items, measures overall teamwork and five factors of teamwork: trust, team orientation, backup, shared mental models, and team leadership. The psychometric testing of the NTS-Icelandic was carried out on data from a pilot study and a national study. The sample for a pilot study included 123 nursing staff from five units, and the sample for a national study included 925 nursing staff from 27 inpatient units. The overall test-retest intraclass correlation coefficient in the pilot study was 0.693 (lower bound = 0.498, upper bound = 0.821) (p teamwork. The NTS-Icelandic tested valid and reliable in this study. Study findings support further use of the Nursing Teamwork Survey internationally.

  11. Psychometric methods for automated test design

    NARCIS (Netherlands)

    Geerlings, H.

    2012-01-01

    Flexible forms of testing, such as adaptive testing and testing on demand, usually require large item pools to avoid overexposure of the items. After an initial investment in the development of an item generator, test items can be (semi-)automatically generated in a negligible amount of time. A dist

  12. Development and psychometric testing of the nursing culture assessment tool.

    Science.gov (United States)

    Kennerly, Susan M; Yap, Tracey L; Hemmings, Annette; Beckett, Gulbahar; Schafer, John C; Borchers, Andrea

    2012-11-01

    A valid and reliable nursing culture assessment tool aimed at capturing general aspects of nursing culture is needed for use in health care settings to assess and then reshape indicated troubled areas of the nursing culture. This article summarizes the Nursing Culture Assessment Tool's (NCAT) development and reports on a cross-sectional, exploratory investigation of its psychometric properties. The research aims were to test the tool's psychometric properties; discover its dimensionality; and refine the item structure to best represent the construct of nursing culture, an occupational subset of organizational culture. Empirical construct validity was tested using a sample of licensed nurses and nursing assistants (n = 340). Exploratory and confirmatory factor analysis (CFA) and logistical regression yielded a 6-factor, 19-item solution. Evidence supports the tool's validity for assessing nursing culture as a basis for shaping the culture into one that supports change, thereby accelerating, improving, and advancing nursing best practices and care outcomes.

  13. Family Self-Efficacy for Diabetes Management: Psychometric Testing.

    Science.gov (United States)

    Mcewen, Marylyn M; Pasvogel, Alice; Murdaugh, Carolyn L

    2016-01-01

    Type 2 diabetes mellitus (T2DM) self-management among Hispanic adults occurs in a family context. Self-efficacy (SE) affects T2DM self-management behaviors; however, no instruments are available to measure family diabetes self-efficacy. The study's purpose was to test the psychometric properties of the Family Self-Efficacy for Diabetes Scale (FSE). Family members (n = 113) of adults with T2DM participated. Psychometric analysis included internal consistency reliability and concurrent and construct validity. Internal consistency reliability was .86. Items loaded on 2 factors, Family SE for Supporting Healthy Behaviors and Family SE for Supporting General Health, accounting for 71% of the variance. FSE correlated significantly with 3 diabetes-related instruments. The FSE is a reliable and valid instrument. Further testing is needed in diverse populations and geographic areas.

  14. Psychometric properties of 2-minute walk test: a systematic review.

    Science.gov (United States)

    Pin, Tamis W

    2014-09-01

    To systematically review the psychometric evidence on the 2-minute walk test (2MWT). Electronic searches of databases including MEDLINE, CINAHL, Academic Search Premier, SPORTDiscus, PsycINFO, EMBASE, the Cochrane Library, and DARE were done until February 2014 using a combination of subject headings and free texts. Studies were included if psychometric properties of the 2MWT were (1) evaluated; (2) written as full reports; and (3) published in English language peer-reviewed journals. A modified consensus-based standard for the selection of health measurement instruments checklist was used to rate the methodological quality of the included studies. A quality assessment for statistical outcomes was used to assess the measurement properties of the 2MWT. Best-evidence synthesis was collated from 25 studies of 14 patient groups. Only 1 study was found that examined the 2MWT in the pediatric population. The testing procedures of the 2MWT varied across the included studies. Reliability, validity (construct and criterion), and responsiveness of the 2MWT also varied across different patient groups. Moderate to strong evidence was found for reliability, convergent validity, discriminative validity, and responsiveness of the 2MWT in frail elderly patients. Moderate to strong evidence for reliability, convergent validity, and responsiveness was found in adults with lower limb amputations. Moderate to strong evidence for validity (convergent and discriminative) was found in adults who received rehabilitation after hip fractures or cardiac surgery. Limited evidence for the psychometric properties of the 2MWT was found in other population groups because of methodological flaws. There is inadequate breadth and depth of psychometric evidence of the 2MWT for clinical and research purposes-specifically, minimal clinically important changes and responsiveness. More good-quality studies are needed, especially in the pediatric population. Consensus on standardized testing procedures of

  15. Testing psychometric properties of the 30-item general health questionnaire.

    Science.gov (United States)

    Klainin-Yobas, Piyanee; He, Hong-Gu

    2014-01-01

    This study aimed to evaluate the psychometric properties of the General Health Questionnaire (GHQ-30) given conflicting findings in the literature. A cross-sectional, nonexperimental research was used with a convenience sample of 271 American female health care professionals. Data were collected by using self-reported questionnaires. A series of exploratory factor analyses (EFAs), confirmatory factor analyses (CFAs), and structural equation modeling (SEM) were performed to examine underlying dimensions of the GHQ-30. Results from EFAs and CFAs revealed the three-factor composition (positive affect, anxiety, and depressed mood). All factor loadings were statistically significant, and one pair of error variance was allowed to be correlated. All factors contained questionnaire items with acceptable face validity and demonstrated good internal consistency reliability. Results from SEM further confirmed underlying constructs of the scale. To our knowledge, this is the first study that extensively tested the psychometric properties of the GHQ-30, taking both statistical and substantive issues into consideration.

  16. Scoring and psychometric validation of the Perception of Anticoagulant Treatment Questionnaire (PACT-Q©

    Directory of Open Access Journals (Sweden)

    Essers B

    2009-04-01

    Full Text Available Abstract Background The 'Perception of Anti-Coagulant Treatment Questionnaire' (PACT-Q was developed to assess patients' expectations of, and satisfaction with their anticoagulant treatment. This questionnaire needs to be finalised and psychometrically validated. Methods The PACT-Q was included in the United States, the Netherlands and France into three phase III multinational clinical trials conducted to evaluate efficacy and safety of a new long-acting anticoagulant drug (idraparinux compared to vitamin K antagonist (VKA. PACT-Q was administered to patients with deep venous thrombosis (DVT, atrial fibrillation (AF or pulmonary embolism (PE at Day 1, to assess patients' expectations, and at 3 and 6 months to assess patients' satisfaction and treatment convenience and burden. The final structure of the PACT-Q (Principal Component Analysis – PCA – with Varimax Rotation was first determined and its psychometric properties were then measured with validity of the structure (Multitrait analysis, internal consistency reliability (Cronbach's alpha coefficients and known-group validity. Results PCA and multitrait analyses showed the multidimensionality of the "Treatment Expectations" dimension, comprising 7 items that had to be scored independently. The "Convenience" and "Burden of Disease and Treatment" dimensions of the hypothesised original structure of the questionnaire were combined, thus resulting in 13 items grouped into the single dimension "Convenience". The "Anticoagulant Treatment Satisfaction" dimension remained unchanged and included 7 items. All items of the "Convenience" and "Anticoagulant Treatment Satisfaction" dimensions displayed good convergent and discriminant validity. The internal consistency reliability was good, with a Cronbach's alpha of 0.84 for the "Convenience" dimension, and 0.76 for the "Anticoagulant Treatment Satisfaction" dimension. Known-group validity was good, especially with regard to occurrence of

  17. Psychometric Testing of the Method of Generations’ Mentality Type Measurement

    Directory of Open Access Journals (Sweden)

    Vlada I. Pishchik

    2013-01-01

    Full Text Available The article analyzes the mentality phenomenon in psychology and states the lack of clear, definite positions, concerning mentality, presents author's concept of mentality system, including nuclear and peripheral elements. On this basis, the initial survey, helping to develop the method of mentality measurement was conducted. The article presents author's method of generations’ mentality type measurement, carries out psychometric testing of method paragraphs. The paragraphs were tested on reliability-stability, reliability-conformity, factorial, convergent, constructive and empirical validities. The data of method testing on different samples are cited

  18. Development and psychometric properties of the Basic Amputee Mobility Score for use in patients with a major lower extremity amputation

    DEFF Research Database (Denmark)

    Kristensen, Morten Tange; Nielsen, Anni Østergaard; Topp, Ulla Madsen

    2017-01-01

    AIM: To develop and examine the psychometric properties, including responsiveness and interrater reliability, of a new outcome measure for the evaluation of basic mobility activities after a major lower extremity amputation - The Basic Amputee Mobility Score (BAMS). METHODS: The four following...

  19. Translation, Cross-cultural Adaptation, and Psychometric Properties of the German Version of the Hip Disability and Osteoarthritis Outcome Score

    NARCIS (Netherlands)

    Blasimann, A.; Dauphinee, S.W.; Staal, J.B.

    2014-01-01

    Study Design Clinical measurement. Objectives To translate and cross-culturally adapt the Hip disability and Osteoarthritis Outcome Score (HOOS) from English into German, and to study its psychometric properties in patients after hip surgery. Background There is no specific hip questionnaire in Germ

  20. Psychometric Properties of the Scores on the Behavioral Inhibition and Activation Scales in a Sample of Norwegian Children

    Science.gov (United States)

    Bjornebekk, Gunnar

    2009-01-01

    The primary aim of this study was to examine the psychometric properties of the scores on a version for children of the Carver and White Behavioral Inhibition and Activation scales (the BIS-BAS scales). This involved administering the BIS-BAS scales, the Positive and Negative Affect Schedule, the Junior Eysenck Personality Questionnaire…

  1. The development and psychometric testing of the horizontal violence scale.

    Science.gov (United States)

    Longo, Joy; Newman, David

    2014-12-01

    Inappropriate behaviors of healthcare workers can threaten the delivery of safe, quality care. The purpose of this research was to develop a research instrument specific to the construct of horizontal violence and conduct foundational psychometric testing of the newly developed instrument. The overall findings on the fit indices suggest that this model does adequately measure the underlying construct of horizontal violence. The instrument can be used as a way to determine the existence and extent of horizontal violence in practice settings and used to augment the study of the work environment, particularly in relation to healthy work environments.

  2. A New Clinical Pain Knowledge Test for Nurses: Development and Psychometric Evaluation.

    Science.gov (United States)

    Bernhofer, Esther I; St Marie, Barbara; Bena, James F

    2017-08-01

    All nurses care for patients with pain, and pain management knowledge and attitude surveys for nurses have been around since 1987. However, no validated knowledge test exists to measure postlicensure clinicians' knowledge of the core competencies of pain management in current complex patient populations. To develop and test the psychometric properties of an instrument designed to measure pain management knowledge of postlicensure nurses. Psychometric instrument validation. Four large Midwestern U.S. hospitals. Registered nurses employed full time and part time August 2015 to April 2016, aged M = 43.25 years; time as RN, M = 16.13 years. Prospective survey design using e-mail to invite nurses to take an electronic multiple choice pain knowledge test. Content validity of initial 36-item test "very good" (95.1% agreement). Completed tests that met analysis criteria, N = 747. Mean initial test score, 69.4% correct (range 27.8-97.2). After revision/removal of 13 unacceptable questions, mean test score was 50.4% correct (range 8.7-82.6). Initial test item percent difficulty range was 15.2%-98.1%; discrimination values range, 0.03-0.50; final test item percent difficulty range, 17.6%-91.1%, discrimination values range, -0.04 to 1.04. Split-half reliability final test was 0.66. A high decision consistency reliability was identified, with test cut-score of 75%. The final 23-item Clinical Pain Knowledge Test has acceptable discrimination, difficulty, decision consistency, reliability, and validity in the general clinical inpatient nurse population. This instrument will be useful in assessing pain management knowledge of clinical nurses to determine gaps in education, evaluate knowledge after pain management education, and measure research outcomes. Copyright © 2017 American Society for Pain Management Nursing. Published by Elsevier Inc. All rights reserved.

  3. Nurse Competence Scale--psychometric testing in a Norwegian context.

    Science.gov (United States)

    Wangensteen, Sigrid; Johansson, Inger S; Nordström, Gun

    2015-01-01

    The aim was to test the construct validity of the Nurse Competence Scale. The Nurse Competence Scale has been used in hospital settings for various purposes in several countries. Despite this, confirmatory factor analyses are scarcely reported. The present study is based on re-analyses of data from 2007 (i.e. psychometric testing) and 593 newly graduated nurses working in various contexts were included. Confirmatory as well as exploratory factor analyses (Principal Component Analysis) were carried out. The original 7-factor model of the Nurse Competence Scale (73 items) was not confirmed. The exploratory factor analyses resulted in a Norwegian Nurse Competence Scale consisting of 46 items in the following competence categories: Planning and delivery of care, Teaching functions, Professional leadership, Research utilization and nursing values and Professional awareness. The results underline the needs for psychometric testing of an instrument after translation processes. The instrument is suitable for describing and comparing nurse competence for various reasons. It may also be helpful in creating competence development programs at an individual as well as at an organizational level. Further studies with a broader sample are recommended.

  4. Conditional Reliability Coefficients for Test Scores.

    Science.gov (United States)

    Nicewander, W Alan

    2017-04-06

    The most widely used, general index of measurement precision for psychological and educational test scores is the reliability coefficient-a ratio of true variance for a test score to the true-plus-error variance of the score. In item response theory (IRT) models for test scores, the information function is the central, conditional index of measurement precision. In this inquiry, conditional reliability coefficients for a variety of score types are derived as simple transformations of information functions. It is shown, for example, that the conditional reliability coefficient for an ordinary, number-correct score, X, is equal to, ρ(X,X'|θ)=I(X,θ)/[I(X,θ)+1] Where: θ is a latent variable measured by an observed test score, X; p(X, X'|θ) is the conditional reliability of X at a fixed value of θ; and I(X, θ) is the score information function. This is a surprisingly simple relationship between the 2, basic indices of measurement precision from IRT and classical test theory (CTT). This relationship holds for item scores as well as test scores based on sums of item scores-and it holds for dichotomous as well as polytomous items, or a mix of both item types. Also, conditional reliabilities are derived for computerized adaptive test scores, and for θ-estimates used as alternatives to number correct scores. These conditional reliabilities are all related to information in a manner similar-or-identical to the 1 given above for the number-correct (NC) score. (PsycINFO Database Record (c) 2017 APA, all rights reserved).

  5. Psychometric testing of the Persian version of the Belongingness Scale-Clinical Placement Experience.

    Science.gov (United States)

    Ashktorab, Tahereh; Hasanvand, Shirin; Seyedfatemi, Naemeh; Zayeri, Farid; Levett-Jones, Tracy; Pournia, Yadollah

    2015-03-01

    Belongingness has been identified both as a fundamental human need and as a prerequisite for nursing students' clinical learning. Belongingness has also been associated with students' academic achievement, retention, self-esteem, self-directed learning, and self-efficacy. The Belongingness Scale-Clinical Placement Experience is a valid and reliable measure of nursing students' belongingness scores; however, a Persian version of this scale is not currently available. This study aimed to translate the Belongingness Scale-Clinical Placement Experience into Persian, to evaluate its psychometric properties, and to measure the belongingness experiences of Iranian nursing students. Following translation and initial validity and reliability testing of the scale, 300 nursing students from three universities in Iran completed the survey. Further psychometric testing was undertaken followed by analysis of descriptive statistics. Based on the results of confirmatory factor analysis two items were removed from the scale. The mean score of Persian version of the Belongingness Scale-Clinical Placement Experience was 3.21 (0.57). The whole scale had a high internal consistency (Cronbach's alpha=0.92). The alpha coefficients of the subscales of "self-esteem", "connectedness", and "efficacy" were 0.85, 0.86, and 0.80 respectively. Similar to previous versions of the Belongingness Scale-Clinical Placement Experience, the Persian version demonstrated strong psychometric properties with strong validity and reliability, indicating its utility and appropriateness when measuring Iranian nursing students' belongingness experiences. Further testing with other cohorts would strengthen these results. Copyright © 2014 Elsevier Ltd. All rights reserved.

  6. What do educational test scores really measure?

    DEFF Research Database (Denmark)

    McIntosh, James; D. Munk, Martin

    measure of pure cognitive ability. We find that variables which are not closely associated with traditional notions of intelligence explain a significant proportion of the variation in test scores. This adds to the complexity of interpreting test scores and suggests that school culture, attitudes...

  7. Psychometric Properties of the Cyberbullying Test, a Screening Instrument to Measure Cybervictimization, Cyberaggression, and Cyberobservation.

    Science.gov (United States)

    Garaigordobil, Maite

    2015-08-19

    The purpose of the study was to analyze the psychometric properties of the Cyberbullying Test. The sample included 3,026 participants from the Basque Country (northern Spain), aged 12 to 18 years. Results confirmed high internal consistency and moderate temporal stability. Exploratory factor analysis yielded three moderately correlated factors (cyberobserver, cyberaggressor, and cybervictim). Confirmatory factor analysis ratified adequate model fit of the three factors. Convergent and discriminant validity were confirmed: (a) cybervictims use a variety of conflict resolution strategies, scoring high in neuroticism, openness, antisocial behavior, emotional attention, school-academic problems, shyness-withdrawal, psychopathological disorders, anxiety, and psychosomatic complaints, and low in agreeableness, responsibility, self-esteem, and social adjustment and (b) cyberaggressors use many aggressive conflict resolution strategies, scoring high in neuroticism, antisocial behavior, school-academic problems, psychopathological and psychosomatic disorders, and low in empathy, agreeableness, responsibility, emotion regulation, and social adjustment. The study confirms the test's reliability and validity.

  8. Enhancing rigour in the validation of patient reported outcome measures (PROMs: bridging linguistic and psychometric testing

    Directory of Open Access Journals (Sweden)

    Roberts Gwerfyl

    2012-06-01

    Full Text Available Abstract Background A strong consensus exists for a systematic approach to linguistic validation of patient reported outcome measures (PROMs and discrete methods for assessing their psychometric properties. Despite the need for robust evidence of the appropriateness of measures, transition from linguistic to psychometric validation is poorly documented or evidenced. This paper demonstrates the importance of linking linguistic and psychometric testing through a purposeful stage which bridges the gap between translation and large-scale validation. Findings Evidence is drawn from a study to develop a Welsh language version of the Beck Depression Inventory-II (BDI-II and investigate its psychometric properties. The BDI-II was translated into Welsh then administered to Welsh-speaking university students (n = 115 and patients with depression (n = 37 concurrent with the English BDI-II, and alongside other established depression and quality of life measures. A Welsh version of the BDI-II was produced that, on administration, showed conceptual equivalence with the original measure; high internal consistency reliability (Cronbach’s alpha = 0.90; 0.96; item homogeneity; adequate correlation with the English BDI-II (r = 0.96; 0.94 and additional measures; and a two-factor structure with one overriding dimension. Nevertheless, in the student sample, the Welsh version showed a significantly lower overall mean than the English (p = 0.002; and significant differences in six mean item scores. This prompted a review and refinement of the translated measure. Conclusions Exploring potential sources of bias in translated measures represents a critical step in the translation-validation process, which until now has been largely underutilised. This paper offers important findings that inform advanced methods of cross-cultural validation of PROMs.

  9. Effects of common chronic medical conditions on psychometric tests used to diagnose minimal hepatic encephalopathy

    DEFF Research Database (Denmark)

    Lauridsen, M M; Poulsen, L; Rasmussen, C K

    2016-01-01

    Many chronic medical conditions are accompanied by cognitive disturbances but these have only to a very limited extent been psychometrically quantified. An exception is liver cirrhosis where hepatic encephalopathy is an inherent risk and mild forms are diagnosed by psychometric tests. The preferred....../15, p psychometrically measurable cognitive deficits, whereas those with ESRF or DMII had not. This adds to the understanding of the clinical consequences of chronic heart- and lung disease, and implies...... that the psychometric tests should be interpreted with great caution in cirrhosis patients with heart- or lung comorbidity....

  10. Development and Psychometric Evaluation of the Hansen Research Services Matrix Adaptive Test: A Measure of Nonverbal IQ.

    Science.gov (United States)

    Hansen, John A

    2016-10-13

    Assessment of individuals on the autism spectrum often includes a measure of nonverbal IQ. One such measure is the Raven's Standard Progressive Matrices (RSPM). For large research studies with participants distributed nationally it is desirable for assessments to be available online. Because time is a premium, it is ideal that the measure produces accurate scores quickly. The Hansen Research Services Matrix Adaptive Test (HRS-MAT) addresses these needs and with similar psychometric properties of the RSPM. Scores based on the HRS-MAT correlated at r = .81 with those of the RSPM. In adult-child pairs, HRS-MAT scores correlated at approximately r = .50. Details from respondents in a national sample and psychometric properties including reliability and validity are discussed.

  11. Does composition medium affect the psychometric properties of scores on an exercise designed to assess written medical communication skills?

    Science.gov (United States)

    Boulet, John R; McKinley, Danette W; Rebbecchi, Thomas; Whelan, Gerald P

    2007-05-01

    The ECFMG Clinical Skills Assessment (CSA) was developed to evaluate whether graduates of international medical schools are ready to enter graduate training programs in the United States. The performance-based patient note exercise is specifically used to assess an examinee's ability to summarize, synthesize and interpret the data collected in a patient interview. The purpose of this study was to investigate whether choice of composition method (typing or writing) affected the psychometric properties of the scores. Using data for a 1-year period, the validity and reliability of typed and written notes was contrasted. Although the characteristics of individuals who chose whether or not to type were different, the statistical analyses indicated that, controlling for examinee ability, physician examiner ratings of the written summaries were not influenced by composition method. The results of this study suggest that the psychometric properties of the patient note scores are invariant with respect to composition method.

  12. [Standardization of the Test Your Memory and evaluation of their concordance with the outcome of the psychometric examination].

    Science.gov (United States)

    Ferrero-Arias, J; Turrión-Rojo, M A

    2016-05-01

    To explore the relationship between scores on the Test Your Memory (TYM) battery and findings from a more exhaustive neurocognitive assessment. The TYM and fourteen psychometric tests were administered to 84 subjects aged 50 or older who attended an outpatient neurology clinic due to cognitive symptoms. Each patient's cognitive state was determined independently from his/her score on the TYM (CDR 0, n=25; CDR 0.5, n=45; CDR 1, n=14). We analysed concurrent validity of TYM scores and results from the psychometric tests, as well as the degree of concordance between the two types of measurement, by contrasting normalised data from each instrument. Although the intraclass correlation coefficient was 0.67 (confidence interval 95%, 0.53-0.77), analysis of the Bland-Altman plot and the curve on the survival-agreement plot (Luiz et al. method) demonstrates that the individual distances between the two methods exhibit excessive dispersion from a clinical viewpoint. TYM-based predictions of the mean z-score on psychometric tests differed substantially from real results in 30% of the subjects. Concordance of 95% can only be achieved by accepting absolute inter-instrument differences of up to 0.87 as identical values. Furthermore, the TYM underestimates cognitive performance for low values and overestimates it for high values. The TYM is a cognitive screening test which should not be used to predict results on psychometric tests or to detect cognitive changes in clinical trials. Copyright © 2014 Sociedad Española de Neurología. Published by Elsevier España, S.L.U. All rights reserved.

  13. A Psychometric Review of Norm-Referenced Tests Used to Assess Phonological Error Patterns

    Science.gov (United States)

    Kirk, Celia; Vigeland, Laura

    2014-01-01

    Purpose: The authors provide a review of the psychometric properties of 6 norm-referenced tests designed to measure children's phonological error patterns. Three aspects of the tests' psychometric adequacy were evaluated: the normative sample, reliability, and validity. Method: The specific criteria used for determining the psychometric…

  14. The application of personal computers in psychometric testing

    Directory of Open Access Journals (Sweden)

    l. S. Shaw

    1988-05-01

    Full Text Available Personal computers in psychometric testing represent an efficient, inexpensive, user-friendly, and reliable tool for research and training. They offer a number of substantial advantages for both test subjects and test directors. Personal computer-aided testing programs provide for the instruction of test subjects, the presentation of test patterns, the automatic evaluation of results, automatic data collection and statistical processing. Test results can be saved, compared with standard values, and printed out in various formats. User-friendly programming reduces the test directors' work to a minimum and relieve them from routine tasks. Flexibility and low cost are further advantages. Opsomming Die gebruik van persoonlike rekenaars vir .psigometriese toetse word as doeltreffende, goedkoop, gebruikerriendelike en betroubare werktuie vir navorsing en opieiding beskou. Hulle bled 'n aantal belangrike voordele aan vir toetslinge asook toetsleiers. Persoonlike rekenaarondersteunde toetsprogramme behels die voorligting van toetslinge, die vertoning van toetspatrone, die outomatiese evaluering van toetsresultate, outomatiese dataversameling asook statistiese dataverwerking. Toetsresultate kan gestoor, met standaardwaardes vergelyk en in verskeie formate uitgedruk word. Die toetsleier se werkbelading word deur middel van gebruikervriendelike programmering tot 'n minimum verminder en hy word ook van routine take bevry. Verdere voordele soos buigsaamheid en lae koste word ook beklemtoon.

  15. Care Dependency Scale - psychometric testing of the Polish version

    NARCIS (Netherlands)

    Dijkstra, Ate; Muszalik, Marta; Kedziora-Kornatowska, Kornelia; Kornatowski, Tomasz

    2010-01-01

    The importance of this study lies in the availability of psychometrically sound assessment instruments, which are of critical importance for the study of patient's care dependency and the provision of care to these patients. The aim of this study was to identify the psychometric properties of the Ca

  16. Psychometric testing of the Pecka Grading Rubric for evaluating higher-order thinking in distance learning.

    Science.gov (United States)

    Pecka, Shannon; Schmid, Kendra; Pozehl, Bunny

    2014-12-01

    This article describes development of the Pecka Grading Rubric (PGR) as a strategy to facilitate and evaluate students' higher-order thinking in discussion boards. The purpose of this study was to describe psychometric properties of the PGR. Rubric reliability was pilot tested on a discussion board assignment used by 15 senior student registered nurse anesthetist enrolled in an Advanced Principles of Anesthesia course. Interrater and intrarater reliabilities were tested using an interclass correlation coefficient (ICC) to evaluate absolute agreement of scoring. Raters gave each category a score, scores of the categories were summed, and a total score was calculated for the entire rubric. Interrater (ICC = 0.939, P < .001) and intrarater (ICC = 0.902 to 0.994, P < .001) reliabilities were excellent for total point scores. A content validity index was used to evaluate content validity. Raters evaluated content validity of each cell of the PGR. The content validity index (0.8-1.0) was acceptable. Known-group validity was evaluated by comparing graduate student registered nurse anesthetists (N = 7) with undergraduate senior nursing students (N = 13). Beginning evidence indicates a valid and reliable instrument that measures higher-order thinking in the student registered nurse anesthetist.

  17. Psychometric assessment of the Rat Grimace Scale and development of an analgesic intervention score.

    Directory of Open Access Journals (Sweden)

    Vanessa Oliver

    Full Text Available Our limited ability to assess spontaneous pain in rodent models of painful human conditions may be associated with a translational failure of promising analgesic compounds in to clinical use. If measurement of spontaneous pain behaviours can be used to generate an analgesic intervention score their use could expand to guide the use of analgesics, as mandated by regulatory bodies and ethical and welfare obligations. One such measure of spontaneous pain, the Rat Grimace Scale (RGS, has recently been described and shown to exhibit reliability. However, reliability of measurement scores is context and content specific, and further testing required to assess translation to a heterogenous setting (different model, raters, environment. The objectives of this study were to perform reliability testing with the Rat Grimace Scale in a heterogenous setting and generate an analgesic intervention score for its use. In a randomised, blinded study, sixteen adult female rats received one of three analgesia treatments (0.05 mg/kg buprenorphine subcutaneously, 1 mg/kg meloxicam subcutaneously, 0.2 mg/kg oral buprenorphine in jelly peri-operatively (telemetry unit implantation surgery. Rats were video-recorded (before, 1-6 and 12 hours post-operatively and images collected for independent scoring by three blinded raters using the RGS, and five experts based on "pain/no pain" assessment. Scores were used to calculate inter- and intra-rater reliability with an intraclass correlation coefficient and generate an analgesic intervention score with receiver operating characteristic curve analysis. The RGS scores showed very good inter- and intra-rater reliability (0.85 [0.78-0.90 95% CI] and 0.83 [0.76-0.89], respectively. An analgesic intervention threshold of greater than 0.67 was determined. These data demonstrate that the RGS is a useful tool which can be successfully employed in a heterogenous setting, and has the potential to guide analgesic intervention.

  18. Health Belief Model Scale for Human Papilloma Virus and its Vaccination: Adaptation and Psychometric Testing.

    Science.gov (United States)

    Guvenc, Gulten; Seven, Memnun; Akyuz, Aygul

    2016-06-01

    To adapt and psychometrically test the Health Belief Model Scale for Human Papilloma Virus (HPV) and Its Vaccination (HBMS-HPVV) for use in a Turkish population and to assess the Human Papilloma Virus Knowledge score (HPV-KS) among female college students. Instrument adaptation and psychometric testing study. The sample consisted of 302 nursing students at a nursing school in Turkey between April and May 2013. Questionnaire-based data were collected from the participants. Information regarding HBMS-HPVV and HPV knowledge and descriptive characteristic of participants was collected using translated HBMS-HPVV and HPV-KS. Test-retest reliability was evaluated and Cronbach α was used to assess internal consistency reliability, and exploratory factor analysis was used to assess construct validity of the HBMS-HPVV. The scale consists of 4 subscales that measure 4 constructs of the Health Belief Model covering the perceived susceptibility and severity of HPV and the benefits and barriers. The final 14-item scale had satisfactory validity and internal consistency. Cronbach α values for the 4 subscales ranged from 0.71 to 0.78. Total HPV-KS ranged from 0 to 8 (scale range, 0-10; 3.80 ± 2.12). The HBMS-HPVV is a valid and reliable instrument for measuring young Turkish women's beliefs and attitudes about HPV and its vaccination. Copyright © 2015 North American Society for Pediatric and Adolescent Gynecology. Published by Elsevier Inc. All rights reserved.

  19. Psychometric Properties of IRT Proficiency Estimates

    Science.gov (United States)

    Kolen, Michael J.; Tong, Ye

    2010-01-01

    Psychometric properties of item response theory proficiency estimates are considered in this paper. Proficiency estimators based on summed scores and pattern scores include non-Bayes maximum likelihood and test characteristic curve estimators and Bayesian estimators. The psychometric properties investigated include reliability, conditional…

  20. What Do Test Scores Really Mean? A Latent Class Analysis of Danish Test Score Performance

    DEFF Research Database (Denmark)

    Munk, Martin D.; McIntosh, James

    2014-01-01

    Latent class Poisson count models are used to analyze a sample of Danish test score results from a cohort of individuals born in 1954-55, tested in 1968, and followed until 2011. The procedure takes account of unobservable effects as well as excessive zeros in the data. We show that the test scores...... of intelligence explain a significant proportion of the variation in test scores. This adds to the complexity of interpreting test scores and suggests that school culture and possible incentive problems make it more di¢ cult to understand what the tests measure....

  1. What do educational test scores really measure?

    DEFF Research Database (Denmark)

    McIntosh, James; D. Munk, Martin

    Latent class Poisson count models are used to analyze a sample of Danish test score results from a cohort of individuals born in 1954-55 and tested in 1968. The procedure takes account of unobservable effects as well as excessive zeros in the data. The bulk of unobservable effects are uncorrelate...

  2. Analyzing Test-Taking Behavior: Decision Theory Meets Psychometric Theory.

    Science.gov (United States)

    Budescu, David V; Bo, Yuanchao

    2015-12-01

    We investigate the implications of penalizing incorrect answers to multiple-choice tests, from the perspective of both test-takers and test-makers. To do so, we use a model that combines a well-known item response theory model with prospect theory (Kahneman and Tversky, Prospect theory: An analysis of decision under risk, Econometrica 47:263-91, 1979). Our results reveal that when test-takers are fully informed of the scoring rule, the use of any penalty has detrimental effects for both test-takers (they are always penalized in excess, particularly those who are risk averse and loss averse) and test-makers (the bias of the estimated scores, as well as the variance and skewness of their distribution, increase as a function of the severity of the penalty).

  3. A call for policy guidance on psychometric testing in doping control in sport

    DEFF Research Database (Denmark)

    Petróczi, Andrea; Backhouse, Susan H.; Barkoukis, Vassilis

    2015-01-01

    confounds self-reported psychometric test results. Further, the cognitive complexity surrounding test performance means that the response-time based measures and the lie detector tests for revealing concealed life-events (e.g., doping use) are prone to produce false or non-interpretable outcomes in field...... settings. Differences in social-cognitive characteristics of doping behaviour that are tested at group level (doping users vs. non-users) cannot be extrapolated to individuals; nor these psychometric measures used for individual diagnostics. In this paper, we present a position statement calling for policy...... guidance on appropriate use of psychometric assessments in the pursuit of clean sport. We argue that both self-reported and response-time based psychometric tests for doping have been designed, tested and validated to explore how athletes feel and think about doping in order to develop a better...

  4. Psychometric equivalence of recorded spondaic words as test items.

    Science.gov (United States)

    Bilger, R C; Matthies, M L; Meyer, T A; Griffiths, S K

    1998-06-01

    In the determination of the speech-reception threshold (SRT), spondaic words are assumed to be homogeneous with respect to intelligibility; and the assumption of equal intelligibility requires that the words be comparable for all signal levels. Previous attempts to assess the equal intelligibility assumption using word thresholds as the sole criterion are not an adequate basis for specifying the equality of intelligibility. In the present study, the recorded spondaic words (Tillman recording) were analyzed in an attempt to create a more homogeneous set of spondaic words for future laboratory work. To achieve this goal, the data reported by Young, Dudley, and Gunter (1982) and data collected in our laboratory were fitted to a logistic function (psychometric function) from which a 50% point (threshold) and slope were obtained. To specify their acoustical parameters, the recorded spondaic words were digitized and the RMS level and duration of each syllable and word were calculated. None of the RMS or duration measures were correlated with word thresholds, so no attempt was made to equate level or duration. On the other hand, when the threshold of each word was adjusted to equal the mean threshold of the set (n = 36), the dispersion among word thresholds and slopes was greatly reduced. Further, we recommend that small sets of "equally intelligible" spondaic words not be used for clinical testing because set size is a strong factor in determining threshold for spondees (Meyer & Bilger, 1997; Punch & Howard, 1985).

  5. A pilot evaluation of a computer-based psychometric test battery designed to detect impairment in patients with cirrhosis

    Science.gov (United States)

    Cook, Nicola A; Kim, Jin Un; Pasha, Yasmin; Crossey, Mary ME; Schembri, Adrian J; Harel, Brian T; Kimhofer, Torben; Taylor-Robinson, Simon D

    2017-01-01

    Background Psychometric testing is used to identify patients with cirrhosis who have developed hepatic encephalopathy (HE). Most batteries consist of a series of paper-and-pencil tests, which are cumbersome for most clinicians. A modern, easy-to-use, computer-based battery would be a helpful clinical tool, given that in its minimal form, HE has an impact on both patients’ quality of life and the ability to drive and operate machinery (with societal consequences). Aim We compared the Cogstate™ computer battery testing with the Psychometric Hepatic Encephalopathy Score (PHES) tests, with a view to simplify the diagnosis. Methods This was a prospective study of 27 patients with histologically proven cirrhosis. An analysis of psychometric testing was performed using accuracy of task performance and speed of completion as primary variables to create a correlation matrix. A stepwise linear regression analysis was performed with backward elimination, using analysis of variance. Results Strong correlations were found between the international shopping list, international shopping list delayed recall of Cogstate and the PHES digit symbol test. The Shopping List Tasks were the only tasks that consistently had P values of battery correlated very strongly with the digit symbol component of PHES in discriminating severity of HE. These findings would indicate that components of the current PHES battery with the international shopping list tasks of Cogstate would be discriminant and have the potential to be used easily in clinical practice. PMID:28919805

  6. Critical Thinking: More than Test Scores

    Science.gov (United States)

    Smith, Vernon G.; Szymanski, Antonia

    2013-01-01

    This article is for practicing or aspiring school administrators. The demand for excellence in public education has lead to an emphasis on standardized test scores. This article explores the development of a professional enhancement program designed to prepare teachers to teach higher order thinking skills. Higher order thinking is the primary…

  7. Development and psychometric testing of the satisfaction with Cultural Simulation Experience Scale.

    Science.gov (United States)

    Courtney-Pratt, Helen; Levett-Jones, Tracy; Lapkin, Samuel; Pitt, Victoria; Gilligan, Conor; Van der Riet, Pamela; Rossiter, Rachel; Jones, Donovan; Everson, Naleya

    2015-11-01

    Decreasing the numbers of adverse health events experienced by people from culturally diverse backgrounds rests, in part, on the ability of education providers to provide quality learning experiences that support nursing students in developing cultural competence, an essential professional attribute. This paper reports on the implementation and evaluation of an immersive 3D cultural empathy simulation. The Satisfaction with Cultural Simulation Experience Scale used in this study was adapted and validated as the first stage of this study. Exploratory factor analysis and confirmatory factor analysis were undertaken to investigate the psychometric properties of the scale using two randomly-split sub-samples. Cronbach's Alpha was used to examine internal consistency reliability. Descriptive statistics were used for analysis of mean satisfaction scores and qualitative comments to open-ended questions were analysed and coded. A purposive sample (n = 497) of second of nursing students participated in the study. The overall Cronbach's alpha for the scale was 0.95 and each subscale demonstrated high internal consistency: 0.92; 0.92; 0.72 respectively. The mean satisfaction score was 4.64 (SD 0.51) out of a maximum of 5 indicating a high level of participant satisfaction with the simulation. Three factors emerged from qualitative analysis: "Becoming culturally competent", "Learning from the debrief" and "Reflecting on practice". The cultural simulation was highly regarded by students. Psychometric testing of the Satisfaction with Cultural Simulation Experience Scale demonstrated that it is a reliable instrument. However, there is room for improvement and further testing in other contexts is therefore recommended.

  8. Psychological stress and infertility. Part 2: Psychometric test data.

    Science.gov (United States)

    Brand, H J

    1982-12-01

    The hypothesis tested in this study was that a group of female functional infertile patients would show significantly more personality maladjustment than a group with definite organic reproductive pathology and a normal fertile group. No significant differences between the functional and organic groups were found on any of the subscores of four personality questionnaires. The normal group (wives of sterile men) scored higher on extraversion than both the functional and organic groups. On self-control the normal group also scored lower (negative connotation) than the other two groups. In general no evidence for personality maladjustment in functional infertility was found.

  9. Development and psychometric testing of the Visual Analogue Scale for Irritable Bowel Syndrome (VAS-IBS).

    OpenAIRE

    Ohlsson Bodil; Bengtsson Mariette; Ulander Kerstin

    2007-01-01

    Abstract Background The aim of this study was to develop and psychometrically test a short, patient-reported questionnaire to be used in clinical practice for patients with Irritable Bowel Syndrome (IBS). The Visual Analogue Scale for Irritable Bowel Syndrome (VAS-IBS) questionnaire was designed to measure the treatment response of symptoms and well-being in patients suffering from IBS. Methods The VAS-IBS was psychometrically tested for content and criterion validity, scale acceptability, it...

  10. Development and psychometric testing of the Visual Analogue Scale for Irritable Bowel Syndrome (VAS-IBS)

    OpenAIRE

    Bengtsson, Mariette; Ohlsson, Bodil; Ulander, Kerstin

    2007-01-01

    Background: The aim of this study was to develop and psychometrically test a short, patient-reported questionnaire to be used in clinical practice for patients with Irritable Bowel Syndrome (IBS). The Visual Analogue Scale for Irritable Bowel Syndrome (VAS-IBS) questionnaire was designed to measure the treatment response of symptoms and well-being in patients suffering from IBS. Methods: The VAS-IBS was psychometrically tested for content and criterion validity, scale acceptability, item-redu...

  11. Development and psychometric testing of the Ascent to Competence Scale.

    Science.gov (United States)

    McCoy, Michelle A; Levett-Jones, Tracy; Pitt, Victoria

    2013-01-01

    This paper reports the development and psychometric testing of the Ascent to Competence Scale, an instrument designed to measure nursing students' perceptions of the quality of their clinical placement experience. The key purpose of clinical placements is to facilitate students' learning and progress toward the attainment of competence. The attainment of competence requires personal commitment and active involvement of students; support and guidance of clinical and academic staff; and clinical environments that are welcoming and inclusive of students. The items for the Ascent to Competence Scale were identified following a critical review of the literature. Content and face validity were established by an expert panel. During 2010 the instrument was tested with third year nursing students (n=88) from one Australian university. Exploratory factor analysis with promax oblique rotation was used to determine construct validity and Cronbach's coefficient alpha determined the scale's internal consistency reliability. The final scale demonstrated satisfactory internal consistency (alpha 0.98). Exploratory factor analysis yielded a three-component structure termed "Being welcomed"; "Belongingness" and "Learning and competence". Each subscale demonstrated high internal consistency: 0.89; 0.96; and 0.95 respectively. The Ascent to Competence Scale provides a fresh perspective on clinical placements as it allows for the relationship between belongingness, learning and competence to be explored. The scale was reliable and valid for this cohort. Further research in different contexts would be valuable in extending upon this work. The Ascent to Competence Scale profiled in this paper will be of benefit to both educational and healthcare institutions. The use of a quantified yardstick, such as the Ascent to Competence Scale, is important in evaluating the efficacy of programs, placements and partnerships between higher education and health services. Copyright © 2011 Elsevier Ltd

  12. Development and psychometric testing of the Supportive Supervisory Scale.

    Science.gov (United States)

    McGilton, Katherine S

    2010-06-01

    To describe the development and psychometric testing of the Supportive Supervisory Scale (SSS). The development of the items of the scale was based on Winnicott's relationship theory and on focus groups with 26 healthcare aides (HCAs) and 30 supervisors from six long-term care (LTC) facilities in Ontario, Canada. Content validity of the 15-item instrument was established by a panel of experts. Based on a secondary analysis of data collected from 222 HCAs in 10 LTC facilities in Ontario, Canada, the SSS was subjected to principal components analysis with oblique rotation. A two-factor solution was accepted, which is consistent with the theoretical conceptualization of the instrument. Factor I was labeled Respects Uniqueness and Factor II was labeled Being Reliable. Internal consistency of Factor I was .95, and that of Factor II was .91. Discriminant validity was also established. The focus groups revealed that "being available to staff" while "recognizing the HCA as an individual, and taking a moment to get to know them" was essential to feeling supported by their supervisor. The SSS is a reliable and valid measure of supervisory support of supervisors working in LTC facilities. At the core of supportive supervision is the supervisor's ability to develop and maintain positive relationships with each HCA. It is through respecting the uniqueness of each HCA and being reliable that supervisor-HCA relationships can flourish. Supportive leadership in LTC settings is a major contributor to HCAs' job satisfaction and retention and to quality of patient care. Therefore, a tool developed and tested to measure supervisors' supportive capacities in LTC is primal to evaluate the effectiveness of supervisors in these environments.

  13. Functional limitations in Romanian children with haemophilia: further testing of psychometric properties of the Paediatric Haemophilia Activities List.

    Science.gov (United States)

    Groen, W; van der Net, J; Lacatusu, A M; Serban, M; Helders, P J M; Fischer, K

    2013-05-01

    Children with haemophilia often experience limitations in activities of daily life. Recently the Paediatric Haemophilia Activities List (PedHAL) has been developed and tested in Dutch children with intensive replacement therapy. The psychometric properties of the PedHAL in children not receiving intensive replacement therapy are not known. The objective was to gain further insight into the psychometric properties of the PedHAL and to study the functional health status of Romanian children and adolescents with haemophilia. Children attending to the rehabilitation centre of Buzias in Romania were sampled consecutively. Construct validity of the PedHAL was evaluated by concurrent testing with objective and subjective measures of physical function and functional ability. Reproducibility was tested by a 3-day test-retest by intraclass correlation coefficient (ICC) and limits of agreement (LOA). Responsiveness to rehabilitation was assessed by Haemophilia Joint Health Score (HJHS) and PedHAL. Twenty-nine children with severe (n = 25) or moderate (n = 4) haemophilia participated. Mean age was 13.2 years (SD 4.0). Median score of the PedHAL was 83.5 (IQR 47.9-90.5). The PedHAL correlated moderately with HJHS (rho = -0.59), Functional Independence Score in& Haemophilia (rho = 0.65) and Child Health Questionnaire-physical function (rho = 0.40) and not with Child Health Questionnaire-mental health, Child Health Questionnaire-behaviour and 6MWT. Test-retest reliability was good (ICC = 0.95). LOA was 17.4 points for the sum score. HJHS scores improved slightly after rehabilitation, whereas PedHAL scores did not change. In general, construct validity and test-retest reliability were good, test-retest agreement showed some variability. Therefore, currently the PedHAL may be more appropriate for research purposes than for individual patient monitoring in clinical practice.

  14. Testing the Psychometric Properties of a Chinese Version of the Level of Expressed Emotion Scale

    Directory of Open Access Journals (Sweden)

    Wai Tong Chien

    2014-01-01

    Full Text Available This study tested the psychometric properties of a Chinese version of the level of expressed emotion scale in Hong Kong Chinese patients with severe mental illness and their family caregivers. First, the semantic equivalence with the original English version and test-retest reliability at 2-week interval of the Chinese version was examined. After that, the reproducibility, construct validity, and internal consistency of the Chinese version were tested. The Chinese version indicated good semantic equivalence with the English version (kappa values = 0.76–0.95 and ICC = 0.81–0.92, test-retest reliability (r = 0.89–0.95, P<0.01, and internal consistency (Cronbach’s α = 0.86–0.92. Among 262 patients with severe mental illness and their caregivers, the 50-item Chinese version had substantial loadings on one of the four factors identified (intrusiveness/hostility, attitude towards patient, tolerance, and emotional involvement, accounting for 71.8% of the total variance of expressed emotion. In confirmatory factor analysis, the identified four-factor model showed the best fit based on all fit indices (χ2/df = 1.93, P=0.75; AGFI = 0.96; TLI = 1.02; RMSEA = 0.031; WRMR = 0.78 to the collected data. The four-factor Chinese version also indicated a good concurrent validity with significant correlations with family functioning (r = −0.54 and family burden (r = 0.49 and a satisfactory reproducibility over six months (intraclass correlation coefficient of 0.90. The mean scores of the overall and subscale of the Chinese version in patients with unipolar disorder were higher than in other illness groups (schizophrenia, psychotic disorders, and bipolar disorder; P<0.01. The Chinese version demonstrates sound psychometric properties to measure families’ expressed emotion in Chinese patients with severe mental illness, which are found varied across countries.

  15. Psychometric analysis and critical appraisal of the original, revised, and modified versions of the Japanese Orthopaedic Association score in the assessment of patients with cervical spondylotic myelopathy.

    Science.gov (United States)

    Furlan, Julio C; Catharine Craven, B

    2016-06-01

    OBJECTIVE Cervical spondylotic myelopathy (CSM) is the most common cause of nontraumatic spinal cord impairment and disability in the world. Given that the Japanese Orthopaedic Association (JOA) score is the most frequently used outcome measure in clinical research and practice for treating patients with CSM, this review was undertaken to comprehensively and critically evaluate the psychometric properties of the JOA score. METHODS The authors identified studies (published in the period of January 1975 to November 2015) on the psychometric properties of the original, revised, and modified versions of the JOA score in Medline, PsycINFO, Excerpta Medica dataBASE (EMBASE), American College of Physicians Journal Club, and Cochrane Database of Systematic Reviews. Additional publications were captured in a secondary search of the bibliographies in both original research articles and literature reviews identified in the original search. The JOA scores were evaluated for item generation and reduction, internal consistency, reliability, validity, and responsiveness. This review included all those versions of the JOA score whose psychometric properties had been reported in at least 2 published studies. RESULTS The primary search strategy identified 59 studies, of which 9 fulfilled the inclusion and exclusion criteria. An additional 18 publications were captured in the secondary search and included in the analysis. The key findings from the 27 studies analyzed indicated the following: 1) the original JOA score (1975) was the source for the revised JOA score (1994) and 3 modified versions (1991, 1993, and 1999 JOA scores) reported or used in at least 2 published studies; 2) the revised and modified versions of the JOA score are markedly different from each other; 3) only the revised JOA score (1994) was validated with the original JOA score; and 4) the 1975 JOA score is the most appropriate instrument for assessing patients in Asian populations (especially from Japan) because

  16. The Mayer-Salovey-Caruso emotional intelligence test: Psychometric properties of the Serbian version

    Directory of Open Access Journals (Sweden)

    Altaras-Dimitrijević Ana

    2010-01-01

    Full Text Available The field of emotional intelligence (EI research has yielded two rather distinct ways of conceptualizing and measuring the construct, which also differ greatly in terms of their scientific value: mixed models of EI, although commercially successful, prove inadequate when subject to scientific scrutiny; by contrast, the Mayer and Salovey ability model of EI meets most of the proposed criteria for establishing a scientifically meaningful EI construct. Its current operationalization, the MSCEIT, has thus far been found to exhibit good reliability, as well as convergent-dicriminant and structural validity. The present study examined the psychometric properties of the Serbian version of the test in a sample of 250 high school graduates. Our results show that the reliabilities of Serbian MSCEIT scores are equivalent to those reported for the American standardization sample; more specifically, reliability coefficients are high for the two Area scores and the Total score (r≥ .86. The Total EI score shows low to moderate correlations with standard measures of academic intelligence (r=.244 - .429; p<.01, and very low correlations with the Big Five personality traits (r<.25 - a pattern that is regarded optimal in establishing convergent-discriminant validity. The factor structure of the Serbian MSCEIT closely corresponds to that of the original test: the theoretically proposed 1-, 2- and 4-factor solutions all prove empirically sustainable; however, in the 2- and 4-factor solutions, the structure of certain EI branches diverges from that which is theoretically expected and incorporated in the scoring system. The obtained data certainly recommend the Serbian MSCEIT for standardization and practical use, in which case they should be supplemented with data regarding the test’s predictive validity. On a more basic level, our results suggest the possibility of assessing a scientifically meaningful EI construct, defined as the ability to reason with

  17. An international psychometric testing of the Care Dependency Scale

    NARCIS (Netherlands)

    Dijkstra, A.; Brown, L.; Havens, B.; Romeren, T.I.; Zanotti, R.; Dassen, T.; van den Heuvel, W.

    2000-01-01

    In an international study, psychometric properties of the Care Dependency Scale (CDS) were examined by analysing data gathered in Dutch, Canadian, Italian and Norwegian nursing homes. For that purpose, from these countries a convenience sample was developed consisting of 525 patients with dementia.

  18. Clinical Teaching Effectiveness Instrument: Development and Psychometric Testing.

    Science.gov (United States)

    Copeland, H. Liesel; Hewson, Mariana

    This report describes the development and psychometric qualities of a new instrument to assess clinical teaching effectiveness in medical education. The strength of the instrument is seen to lie in the qualitative development process involving iterative checking with key stakeholders; its high reliability, validity, and feasibility; and its ease…

  19. ITC Guidelines on Quality Control in Scoring, Test Analysis, and Reporting of Test Scores

    Science.gov (United States)

    Allalouf, Avi

    2014-01-01

    The Quality Control (QC) Guidelines are intended to increase the efficiency, precision, and accuracy of the scoring, analysis, and reporting process of testing. The QC Guidelines focus on large-scale testing operations where multiple forms of tests are created for use on set dates. However, they may also be used for a wide variety of other testing…

  20. ITC Guidelines on Quality Control in Scoring, Test Analysis, and Reporting of Test Scores

    Science.gov (United States)

    Allalouf, Avi

    2014-01-01

    The Quality Control (QC) Guidelines are intended to increase the efficiency, precision, and accuracy of the scoring, analysis, and reporting process of testing. The QC Guidelines focus on large-scale testing operations where multiple forms of tests are created for use on set dates. However, they may also be used for a wide variety of other testing…

  1. Investigating the Value of Section Scores for the "TOEFL iBT"® Test. "TOEFL iBT"® Research Report. TOEFL iBT-21. ETS Research Report RR-13-35

    Science.gov (United States)

    Sawaki, Yasuyo; Sinharay, Sandip

    2013-01-01

    This study investigates the value of reporting the reading, listening, speaking, and writing section scores for the "TOEFL iBT"® test, focusing on 4 related aspects of the psychometric quality of the TOEFL iBT section scores: reliability of the section scores, dimensionality of the test, presence of distinct score profiles, and the…

  2. The factor structure and psychometric properties of the Spanish version of the Mayer-Salovey-Caruso Emotional Intelligence Test.

    Science.gov (United States)

    Sanchez-Garcia, Manuel; Extremera, Natalio; Fernandez-Berrocal, Pablo

    2016-11-01

    This research examined evidence regarding the reliability and validity of scores on the Spanish version of the Mayer-Salovey-Caruso Emotional Intelligence Test, Version 2.0 (MSCEIT; Mayer, Salovey, & Caruso, 2002). In Study 1, we found a close convergence of the Spanish consensus scores and the general and expert consensus scores determined with Mayer, Salovey, Caruso, and Sitarenios (2003) data. The MSCEIT also demonstrated adequate evidence of reliability of test scores as estimated by internal consistency and test-retest correlation after 12 weeks. Confirmatory factor analysis supported a 3-level higher factor model with 8 manifest variables (task scores), 4 first-level factors (corresponding to the 4-branch model of Mayer & Salovey [1997], with 2 tasks for each branch), 2 second-level factors (experiential and strategic areas, with 2 branches for each area), and 1 third-level factor (overall emotional intelligence [EI]), and multigroup analyses supported MSCEIT cross-gender invariance. Study 2 found evidence for the discriminant validity of scores on the MSCEIT subscales, which were differentially related to personality and self-reported EI. Study 3 provided evidence of the incremental validity of scores on the MSCEIT, which added significant variance to the prospective prediction of psychological well-being after controlling for personality traits. The psychometric properties of the Spanish MSCEIT are similar to those of the original English version, supporting its use for assessing emotional abilities in the Spanish population. (PsycINFO Database Record

  3. A tutorial on aphasia test development in any language: Key substantive and psychometric considerations

    Science.gov (United States)

    Ivanova, Maria V.; Hallowell, Brooke

    2013-01-01

    Background There are a limited number of aphasia language tests in the majority of the world's commonly spoken languages. Furthermore, few aphasia tests in languages other than English have been standardized and normed, and few have supportive psychometric data pertaining to reliability and validity. The lack of standardized assessment tools across many of the world's languages poses serious challenges to clinical practice and research in aphasia. Aims The current review addresses this lack of assessment tools by providing conceptual and statistical guidance for the development of aphasia assessment tools and establishment of their psychometric properties. Main Contribution A list of aphasia tests in the 20 most widely spoken languages is included. The pitfalls of translating an existing test into a new language versus creating a new test are outlined. Factors to consider in determining test content are discussed. Further, a description of test items corresponding to different language functions is provided, with special emphasis on implementing important controls in test design. Next, a broad review of principal psychometric properties relevant to aphasia tests is presented, with specific statistical guidance for establishing psychometric properties of standardized assessment tools. Conclusions This article may be used to help guide future work on developing, standardizing and validating aphasia language tests. The considerations discussed are also applicable to the development of standardized tests of other cognitive functions. PMID:23976813

  4. Development and psychometric testing of the clinical networks engagement tool

    Science.gov (United States)

    Hecker, Kent G.; Rabatach, Leora; Noseworthy, Tom W.; White, Deborah E.

    2017-01-01

    Background Clinical networks are being used widely to facilitate large system transformation in healthcare, by engagement of stakeholders throughout the health system. However, there are no available instruments that measure engagement in these networks. Methods The study purpose was to develop and assess the measurement properties of a multiprofessional tool to measure engagement in clinical network initiatives. Based on components of the International Association of Public Participation Spectrum and expert panel review, we developed 40 items for testing. The draft instrument was distributed to 1,668 network stakeholders across different governance levels (leaders, members, support, frontline stakeholders) in 9 strategic clinical networks in Alberta (January to July 2014). With data from 424 completed surveys (25.4% response rate), descriptive statistics, exploratory and confirmatory factor analysis, Pearson correlations, linear regression, multivariate analysis, and Cronbach alpha were conducted to assess reliability and validity of the scores. Results Sixteen items were retained in the instrument. Exploratory factor analysis indicated a four-factor solution and accounted for 85.7% of the total variance in engagement with clinical network initiatives: global engagement, inform (provided with information), involve (worked together to address concerns), and empower (given final decision-making authority). All subscales demonstrated acceptable reliability (Cronbach alpha 0.87 to 0.99). Both the confirmatory factor analysis and regression analysis confirmed that inform, involve, and empower were all significant predictors of global engagement, with involve as the strongest predictor. Leaders had higher mean scores than frontline stakeholders, while members and support staff did not differ in mean scores. Conclusions This study provided foundational evidence for the use of this tool for assessing engagement in clinical networks. Further work is necessary to evaluate

  5. Psychometric Evaluation of the Revised Michigan Diabetes Knowledge Test (V.2016) in Arabic: Translation and Validation

    Science.gov (United States)

    Alhaiti, Ali Hassan; Alotaibi, Alanod Raffa; Jones, Linda Katherine; DaCosta, Cliff

    2016-01-01

    Objective. To translate the revised Michigan Diabetes Knowledge Test into the Arabic language and examine its psychometric properties. Setting. Of the 139 participants recruited through King Fahad Medical City in Riyadh, Saudi Arabia, 34 agreed to the second-round sample for retesting purposes. Methods. The translation process followed the World Health Organization's guidelines for the translation and adaptation of instruments. All translations were examined for their validity and reliability. Results. The translation process revealed excellent results throughout all stages. The Arabic version received 0.75 for internal consistency via Cronbach's alpha test and excellent outcomes in terms of the test-retest reliability of the instrument with a mean of 0.90 infraclass correlation coefficient. It also received positive content validity index scores. The item-level content validity index for all instrument scales fell between 0.83 and 1 with a mean scale-level index of 0.96. Conclusion. The Arabic version is proven to be a reliable and valid measure of patient's knowledge that is ready to be used in clinical practices. PMID:27995149

  6. Development and psychometric testing of a visual analog scale for dental fluorosis

    Directory of Open Access Journals (Sweden)

    R Ramya

    2014-01-01

    Full Text Available Objectives: The aim was to develop and psychometrically test a visual analog scale (VAS for dental fluorosis. Materials and Methods: In a cross-sectional study, esthetic perceptions of varying severities of fluorosis in permanent maxillary incisor teeth presented in 36 photographs were graded on a 10 cm VAS by two groups - an expert panel of dental specialists and lay group comprising 1 st year dental students. The photographs were also graded using Thylstrup-Fejerskov index (TFI. Inter and intra-examiner reliability testing and comparison of specialists′ and 1 st year students′ perceptions were made using intra-class correlation coefficient (ICC. Mean VAS values were validated against TFI scores using Spearman correlation coefficient. Results: A VAS for dental fluorosis was constructed with the help of mean values of subjective ratings for each of the photographs. Inter-rater reliability was found to be moderate (ICC = 0.59. The intra-rater reliability for both the lay group (ICC range: 0.91-0.98 and dental specialists′ group (ICC range: 0.87-0.98 was found to be excellent. High correlation was obtained between lay group′s and dental specialists′ perception of fluorosis on VAS (r = 0.93. VAS scores correlated moderately with TFI scores for the photographs (Spearman′s ρ = 0.725. All the findings were found to be statistically significant (P < 0.05. Conclusion: The developed VAS was found to be a valid and reliable tool to assess dental fluorosis. Validation of the scale on a larger sample from the general population is recommended before it is applied in clinical and epidemiological settings.

  7. Psychometric properties of the foot and ankle outcome score in a community-based study of adults with and without osteoarthritis

    DEFF Research Database (Denmark)

    Golightly, Yvonne M; Devellis, Robert F; Nelson, Amanda E

    2014-01-01

    examined psychometric properties of the Foot and Ankle Outcome Score (FAOS) subscales (pain, other symptoms, activities of daily living [ADL], sport and recreational function [Sport/Recreation], and foot and ankle related quality of life [QOL]) in a large, community-based sample of African American...

  8. Psychometric properties of the Turkish versions of the Drug Use Disorders Identification Test (DUDIT) and the Drug Abuse Screening Test (DAST-10) in the prison setting.

    Science.gov (United States)

    Evren, Cuneyt; Ogel, Kultegin; Evren, Bilge; Bozkurt, Muge

    2014-01-01

    The aim of this study was to evaluate psychometric properties of the Drug Use Disorders Identification Test (DUDIT) and the Drug Abuse Screening Test (DAST-10) in prisoners with (n = 124) or without (n = 78) drug use disorder. Participants were evaluated with the DUDIT, the DAST-10, and the Addiction Profile Index-Short (API-S). The DUDIT and the DAST-10 were found to be psychometrically sound drug abuse screening measures with high convergent validity when compared with each other (r = 0.86), and API-S (r = 0.88 and r = 0.84, respectively), and to have a Cronbach's α of 0.93 and 0.87, respectively. In addition, a single component accounted for 58.28% of total variance for DUDIT, whereas this was 47.10% for DAST-10. The DUDIT had sensitivity and specificity scores of 0.95 and 0.79, respectively, when using the optimal cut-off score of 10, whereas these scores were 0.88 and 0.74 for the DAST-10 when using the optimal cut-off score of 4. Additionally, both the DUDIT and the DAST-10 showed good discriminant validity as they differentiated prisoners with drug use disorder from those without. Findings support the Turkish versions of both the DUDIT and the DAST-10 as reliable and valid drug abuse screening instruments that measure unidimensional constructs.

  9. Validating the Interpretations and Uses of Test Scores

    Science.gov (United States)

    Kane, Michael T.

    2013-01-01

    To validate an interpretation or use of test scores is to evaluate the plausibility of the claims based on the scores. An argument-based approach to validation suggests that the claims based on the test scores be outlined as an argument that specifies the inferences and supporting assumptions needed to get from test responses to score-based…

  10. Development and psychometric properties of the Basic Amputee Mobility Score for use in patients with a major lower extremity amputation.

    Science.gov (United States)

    Kristensen, Morten Tange; Nielsen, Anni Østergaard; Topp, Ulla Madsen; Holmehave-Brandt, Jakob; Petterson, Charlotte Falkenberg; Gebuhr, Peter

    2017-08-31

    To develop and examine the psychometric properties, including responsiveness and interrater reliability, of a new outcome measure for the evaluation of basic mobility activities after a major lower extremity amputation - The Basic Amputee Mobility Score (BAMS). The four following essential activities were chosen through consensus meetings with experienced amputee physiotherapists: (i) supine in bed to sitting on the edge of the bed; (ii) bed to wheelchair transfer; (iii) indoor wheelchair mobility; and (iv) get up from a wheelchair to standing on the non-amputated leg. Each activity is scored from 0 to 2 (0 = not able to; 1 = able to with assistance/guiding; and 2 = independent), and cumulated to a 1-day BAMS score of 0-8. Validity and responsiveness were established in 106 consecutive in-hospital patients with a major dysvascular lower extremity amputation, while reliability and agreement were examined in an additional sample of 30 patients. The 30-day mortality risk was reduced by 88% (HR = 0.12, 95% CI 0.02-0.68) for those out of bed (BAMS ≥2 points) at the first physiotherapy assessment, while BAMS scores improved between the first and the discharge assessment, with a standardized response mean of 1.3. Reliability assessments resulted in a weighted Kappa value of 0.98, a standard error of measurement of 0.32 and a minimal detectable change of 0.89 points. No systematic between-rater bias was seen (P = 0.3). The BAMS was feasible in all patients, and showed a large responsiveness, excellent interrater reliability and with a change of 1 point indicating a real change in performances. Geriatr Gerontol Int 2017; ••: ••-••. © 2017 Japan Geriatrics Society.

  11. Applicability of the Classroom Assessment Scoring System in Chinese Preschools Based on Psychometric Evidence

    Science.gov (United States)

    Hu, Bi Ying; Fan, Xitao; Gu, Chuanhua; Yang, Ning

    2016-01-01

    Research Findings: This study examined the applicability of the Classroom Assessment Scoring System (CLASS) Pre-K (Pianta, La Paro, & Hamre, 2008) and its underpinning framework of teaching through interactions in typical Chinese kindergarten classrooms. A sample of 180 kindergarten classrooms in China was selected, and the CLASS was used to…

  12. Psychometric Properties of Scores on a New Measure of Psychological Type.

    Science.gov (United States)

    Vacha-Haase, Tammi; Thompson, Bruce

    Instruments measuring Carl Jung's (1921/1971) theory of psychological types have been widely used in various counseling contexts. The most popular measure of types has been the Myers-Briggs Type Indicator (K. Briggs and I. Myers). This measure has been criticized for dichotomous scoring, forced-choice response formats, and differential gender…

  13. Sensitivity and validity of psychometric tests for assessing driving impairment: effects of sleep deprivation.

    Directory of Open Access Journals (Sweden)

    Stefan Jongen

    Full Text Available To assess drug induced driving impairment, initial screening is needed. However, no consensus has been reached about which initial screening tools have to be used. The present study aims to determine the ability of a battery of psychometric tests to detect performance impairing effects of clinically relevant levels of drowsiness as induced by one night of sleep deprivation.Twenty four healthy volunteers participated in a 2-period crossover study in which the highway driving test was conducted twice: once after normal sleep and once after one night of sleep deprivation. The psychometric tests were conducted on 4 occasions: once after normal sleep (at 11 am and three times during a single night of sleep deprivation (at 1 am, 5 am, and 11 am.On-the-road driving performance was significantly impaired after sleep deprivation, as measured by an increase in Standard Deviation of Lateral Position (SDLP of 3.1 cm compared to performance after a normal night of sleep. At 5 am, performance in most psychometric tests showed significant impairment. As expected, largest effect sizes were found on performance in the Psychomotor Vigilance Test (PVT. Large effects sizes were also found in the Divided Attention Test (DAT, the Attention Network Test (ANT, and the test for Useful Field of View (UFOV at 5 and 11 am during sleep deprivation. Effects of sleep deprivation on SDLP correlated significantly with performance changes in the PVT and the DAT, but not with performance changes in the UFOV.From the psychometric tests used in this study, the PVT and DAT seem most promising for initial evaluation of drug impairment based on sensitivity and correlations with driving impairment. Further studies are needed to assess the sensitivity and validity of these psychometric tests after benchmark sedative drug use.

  14. Facilitating the Interpretation of English Language Proficiency Scores: Combining Scale Anchoring and Test Score Mapping Methodologies

    Science.gov (United States)

    Powers, Donald; Schedl, Mary; Papageorgiou, Spiros

    2017-01-01

    The aim of this study was to develop, for the benefit of both test takers and test score users, enhanced "TOEFL ITP"® test score reports that go beyond the simple numerical scores that are currently reported. To do so, we applied traditional scale anchoring (proficiency scaling) to item difficulty data in order to develop performance…

  15. Secondary Psychometric Examination of the Dimensional Obsessive-Compulsive Scale: Classical Testing, Item Response Theory, and Differential Item Functioning.

    Science.gov (United States)

    Thibodeau, Michel A; Leonard, Rachel C; Abramowitz, Jonathan S; Riemann, Bradley C

    2015-12-01

    The Dimensional Obsessive-Compulsive Scale (DOCS) is a promising measure of obsessive-compulsive disorder (OCD) symptoms but has received minimal psychometric attention. We evaluated the utility and reliability of DOCS scores. The study included 832 students and 300 patients with OCD. Confirmatory factor analysis supported the originally proposed four-factor structure. DOCS total and subscale scores exhibited good to excellent internal consistency in both samples (α = .82 to α = .96). Patient DOCS total scores reduced substantially during treatment (t = 16.01, d = 1.02). DOCS total scores discriminated between students and patients (sensitivity = 0.76, 1 - specificity = 0.23). The measure did not exhibit gender-based differential item functioning as tested by Mantel-Haenszel chi-square tests. Expected response options for each item were plotted as a function of item response theory and demonstrated that DOCS scores incrementally discriminate OCD symptoms ranging from low to extremely high severity. Incremental differences in DOCS scores appear to represent unbiased and reliable differences in true OCD symptom severity.

  16. D-Catch instrument : development and psychometric testing of a measurement instrument for nursing documentation in hospitals

    NARCIS (Netherlands)

    Paans, Wolter; Sermeus, Walter; Nieweg, Roos M. B.; van der Schans, Cees P.

    2010-01-01

    P>Title. D-Catch instrument: development and psychometric testing of a measurement instrument for nursing documentation in hospitals. Aim. This paper is a report of the development and testing of the psychometric properties of an instrument to measure the accuracy of nursing documentation in general

  17. Development and Psychometric Properties of the Taiwan Odd-Even Number Sequencing Test: A Nonalphabetic Measure of Working Memory.

    Science.gov (United States)

    Chen, Yen-Ting; Peng, Chung-Yu; Hua, Mau-Sun; Liu, Chen-Chung; Chen, Hsin-Yi; Hwu, Hai-Gwo

    2016-05-09

    Alphabetic working memory (WM) tests, such as the Wechsler Adult Intelligence Scale-III and IV Letter Number Sequencing, are not appropriate for nonalphabetic cultures. This study examined the psychometric properties of the Taiwan Odd-Even Number Sequencing Test (TOENST) and identified representative norms. The TOENST and other mental screening tasks were administered to 300 randomly selected healthy participants, 32 purposive sampling patients with schizophrenia, and 32 quota sampling controls. To investigate reliability and validity, a subset of the 300 healthy participants was randomly selected to receive a second TOENST (n = 30) or conventional WM tests (n = 42). The split-half reliability of the TOENST ranged from 0.69 to 0.95, and its test-retest reliability was 0.75. Criterion validity was demonstrated by significant correlations with conventional WM measures (all p < .05, except semantic verbal fluency), and construct validity was demonstrated by significant correlations with aging (main effect, F10,259 = 10.99, p < .001). Normative data were established, and performance was significantly associated with age and education. TOENST scores of patients with schizophrenia were significantly lower and correlated with frontal lobe tests, but not demographical or clinical characteristics. The TOENST has adequate psychometric properties and clinical utility and is as a viable alternative WM task for nonalphabetic cultures.

  18. Psychometric properties of the NOMO 1.0 tested among adult powered-mobility users

    DEFF Research Database (Denmark)

    Sund, Terje; Brandt, Åse; Anttila, Heidi

    2017-01-01

    (Participation Repertoire). PURPOSE: This study aimed to investigate a range of psychometric properties of the NOMO 1.0 in a sample of adult powered mobility device (PMD) users. METHOD: Data collected from PMD users ( N = 248) in Denmark, Finland, and Norway as part of a larger study were analyzed using state...... scale and six components of the Frequency scale. IMPLICATIONS: The NOMO 1.0 should be used for research purposes and not for clinical practice. Better reliability should be established for the Need for Assistance and Ease/Difficulty scales prior to further psychometric testing to establish the validity...

  19. A Psychometric Study of the Kinetic-House-Tree-Person Scoring System for People with Psychiatric Disorders in Taiwan

    Directory of Open Access Journals (Sweden)

    Chih-Ying Li

    2014-06-01

    Conclusion: The KHTP scoring system has acceptable construct validity, inter-rater reliability, and test–retest reliability. Because drawing tests have the advantage of expressing nonverbal characteristics, the scoring system should prove to be very useful for those who are unwilling or unable to communicate verbally. This study therefore provides valuable information for clinical application, particularly for the psychiatric rehabilitation professions.

  20. A Procedure for Linear Polychotomous Scoring of Test Items

    Science.gov (United States)

    1993-10-01

    associated with the response categories of test items . When tests are scored using these scoring weights, test reliability increases. The new procedure is...program POLY. The example demonstrates how polyweighting can be used to calibrate and score test items drawn from an item bank that is too large to

  1. Psychometric Properties of the Iranian Version of the Diabetes Numeracy Test-15.

    Science.gov (United States)

    Moradi, Yousef; Baradaran, Hamid Reza; Khamseh, Mohammad Ebrahim

    2016-01-01

    Low health literacy (HL) of patients has obtained more attention as a risk factor for poor adherence to treatment and adverse outcomes in chronic disease's management particular in diabetes care. Diabetes Numeracy Test-15 (DNT-15) has been developed specifically for this purpose. The objective of the current study is to evaluate psychometric properties of Iranian (Persian) version of the DNT-15. The shortened version of the DNT (15-items) was completed by 120 patients with diabetes. The Kuder-Richardson Formula 20 for internal consistency was conducted. Content validity, criterion-related validity, and construct validity were also evaluated. The average score on the DNT was 72% and took an average of 25 minutes to complete. The DNT-15 had a very good internal reliability (KR-20 = 0.90) and also content validity (content validity ratio: 089 and content validity index: 0.86). The DNT-15 (Persian version) is a reliable and valid measure of diabetes-related numeracy skills for Iranian patients with diabetes; however, additional studies are needed to further explore the association between diabetes-specific numeracy and acculturation and their impact on diabetes-related outcomes in Iranian population.

  2. Questionnaire for measuring organisational attributes in dental-care practices: psychometric properties and test-retest reliability.

    Science.gov (United States)

    Goetz, Katja; Hasse, Philipp; Szecsenyi, Joachim; Campbell, Stephen M

    2016-04-01

    The consideration of organisational aspects, such as shared goals and clear communication, within the health care team is important to ensure good quality care. In primary health care, the instrument Survey of Organizational Attributes for Primary Care (SOAPC) is available to measure organisational attributes of care. However, there is no instrument available for dental care. The aim of the present study was to investigate psychometric properties and test-retest reliability of the version of SOAPC adapted for dental care, namely the Survey of Organizational Attributes in Dental Care (SOADC). The SOADC consists of 21 items in the following four subscales: communication; decision making; stress/chaos; and history of change. Convergent construct validity was measured using the job satisfaction scale. A total of 287 dental-care practices were asked to participate in the validation study. Psychometric properties and test-retest reliability were observed. A total of 43 dental-care practices responded to the survey. At baseline, 178 dental-care staff completed the questionnaire, and 4 weeks later 138 did so. Internal consistency, measured by Cronbach's alpha, was 0.718 or higher in the subscales. The test-retest reliability for each subscale and the overall SOADC score demonstrated good correlations over the 4-week test-retest interval, except for 'history of change'. A strong correlation with the aggregated job-satisfaction scale showed high convergent construct validity of SOADC. The consideration of organisational aspects from the perspective of dental-care teams is important for providing good quality of care. The SOADC is a reliable instrument with good psychometric properties and is suitable for the evaluation of organisational attributes in dental-care practices. © 2015 FDI World Dental Federation.

  3. Psychometric Testing of the Chinese Version of the Decisional Balance Scale (CDBS)

    Science.gov (United States)

    Chen, Huey-Shys; Sheu, Jiunn-Jye; Chen, W. William

    2006-01-01

    The purpose of this study was to conduct psychometric testing on the Chinese version of the decisional balance scale (CDBS) with Taiwanese seventh, eighth, and ninth graders who were recruited from the Taipei metropolitan area. A random cluster sampling method was used with 554 adolescents between the ages of 13 and 17 years. Factor analysis…

  4. Psychometric Evaluation of the Italian Adaptation of the Test of Inferential and Creative Thinking

    Science.gov (United States)

    Faraci, Palmira; Hell, Benedikt; Schuler, Heinz

    2016-01-01

    This article describes the psychometric properties of the Italian adaptation of the "Analyse des Schlussfolgernden und Kreativen Denkens" (ASK; Test of Inferential and Creative Thinking) for measuring inferential and creative thinking. The study aimed to (a) supply evidence for the factorial structure of the instrument, (b) describe its…

  5. Development and Psychometric Testing of the Dogs and WalkinG Survey (DAWGS)

    Science.gov (United States)

    Richards, Elizabeth A.; McDonough, Meghan H.; Edwards, Nancy E.; Lyle, Roseann M.; Troped, Philip J.

    2013-01-01

    Purpose: Dog owners represent 40% of the population, a promising audience to increase population levels of physical activity. The purpose of this study was to develop and test the psychometric properties of a new instrument to assess social-cognitive theory constructs related to dog walking. Method: Dog owners ("N" = 431) completed the…

  6. Psychometric Characteristics of the Boder Test of Reading-Spelling Patterns: Take One Giant Step Backwards.

    Science.gov (United States)

    Reynolds, Cecil R.

    1984-01-01

    The Boder Test of Reading-Spelling Patterns is designed as an assessment of reading and spelling skills that allows specific diagnosis of the source and typology of reading problems. From a purely psychometric perspective, the BTRSP fails on virtually every characteristic examined. (BW)

  7. Psychometric Evaluation of the Italian Adaptation of the Test of Inferential and Creative Thinking

    Science.gov (United States)

    Faraci, Palmira; Hell, Benedikt; Schuler, Heinz

    2016-01-01

    This article describes the psychometric properties of the Italian adaptation of the "Analyse des Schlussfolgernden und Kreativen Denkens" (ASK; Test of Inferential and Creative Thinking) for measuring inferential and creative thinking. The study aimed to (a) supply evidence for the factorial structure of the instrument, (b) describe its…

  8. Consistency and Raw Scores Survive Another Test: A Last Response to Prediger and His Colleagues

    Science.gov (United States)

    Holland, John L.

    1976-01-01

    Prediger confuses observations about the data with Holland's theoretical statement, performs some uninterpretable analyses, omits much relevant data, and provides an incomplete account of what psychometric authorities have said about raw scores in interest inventories. (Author)

  9. Psychometric issues and test quality in the Netherlands

    NARCIS (Netherlands)

    Egberink, Iris; Hurks, Petra

    2016-01-01

    The Dutch Committee on Tests and Testing (COTAN) aims at improving test use and test quality in the Netherlands. Improving test use is done by raising standards in the use of psychological tests. Improving test quality is done by assessing the quality of different psychological instruments that are

  10. Psychometric issues and test quality in the Netherlands

    NARCIS (Netherlands)

    Egberink, I.J.L.; Hurks, P.P.M.

    The Dutch Committee on Tests and Testing (COTAN) aims at improving test use and test quality in the Netherlands. Improving test use is done by raising standards in the use of psychological tests. Improving test quality is done by assessing the quality of different psychological instruments that are

  11. The Test Score Decline: A Review and Annotated Bibliography

    Science.gov (United States)

    1981-08-01

    J.R., The Test Score Decline: Are the Public Schools the Scapegoat? Part Two =129. K%’apfer. P., Kapfer , M., & Woodruff, A., Declining Test Scores...Michigan State University, August 1976. 129. Kapfer , P.F., Kapfer , M.B., & Woodruff, A.D., Declining test scores: Inter- pretations, issues, and relationship

  12. Psychometric properties of the Spanish version of the Jefferson Scale of Empathy: making sense of the total score through a second order confirmatory factor analysis.

    Science.gov (United States)

    Ferreira-Valente, Alexandra; Costa, Patrício; Elorduy, Marta; Virumbrales, Montserrat; Costa, Manuel J; Palés, Jorge

    2016-09-19

    Empathy is a key aspect of the physician-patient interactions. The Jefferson Scale of Empathy (JSE) is one of the most used empathy measures of medical students. The development of cross-cultural empathy studies depends on valid and reliable translations of the JSE. This study sought to: (1) adapt and assess the psychometric properties in Spanish students of the Spanish JSE validated in Mexican students; (2) test a second order latent factor model. The Spanish JSE was adapted from the Spanish JSE-S, resulting in a final version of the measure. A non-probabilistic sample of 1104 medical students of two Spanish medical schools completed a socio-demographic and the Spanish JSE-S. Descriptive statistics, along with a confirmatory factor analysis, the average variance extracted (AVE), Cronbach's alphas and composite reliability (CR) coefficients were computed. An independent samples t-test was performed to access sex differences. The Spanish JSE-S demonstrated acceptable to good sensitivity (individual items - except for item 2 - and JSE-S total score: -2.72 factor analysis supported the three-factor solution and the second order latent factor model. The findings provide support for the sensitivity, construct validity and reliability of the adapted Spanish JSE-S with Spanish medical students. Data confirm the hypothesized second order latent factor model. This version may be useful in future research examining empathy in Spanish medical students, as well as in cross-cultural studies.

  13. Psychometric and psychological effects of review on computerized fixed and adaptive tests

    OpenAIRE

    Olea Díaz, Julio; Revuelta Menéndez, Javier; Ximénez, Carmen; Abad,Francisco José

    2000-01-01

    Two computerized versions of an English vocabulary test for Spanish speakers (an adaptive and a fixed one) were applied in a Spanish sample of first-year psychology undergraduate students. The effects of test type (computer-adaptive vs. computerized-fixed) and review condition (allowed vs. not allowed) on several psychological variables were examined. Withinsubject variables were measured both before and after review to study the effects of review on the psychological and psychometric variabl...

  14. [Internal structure and standardised scores of the Torrance Test of Creative Thinking].

    Science.gov (United States)

    Ferrando, Mercedes; Ferrándiz, Carmen; Bermejo, María R; Sánchez, Cristina; Parra, Joaquín; Prieto, María D

    2007-08-01

    The present work sets out to study the internal structure of the Torrance Test of Creative Thinking (TTCT) and to establish standardised scores that will enable the test to be used in both a diagnostic and educational context. 649 students (319 girls and 330 boys), aged 5 to 12 years from various schools in Murcia and Alicante (SE Spain), took part in the study. The findings suggest that the psychometric characteristics of TTCT are satisfactory, and its internal structure can be attributed to three factors that are responsible for a high percentage of the variance (73.8%). The standardised score tables, which are provided for first time in this context, will be useful in the evaluation of creativity and the identification of students with high intellectual abilities.

  15. The development and psychometric testing of the geriatric health promotion scale.

    Science.gov (United States)

    Wang, Jeng; Lee, Chia-Mou; Chang, Chong-Fang; Jane, Sui-Whi; Chen, Mei-Yen

    2015-03-01

    As life expectancy increases, being old is no longer synonymous with frailty, weakness, or poor health. Today, most people expect to maintain or promote their health by adopting healthy lifestyle habits. This study developed and tested the psychometric Geriatric Health Promotion (GHP) scale. The GHP scale is a multidimensional instrument that attempts to evaluate lifestyle patterns prevalent among senior citizens in Taiwan. Data used in this study were acquired from a large health-promotion study that targeted elderly residents of rural areas in Taiwan. All subjects were aged 65 years or older and resided in the four communities with the highest mean age ratios in the nation. Trained research assistants assisted the participants to answer the research instrument questions. A total of 520 participants over 65 years of age (mean = 75.02 ± 6.04) were enrolled in this study. The initial draft of the GHP scale included 33 items. After conducting the item analysis, correlation analysis, and factor analysis, the final draft of the GHP was reduced to 22 items. A six-factor solution with Varimax rotation offered the most parsimonious model, with health habits, community participation, health responsibilities, healthy diet, regular exercise, and oral health, explaining 68% of the total variance. The Cronbach's alpha for the developed model scored as high as .94, with a test-retest reliability of .72 (p adults. The GHP scale is easily administered in less than 10 minutes. We recommend that healthcare providers use the GHP scale to evaluate the effectiveness of health promotion programs targeting older adults.

  16. Nontraditional Scoring of C-tests

    CERN Document Server

    Tamara, Tretjakova

    2007-01-01

    In C-tests the hypothesis of items local independence is violated, which doesn't permit to consider them as real tests. It is suggested to determine the distances between separate C-test items (blanks) and to combine items into clusters. Weights, inversely proportional to the number of items in corresponding clusters, are assigned to items. As a result, the C-test structure becomes similar to the structure of classical tests, without violation of local independence hypothesis.

  17. Development and psychometric properties of a health knowledge test on six chronic conditions.

    Science.gov (United States)

    Gellert, Paul; Detel, Sibylle; Ernsting, Clemens; Oedekoven, Monika; Kuhlmey, Adelheid

    2016-12-01

    The aim of this study was to develop and test the psychometric properties of a health knowledge test on common chronic conditions in the general population. Operationalization based on a facet design led to 108 knowledge items on six conditions: cardiovascular disease, cancer, respiratory diseases, musculoskeletal system conditions, depression, and chronic pain. We refined the items (qualitative study 1; N=20) and selected the items by applying a mixed Rasch model (study 2; N=861). The psychometric properties (Study 3; N=4144) of the remaining 24 items were tested using exploratory (split sample N=2110) and confirmatory factor analyses (split sample N=2034). 108 items were refined within study 1, 24 of which were selected in study 2. In study 3, a general health knowledge factor was confirmed based on six subscales on specific conditions. Convergent validity was confirmed by the overlap of health knowledge with education and perceived health knowledge. The development and evaluation of the psychometric properties of a health knowledge test on six common conditions will improve future research on health knowledge. Chronic conditions present a challenge; assessing the level of health knowledge is the first step to prevent and to cope with these conditions. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  18. Emotional intelligence: Part 1: the development of scales and psychometric testing.

    Science.gov (United States)

    Akerjordet, Kristin; Severinsson, Elisabeth

    2009-03-01

    This article, the first in a series of four, describes the development of two scales for deductive and inductive measurement of emotional intelligence (EI), based on the literature and the identification of the psychometric properties of the scales. The data collection comprised two parts: (i) a literature search on the subject of emotional intelligence; and (ii) psychometric testing of the scales. The Emotional Intelligence Scale, comprising 23 items, and the Emotional Reactions and Thoughts Scale, containing 25 items, were tested on a sample of 250 postnatal mothers. The response rate was 80%. An explorative factor analysis was used to investigate the construct validity of the underlying dimensions of emotional intelligence and yielded a three-factor solution for the Emotional Intelligence Scale and a four-factor solution for the Emotional Reactions and Thoughts Scale. The internal consistency of the scales was satisfactory. How well the factor solutions fit in clinical practice remains to be validated.

  19. A NOTE ON INCONSISTENCY OF THE SCORE TEST

    Directory of Open Access Journals (Sweden)

    Sumathi K

    2010-12-01

    Full Text Available The score test proposed by Rao (1947 has been widely used in the recent years for data analysis and model building because of its simplicity. However, at the time of its computation, it has been found that the value of the score test statistic becomes negative. Freedman (2007 discussed some of the theoretical reasons for this inconsistency of the score test and observed that the test was inconsistent when the observed Fisher information matrix was used rather than the expected Fisher information matrix. The present paper is an attempt to demonstrate the inconsistency of the score test in terms of the power function.

  20. Psychometric Properties of the Iranian Version of the Diabetes Numeracy Test-15

    OpenAIRE

    Yousef Moradi; Hamid Reza Baradaran; Mohammad Ebrahim Khamseh

    2016-01-01

    Background: Low health literacy (HL) of patients has obtained more attention as a risk factor for poor adherence to treatment and adverse outcomes in chronic disease′s management particular in diabetes care. Diabetes Numeracy Test-15 (DNT-15) has been developed specifically for this purpose. The objective of the current study is to evaluate psychometric properties of Iranian (Persian) version of the DNT-15. Methods: The shortened version of the DNT (15-items) was completed by 120 patients...

  1. The Nurse–Nurse Collaboration Behavior Scale: Development and psychometric testing

    Directory of Open Access Journals (Sweden)

    Chunli Liao

    2015-12-01

    Conclusion: The NNCB Scale developed in this study demonstrates acceptable reliability and validity for measuring the level of NNCB. Its implementration on a broader scale would at the very least guide and promote collaborative relationships between nurses involved in patient care. It should be noted that the scale requires further psychometric testing using a larger sample size of nurses who also represent a wider diversity of backgrounds, as well as researchers who are encouraged to improve the instrument.

  2. THE USAGE OF PSYCHOMETRIC TEST: ¿AN ABSURD IN AN EST?

    Directory of Open Access Journals (Sweden)

    LUZ MERY CARVAJAL MARÍN

    2007-01-01

    Full Text Available This article, presents an analysis on the usage of psychometric tests in a company of temporary services (EST ofBogota, Colombia, based on the socialconstructionist theory. The investigation give the evidence of the existence ofa gap among the Psychology as a discipline and a profession, as well as a displacement of the knowledge of thescientific Psychology towards the psychology of the common sense, as an answer from the organizational psychologiststo the logics of the market.

  3. Psychometric Study of the Raven Progressive Matrices Tests in elementary students from Lima

    OpenAIRE

    Delgado Vásquez, Ana; Facultad de Psicología, Universidad Nacional Mayor de San Marcos, Lima, Perú; Escurra Mayaute, Luis M.; Facultad de Psicología, Universidad Nacional Mayor de San Marcos, Lima, Perú; Bulnes Bedón, Mario; Facultad de Psicología, Universidad Nacional Mayor de San Marcos, Lima, Perú; Quesada Murillo, María Rosario; Facultad de Psicología, Universidad Nacional Mayor de San Marcos, Lima, Perú

    2014-01-01

    It was made a psychometric study of Raven Coloured Progressive Matrices Test, which is a non verbal instrument that evaluates abilities related to make comparisons, thinking for analogy and spatial perceptions organization. It was designed mainly as a measure of factor "g" of Spearman or general intelligence, composed by two abilities, named educational and reproductive. The participants were 2496 students obtained at random considering all school districts (USE) of the city of Lima, Peru. Th...

  4. Applications of the Linear Logistic Test Model in Psychometric Research

    Science.gov (United States)

    Kubinger, Klaus D.

    2009-01-01

    The linear logistic test model (LLTM) breaks down the item parameter of the Rasch model as a linear combination of some hypothesized elementary parameters. Although the original purpose of applying the LLTM was primarily to generate test items with specified item difficulty, there are still many other potential applications, which may be of use…

  5. Measuring benefits and patients' satisfaction when glasses are not needed after cataract and presbyopia surgery: scoring and psychometric validation of the Freedom from Glasses Value Scale (FGVS©)

    Science.gov (United States)

    2010-01-01

    Background The purpose of this study was to reduce the number of items, create a scoring method and assess the psychometric properties of the Freedom from Glasses Value Scale (FGVS), which measures benefits of freedom from glasses perceived by cataract and presbyopic patients after multifocal intraocular lens (IOL) surgery. Methods The 21-item FGVS, developed simultaneously in French and Spanish, was administered by phone during an observational study to 152 French and 152 Spanish patients who had undergone cataract or presbyopia surgery at least 1 year before the study. Reduction of items and creation of the scoring method employed statistical methods (principal component analysis, multitrait analysis) and content analysis. Psychometric properties (validation of the structure, internal consistency reliability, and known-group validity) of the resulting version were assessed in the pooled population and per country. Results One item was deleted and 3 were kept but not aggregated in a dimension. The other 17 items were grouped into 2 dimensions ('global evaluation', 9 items; 'advantages', 8 items) and divided into 5 sub-dimensions, with higher scores indicating higher benefit of surgery. The structure was validated (good item convergent and discriminant validity). Internal consistency reliability was good for all dimensions and sub-dimensions (Cronbach's alphas above 0.70). The FGVS was able to discriminate between patients wearing glasses or not after surgery (higher scores for patients not wearing glasses). FGVS scores were significantly higher in Spain than France; however, the measure had similar psychometric performances in both countries. Conclusions The FGVS is a valid and reliable instrument measuring benefits of freedom from glasses perceived by cataract and presbyopic patients after multifocal IOL surgery. PMID:20497555

  6. Measuring benefits and patients' satisfaction when glasses are not needed after cataract and presbyopia surgery: scoring and psychometric validation of the Freedom from Glasses Value Scale (FGVS©

    Directory of Open Access Journals (Sweden)

    Arnould Benoit

    2010-05-01

    Full Text Available Abstract Background The purpose of this study was to reduce the number of items, create a scoring method and assess the psychometric properties of the Freedom from Glasses Value Scale (FGVS, which measures benefits of freedom from glasses perceived by cataract and presbyopic patients after multifocal intraocular lens (IOL surgery. Methods The 21-item FGVS, developed simultaneously in French and Spanish, was administered by phone during an observational study to 152 French and 152 Spanish patients who had undergone cataract or presbyopia surgery at least 1 year before the study. Reduction of items and creation of the scoring method employed statistical methods (principal component analysis, multitrait analysis and content analysis. Psychometric properties (validation of the structure, internal consistency reliability, and known-group validity of the resulting version were assessed in the pooled population and per country. Results One item was deleted and 3 were kept but not aggregated in a dimension. The other 17 items were grouped into 2 dimensions ('global evaluation', 9 items; 'advantages', 8 items and divided into 5 sub-dimensions, with higher scores indicating higher benefit of surgery. The structure was validated (good item convergent and discriminant validity. Internal consistency reliability was good for all dimensions and sub-dimensions (Cronbach's alphas above 0.70. The FGVS was able to discriminate between patients wearing glasses or not after surgery (higher scores for patients not wearing glasses. FGVS scores were significantly higher in Spain than France; however, the measure had similar psychometric performances in both countries. Conclusions The FGVS is a valid and reliable instrument measuring benefits of freedom from glasses perceived by cataract and presbyopic patients after multifocal IOL surgery.

  7. Pictures Speak Louder than Test Scores.

    Science.gov (United States)

    McCabe, Deborah; Hilmo, Joellen

    1985-01-01

    The Goodenough-Harris Draw-a-Person Test, if given at regular intervals during periods of remediation, may show clear evidence of improvement in behavior and attitude of learning disabled students. (CL)

  8. Psychometric Testing of the FACES III with Rural Adolescents

    Science.gov (United States)

    Ide, Bette; Dingmann, Colleen; Cuevas, Elizabeth; Meehan, Maurita

    2010-01-01

    This study tests the validity and reliability of the Family Adaptability and Cohesion Scale III (FACES III) in two samples of rural adolescents. The underlying theory is the linear 3-D circumplex model. The FACES III was administered to 1,632 adolescents in Grades 7 through 12 in two counties in a rural western state. The FACES III Scale and the…

  9. Video game addiction test: validity and psychometric characteristics.

    NARCIS (Netherlands)

    Rooij, A.J. van; Schoenmakers, T.M.; Eijnden, R.J.J.M. van den; Vermulst, A.A.; Mheen, D. van de

    2012-01-01

    The study explores the reliability, validity, and measurement invariance of the Video game Addiction Test (VAT). Game-addiction problems are often linked to Internet enabled online games; the VAT has the unique benefit that it is theoretically and empirically linked to Internet addiction. The study

  10. Video Game Addiction Test: Validity and Psychometric Characteristics

    NARCIS (Netherlands)

    Rooij, A.J. van; Schoenmakers, T.M.; Eijnden, R.J.J.M. van den; Vermulst, A.A.; Mheen, H. van de

    2012-01-01

    The study explores the reliability, validity, and measurement invariance of the Video game Addiction Test (VAT). Game-addiction problems are often linked to Internet enabled online games; the VAT has the unique benefit that it is theoretically and empirically linked to Internet addiction. The study

  11. Video game addiction test: validity and psychometric characteristics.

    NARCIS (Netherlands)

    Rooij, A.J. van; Schoenmakers, T.M.; Eijnden, R.J.J.M. van den; Vermulst, A.A.; Mheen, D. van de

    2012-01-01

    The study explores the reliability, validity, and measurement invariance of the Video game Addiction Test (VAT). Game-addiction problems are often linked to Internet enabled online games; the VAT has the unique benefit that it is theoretically and empirically linked to Internet addiction. The study

  12. Video Game Addiction Test: Validity and Psychometric Characteristics

    NARCIS (Netherlands)

    Rooij, A.J. van; Schoenmakers, T.M.; Eijnden, R.J.J.M. van den; Vermulst, A.A.; Mheen, H. van de

    2012-01-01

    The study explores the reliability, validity, and measurement invariance of the Video game Addiction Test (VAT). Game-addiction problems are often linked to Internet enabled online games; the VAT has the unique benefit that it is theoretically and empirically linked to Internet addiction. The study

  13. Psychometric Properties of the Alcohol Use Disorders Identification Test-Consumption (AUDIT-C) in Public First Responders.

    Science.gov (United States)

    Jeong, Hyeonseok S; Park, Shinwon; Lim, Soo Mee; Ma, Jiyoung; Kang, Ilhyang; Kim, Jungyoon; Kim, Eui-Jung; Choi, Yejee J; Lim, Jae-Ho; Chung, Yong-An; Lyoo, In Kyoon; Yoon, Sujung; Kim, Jieun E

    2017-03-21

    Problematic alcohol consumption is prevalent among first responders because alcohol is commonly used to cope with occupational stress and frequent exposure to traumatic incidents, making them an at-risk population for alcohol use disorders (AUD). This study investigated the psychometric properties of the Korean version of the Alcohol Use Disorders Identification Test-Consumption (AUDIT-C) among public first responders. The Structured Clinical Interview for the Diagnostic and Statistical Manual of Mental Disorders-IV-Text Revision (SCID), AUDIT-C, AUDIT, and CAGE were administered to 222 public first responders, who were recruited by convenience sampling. One-week test-retest reliability was evaluated in a subsample (n = 24). Receiver operating characteristic (ROC) curve analyses were conducted to evaluate the diagnostic accuracy and estimate the optimal cut-off scores for any AUD and alcohol dependence. Three different analytic criteria were utilized to calculate the cut-off scores. The AUDIT-C demonstrated good test-retest reliability (intraclass correlation coefficient for test-retest reliability = 0.91) and satisfactory convergent validity. The areas under the ROC curves for any AUD and alcohol dependence of the AUDIT-C were 0.87 and 0.93, respectively. For any AUD, all three criteria suggested a cut-off score of 7.5 (sensitivity = 81.8%, specificity = 79.8%), whereas for alcohol dependence, a cut-off score of 8.5 (sensitivity = 85.7%, specificity = 86.1%) was derived from two criteria. In conclusion, the AUDIT-C demonstrated good reliability and validity and proved to be a brief and effective screening test for AUD among first responders.

  14. Development and psychometric testing of an instrument to measure safety climate perceptions in community pharmacy.

    Science.gov (United States)

    Newham, Rosemary; Bennie, Marion; Maxwell, David; Watson, Anne; de Wet, Carl; Bowie, Paul

    2014-12-01

    A positive and strong safety culture underpins effective learning from patient safety incidents in health care, including the community pharmacy (CP) setting. To build this culture, perceptions of safety climate must be measured with context-specific and reliable instruments. No pre-existing instruments were specifically designed or suitable for CP within Scotland. We therefore aimed to develop a psychometrically sound instrument to measure perceptions of safety climate within Scottish CPs. The first stage, development of a preliminary instrument, comprised three steps: (i) a literature review; (ii) focus group feedback; and (iii) content validation. The second stage, psychometric testing, consisted of three further steps: (iv) a pilot survey; (v) a survey of all CP staff within a single health board in NHS Scotland; and (vi) application of statistical methods, including principal components analysis and calculation of Cronbach's reliability coefficients, to derive the final instrument. The preliminary questionnaire was developed through a process of literature review and feedback. This questionnaire was completed by staff in 50 CPs from the 131 (38%) sampled. 250 completed questionnaires were suitable for analysis. Psychometric evaluation resulted in a 30-item instrument with five positively correlated safety climate factors: leadership, teamwork, safety systems, communication and working conditions. Reliability coefficients were satisfactory for the safety climate factors (α > 0.7) and overall (α = 0.93). The robust nature of the technical design and testing process has resulted in the development of an instrument with sufficient psychometric properties, which can be implemented in the community pharmacy setting in NHS Scotland. © 2014 John Wiley & Sons, Ltd.

  15. Improving Scores on the IELTS Speaking Test

    Science.gov (United States)

    Issitt, Steve

    2008-01-01

    This article presents three strategies for teaching students who are taking the IELTS speaking test. The first strategy is aimed at improving confidence and uses a variety of self-help materials from the field of popular psychology. The second encourages students to think critically and invokes a range of academic perspectives. The third strategy…

  16. Improving Scores on the IELTS Speaking Test

    Science.gov (United States)

    Issitt, Steve

    2008-01-01

    This article presents three strategies for teaching students who are taking the IELTS speaking test. The first strategy is aimed at improving confidence and uses a variety of self-help materials from the field of popular psychology. The second encourages students to think critically and invokes a range of academic perspectives. The third strategy…

  17. The Trait Structure of Cloze Test Scores.

    Science.gov (United States)

    Bachman, Lyle F.

    1982-01-01

    Presents study designed to examine trait structure of a cloze test using confirmatory factor analysis. Results suggest that a modified cloze passage, using rational deletions, is capable of measuring syntactic- and discourse-level relationships in a text, and this advantage may outweigh considerations of reduced redundancy which underlie random…

  18. Fuzzy Math: A Meditation on Test Scoring

    Science.gov (United States)

    Jacks, Meredith

    2011-01-01

    As a public school English teacher, the author observes standardized testing season each year with a sort of grim fascination. "So this is it," she thinks as she paces around her silent classroom, peering over kids' shoulders at articles about parasailing. Line graphs tracking the rainfall in Tulsa. Parts of speech. Functions of "x." "These are…

  19. Psychometric properties of the Knee injury and Osteoarthritis Outcome Score for Children (KOOS-Child) in children with knee disorders

    DEFF Research Database (Denmark)

    Ortqvist, Maria; Iversen, Maura D; Janarv, Per-Mats

    2014-01-01

    -Child was developed. This study aims to evaluate psychometric properties of the final KOOS-Child when used in children with knee disorders. METHODS: 115 children (boys/girls 51/64, 7-16 years) with knee disorders were recruited. All children (n=115) completed the KOOS-Child, the Child-Health Assessment Questionnaire...... better. CONCLUSIONS: The final KOOS-Child demonstrates good psychometric properties and supports the use of the KOOS-Child when evaluating children with knee disorders....

  20. Development and psychometric testing of the Suicide Caring Competence Scale (SCCS) for family caregivers in Taiwan.

    Science.gov (United States)

    Sun, Fan-Ko; Chiang, Chun-Ying; Chen, Wei-Jen; Wang, Ruey-Hsia; Huang, Hui-Man; Lin, Hung-Yen

    2014-08-01

    Suicide caring competence is important for family caregivers to care their relatives with suicidal tendencies. The purpose of this study was to develop and test the psychometric properties of the Suicide Caring Competence Scale (SCCS) for family caregivers in Taiwan. A 20-item SCCS was tested on 165 family caregivers. Confirmatory factor analysis indicated that five subscales with 19 items best fit the data. The Cronbach's α and test-retest reliability of the SCCS was 0.90 and 0.81, respectively. The SCCS demonstrated acceptable construct validity and reliability. Nurses can use the SCCS to assess the suicide caring competence of family caregivers.

  1. A Human Capital Model of Educational Test Scores

    DEFF Research Database (Denmark)

    McIntosh, James; D. Munk, Martin

    measure of pure cognitive ability. We find that variables which are not closely associated with traditional notions of intelligence explain a significant proportion of the variation in test scores. This adds to the complexity of interpreting test scores and suggests that school culture, attitudes...

  2. A Psychometric Approach to the Development of a 5E Lesson Plan Scoring Instrument for Inquiry-Based Teaching

    Science.gov (United States)

    Goldston, M. Jenice; Dantzler, John; Day, Jeanelle; Webb, Brenda

    2013-01-01

    This research centers on the psychometric examination of the structure of an instrument, known as the 5E Lesson Plan (5E ILPv2) rubric for inquiry-based teaching. The instrument is intended to measure an individual's skill in developing written 5E lesson plans for inquiry teaching. In stage one of the instrument's development, an exploratory…

  3. Video game addiction test: validity and psychometric characteristics.

    Science.gov (United States)

    van Rooij, Antonius J; Schoenmakers, Tim M; van den Eijnden, Regina J J M; Vermulst, Ad A; van de Mheen, Dike

    2012-09-01

    The study explores the reliability, validity, and measurement invariance of the Video game Addiction Test (VAT). Game-addiction problems are often linked to Internet enabled online games; the VAT has the unique benefit that it is theoretically and empirically linked to Internet addiction. The study used data (n=2,894) from a large-sample paper-and-pencil questionnaire study, conducted in 2009 on secondary schools in Netherlands. Thus, the main source of data was a large sample of schoolchildren (aged 13-16 years). Measurements included the proposed VAT, the Compulsive Internet Use Scale, weekly hours spent on various game types, and several psychosocial variables. The VAT demonstrated excellent reliability, excellent construct validity, a one-factor model fit, and a high degree of measurement invariance across gender, ethnicity, and learning year, indicating that the scale outcomes can be compared across different subgroups with little bias. In summary, the VAT can be helpful in the further study of video game addiction, and it contributes to the debate on possible inclusion of behavioral addictions in the upcoming DSM-V.

  4. Development and psychometric testing of the Visual Analogue Scale for Irritable Bowel Syndrome (VAS-IBS

    Directory of Open Access Journals (Sweden)

    Ohlsson Bodil

    2007-05-01

    Full Text Available Abstract Background The aim of this study was to develop and psychometrically test a short, patient-reported questionnaire to be used in clinical practice for patients with Irritable Bowel Syndrome (IBS. The Visual Analogue Scale for Irritable Bowel Syndrome (VAS-IBS questionnaire was designed to measure the treatment response of symptoms and well-being in patients suffering from IBS. Methods The VAS-IBS was psychometrically tested for content and criterion validity, scale acceptability, item-reduction, internal reliability consistency, simplicity, and speed. Two samples were used. One expert panel (five physicians and four registered nurses, who gave their opinion on the content validity, and one of 71 patients with IBS (mean age 38 years SD +13, range 19–65, who completed the VAS-IBS, as well as the Gastrointestinal Symptom Rating Scale and the Psychological General Well-Being Index for criterion validity. Results The items in the VAS-IBS capture the main physical concerns women with IBS might present and the psychometric testing confirmed that the VAS-IBS is an acceptable homogeneous patient-reported questionnaire indicated by Cronbach's alpha internal consistency reliability coefficient, with a value of 0.85. All correlations to test the criterion validity performed by using Pearson's correlation test, were statistically significant (p Conclusion The VAS-IBS appears to be reliable and user-friendly, for patients as well as for health professionals. The final version of the VAS-IBS including nine items needs to be further tested in clinical practice cross-culturally in women as well as in men.

  5. A call for policy guidance on psychometric testing in doping control in sport.

    Science.gov (United States)

    Petróczi, Andrea; Backhouse, Susan H; Barkoukis, Vassilis; Brand, Ralf; Elbe, Anne-Marie; Lazuras, Lambros; Lucidi, Fabio

    2015-11-01

    One of the fundamental challenges in anti-doping is identifying athletes who use, or are at risk of using, prohibited performance enhancing substances. The growing trend to employ a forensic approach to doping control aims to integrate information from social sciences (e.g., psychology of doping) into organised intelligence to protect clean sport. Beyond the foreseeable consequences of a positive identification as a doping user, this task is further complicated by the discrepancy between what constitutes a doping offence in the World Anti-Doping Code and operationalized in doping research. Whilst psychology plays an important role in developing our understanding of doping behaviour in order to inform intervention and prevention, its contribution to the array of doping diagnostic tools is still in its infancy. In both research and forensic settings, we must acknowledge that (1) socially desirable responding confounds self-reported psychometric test results and (2) that the cognitive complexity surrounding test performance means that the response-time based measures and the lie detector tests for revealing concealed life-events (e.g., doping use) are prone to produce false or non-interpretable outcomes in field settings. Differences in social-cognitive characteristics of doping behaviour that are tested at group level (doping users vs. non-users) cannot be extrapolated to individuals; nor these psychometric measures used for individual diagnostics. In this paper, we present a position statement calling for policy guidance on appropriate use of psychometric assessments in the pursuit of clean sport. We argue that, to date, both self-reported and response-time based psychometric tests for doping have been designed, tested and validated to explore how athletes feel and think about doping in order to develop a better understanding of doping behaviour, not to establish evidence for doping. A false 'positive' psychological profile for doping affects not only the individual

  6. Improving personality facet scores with multidimensional computer adaptive testing

    DEFF Research Database (Denmark)

    Makransky, Guido; Mortensen, Erik Lykke; Glas, Cees A W

    2013-01-01

    Narrowly defined personality facet scores are commonly reported and used for making decisions in clinical and organizational settings. Although these facets are typically related, scoring is usually carried out for a single facet at a time. This method can be ineffective and time consuming when...... personality tests contain many highly correlated facets. This article investigates the possibility of increasing the precision of the NEO PI-R facet scores by scoring items with multidimensional item response theory and by efficiently administering and scoring items with multidimensional computer adaptive...... testing (MCAT). The increase in the precision of personality facet scores is obtained from exploiting the correlations between the facets. Results indicate that the NEO PI-R could be substantially shorter without attenuating precision when the MCAT methodology is used. Furthermore, the study shows...

  7. [The methodological assessment and qualitative evaluation of psychometric performance tests based on the example of modern tests that assess reading and spelling skills].

    Science.gov (United States)

    Galuschka, Katharina; Rothe, Josefine; Schulte-Körne, Gerd

    2015-09-01

    This article looks at a means of objectively evaluating the quality of psychometric tests. This approach enables users to evaluate psychometric tests based on their methodological characteristics, in order to decide which instrument should be used. Reading and spelling assessment tools serve as examples. The paper also provides a review of German psychometric tests for the assessment of reading and spelling skills. This method facilitates the identification of psychometric tests.of high methodological quality which can be used for the assessment of reading and spelling skills. Reading performance should ideally be assessed with the following instruments: ELFE 1-6, LGVT 6-12, LESEN 6-7, LESEN 8-9, or WLLP-R. The tests to be used for the evaluation of spelling skills are DERET 1-2+, DERET 3-4+, WRT 1+, WRT 2+, WRT 3+, WRT 4+ or HSP 1-10.

  8. Development and testing of a portfolio evaluation scoring tool.

    Science.gov (United States)

    Karlowicz, Karen A

    2010-02-01

    This study focused on development of a portfolio evaluation tool to guide the assignment of valid and reliable scores. Tool development was facilitated by a literature review, guidance of a faculty committee, and validation by content experts. Testing involved a faculty team that evaluated 60 portfolios. Calculation of interrater reliability and a paired-samples t test were used to judge effectiveness. Interrater reliability was 0.78 for overall scores, 0.81 for the seven program outcomes criteria scores, and more than 0.65 for scores assigned by 11 of 13 pairs of raters. There were no significant differences between raters' scores in 10 of 13 pairs. The portfolio evaluation tool demonstrated high reliability and should be tested by other schools using portfolio evaluation.

  9. A test for the Assessment of Pragmatic Abilities and Cognitive Substrates (APACS: Normative data and psychometric properties

    Directory of Open Access Journals (Sweden)

    Giorgio eArcara

    2016-02-01

    Full Text Available The Assessment of Pragmatic Abilities and Cognitive Substrates (APACS test is a new tool to evaluate pragmatic abilities in patients with acquired communicative deficits, ranging from schizophrenia to neurodegenerative diseases. APACS focuses on two main domains, namely discourse and non-literal language, combining traditional tasks with refined linguistic materials in Italian, in a unified framework inspired by language pragmatics. The test includes six tasks (Interview, Description, Narratives, Figurative Language 1, Humor, Figurative Language 2 and three composite scores (Pragmatic Productions, Pragmatic Comprehension, APACS Total. Psychometric properties and normative data were computed on a sample of 119 healthy participants representative of the general population. The analysis revealed acceptable internal consistency and good test-retest reliability for almost every APACS task, suggesting that items are coherent and performance is consistent over time. Factor analysis supports the validity of the test, revealing two factors possibly related to different facets and substrates of the pragmatic competence. Finally, excellent match between APACS items and scores and the pragmatic constructs measured in the test was evidenced by experts’ evaluation of content validity. The performance on APACS showed a general effect of demographic variables, with a negative effect of age and a positive effect of education. The norms were calculated by means of state-of-the-art regression methods. Overall, APACS is a valuable tool for the assessment of pragmatic deficits in verbal communication. The short duration and easiness of administration make the test especially suitable to use in clinical settings. In presenting APACS, we also aim at promoting the inclusion of pragmatics in the assessment practice, as a relevant dimension in defining the patient’s cognitive profile, given its vital role for communication and social interaction in daily life. The

  10. A Test for the Assessment of Pragmatic Abilities and Cognitive Substrates (APACS): Normative Data and Psychometric Properties

    Science.gov (United States)

    Arcara, Giorgio; Bambini, Valentina

    2016-01-01

    The Assessment of Pragmatic Abilities and Cognitive Substrates (APACS) test is a new tool to evaluate pragmatic abilities in clinical populations with acquired communicative deficits, ranging from schizophrenia to neurodegenerative diseases. APACS focuses on two main domains, namely discourse and non-literal language, combining traditional tasks with refined linguistic materials in Italian, in a unified framework inspired by language pragmatics. The test includes six tasks (Interview, Description, Narratives, Figurative Language 1, Humor, Figurative Language 2) and three composite scores (Pragmatic Productions, Pragmatic Comprehension, APACS Total). Psychometric properties and normative data were computed on a sample of 119 healthy participants representative of the general population. The analysis revealed acceptable internal consistency and good test-retest reliability for almost every APACS task, suggesting that items are coherent and performance is consistent over time. Factor analysis supports the validity of the test, revealing two factors possibly related to different facets and substrates of the pragmatic competence. Finally, excellent match between APACS items and scores and the pragmatic constructs measured in the test was evidenced by experts' evaluation of content validity. The performance on APACS showed a general effect of demographic variables, with a negative effect of age and a positive effect of education. The norms were calculated by means of state-of-the-art regression methods. Overall, APACS is a valuable tool for the assessment of pragmatic deficits in verbal communication. The short duration and easiness of administration make the test especially suitable to use in clinical settings. In presenting APACS, we also aim at promoting the inclusion of pragmatics in the assessment practice, as a relevant dimension in defining the patient's cognitive profile, given its vital role for communication and social interaction in daily life. The combined

  11. A Test for the Assessment of Pragmatic Abilities and Cognitive Substrates (APACS): Normative Data and Psychometric Properties.

    Science.gov (United States)

    Arcara, Giorgio; Bambini, Valentina

    2016-01-01

    The Assessment of Pragmatic Abilities and Cognitive Substrates (APACS) test is a new tool to evaluate pragmatic abilities in clinical populations with acquired communicative deficits, ranging from schizophrenia to neurodegenerative diseases. APACS focuses on two main domains, namely discourse and non-literal language, combining traditional tasks with refined linguistic materials in Italian, in a unified framework inspired by language pragmatics. The test includes six tasks (Interview, Description, Narratives, Figurative Language 1, Humor, Figurative Language 2) and three composite scores (Pragmatic Productions, Pragmatic Comprehension, APACS Total). Psychometric properties and normative data were computed on a sample of 119 healthy participants representative of the general population. The analysis revealed acceptable internal consistency and good test-retest reliability for almost every APACS task, suggesting that items are coherent and performance is consistent over time. Factor analysis supports the validity of the test, revealing two factors possibly related to different facets and substrates of the pragmatic competence. Finally, excellent match between APACS items and scores and the pragmatic constructs measured in the test was evidenced by experts' evaluation of content validity. The performance on APACS showed a general effect of demographic variables, with a negative effect of age and a positive effect of education. The norms were calculated by means of state-of-the-art regression methods. Overall, APACS is a valuable tool for the assessment of pragmatic deficits in verbal communication. The short duration and easiness of administration make the test especially suitable to use in clinical settings. In presenting APACS, we also aim at promoting the inclusion of pragmatics in the assessment practice, as a relevant dimension in defining the patient's cognitive profile, given its vital role for communication and social interaction in daily life. The combined

  12. Evaluating the Predictive Validity of Graduate Management Admission Test Scores

    Science.gov (United States)

    Sireci, Stephen G.; Talento-Miller, Eileen

    2006-01-01

    Admissions data and first-year grade point average (GPA) data from 11 graduate management schools were analyzed to evaluate the predictive validity of Graduate Management Admission Test[R] (GMAT[R]) scores and the extent to which predictive validity held across sex and race/ethnicity. The results indicated GMAT verbal and quantitative scores had…

  13. Evaluating the Predictive Validity of Graduate Management Admission Test Scores

    Science.gov (United States)

    Sireci, Stephen G.; Talento-Miller, Eileen

    2006-01-01

    Admissions data and first-year grade point average (GPA) data from 11 graduate management schools were analyzed to evaluate the predictive validity of Graduate Management Admission Test[R] (GMAT[R]) scores and the extent to which predictive validity held across sex and race/ethnicity. The results indicated GMAT verbal and quantitative scores had…

  14. Group differences in the heritability of items and test scores

    NARCIS (Netherlands)

    Wicherts, J.M.; Johnson, W.

    2009-01-01

    It is important to understand potential sources of group differences in the heritability of intelligence test scores. On the basis of a basic item response model we argue that heritabilities which are based on dichotomous item scores normally do not generalize from one sample to the next. If groups

  15. Measuring hospital nurses' well-being at work - psychometric testing of the scale.

    Science.gov (United States)

    Päätalo, Kati; Kyngäs, Helvi

    2016-12-01

    Valid and reliable methods to measure nurses' well-being at work are needed. Well-being at work refers to a positive viewpoint and experience in a work context. The aim of this study is to test the psychometric properties of nurses' well-being at work-scale (NWB-scale). The NWB-scale was tested in a Finnish hospital (N = 233) using statistical methods. The NWB-scale consists of 67 items and 12 factors: patients' experience of high-quality care, assistance and support among nurses, nurses' togetherness and collaboration, satisfying practical organization of work, challenging and meaningful work, freedom to express diverse feelings in a work community, well-conducted everyday nursing, status related to the work itself, fair and supportive leadership, opportunities for professional development, fluent communication with other professionals, and being together with other nurses in an informal way. Content validity was good based on expert evaluations. Construct validity of the scale was also very good based on exploratory factor analysis. Reliability (internal consistency) was assessed by Cronbach's alpha coefficient, which ranged from 0.66 to 0.91. Based on psychometric properties, NWB is a valid and reliable scale to measure hospital nurses' well-being at work. Extrapolation to other professionals and settings needs required modification and careful testing.

  16. Counselor Simulation by Film in Test Score Reporting Interviews

    Science.gov (United States)

    Collins, Tom

    1972-01-01

    The responsible and innovative utilization of media, not only in test score reporting but also in other guidance functions, may assist the counselor in permitting him more time to function with clients in counseling relationships. (Author)

  17. SUBCLINICAL HEPATIC ENCEPHALOPATHY DETECTED THROUGH PSYCHOMETRIC TESTS IN PATIENTS WITH HEPATIC CELLULAR CARCINOMA AND CIRRHOSIS

    Institute of Scientific and Technical Information of China (English)

    李薇; 吴积坰; 李惠芳; 顾海蔚; 范晓方

    2002-01-01

    Objective To study subclinical hepatic encephalopathy (SHE) in patients with hepatic cellular carcinoma by using psychometric tests.Methods 112 cases of hepatic cellular carcinoma complicated with cirrhosis and 65 controls were detected with number connection test (NCT), digital span test (DSP) and digital symbol test (DSY).Results The abnormal rate of NCT, DSY and DSP in patients was 23.2% (26/112), 14.3% (16/112) and 11.6%(13/112) respectively. Patients in the Child-Pugh B group had a significantly higher abnormal rate than that in the Child-Pugh A group.Conclusion SHE has an incidence of 25.0% (28/112) in patients with hepatic cellular carcinoma and cirrhosis in this study population; its occurrence is associated with the severity of liver impairment.

  18. Does parental physical violence reduce children's standardized test score performance?

    Science.gov (United States)

    Peek-Asa, Corinne; Maxwell, Leah; Stromquist, Ann; Whitten, Paul; Limbos, Mary Ann; Merchant, James

    2007-11-01

    Many negative cognitive and behavioral outcomes have been identified among children living in households with parental violence, but few studies have examined academic performance. In a rural population-based cohort, we examine the role of parental violence on standardized test score performance. The cohort included 306 children ages 6 through 17. Parents responded to a health interview that included questions about physical violence. Children's standardized test scores were collected prospectively for 5 years after the parent interview. Hierarchical multivariate models clustering on school, household, and repeated individual test scores and controlling for children's and parent's characteristics were run to predict test score performance. One in five children lived in a household in which parents reported at least one act of physical violence. Children whose parents reported intimate partner violence (IPV) performed an average of 12.2 percentile points lower than children whose parents reported no IPV (95% CI, -19.2--5.2; p Parent-reported IPV led to larger test score reductions for girls than for boys and for children less than 12 years old than for older children. Parental physical violence was common, and children in homes with violence had significantly poorer performance on standardized test scores.

  19. A Monte Carlo Comparison of Three Optimal Test Scoring Procedures,

    Science.gov (United States)

    theory, with conventional scoring. The effect of the COWS procedures is to differentially weight test items as a function of examine ability and the...item characteristics. Low ability examinees are given very low weights on difficult test items , lowering the effects of guessing, and decreasing test error.

  20. Grades and Test Scores: Accounting for Observed Differences.

    Science.gov (United States)

    Willingham, Warren W.; Pollack, Judith M.; Lewis, Charles

    2002-01-01

    Proposed a framework of possible differences between grades and test scores and tested the framework with data on 8,454 high school seniors from the National Education Longitudinal Study. Identified differences and correlations among achievement factors. Differences between grades and tests give these measures complementary strengths in…

  1. Prediction of true test scores from observed item scores and ancillary data.

    Science.gov (United States)

    Haberman, Shelby J; Yao, Lili; Sinharay, Sandip

    2015-05-01

    In many educational tests which involve constructed responses, a traditional test score is obtained by adding together item scores obtained through holistic scoring by trained human raters. For example, this practice was used until 2008 in the case of GRE(®) General Analytical Writing and until 2009 in the case of TOEFL(®) iBT Writing. With use of natural language processing, it is possible to obtain additional information concerning item responses from computer programs such as e-rater(®). In addition, available information relevant to examinee performance may include scores on related tests. We suggest application of standard results from classical test theory to the available data to obtain best linear predictors of true traditional test scores. In performing such analysis, we require estimation of variances and covariances of measurement errors, a task which can be quite difficult in the case of tests with limited numbers of items and with multiple measurements per item. As a consequence, a new estimation method is suggested based on samples of examinees who have taken an assessment more than once. Such samples are typically not random samples of the general population of examinees, so that we apply statistical adjustment methods to obtain the needed estimated variances and covariances of measurement errors. To examine practical implications of the suggested methods of analysis, applications are made to GRE General Analytical Writing and TOEFL iBT Writing. Results obtained indicate that substantial improvements are possible both in terms of reliability of scoring and in terms of assessment reliability.

  2. Construction and psychometric testing of the EMPATHIC questionnaire measuring parent satisfaction in the pediatric intensive care unit

    NARCIS (Netherlands)

    Latour, Jos M.; van Goudoever, Johannes B.; Duivenvoorden, Hugo J.; Albers, Marcel J. I. J.; van Dam, Nicolette A. M.; Dullaart, Eugenie; van Heerde, Marc; de Neef, Marjorie; Verlaat, Carin W. M.; van Vught, Elise M.; Hazelzet, Jan A.

    2011-01-01

    To construct and test the reliability and validity of the EMpowerment of PArents in THe Intensive Care (EMPATHIC) questionnaire measuring parent satisfaction in the pediatric intensive care unit (PICU). Structured development and psychometric testing of a parent satisfaction-with-care instrument wit

  3. Construction and psychometric testing of the EMPATHIC questionnaire measuring parent satisfaction in the pediatric intensive care unit

    NARCIS (Netherlands)

    J.M. Latour (Jos); J.B. van Goudoever (Hans); H.J. Duivenvoorden (Hugo); M.J.I.J. Albers (Marcel); N.A.M. van Dam (Nicolette); E. Dullaart (Eugenie); M. van Heerde (Marc); M. de Neef (Marjorie); C.W.M. Verlaat (Carin); E.M. van Vught (Elise); J.A. Hazelzet (Jan)

    2011-01-01

    textabstractAbstract PURPOSE: To construct and test the reliability and validity of the EMpowerment of PArents in THe Intensive Care (EMPATHIC) questionnaire measuring parent satisfaction in the pediatric intensive care unit (PICU). METHODS: Structured development and psychometric testing of a paren

  4. Psychometric evaluation of the Sinonasal Outcome Test-16 and activity impairment assessment in acute bacterial sinusitis.

    Science.gov (United States)

    Quadri, Nuz; Lloyd, Andrew; Keating, Karen N; Nafees, Beenish; Piccirillo, Jay; Wild, Diane

    2013-07-01

    To validate the Sinonasal Outcome Test-16 and Activity Impairment Assessment in patients with acute bacterial sinusitis. Data were used from a phase III clinical trial designed to evaluate the efficacy and safety of moxifloxacin 400 mg once daily for 5 consecutive days in the treatment of acute bacterial sinusitis. The psychometric properties and factor structure of the 2 measures were assessed. Participants were given the measures to self-complete using either a telephone voice response system or a paper-and-pencil format. Three hundred seventy-four patients with acute bacterial sinusitis were used in the analysis. Patients received either a placebo or 400 mg moxifloxacin once daily. Patients were then reviewed at test of cure and follow-up. All analyses were conducted on a combined sample of placebo and active treatment patients. The Sinonasal Outcome Test-16 was associated with minimal missing data at baseline but a higher proportion by test of cure. There was no evidence of floor or ceiling effects and no significant skew. The Activity Impairment Assessment also had low missing data at baseline and no obvious floor or ceiling effects, but the data were not normally distributed. Both measures had good internal consistency. Convergent and divergent validity as well as sensitivity and the minimally important difference are also reported. The measures both have good psychometric properties and are suitable for use with patients with acute bacterial sinusitis. Both instruments are sensitive. The minimal important difference estimates for the Sinonasal Outcome Test-16 are quite high but are similar to estimates reported previously.

  5. High Test Scores: The Wrong Road to National Economic Success

    Science.gov (United States)

    Baker, Keith

    2011-01-01

    A widely held view is that good schools are essential to a nation's international economic success and that high test scores on international tests of academic skills and knowledge indicate how good a nation's schools are. The widespread belief that good schools are an important contributor to a nation's economic success in the world is supported…

  6. Effort Analysis: Individual Score Validation of Achievement Test Data

    Science.gov (United States)

    Wise, Steven L.

    2015-01-01

    Whenever the purpose of measurement is to inform an inference about a student's achievement level, it is important that we be able to trust that the student's test score accurately reflects what that student knows and can do. Such trust requires the assumption that a student's test event is not unduly influenced by construct-irrelevant factors…

  7. High Test Scores: The Wrong Road to National Economic Success

    Science.gov (United States)

    Baker, Keith

    2011-01-01

    A widely held view is that good schools are essential to a nation's international economic success and that high test scores on international tests of academic skills and knowledge indicate how good a nation's schools are. The widespread belief that good schools are an important contributor to a nation's economic success in the world is supported…

  8. A Human Capital Model of Educational Test Scores

    DEFF Research Database (Denmark)

    McIntosh, James; D. Munk, Martin

    Latent class Poisson count models are used to analyze a sample of Danish test score results from a cohort of individuals born in 1954-55 and tested in 1968. The procedure takes account of unobservable effects as well as excessive zeros in the data. The bulk of unobservable effects are uncorrelate...

  9. Accountancy, teaching methods, sex, and American College Test scores.

    Science.gov (United States)

    Heritage, J; Harper, B S; Harper, J P

    1990-10-01

    This study examines the significance of sex, methodology, academic preparation, and age as related to development of judgmental and problem-solving skills. Sex, American College Test (ACT) Mathematics scores, Composite ACT scores, grades in course work, grade point average (GPA), and age were used in studying the effects of teaching method on 96 students' ability to analyze data in financial statements. Results reflect positively on accounting students compared to the general college population and the women students in particular.

  10. A psychometric comparison of the Internet Addiction Test, the Internet-Related Problem Scale, and self-diagnosis.

    Science.gov (United States)

    Widyanto, Laura; Griffiths, Mark D; Brunsden, Vivienne

    2011-03-01

    One of the more prominent issues in the field of Internet addiction is the validity of the instrument used to assess users' level of Internet involvement. Many of the instruments used to assess Internet addiction have high face validity but have yet to be tested psychometrically. The aim of this study is to compare two of the most used Internet addiction research measures, the Internet Addiction Test (IAT) and the Internet-Related Problem Scale (IRPS), along with a self-diagnostic question simply asking Internet users if they thought they were addicted to the Internet. A total of 225 Internet users participated in the study (69 males and 156 females). Participants who defined themselves as Internet addicts had higher scores on both the IAT and IRPS, and the three different Internet addiction measures were strongly correlated to each other. For the IAT, factor analysis generated three factors (emotional/psychological conflict; time management issues; mood modification) explaining 56.3% of the variance. For the IRPS, factor analysis generated four factors (negative effects of Internet use; mood modification; loss of control; increased Internet use) explaining 60.2% of the variance. The implications for these findings are discussed.

  11. Cutoff Scores for MMPI-2 and MMPI-2-RF Cognitive-Somatic Validity Scales for Psychometrically Defined Malingering Groups in a Military Sample.

    Science.gov (United States)

    Jones, Alvin

    2016-07-12

    This research examined cutoff scores for MMPI-2 and MMPI-2-RF validity scales specifically developed to assess non-credible reporting of cognitive and/or somatic symptoms. The validity scales examined included the Response Bias Scale (RBS), the Symptom Validity Scales (FBS, FBS-r), Infrequent Somatic Responses scale (Fs), and the Henry-Heilbronner Indexes (HHI, HHI-r). Cutoffs were developed by comparing a psychometrically defined non-malingering group with three psychometrically defined malingering groups (probable, probable to definite, and definite malingering) and a group that combined all malingering groups. The participants in this research were drawn from a military sample consisting largely of patients with traumatic brain injury (mostly mild traumatic brain injury). Specificities for cutoffs of at least 0.90 are provided. Sensitivities, predictive values, and likelihood ratios are also provided. RBS had the largest mean effect size (d) when the malingering groups were compared to the non-malingering group (d range = 1.23-1.58). Published by Oxford University Press 2016. This work is written by (a) US Government employee(s) and is in the public domain in the US.

  12. What Do Test Scores in Texas Tell Us?

    Directory of Open Access Journals (Sweden)

    Stephen P. Klein et al

    2000-10-01

    Full Text Available We examine the results on the Texas Assessment of Academic Skills (TAAS, the highest-profile state testing program and one that has recorded extraordinary recent gains in math and reading scores. To investigate whether the dramatic math and reading gains on the TAAS represent actual academic progress, we have compared these gains to score changes in Texas on another test, the National Assessment of Educational Progress (NAEP. Texas students did improve significantly more on a fourth-grade NAEP math test than their counterparts nationally. But, the size of this gain was smaller than their gains on TAAS and was not present on the eighth-grade math test. The stark differences between the stories told by NAEP and TAAS are especially striking when it comes to the gap in average scores between whites and students of color. According to the NAEP results, that gap in Texas is not only very large but increasing slightly. According to TAAS scores, the gap is much smaller and decreasing greatly. Many schools are devoting a great deal of class time to highly specific TAAS preparation. While this preparation may improve TAAS scores, it may not help students develop necessary reading and math skills. Schools with relatively large percentages of minority and poor students may be doing this more than other schools. We raise serious questions about the validity of those gains, and caution against the danger of making decisions to sanction or reward students, teachers and schools on the basis of test scores that may be inflated or misleading. Finally, we suggest some steps that states can take to increase the likelihood that their test results merit public confidence and provide a sound basis for educational policy.

  13. Propriedades psicométricas apresentadas em manuais de testes de inteligência Psychometric parameters in intelligence test directions

    Directory of Open Access Journals (Sweden)

    Ana Paula Porto Noronha

    2003-06-01

    Full Text Available A pesquisa teve como objetivo verificar quais os parâmetros psicométricos apresentados nos manuais de 19 instrumentos de avaliação da inteligência. Os elementos avaliados nos instrumentos foram: análise de itens, padronização, validade e precisão. Os resultados encontrados mostraram que, dos 19 testes avaliados, 89,5% apresentaram estudos de padronização, sendo que o procedimento mais utilizado na escolha dos sujeitos foi o não aleatório (62,2% dos testes. No que se refere à validade, a de construto foi a mais freqüente dentre os testes (94,7%. Observou-se que todos os instrumentos apresentaram verifica��ão da precisão, sendo o método de consistência interna o mais aplicado (78,9%. Conclui-se que, embora os autores concordem que todos os testes devam realizar estudos de verificação dos parâmetros psicométricos e devam possuir normas regionais, tal prática ainda não se encontra totalmente difundida na avaliação psicológica brasileira,This research aimed to verify the psychometric parameters presented in manuals of 19 intelligence tests. The psychometric properties included in the analysis were: item analysis, validity, reliability, and norms studies. The results indicated that 89.5% of the 19 tests presented norming studies. The procedure of sample selection was mostly non-random (62.2% of the tests. Construct validity was the most frequent method used among the studies (94.7%. All tests presented reliability studies, most of them using internal consistency coefficient (78.9%. It is concluded that although the authors agree that all tests need studies to verify psychometric parameters and studies to obtain regional norms this action isn’t divulged totally yet in the Brazilian psychological assessment.

  14. Psychometric testing of an instrument measuring core competencies of nursing students: an application of Mokken scaling.

    Science.gov (United States)

    Perng, Shoa-Jen; Watson, Roger

    2013-01-01

    Assessing the core competencies of nursing students provides information about students' learning outcomes for educational evaluation and improvement. The aim of this study was to develop the Nursing Students Core Competencies scale to measure 8 core competencies of nursing students in Taiwan. The study employed factor analysis and Mokken scaling analysis for psychometric testing of this instrument between a group of nursing graduates and their evaluators. The results indicated that the Nursing Students Core Competencies scale has demonstrated evidence of internal consistency, structural validity, unidimensionality, and a hierarchy of items for students' self-assessment and instructor's rating. The use of Mokken scaling analysis extends the knowledge of developing competence assessment tools; it can be used to reveal the domains or items of competency nursing students perceive that are easy or difficult, providing information for curricular design.

  15. A Job-Seeking Self-Efficacy Scale for People with Physical Disabilities: Preliminary Development and Psychometric Testing.

    Science.gov (United States)

    Barlow, Julie; Wright, Chris; Cullen, Lesley

    2002-01-01

    Study sought to develop and conduct preliminary testing of the psychometric properties of a job-seeking self-efficacy (JSS) scale that reflected the experiences of people with physical disabilities. Greater job seeking self-efficacy and perceived ability to manage disability at interview were associated with more positive psychological well-being.…

  16. Investigation of Psychometric Properties of the Test for Creative Thinking-Drawing Production: Evidence from Study in Latvia

    Science.gov (United States)

    Kalis, Emils; Roke, Liga; Krumina, Indra

    2016-01-01

    The Test for Creative Thinking-Drawing Production (TCT-DP) is designed as an effective drawing-based instrument for measuring creative potential. Many studies report adaptation efforts in other cultures pointing out good psychometric properties of the instrument nonetheless revealing also some trouble spots. The present study includes adaptation…

  17. Movement skill assessment in children with profound multiple disabilities : a psychometric analysis of the Top Down Motor Milestone Test

    NARCIS (Netherlands)

    van der Putten, A; Vlaskamp, C; Reynders, K; Nakken, H

    Objective: To analyse the psychometric properties of the Top Down Motor Milestone Test (TDMMT), an internationally used instrument in the planning and evaluation of movement-oriented interventions. Setting: Centres for special education in the Netherlands. Subjects: Children with profound multiple

  18. America's Mediocre Test Scores: Education Crisis or Poverty Crisis?

    Science.gov (United States)

    Petrilli, Michael J.; Wright, Brandon L.

    2016-01-01

    At a time when the national conversation is focused on lagging upward mobility, it is no surprise that many educators point to poverty as the explanation for mediocre test scores among U.S. students compared to those of students in other countries. If American teachers in struggling U.S. schools taught in Finland, says Finnish educator Pasi…

  19. A Latent Class Approach to Estimating Test-Score Reliability

    Science.gov (United States)

    van der Ark, L. Andries; van der Palm, Daniel W.; Sijtsma, Klaas

    2011-01-01

    This study presents a general framework for single-administration reliability methods, such as Cronbach's alpha, Guttman's lambda-2, and method MS. This general framework was used to derive a new approach to estimating test-score reliability by means of the unrestricted latent class model. This new approach is the latent class reliability…

  20. Commentary on "Validating the Interpretations and Uses of Test Scores"

    Science.gov (United States)

    Brennan, Robert L.

    2013-01-01

    Kane's paper "Validating the Interpretations and Uses of Test Scores" is the most complete and clearest discussion yet available of the argument-based approach to validation. At its most basic level, validation as formulated by Kane is fundamentally a simply-stated two-step enterprise: (1) specify the claims inherent in a particular interpretation…

  1. The Correlational Relationship between Homeschooling Demographics and High Test Scores.

    Science.gov (United States)

    Burns, Johnna

    Homeschooling, one of the fastest growing educational alternatives, is enjoying increasing respect from educators and parents alike. This is partly because homeschooling children score as well and often better on standardized tests than their publicly schooled counterparts. However, the vast majority of homeschooled students come from the…

  2. America's Mediocre Test Scores: Education Crisis or Poverty Crisis?

    Science.gov (United States)

    Petrilli, Michael J.; Wright, Brandon L.

    2016-01-01

    At a time when the national conversation is focused on lagging upward mobility, it is no surprise that many educators point to poverty as the explanation for mediocre test scores among U.S. students compared to those of students in other countries. If American teachers in struggling U.S. schools taught in Finland, says Finnish educator Pasi…

  3. Scoring Rod-and-Frame Tests: Quantitative and Qualitative Considerations.

    Science.gov (United States)

    Haller, Otto; Edgington, Eugene S.

    1982-01-01

    Current scoring procedures depend on unrealistic assumptions about subjects' performance on the rod-and-frame test. A procedure is presented which corrects for constant error, is sensitive to response strategy and consistency, and examines qualitative and quantitative aspects of performance and individual differences in laterality bias as defined…

  4. The Weighted Airman Promotion System: Standardizing Test Scores

    Science.gov (United States)

    2008-01-01

    42 Approaches to Standardizing PFE /SKT Scores...NPS non–prior service OSD Office of the Secretary of Defense OSI Office of Special Investigations PFE Promotion Fitness Exam RAW Retrieval Application...both the Promotion Fitness Exam ( PFE ) and Specialty Knowledge Test (SKT). Using a percentile ranking was one way to standardize because every AFSC

  5. Factors influencing success in quality-improvement collaboratives: development and psychometric testing of an instrument

    Directory of Open Access Journals (Sweden)

    Grol Richard PTM

    2010-10-01

    Full Text Available Abstract Background To increase the effectiveness of quality-improvement collaboratives (QICs, it is important to explore factors that potentially influence their outcomes. For this purpose, we have developed and tested the psychometric properties of an instrument that aims to identify the features that may enhance the quality and impact of collaborative quality-improvement approaches. The instrument can be used as a measurement instrument to retrospectively collect information about perceived determinants of success. In addition, it can be prospectively applied as a checklist to guide initiators, facilitators, and participants of QICs, with information about how to perform or participate in a collaborative with theoretically optimal chances of success. Such information can be used to improve collaboratives. Methods We developed an instrument with content validity based on literature and the opinions of QIC experts. We collected data from 144 healthcare professionals in 44 multidisciplinary improvement teams participating in two QICs and used exploratory factor analysis to assess the construct validity. We used Cronbach's alpha to ascertain the internal consistency. Results The 50-item instrument we developed reflected expert-opinion-based determinants of success in a QIC. We deleted nine items after item reduction. On the basis of the factor analysis results, one item was dropped, which resulted in a 40-item questionnaire. Exploratory factor analysis showed that a three-factor model provided the best fit. The components were labeled 'sufficient expert team support', 'effective multidisciplinary teamwork', and 'helpful collaborative processes'. Internal consistency reliability was excellent (alphas between .85 and .89. Conclusions This newly developed instrument seems a promising tool for providing healthcare workers and policy makers with useful information about determinants of success in QICs. The psychometric properties of the instrument are

  6. Development and psychometric testing of a tool to measure missed nursing care.

    Science.gov (United States)

    Kalisch, Beatrice J; Williams, Reg Arthur

    2009-05-01

    In a qualitative study, medical-surgical and intensive care staff nurses reported that they did not complete a significant amount of nursing care on regular basis. Thus, it was determined that a quantitative tool was needed to measure the amount and type of missed nursing care and the reasons for missing care. The authors report the results of a psychometric evaluation of this tool (The Missed Nursing Care Survey [MISSCARE Survey]) to measure missed nursing care (part A) and the reasons for missed nursing care (part B). Two studies were conducted--study 1 (n = 459) and study 2 (n = 639). A sample of staff nurses was drawn from 35 medical-surgical, rehabilitation, and intensive care patient units in 4 acute care hospitals. Acceptability was high, with 85% of the respondents answering all items on the survey. Factor analysis with Varimax rotation resulted in a 3-factor solution for part 2 (communication, labor resources, and material resources). Cronbach alpha values ranged from 0.64 to 0.86. Confirmatory factor analysis demonstrated a good fit of the data. Using a contrasting group approach, a comparison of nurse's perceptions of missed care on intensive care units versus rehabilitation units resulted, as hypothesized, in a significantly lower amount of missed care on intensive care units. Pearson correlation coefficient on a test-retest of the same subjects yielded a value of 0.87 on part A and 0.86 on part B. Although further validation of the MISSCARE Survey is needed, current evidence demonstrates that the tool meets stringent psychometric standards.

  7. A questionnaire measuring staff perceptions of Lean adoption in healthcare: development and psychometric testing.

    Science.gov (United States)

    Kaltenbrunner, Monica; Bengtsson, Lars; Mathiassen, Svend Erik; Engström, Maria

    2017-03-24

    During the past decade, the concept of Lean has spread rapidly within the healthcare sector, but there is a lack of instruments that can measure staff's perceptions of Lean adoption. Thus, the aim of the present study was to develop a questionnaire measuring Lean in healthcare, based on Liker's description of Lean, by adapting an existing instrument developed for the service sector. A mixed-method design was used. Initially, items from the service sector instrument were categorized according to Liker's 14 principles describing Lean within four domains: philosophy, processes, people and partners and problem-solving. Items were lacking for three of Liker's principles and were therefore developed de novo. Think-aloud interviews were conducted with 12 healthcare staff from different professions to contextualize and examine the face validity of the questionnaire prototype. Thereafter, the adjusted questionnaire's psychometric properties were assessed on the basis of a cross-sectional survey among 386 staff working in primary care. The think-aloud interviews led to adjustments in the questionnaire to better suit a healthcare context, and the number of items was reduced. Confirmatory factor analysis of the adjusted questionnaire showed a generally acceptable correspondence with Liker's description of Lean. Internal consistency, measured using Cronbach's alpha, for the factors in Liker's description of Lean was 0.60 for the factor people and partners, and over 0.70 for the three other factors. Test-retest reliability measured by the intra-class correlation coefficient ranged from 0.77 to 0.88 for the four factors. We designed a questionnaire capturing staff's perceptions of Lean adoption in healthcare on the basis of Liker's description. This Lean in Healthcare Questionnaire (LiHcQ) showed generally acceptable psychometric properties, which supports its usability for measuring Lean adoption in healthcare. We suggest that further research focus on verifying the usability of

  8. Simplifying multivariate survival analysis using global score test methodology

    Science.gov (United States)

    Zain, Zakiyah; Aziz, Nazrina; Ahmad, Yuhaniz

    2015-12-01

    In clinical trials, the main purpose is often to compare efficacy between experimental and control treatments. Treatment comparisons often involve multiple endpoints, and this situation further complicates the analysis of survival data. In the case of tumor patients, endpoints concerning survival times include: times from tumor removal until the first, the second and the third tumor recurrences, and time to death. For each patient, these endpoints are correlated, and the estimation of the correlation between two score statistics is fundamental in derivation of overall treatment advantage. In this paper, the bivariate survival analysis method using the global score test methodology is extended to multivariate setting.

  9. Psychometric evaluation of a motor control test battery of the craniofacial region.

    Science.gov (United States)

    von Piekartz, H; Stotz, E; Both, A; Bahn, G; Armijo-Olivo, S; Ballenberger, N

    2017-09-11

    The primary objective of this study was to determine the structural and known-group validity as well as the inter-rater reliability of a test battery to evaluate the motor control of the craniofacial region. Seventy volunteers without TMD and 25 subjects with TMD (Axes I) per the DC/TMD were asked to execute a test battery consisting of eight tests. The tests were video-taped in the same sequence in a standardised manner. Two experienced physical therapists participated in this study as blinded assessors. We used exploratory factor analysis to identify the underlying component structure of the eight tests. Internal consistency (Cronbach's α), inter-rater reliability (intra-class correlation coefficient) and construct validity (ie, hypothesis testing-known-group validity) (receiver operating curves) were also explored for the test battery. The structural validity showed the presence of one factor underlying the construct of the test battery. The internal consistency was excellent (0.90) as well as the inter-rater reliability. All values of reliability were close to 0.9 or above indicating very high inter-rater reliability. The area under the curve (AUC) was 0.93 for rater 1 and 0.94 for rater two, respectively, indicating excellent discrimination between subjects with TMD and healthy controls. The results of the present study support the psychometric properties of test battery to measure motor control of the craniofacial region when evaluated through videotaping. This test battery could be used to differentiate between healthy subjects and subjects with musculoskeletal impairments in the cervical and oro-facial regions. In addition, this test battery could be used to assess the effectiveness of management strategies in the craniofacial region. © 2017 John Wiley & Sons Ltd.

  10. [Development and clinical testing of the Russian version of the Acute Cystitis Symptom Score - ACSS].

    Science.gov (United States)

    Alidjanov, J F; Abdufattaev, U A; Makhmudov, D Kh; Mirkhamidov, D Kh; Khadzhikhanov, F A; Azgamov, A V; Pilatz, A; Naber, K G; Wagenlehner, F M; Akilov, F A

    2014-01-01

    The Acute Cystitis Symptom Score - ACSS was originally developed in the Uzbek language and has demonstrated high reliability and validity. The study was aimed to develop a Russian version of the ACSS questionnaire and evaluate its psychometric properties. Translation and adaptation of the ACSS questionnaire containing 18 questions, 6 of them - for the typical symptoms of acute cystitis (AC), 4 - for the differential diagnosis; 3 - for the quality of life, and 5 - for the conditions that may affect the choice of treatment, were performed according to the recommendations developed by the Mapi Research Institute. Study involved 83 Russian-speaking women (mean age, 35.6 ±13.7 years); 38 (45.8%) patients were in the main group (patients with AC), and 45 (54.2%) - in the control group (without AC). Medical examination and appropriate treatment of the respondents were conducted in accordance with approved standards. After completing the course of therapy, 19 (50%) patients of the main group came for the control examination. There was statistically significant difference in the scores obtained in the two groups. Score profiles positively correlated with the results of laboratory tests (rho = 0.26-0.48). Cronbach's alpha for the Russian version of the questionnaire was 0.86 (95% CI, 0.81-0.91), area under the curve in the ROC analysis was 0.96. The results of testing the Russian version correspond to those of the original version. The Russian version of the ACSS questionnaire has high. reliability and validity, and can be recommended for clinical research and diagnosis of primary AC, and dynamic monitoring of the effectiveness of the treatment of the Russian-speaking population of patients.

  11. The stroke caregiver unmet resource needs scale: development and psychometric testing.

    Science.gov (United States)

    King, Rosemarie B; Hartke, Robert J; Lee, Jungwha; Raad, Jason

    2013-12-01

    The purposes of this study were to develop and validate a measure of unmet resource needs of the caregivers of survivors of stroke and to describe the caregivers' unmet needs during 1 year. A longitudinal, descriptive design was used to test the reliability and validity of the Unmet Resource Needs (URN) measure. Item development was based on literature review and preliminary study findings. A stress and coping conceptual model framed the hypotheses. Psychometric testing was based on 6-month postdischarge data (n = 166). Content and structural construct validity, internal consistency reliability through 1 year, and concurrent validity were tested. Change in URN over time was examined. Content validity was supported by floor and ceiling effects less than 5%. Principal axis factoring yielded a 12-item, two-factor solution reflecting general and technology unmet needs. Internal consistency reliability was satisfactory for the total scale and subscales at all times, excepting the baseline three-item technology scale (α = .56). Concurrent validity was supported by significant correlations with model constructs (threat, positive problem solving, depression, preparedness; p < .01) in the expected direction. Functional status and resource use were not associated with the URN. Repeated measures analysis of variance (n = 123) indicated a significant decrease in unmet needs from baseline to 3, 6, and 12 months postdischarge (p < .001). Nevertheless, 42% reported one or more unmet needs at year 1. Assessment and counseling on unmet needs is indicated throughout the caregiving trajectory to reduce negative outcomes.

  12. Evaluation of the psychometric properties of the Spanish version of the Denver Developmental Screening Test II.

    Science.gov (United States)

    De-Andrés-Beltrán, Beatriz; Rodríguez-Fernández, Ángel L; Güeita-Rodríguez, Javier; Lambeck, Johan

    2015-03-01

    The objective of this study was to examine the psychometric properties of the Spanish version of the Denver Developmental Screening Test II in a population of Spanish children. Two hundred children ranging from 9 month to 6 years were grouped into two samples (healthy/with psychomotor delay) and screened in order to check whether they suffered from psychomotor delay. Children from three Early Intervention Centres and three schools participated in this study. Criterion validity was calculated by the method of extreme groups, comparing healthy children to those with development delay. Interobserver and intraobserver reliability were calculated using Cohen Kappa coefficient, and internal consistency was calculated via the Kuder-Richardson coefficient. The scale demonstrated 89% sensitivity, 92% specificity, a positive predicted value of 91% and a negative predicted value of 89%, whereas the positive and negative likelihood ratio was 11.12 and 0.12, respectively. Intraobserver reliability ranged from 0.662 to 1, and interobserver reliability ranged from 0.886 to 1. The Kuder-Richardson coefficient values ranged from 87.5 to 97.6%. The Spanish version of the Denver Developmental Screening Test II was found to have a good criterion validity, reliability and internal consistency and is a suitable screening test for use in a population of Spanish children.

  13. Neuropsychological assessment with the Visual Gestalt Test: psychometric properties and differential diagnostic probabilities.

    Science.gov (United States)

    la Cour, Peter; Andersen, Ruth

    2006-02-01

    The Visual Gestalt Test is a neuropsychological instrument developed for evaluation of learning and memory of visuo-spatial material. A revised strategy of scoring has motivated the present study, where data from 153 normal persons, 99 epilepsy patients, and 24 depressed patients are presented and compared. The Visual Gestalt Test is observed to discriminate between normal and diagnosed groups in several ways. Additionally it is found to discriminate between depressed and brain damage subgroups of patients. Data are presented in order to supplement previously published ways of scoring and norms. Practical guidelines for the clinical applications of the test are suggested as perspectives.

  14. Psychometric Characteristics and Appropriate Use of the Gesell School Readiness Screening Test.

    Science.gov (United States)

    Lichtenstein, Robert

    1990-01-01

    Investigated the adequacy of the Gesell School Readiness Screening Test (GSRST) to gauge 46 kindergartners' readiness. Found agreement between GSRST and teacher assessments of student readiness. Test-retest and interrater reliability were below acceptable levels, and lower than figures yielded by a quantitative scoring method. Concluded that GSRST…

  15. Psychometric testing of the Self-Care of Heart Failure Index Version 6.2.

    Science.gov (United States)

    Vellone, Ercole; Riegel, Barbara; Cocchieri, Antonello; Barbaranelli, Claudio; D'Agostino, Fabio; Antonetti, Giovanni; Glaser, Dale; Alvaro, Rosaria

    2013-10-01

    The Self-Care of Heart Failure Index Version 6.2 (SCHFI v.6.2) is widely used, but its psychometric profile is still questioned. In a sample of 659 heart failure patients from Italy, we performed confirmatory factor analysis (CFA) to test the original construct of the SCHFI v.6.2 scales (Self-Care Maintenance, Self-Care Management, and Self-Care Confidence), with limited success. We then used exploratory factor analysis to determine the presence of separate scale dimensions, followed by CFA in a separate sub-sample. Construct validity of individual scales showed excellent fit indices: CFI = .92, RMSEA = .05 for the Self-Care Maintenance Scale; CFI = .95, RMSEA = .07 for the Self-Care Management Scale; CFI = .99, RMSEA = .02 for the Self-Care Confidence scale. Contrasting groups validity, internal consistency, and test-retest reliability were supported as well. This evidence provides a new understanding of the structure of the SCHFI v.6.2 and supports its use in clinical practice and research.

  16. Development and psychometric testing of the active aging scale for Thai adults

    Directory of Open Access Journals (Sweden)

    Thanakwang K

    2014-07-01

    Full Text Available Kattika Thanakwang,1,2 Sang-arun Isaramalai,2,3 Urai Hatthakit3 1Institute of Nursing, Suranaree University of Technology, Nakhon Ratchasima, Thailand; 2Research Center for Caring System of Thai Elderly, 3Faculty of Nursing, Prince of Songkla University, Songkla, Thailand Background: Active aging is central to enhancing the quality of life for older adults, but its conceptualization is not often made explicit for Asian elderly people. Little is known about active aging in older Thai adults, and there has been no development of scales to measure the expression of active aging attributes. Purpose: The aim of this study was to develop a culturally relevant composite scale of active aging for Thai adults (AAS-Thai and to evaluate its reliability and validity. Methods: Eight steps of scale development were followed: 1 using focus groups and in-depth interviews, 2 gathering input from existing studies, 3 developing preliminary quantitative measures, 4 reviewing for content validity by an expert panel, 5 conducting cognitive interviews, 6 pilot testing, 7 performing a nationwide survey, and 8 testing psychometric properties. In a nationwide survey, 500 subjects were randomly recruited using a stratified sampling technique. Statistical analyses included exploratory factor analysis, item analysis, and measures of internal consistency, concurrent validity, and test–retest reliability. Results: Principal component factor analysis with varimax rotation resulted in a final 36-item scale consisting of seven factors of active aging: 1 being self-reliant, 2 being actively engaged with society, 3 developing spiritual wisdom, 4 building up financial security, 5 maintaining a healthy lifestyle, 6 engaging in active learning, and 7 strengthening family ties to ensure care in later life. These factors explained 69% of the total variance. Cronbach’s alpha coefficient for the overall AAS-Thai was 0.95 and varied between 0.81 and 0.91 for the seven subscales

  17. Basic competence in intensive and critical care nursing: development and psychometric testing of a competence scale.

    Science.gov (United States)

    Lakanmaa, Riitta-Liisa; Suominen, Tarja; Perttilä, Juha; Ritmala-Castrén, Marita; Vahlberg, Tero; Leino-Kilpi, Helena

    2014-03-01

    To develop a scale to assess basic competence in intensive and critical care nursing. In this study, basic competence denotes preliminary competence to practice in an intensive care unit. There is a need for competence assessment scales in intensive care nursing practice and education. The nursing care performed in the intensive care unit is special by its nature and needs to be assessed as such. At this moment, however, there is no tested, reliable and valid scale in this field. A multi-phase, multi-method development and psychometric testing of the scale was conducted. The scale was developed in three phases. First, following a literature review and Delphi study, the items were created. Second, the scale was pilot tested twice by nursing students (n1 = 18, n2 = 56) and intensive care nurses (n1 = 12, n2 = 54), and revisions were made. Third, reliability and construct validity were tested by graduating nursing students (n = 139) and intensive care nurses (n = 431). The Intensive and Critical Care Nursing Competence Scale (ICCN-CS-1) is a self-assessment test consisting of 144 items. Basic competence is divided into patient-related clinical competence and general professional competence. In addition, basic competence is comprised of knowledge base, skill base, attitude and value base and experience base. ICCN-CS-1 is a reliable and tolerably valid scale. The ICCN-CS-1 is a promising scale for use among nursing students and nurses. Future research is needed to evaluate its construct validity further and to assess its suitability for completion during intensive care unit's orientation programmes and nursing students' clinical practice in an intensive care unit. The ICCN-CS-1 can be used for basic competence assessment in professional development discussions in intensive care units, in mentor evaluation situations during nursing students' clinical practice and in intensive care nursing education. © 2013 Blackwell Publishing Ltd.

  18. Developing a nanoparticle test for prostate cancer scoring

    Directory of Open Access Journals (Sweden)

    Huo Qun

    2012-03-01

    Full Text Available Abstract Background Over-diagnosis and treatment of prostate cancer has been a major problem in prostate cancer care and management. Currently the most relevant prognostic factor to predict a patient's risk of death due to prostate cancer is the Gleason score of the biopsied tissue samples. However, pathological analysis is subjective, and the Gleason score is only a qualitative estimate of the cancer malignancy. Molecular biomarkers and diagnostic tests that can accurately predict prostate tumor aggressiveness are rather limited. Method We report here for the first time the development of a nanoparticle test that not only can distinguish prostate cancer from normal and benign conditions, but also has the potential to predict the aggressiveness of prostate cancer quantitatively. To conduct the test, a prostate tissue lysate sample is spiked into a blood serum or human IgG solution and the spiked sample is incubated with a citrate-protected gold nanoparticle solution. IgG is known to adsorb to citrate-protected gold nanoparticles to form a "protein corona" on the nanoparticle surface. From this study, we discovered that certain tumor-specific molecules can interact with IgG and change the adsorption behavior of IgG to the gold nanoparticles. This change is reflected in the nanoparticle size of the assay solution and detected by a dynamic light scattering technique. Assay data were analyzed by one-way ANOVA for multiple variant analysis, and using the Student t-test or nonparametric Mann-Whitney U-tests for pairwise analyses. Results An inverse, quantitative correlation of the average nanoparticle size of the assay solution with tumor status and histological diagnostic grading was observed from the nanoparticle test. IgG solutions spiked with prostate tumor tissue exhibit significantly smaller nanoparticle size than the solutions spiked with normal and benign tissues. The higher grade the tumor is, the smaller the nanoparticle size is. The test

  19. Development and psychometric testing of the Nurse Practitioner Primary Care Organizational Climate Questionnaire.

    Science.gov (United States)

    Poghosyan, Lusine; Nannini, Angela; Finkelstein, Stacey R; Mason, Emanuel; Shaffer, Jonathan A

    2013-01-01

    Policy makers and healthcare organizations are calling for expansion of the nurse practitioner (NP) workforce in primary care settings to assure timely access and high-quality care for the American public. However, many barriers, including those at the organizational level, exist that may undermine NP workforce expansion and their optimal utilization in primary care. This study developed a new NP-specific survey instrument, Nurse Practitioner Primary Care Organizational Climate Questionnaire (NP-PCOCQ), to measure organizational climate in primary care settings and conducted its psychometric testing. Using instrument development design, the organizational climate domain pertinent for primary care NPs was identified. Items were generated from the evidence and qualitative data. Face and content validity were established through two expert meetings. Content validity index was computed. The 86-item pool was reduced to 55 items, which was pilot tested with 81 NPs using mailed surveys and then field-tested with 278 NPs in New York State. SPSS 18 and Mplus software were used for item analysis, reliability testing, and maximum likelihood exploratory factor analysis. Nurse Practitioner Primary Care Organizational Climate Questionnaire had face and content validity. The content validity index was .90. Twenty-nine items loaded on four subscale factors: professional visibility, NP-administration relations, NP-physician relations, and independent practice and support. The subscales had high internal consistency reliability. Cronbach's alphas ranged from.87 to .95. Having a strong instrument is important to promote future research. Also, administrators can use it to assess organizational climate in their clinics and propose interventions to improve it, thus promoting NP practice and the expansion of NP workforce.

  20. Development and psychometric testing of the Chinese Postnatal Risk Factors Questionnaire (CPRFQ) for postpartum depression.

    Science.gov (United States)

    Yan, Xiaoyu; Lu, Jun; Shi, Shenxun; Wang, Ximei; Zhao, Rui; Yan, Yuan; Chen, Gang

    2015-04-01

    This article describes the development and psychometric assessment of the Chinese Postnatal Risk Factors Questionnaire (CPRFQ). There were four phases in this process: (1) the items were generated using a literature review and a focus group, (2) content validity was evaluated by an expert panel, (3) a pilot study was conducted with 45 postpartum women to refine the scale, and (4) a convenience sample of 256 postpartum women in China was recruited to complete the questionnaire. Construct validity was established by exploratory factor analysis; a four-factor structure of the scale was accepted (social and family, personality and relationship, mother and infant, maternal feelings and 'doing the month'). These factors explained 47.46 % of the variance. Pearson's correlation coefficient was conducted to test convergent validity with the Edinburgh Postnatal Depression Scale (EPDS) (r = 0.54; p < 0.001). The Cronbach's alpha coefficient of the four subscales ranged from 0.58 to 0.71. The final 18-item version of the questionnaire is potentially a valuable tool for assessing postnatal risk factors in Chinese postpartum mothers.

  1. The K-factor, Covitality, and personality : A Psychometric Test of Life History Theory.

    Science.gov (United States)

    Figueredo, Aurelio José; Vásquez, Geneva; Brumbach, Barbara Hagenah; Schneider, Stephanie M R

    2007-03-01

    We present a psychometric test of life history theory as applied to human individual differences using MIDUS survey data (Brim et al. 2000). Twenty scales measuring cognitive and behavioral dimensions theoretically related to life history strategy were constructed using items from the MIDUS survey. These scales were used to construct a single common factor, the K-factor, which accounted for 70% of the reliable variance. The scales used included measures of personal, familial, and social function. A second common factor, Covitality, was constructed from scales for physical and mental health. Finally, a single general factor, Personality, was constructed from scales for the "Big Five" factors of personality. The K-factor, covitality factor, and general personality factor correlated significantly with each other, supporting the prediction that high K predicts high somatic effort and also manifests in behavioral display. Thus, a single higher-order common factor, the Super-K factor, was constructed that consisted of the K-factor, covitality factor, and personality factor.

  2. Psychometric testing of the Jefferson Scale of Empathy Health Profession Students' version with Australian paramedic students.

    Science.gov (United States)

    Williams, Brett; Brown, Ted; Boyle, Malcolm; Dousek, Simon

    2013-03-01

    Evidence now suggests that improved empathic behaviors can have a positive impact on healthcare outcomes. Therefore, having psychometrically-sound empathy scales is important for healthcare educators. In this study, the factor structure of the 20-item Jefferson Scale Empathy-Health Profession Students' version, when completed by a group of undergraduate paramedic students from a large Australian university, was investigated. Data from the Scale completed by 330 paramedic students were analyzed using principal components analysis followed by a maximum likelihood confirmatory factor analysis to test goodness of fit to the sample data. Two factors emerged from the principal components analysis, "compassionate care" and "perspective taking", accounting for 44.2% of the total variance. The 17-item two-factor model produced good model fit and good reliability estimates. Three of the original items did not fit the model. Results from the confirmatory factor analysis suggest that the 17-item Jefferson Scale Empathy-Health Profession Students' version is a valid and reliable measure for undergraduate paramedic students' empathy levels.

  3. Psychometric Testing of the Self-Efficacy for Interdisciplinary Plans of Care Scale.

    Science.gov (United States)

    Molle, Elizabeth; Froman, Robin

    2017-01-01

    Computerized interdisciplinary plans of care have revitalized nurse-centric care plans into dynamic and meaningful electronic documents. To maximize the benefits of these documents, it is important to understand healthcare professionals' attitudes, specifically their confidence, for making computerized interdisciplinary care plans useful and meaningful documents. The purpose of the study was to test the psychometric properties of the Self-Efficacy for Interdisciplinary Plans of Care instrument intended to measure healthcare professionals' self-efficacy for using such documents. Content validity was assessed by an expert review panel. Content validity indices ranged from 0.75 to 1.00, with a scale CVI of 0.94. A sample of 389 healthcare providers completed the 14-item instrument. Principal axis factoring was used to assess factor structure. The exploratory factor analysis yielded a single-factor structure accounting for 71.76% of covariance. Cronbach internal consistency coefficient for the single factor solution was .97. The corrected item-total correlations ranged from 0.71 to 0.90. The coefficient of stability, during a 2-week period, with a subset of the sample (n = 38), was estimated at 0.82. The results of this study suggest that the Self-Efficacy for Interdisciplinary Plans of Care has sturdy reliability and validity for measuring the self-efficacy of healthcare providers to make computerized interdisciplinary plans of care meaningful and useful documents.

  4. Development and psychometric testing of the Chinese nurses job satisfaction scale

    Directory of Open Access Journals (Sweden)

    Ying Liu

    2016-10-01

    Full Text Available A proposed Chinese Nurses Job Satisfaction Scale was developed and its psychometric properties tested. Instrument development was conducted in four phases. The constructs and items of the instrument were developed from literature review and were confirmed by interviewing six Chinese clinical experts in phase I. During phase II, five experts evaluated the scale’s content validity. In phase III, exploratory factor analysis was used to examine the instrument constructs (n = 302. In phase IV, a multi-stage random sampling was used to recruit 510 subjects to examine the proposed model by confirmatory factor analysis. The Content Validity Index of the instrument for items ranged from 0.8 to 1.0. Seven factors were extracted from exploratory factor analysis, which explained 71.47% of the total variance. A confirmatory factor analysis supported the modified measurement model of nurses’ job satisfaction with 34 items. The scale overall internal consistency Cronbach’s alpha coefficient was 0.93.

  5. Psychometric Testing of the Greek Version of the Clinical Learning Environment-Teacher (CLES+T)

    Science.gov (United States)

    Papastavrou, Evridiki; Dimitriadou, Maria; Tsangari, Haritini

    2016-01-01

    Clinical practice is an important part of nursing education, and robust instruments are required to evaluate the effectiveness of the hospital setting as a learning environment. The study aim is the psychometric test of the Clinical Learning Environment+Teacher (CLES+T) scale-Greek version.: 463 students practicing in acute care hospitals participated in the study. The reliability of the instrument was estimated with Cronbach’s alpha coefficients. The construct validity was evaluated using exploratory factor analysis (EFA) with Varimax rotation. Convergent validity was examined by measuring the bivariate correlations between the scale/subscales. Content, validity and semantic equivalence were examined through reviews by a panel of experts. The total scale showed high internal consistency (α=0.95). EFA was identical to the original scale, had eigen values larger than one and explained a total of 67.4% of the variance. The factor with the highest eigen value and the largest percentage of variance explained was “supervisory relationship”, with an original eigenvalue of 13.1 (6.8 after Varimax rotation) and an explanation of around 38% of the variance (or 20% after rotation). Convergent validity was examined by measuring the bivariate correlations between the scale and a question that measured the general satisfaction. The Greek version of the CLES+T is a valid and reliable instrument that can be used to examine students’ perceptions of the clinical learning environment. PMID:26652075

  6. The Effect of Logical Choice Weight and Corrected Scoring Methods on Multiple Choice Agricultural Science Test Scores

    Directory of Open Access Journals (Sweden)

    B. K. Ajayi

    2012-12-01

    Full Text Available The study focused on the effect of logical choice weight and corrected scoring methods on multiple choice Agricultural science test scores the study also investigated the interaction effect of logical choice weight and corrected scoring methods in schools ,and types of school in multiple choice agricultural science test. The researcher used a combination of survey type and one short experimental design. The sample for the study consisted of 600 students selected by stratified random sampling techniques in south western Nigeria. Overall performance of students in percentage, and correlation was analyzed. The hypotheses were generated and tested at 0.05 level of significance. The study revealed that there was a significant difference in the academic performance of students in logical choice weight and corrected scoring methods in multiple choice agricultural science test scores. The result also shown that there was no interaction effect on the two scoring methods in the type of schools, the location of schools in multiple choices agricultural science test. The study revealed that logical choice weight scoring method was the best method that favoured the scoring of the students’ scripts in multiple choices agricultural science test. On the basis of these findings, logical choice weight should be introduced to the teachers to use in the classroom as a new method of scoring multiple choice agricultural science the logical choice weight method is recommended in the ministry of education, in Examination Division, and to junior secondary schools for scoring JSS (3 three multiple choice test. Examination bodies such as West Africa Examination Council (WAEC, National Examination Council (NECO, Joint Admission and Matriculation Board (JAMB should adopt the use of logical choice weight method in scoring multiple choice tests. The method could be used in tertiary institutions for post ‘JAMB’ Unify Matriculation Examination (UME test. It is also

  7. Development and psychometric testing of a new instrument to measure the caring behaviour of nurses in Italian acute care settings.

    Science.gov (United States)

    Piredda, Michela; Ghezzi, Valerio; Fenizia, Elisa; Marchetti, Anna; Petitti, Tommasangelo; De Marinis, Maria Grazia; Sili, Alessandro

    2017-07-16

    To develop and psychometrically test the Italian-language Nurse Caring Behaviours Scale, a short measure of nurse caring behaviour as perceived by inpatients. Patient perceptions of nurses' caring behaviours are a predictor of care quality. Caring behaviours are culture-specific, but no measure of patient perceptions has previously been developed in Italy. Moreover, existing tools show unclear psychometric properties, are burdensome for respondents, or are not widely applicable. Instrument development and psychometric testing. Item generation included identifying and adapting items from existing measures of caring behaviours as perceived by patients. A pool of 28 items was evaluated for face validity. Content validity indexes were calculated for the resulting 15-item scale; acceptability and clarity were pilot tested with 50 patients. To assess construct validity, a sample of 2,001 consecutive adult patients admitted to a hospital in 2014 completed the scale and was split into two groups. Reliability was evaluated using nonlinear structural equation modelling coefficients. Measurement invariance was tested across subsamples. Item 15 loaded poorly in the exploratory factor analysis (n = 983) and was excluded from the final solution, positing a single latent variable with 14 indicators. This model fitted the data moderately. The confirmatory factor analysis (n = 1018) returned similar results. Internal consistency was excellent in both subsamples. Full scalar invariance was reached, and no significant latent mean differences were detected across subsamples. The new instrument shows reasonable psychometric properties and is a promising short and widely applicable measure of inpatient perceptions of nurse caring behaviours. © 2017 John Wiley & Sons Ltd.

  8. Correlation of the Scores on Barron's Ego Strength Scale with the Scores on the Bender-Gestalt Test.

    Science.gov (United States)

    Martin, John D.; And Others

    1979-01-01

    The degree of relationship between scores on the Barron Ego Strength Scale and the scores on the Bender-Gestalt Test was investigated on a sample of college students. Correlations were moderate to low. Racial differences were observed on the Bender-Gestalt Test. (Author/JKS)

  9. Detection of Invalid Test Scores on Admission Tests : A Simulation Study Using Person-Fit Statistics

    NARCIS (Netherlands)

    Tendeiro, Jorge N.; Meijer, Rob R.; Albers, Casper J.

    While an admission test may strongly predict success in university or law school programs for most test takers, there may be some test takers who are mismeasured. To address this issue, a class of statistics called person-fit statistics is used to check the validity of individual test scores.

  10. A new scale for disaster nursing core competencies: Development and psychometric testing.

    Science.gov (United States)

    Al Thobaity, Abdulellah; Williams, Brett; Plummer, Virginia

    2016-02-01

    All nurses must have core competencies in preparing for, responding to and recovering from a disaster. In the Kingdom of Saudi Arabia (KSA), as in many other countries, disaster nursing core competencies are not fully understood and lack reliable, validated tools. Thus, it is imperative to develop a scale for exploring disaster nursing core competencies, roles and barriers in the KSA. This study's objective is to develop a valid, reliable scale that identifies and explores core competencies of disaster nursing, nurses' roles in disaster management and barriers to developing disaster nursing in the KSA. This study developed a new scale testing its validity and reliability. A principal component analysis (PCA) was used to develop and test psychometric properties of the new scale. The PCA used a purposive sample of nurses from emergency departments in two hospitals in the KSA. Participants rated 93 paper-based, self-report questionnaire items from 1 to 10 on a Likert scale. PCA using Varimax rotation was conducted to explore factors emerging from responses. The study's participants were 132 nurses (66% response rate). PCA of the 93 questionnaire items revealed 49 redundant items (which were deleted) and 3 factors with eigenvalues of >1. The remaining 44 items accounted for 77.3% of the total variance. The overall Cronbach's alpha was 0.96 for all factors: 0.98 for Factor 1, 0.92 for Factor 2 and 0.86 for Factor 3. This study provided a validated, reliable scale for exploring nurses' core competencies, nurses' roles and barriers to developing disaster nursing in the KSA. The new scale has many implications, such as for improving education, planning and curricula. Crown Copyright © 2015. Published by Elsevier Ltd. All rights reserved.

  11. The Logical Grammatical Structures Test: psychometric properties and normative data in Dutch-speaking children and adolescents.

    Science.gov (United States)

    Van der Elst, Wim; Reed, Helen; Jolles, Jelle

    2013-01-01

    Logical grammatical structures comprehension is the ability to understand the relations between objects, actions, and qualities in spoken and written sentences. The Logical Grammatical Structures Test (LGST) was especially devised to assess these abilities in children and adolescents. In the present study the LGST was administered to 405 healthy Dutch children and adolescents. The aims of the present study were (i) to evaluate the psychometric properties of the LGST (using an Item Response Theory framework), and (ii) to establish demographically corrected normative data. The results showed that there was a strong curvilinear relationship between age and LGST performance, i.e., the relative improvement in ability level was much more pronounced for younger children (aged below 14 years) than for older children (aged above 14 years). Level of parental education was positively associated with the LGST performance. Normative data that took the relevant demographic variables into account were established, and it was shown that the LGST had sound psychometric properties.

  12. Psychometric properties of a computerized adaptive test for assessing mobility in older adults using novel video-animation technology.

    Science.gov (United States)

    Ip, Edward H; Rejeski, W Jack; Marsh, Anthony P; Barnard, Ryan; Chen, Shyh-Huei

    2013-10-01

    This paper reports on the psychometric properties of a computerized adaptive test (CAT) version of the Mobility Assessment Tool (MAT) for older adults (MAT-CAT). An item pool of 78 video-animation-based items for mobility was developed, and response data were collected from a sample of 234 participants aged 65-90 years. The video-animation-based instrument was designed to minimize ambiguity in the presentation of task demands. In addition to evaluating traditional psychometric properties including dimensionality, differential item functioning (DIF), and local dependence, we extensively tested the performance of several MAT-CAT measures and compared their performances with a fixed format. Operationally, the MAT-CAT was sufficiently unidimensional and had acceptable levels of local independence. One DIF item was removed. Most importantly, the CAT measures showed that even starting with a single fixed item at the mean ability, the adaptive version delivered better performance than the fixed format in terms of several criteria including the standard error of estimate. The MAT-CAT demonstrated satisfactory psychometric properties and superior performance to a fixed format. The video-animation-based adaptive instrument can be used for assessing mobility with specificity and precision.

  13. Psychometric evaluation of the Signs of Depression Scale with a revised scoring mechanism in stroke patients with communicative impairment.

    Science.gov (United States)

    van Dijk, Mariska J; de Man-van Ginkel, Janneke M; Hafsteinsdóttir, Thóra B; Schuurmans, Marieke J

    2017-05-01

    To investigate (1) the diagnostic value of the Signs of Depression Scale (SODS) in a Likert scale format and (2) whether the Likert scale improves the diagnostic value compared with the original dichotomous scale. Cross-sectional multicentre study. One general and one university hospital in the Netherlands. A total of 116 consecutive hospitalized stroke patients, of whom 53 were patients with communicative impairment. Depression was diagnosed with the Composite International Diagnostic Interview (CIDI) administered to the patients' relatives. The Barthel Index (BI) was used as an external validator. The correlation between the CIDI and the SODS-Likert or the SODS was small ( rb = 0.18), and the correlation between the Barthel Index and the SODS-Likert ( rs = -0.30) or the SODS ( rs = -0.33) was moderate. For both instruments, the discriminatory power for diagnosing depression when compared with the CIDI was best at a cut-off score of ⩾2. The internal consistency of the SODS-Likert was acceptable (α = 0.69) and slightly higher than that of the SODS (α = 0.57). The inter-rater reliability of the SODS-Likert and the SODS was acceptable (intraclass correlation coefficient (ICC) 0.66 and ICC 0.80, respectively). The clinical utility was rated good. The diagnostic value of the SODS did not improve using a Likert scale format. However, the diagnostic value of the original dichotomous SODS is reasonable for the initial mood assessment of stroke patients with communicative impairment.

  14. Psychometric Characteristics Associated with Changes on the Passage Comprehension Test of the Woodcock Reading Mastery Tests.

    Science.gov (United States)

    Halpin, Gerald; Simpson, Robert

    Forty adolescent subjects with behavior problems at home or at school were administered the Woodcock Reading Mastery Tests. Subjects ranged in age from 12 to 18, and included 25 males and 15 females. The Passage Comprehension Test for each subject was rescored using three different ceiling criteria: (1) five errors in six consecutive responses,…

  15. Testing for Race: Stanford University, Asian Americans, and Psychometric Testing in California, 1920-1935

    OpenAIRE

    Palter, David

    2014-01-01

    Between 1920 and 1935, researchers at Stanford University administered thousands of eugenic tests of intelligence and personality traits to Chinese-American and Japanese-American children in California's public schools. The researchers and their funders, a diverse coalition of white supremacists and immigrant advocacy organizations, sought to use these tests to gauge the assimilative possibility and racial worth of Asian immigrants, and to intervene in local, national, and transpacific policy...

  16. Psychometric properties of a test in evidence based practice: the Spanish version of the Fresno test

    Directory of Open Access Journals (Sweden)

    Jiménez-Villa Josep

    2010-06-01

    Full Text Available Abstract Background Validated instruments are needed to evaluate the programmatic impact of Evidence Based Practice (EBP training and to document the competence of individual trainees. This study aimed to translate the Fresno test into Spanish and subsequently validate it, in order to ensure the equivalence of the Spanish version against the original English version. Methods Before and after study performed between October 2007 and June 2008. Three groups of participants: (a Mentors of family medicine residents (expert group (n = 56; (b Family medicine physicians (intermediate experience group (n = 17; (c Family medicine residents (novice group (n = 202; Medical residents attended an EBP course, and two sets of the test were administered before and after the course. The Fresno test is a performance based measure for use in medical education that assesses EBP skills. The outcome measures were: inter-rater and intra-rater reliability, internal consistency, item analyses, construct validity, feasibility of administration, and responsiveness. Results Inter-rater correlations were 0.95 and 0.85 in the pre-test and the post-test respectively. The overall intra-rater reliability was 0.71 and 0.81 in the pre-test and post-test questionnaire, respectively. Cronbach's alpha was 0.88 and 0.77, respectively. 152 residents (75.2% returned both sets of the questionnaire. The observed effect size for the residents was 1.77 (CI 95%: 1.57-1.95, the standardised response mean was 1.65 (CI 95%:1.47-1.82. Conclusions The Spanish version of the Fresno test is a useful tool in assessing the knowledge and skills of EBP in Spanish-speaking residents of Family Medicine.

  17. The Visual Aural Digit Span Test and Bender Gestalt Test as Predictors of Wide Range Achievement Test-Revised Scores.

    Science.gov (United States)

    Smith, Teresa C.; Smith, Billy L.

    1988-01-01

    Examined Visual Aural Digit Span Test (VADS) and Bender-Gestalt (BG) scores as predictors of Wide Range Achievement Test-Revised (WRAT-R) scores among 115 elementary school students referred for low academic achievement. Divided children into three age groups. Results suggest BG and VADS Test can be effective screening devices for young children…

  18. The Graduate Management Admission Test: Technical Report on Test Development and Score Interpretation for GMAT Users.

    Science.gov (United States)

    Schrader, William B.

    This report provides information on test development, test administration, and score interpretation for the Graduate Management Admission Test (GMAT). The GMAT, first administered in 1954, provides objective measures of an applicant's abilities for use in admissions decisions by graduate management schools. It is currently composed of five…

  19. Cross-cultural psychometric testing of the Care Dependency Scale with data

    NARCIS (Netherlands)

    Dijkstra, A; Coleman, M; Tomas, C; Valimaki, M; Dassen, T

    2003-01-01

    Background. The importance of the present study lies in addressing whether it is justified to compare the care dependency status of older patients from different settings and cultures using the same instrument. Aim. The aim of this international study was to compare the psychometric properties of th

  20. Psychometric Properties of the Mandarin Version of the Childhood Autism Spectrum Test (CAST): An Exploratory Study

    Science.gov (United States)

    Sun, Xiang; Allison, Carrie; Auyeung, Bonnie; Matthews, Fiona E.; Norton, Samuel; Baron-Cohen, Simon; Brayne, Carol

    2014-01-01

    Limited studies have investigated the latent autistic traits in the mainland Chinese population for autism spectrum conditions (ASC). This study explored the psychometric properties of a Mandarin Chinese version of the CAST in a sample consisting of 737 children in mainstream schools and 50 autistic cases. A combination of categorical data factor…

  1. Your move: The effect of chess on mathematics test scores.

    Science.gov (United States)

    Rosholm, Michael; Mikkelsen, Mai Bjørnskov; Gumede, Kamilla

    2017-01-01

    We analyse the effect of substituting a weekly mathematics lesson in primary school grades 1-3 with a lesson in mathematics based on chess instruction. We use data from the City of Aarhus in Denmark, combining test score data with a comprehensive data set obtained from administrative registers. We use two different methodological approaches to identify and estimate treatment effects and we tend to find positive effects, indicating that knowledge acquired through chess play can be transferred to the domain of mathematics. We also find larger impacts for unhappy children and children who are bored in school, perhaps because chess instruction facilitates learning by providing an alternative approach to mathematics for these children. The results are encouraging and suggest that chess may be an important and effective tool for improving mathematical capacity in young students.

  2. The "Reading the Mind in the Eyes" Test: Investigation of Psychometric Properties and Test-Retest Reliability of the Persian Version

    Science.gov (United States)

    Khorashad, Behzad S.; Baron-Cohen, Simon; Roshan, Ghasem M.; Kazemian, Mojtaba; Khazai, Ladan; Aghili, Zahra; Talaei, Ali; Afkhamizadeh, Mozhgan

    2015-01-01

    The psychometric properties of the Persian "Reading the Mind in the Eyes" test were investigated, so were the predictions from the Empathizing-Systemizing theory of psychological sex differences. Adults aged 16-69 years old (N = 545, female = 51.7%) completed the test online. The analysis of items showed them to be generally acceptable.…

  3. Field Psychometric Testing of the Instrument for Assessment of Psychological Predictors of Well-Being and Quality of Life in People with HIV or AIDS.

    Science.gov (United States)

    Remor, Eduardo; Fuster-RuizdeApodaca, Maria José; Ballester-Arnal, Rafael; Gómez-Martínez, Sandra; Fumaz, Carmina R; González-Garcia, Marian; Ubillos-Landa, Silvia; Aguirrezabal-Prado, Arrate; Molero, Fernando; Ruzafa-Martínez, Maria

    2016-06-01

    The Screenphiv, a screening measure for psychological issues related to HIV, was psychometrically tested in a study involving 744 HIV-infected people in Spain. Participants ages 18-82 (M = 43.04, 72 % men, 28 % women) completed an assessment protocol that included the Screenphiv and the MOS-HIV. A trained interviewer also collected relevant illness-related clinical data and socio-demographics from the participants. A confirmatory factor analysis was used to evaluate the goodness of fit of the Screenphiv's theoretical model and confirmed six first-order factors and two second-order factors [RMSEA (IC 90 %) = 0.07 (0.07-0.08)]. No floor or ceiling effects were observed for the scores. Cronbach's alphas were acceptable for all of the factors (from 0.65 to 0.92). Criterion-related validity also achieved; Screenphiv scores were related to socio-demographic and clinical variables and MOS-HIV summary scores. The Screenphiv is a reliable and valid measure, ready to use in research and clinical settings in Spain.

  4. A psychometric comparison of the Internet Addiction Test, the Internet Related Problem Scale, and self-diagnosis

    OpenAIRE

    Widyanto, L; Griffiths, MD; Brunsden, V

    2011-01-01

    One of the more prominent issues in the field of Internet addiction is the validity of the instrument used to assess users’ level of Internet involvement. Many of the instruments used to assess Internet addiction have high face validity but have yet to be tested psychometrically. The aim of this study is to compare two of the most used Internet addiction research measures, the Internet Addiction Test (IAT) and the Internet-Related Problem Scale (IRPS), along with a self-diagnostic question si...

  5. Prediction of the difficulty level in a standardized reading comprehension test: contributions from cognitive psychology and psychometrics.

    Directory of Open Access Journals (Sweden)

    Brizuela, Armel

    2013-07-01

    Full Text Available This research seeks to identify possible predictors of the difficulty level of reading comprehension items used in a standardized psychometric test for university admission. Several potential predictors of difficulty were proposed, namely, propositional density, negations, grammatical structure, vocabulary difficulty, presence of enhancement elements (words highlighted typographically, item abstraction level and degree of similarity between correct option and relevant text to resolve the item. By Linear Logistic Test Model (Fisher, 1973 it was found that the number of propositions, the syntactic structure, and fundamentally, the presence of difficult words contributed to the prediction of the item difficulty level.

  6. Bayesian psychometric scaling

    NARCIS (Netherlands)

    Fox, G.J.A.; Berg, van den S.M.; Veldkamp, B.P.; Irwing, P.; Booth, T.; Hughes, D.

    2015-01-01

    In educational and psychological studies, psychometric methods are involved in the measurement of constructs, and in constructing and validating measurement instruments. Assessment results are typically used to measure student proficiency levels and test characteristics. Recently, Bayesian item resp

  7. Bayesian psychometric scaling

    NARCIS (Netherlands)

    Fox, Gerardus J.A.; van den Berg, Stéphanie Martine; Veldkamp, Bernard P.; Irwing, P.; Booth, T.; Hughes, D.

    2015-01-01

    In educational and psychological studies, psychometric methods are involved in the measurement of constructs, and in constructing and validating measurement instruments. Assessment results are typically used to measure student proficiency levels and test characteristics. Recently, Bayesian item

  8. Future of Psychometrics: Ask What Psychometrics Can Do for Psychology

    Science.gov (United States)

    Sijtsma, Klaas

    2012-01-01

    I address two issues that were inspired by my work on the Dutch Committee on Tests and Testing (COTAN). The first issue is the understanding of problems test constructors and researchers using tests have of psychometric knowledge. I argue that this understanding is important for a field, like psychometrics, for which the dissemination of…

  9. Future of Psychometrics: Ask What Psychometrics Can Do for Psychology

    Science.gov (United States)

    Sijtsma, Klaas

    2012-01-01

    I address two issues that were inspired by my work on the Dutch Committee on Tests and Testing (COTAN). The first issue is the understanding of problems test constructors and researchers using tests have of psychometric knowledge. I argue that this understanding is important for a field, like psychometrics, for which the dissemination of…

  10. Empirical Bayes Estimates of Domain Scores under Binomial and Hypergeometric Distributions for Test Scores.

    Science.gov (United States)

    Lin, Miao-Hsiang; Hsiung, Chao A.

    1994-01-01

    Two simple empirical approximate Bayes estimators are introduced for estimating domain scores under binomial and hypergeometric distributions respectively. Criteria are established regarding use of these functions over maximum likelihood estimation counterparts. (SLD)

  11. The Factor Structure and Psychometric Properties of the Spanish Version of the Mayer-Salovey-Caruso Emotional Intelligence Test

    OpenAIRE

    Sánchez García, Manuel; Extremera, Natalio; Fernández Berrocal, Pablo

    2016-01-01

    This research examined evidence regarding the reliability and validity of scores on the Spanish version of the Mayer-Salovey-Caruso Emotional Intelligence Test, Version 2.0 (MSCEIT; Mayer, Salovey, & Caruso, 2002). In Study 1, we found a close convergence of the Spanish consensus scores and the general and expert consensus scores determined with Mayer, Salovey, Caruso, and Sitarenios (2003) data. The MSCEIT also demonstrated adequate evidence of reliability of test scores as estim...

  12. Volatility in School Test Scores: Implications for Test-Based Accountability Systems

    Science.gov (United States)

    Kane, Thomas J.; Staiger, Douglas O.

    2002-01-01

    By the spring of 2000, forty states had begun using student test scores to rate school performance. Twenty states have gone a step further and are attaching explicit monetary rewards or sanctions to a school's test performance. In this paper, the authors focus on accountability programs in which states measure the effectiveness of individual…

  13. Applied psychometrics in clinical psychiatry: the pharmacopsychometric triangle

    DEFF Research Database (Denmark)

    Bech, P

    2009-01-01

    OBJECTIVE: To consider applied psychometrics in psychiatry as a discipline focusing on pharmacopsychology rather than psychopharmacology as illustrated by the pharmacopsychometric triangle. METHOD: The pharmacopsychological dimensions of clinically valid effects of drugs (antianxiety, antidepress......OBJECTIVE: To consider applied psychometrics in psychiatry as a discipline focusing on pharmacopsychology rather than psychopharmacology as illustrated by the pharmacopsychometric triangle. METHOD: The pharmacopsychological dimensions of clinically valid effects of drugs (antianxiety......, antidepressive, antimanic, and antipsychotic), of clinically unwanted effects of these drugs, and the patients' own subjective perception of the balance between wanted and unwanted effects are analysed using rating scales assessed by modern psychometric tests (item response theory models) RESULTS: Symptom rating...... scales fulfilling the item response theory models have been shown to be psychometrically valid outcome scales as their total scores are sufficient statistics for demonstrating dose-response relationship within the various classes of antianxiety, antidepressive, antimanic or antipsychotic drugs. The total...

  14. Development and psychometric testing of a breast cancer patient-profiling questionnaire

    Directory of Open Access Journals (Sweden)

    Gorini A

    2015-06-01

    Full Text Available Alessandra Gorini,1,2 Ketti Mazzocco,1,2 Sara Gandini,2 Elisabetta Munzone,2 Gordon McVie,2 Gabriella Pravettoni1,2 1Department of Health Science, University of Milan, Milan, Italy; 2European Institute of Oncology, Milan, Italy Introduction: The advent of “personalized medicine” has been driven by technological advances in genomics. Concentration at the subcellular level of a patient's cancer cells has meant inevitably that the “person” has been overlooked. For this reason, we think there is an urgent need to develop a truly personalized approach focusing on each patient as an individual, assessing his/her unique mental dimensions and tailoring interventions to his/her individual needs and preferences. The aim of this study was to develop and test the psychometric properties of the ALGA-Breast Cancer (ALGA-BC, a new multidimensional questionnaire that assesses the breast cancer patient's physical and mental characteristics in order to provide physicians, prior to the consultation, with a patient's profile that is supposed to facilitate subsequent communication, interaction, and information delivery between the doctor and the patient. Methods: The specific validation processes used were: content and face validity, construct validity using factor analysis, reliability and internal consistency using test–retest reliability, and Cronbach's alpha correlation coefficient. The exploratory analysis included 100 primary breast cancer patients and 730 healthy subjects. Results: The exploratory factor analysis revealed eight key factors: global self-rated health, perceived physical health, anxiety, self-efficacy, cognitive closure, memory, body image, and sexual life. Test–retest reliability and internal consistency were good. Comparing patients with a sample of healthy subjects, we also observed a general ability of the ALGA-BC questionnaire to discriminate between the two. Conclusion: The ALGA-BC questionnaire with 29 items is a valid

  15. Development and psychometric testing of a semantic differential scale of sexual attitude for the older person.

    Science.gov (United States)

    Park, Hyojung; Shin, Sunhwa

    2015-12-01

    The purpose of this study was to develop and test a semantic differential scale of sexual attitudes for older people in Korea. The scale was based on items derived from a literature review and focus group interviews. A methodological study was used to test the reliability and validity of the instrument. A total of 368 older men and women were recruited to complete the semantic differential scale. Fifteen pairs of adjective ratings were extracted through factor analysis. Total variance explained was 63.40%. To test for construct validity, group comparisons were implemented. The total score of sexual attitudes showed significant differences depending on gender and availability of sexual activity. Cronbach's alpha coefficient for internal consistency was 0.96. The findings of this study demonstrate that the semantic differential scale of sexual attitude is a reliable and valid instrument. © 2015 Wiley Publishing Asia Pty Ltd.

  16. The Relationship of Scores on Elizur's Hostility System on the Rorschach to the Acting-Out Score on the Hand Test.

    Science.gov (United States)

    Martin, John D.; And Others

    1978-01-01

    The relationship between Elizur's Hostility Scoring on the Rorschach Test and the Acting-Out Score on the Hand Test was examined. Correlations between the two measures (using several scoring procedures) ranged from .40 to .64. (JKS)

  17. Reporting Diagnostic Scores in Educational Testing: Temptations, Pitfalls, and Some Solutions

    Science.gov (United States)

    Sinharay, Sandip; Puhan, Gautam; Haberman, Shelby J.

    2010-01-01

    Diagnostic scores are of increasing interest in educational testing due to their potential remedial and instructional benefit. Naturally, the number of educational tests that report diagnostic scores is on the rise, as are the number of research publications on such scores. This article provides a critical evaluation of diagnostic score reporting…

  18. Hemisphere Preference Test: psychometric properties and relations with academic performance among adolescents.

    Science.gov (United States)

    Díaz-Morales, Juan Francisco; Escribano Barreno, Cristina

    2014-01-01

    The present study explored, in adolescents, the dimensionality (factorial structure), reliability (internal consistency and test-retest stability) and predictive validity (in relation with academic grades) of the Hemispheric Preference Test (HPT), a widely used self-report index of thinking styles among adults. A sample of 990 Spanish adolescents aged 10-14 completed HPT and reported their academic grades. Results indicated a two-factor structure for HPT that can be interpreted in terms of left- and right-Hemisphere Preference (HP). The two-factor structure was clearer in boys compared to girls. The internal consistency and test-retest at 6 and 12 months were satisfactory. Left-HP scores decreased with age whereas girls obtained a higher mean score on right-HP sub-scale. Finally, Left-HP accounted for a significant variance percentage on academic grades after controlling for age and sex. Results suggested that Spanish version of the HPT was effective and reliable among adolescents.

  19. A quantitative test of the cultural theory of risk perceptions: comparison with the psychometric paradigm.

    Science.gov (United States)

    Marris, C; Langford, I H; O'Riordan, T

    1998-10-01

    This paper seeks to compare two frameworks which have been proposed to explain risk perceptions, namely, cultural theory and the psychometric paradigm. A structured questionnaire which incorporated elements from both approaches was administered to 129 residents of Norwich, England. The qualitative risk characteristics generated by the psychometric paradigm explained a far greater proportion of the variance in risk perceptions than cultural biases, though it should be borne in mind that the qualitative characteristics refer directly to risks whereas cultural biases are much more distant variables. Correlations between cultural biases and risk perceptions were very low, but the key point was that each cultural bias was associated with concern about distinct types of risks and that the pattern of responses was compatible with that predicted by cultural theory. The cultural approach also provided indicators for underlying beliefs regarding trust and the environment; beliefs which were consistent within each world view but divergent between them. An important drawback, however, was that the psychometric questionnaire could only allocate 32% of the respondents unequivocally to one of the four cultural types. The rest of the sample expressed several cultural biases simultaneously, or none at all. Cultural biases are therefore probably best interpreted as four extreme world views, and a mixture of qualitative and quantitative research methodologies would generate better insights into who might defend these views in what circumstances, whether there are only four mutually exclusive world views or not, and how these views are related to patterns of social solidarity, and judgments on institutional trust.

  20. Psychometric testing of the Norwegian version of the questionnaire, student satisfaction and self-confidence in learning, used in simulation.

    Science.gov (United States)

    Tosterud, Randi; Petzäll, Kerstin; Hedelin, Birgitta; Hall-Lord, Marie Louise

    2014-11-01

    Simulation is increasingly being used as an approach to learning in nurse education. There is a need for frameworks and valid evaluation tools to help guide educators in implementing the method. The questionnaire, Student Satisfaction and Self-Confidence in Learning, which consists of two subscales, has been developed by the National League for Nursing in the US for evaluating simulation used in nurse education. The aim of the present study was to test the questionnaire, Student Satisfaction and Self-Confidence in Learning, for psychometric properties in a Norwegian nurse education context. A sample consisting of 130 nursing students participated in a simulation situation, and 123 responded. When the questionnaire was tested in its entirety, psychometric testing conducted with a principal component analysis did not reveal a stable factor solution. The two subscales were then tested separately. The analysis for Satisfaction with Current Learning suggested a one-component solution, thereby explaining 62.8% of the variance, and the internal reliability was 0.84. With regard to Self-Confidence in Learning, no stable solution was achieved, and an alpha value of 0.64 was shown. To further validate the questionnaire, Student Satisfaction and Self-Confidence in Learning, more studies by various nursing programmes in different cultural contexts are recommended.

  1. A Procedure for Linear Polychotomous Scoring of Test Items (Computer Diskette).

    Science.gov (United States)

    weights that are then associated with the response categories of test items . When tests are scored using these scoring weights, test reliability...program poly. The example demonstrates how polyweighting can be used to calibrate and score test items drawn from an item bank that is too large to

  2. Psychometric Properties of the Physical Activity Questionnaire for Older Children in Italy: Testing the Validity among a General and Clinical Pediatric Population.

    Directory of Open Access Journals (Sweden)

    Erica Gobbi

    Full Text Available The purpose of this research was to assess an Italian version of the Physical Activity Questionnaire for Older Children (PAQ-C-It. Three separate studies were conducted, whereby testing general psychometric properties, construct validity, concurrent validity and the factor structure of the PAQ-C-It among general and clinical pediatric population. Study 1 (n = 1170 examined the psychometric properties, internal consistency, factor structure (exploratory factor analysis, EFA and construct validity with enjoyment perception during physical activity. Study 2 (n = 59 reported on reliability, construct validity with enjoyment and BMI, and on cross-sectional concurrent validity with objectively measured MVPA (tri-axial accelerometry over the span of seven consecutive days. Study 3 (n = 58 examined the PAQ-C-It reliability, construct validity with BMI and VO2max as the objective measurement among a population of children with congenital heart defects (CHD. In study 2 and 3, the factor structure of the PAQ-C-It was then re-examined with an EFA. The PAQ-C-It showed acceptable to good reliability (alpha .70 to .83. Results on construct validity showed moderate but significant association with enjoyment perception (r = .30 and .36, with BMI (r = -.30 and -.79 for CHD simple form, and with the VO2max (r = .55 for CHD simple form. Significant concurrent validity with the objectively measured MVPA was reported (rho = .30, p < .05. Findings of the EFA suggested a two-factor structure for the PAQ-C-It, with items 2, 3, and 4 contributing little to the total score. This study supports the PAQ-C-It as an appropriate instrument to assess the MVPA levels of Italian children, including children with simple forms of CHD. Support is given to the possible instrument effectiveness on a large international perspective in order to level out data gathering across the globe.

  3. Psychometric Properties of the Physical Activity Questionnaire for Older Children in Italy: Testing the Validity among a General and Clinical Pediatric Population.

    Science.gov (United States)

    Gobbi, Erica; Elliot, Catherine; Varnier, Maurizio; Carraro, Attilio

    2016-01-01

    The purpose of this research was to assess an Italian version of the Physical Activity Questionnaire for Older Children (PAQ-C-It). Three separate studies were conducted, whereby testing general psychometric properties, construct validity, concurrent validity and the factor structure of the PAQ-C-It among general and clinical pediatric population. Study 1 (n = 1170) examined the psychometric properties, internal consistency, factor structure (exploratory factor analysis, EFA) and construct validity with enjoyment perception during physical activity. Study 2 (n = 59) reported on reliability, construct validity with enjoyment and BMI, and on cross-sectional concurrent validity with objectively measured MVPA (tri-axial accelerometry) over the span of seven consecutive days. Study 3 (n = 58) examined the PAQ-C-It reliability, construct validity with BMI and VO2max as the objective measurement among a population of children with congenital heart defects (CHD). In study 2 and 3, the factor structure of the PAQ-C-It was then re-examined with an EFA. The PAQ-C-It showed acceptable to good reliability (alpha .70 to .83). Results on construct validity showed moderate but significant association with enjoyment perception (r = .30 and .36), with BMI (r = -.30 and -.79 for CHD simple form), and with the VO2max (r = .55 for CHD simple form). Significant concurrent validity with the objectively measured MVPA was reported (rho = .30, p PAQ-C-It, with items 2, 3, and 4 contributing little to the total score. This study supports the PAQ-C-It as an appropriate instrument to assess the MVPA levels of Italian children, including children with simple forms of CHD. Support is given to the possible instrument effectiveness on a large international perspective in order to level out data gathering across the globe.

  4. Psychometric Quality of the Dutch Version of the Children's Eating Attitude Test in a Community Sample and a Sample of Overweight Youngsters

    Directory of Open Access Journals (Sweden)

    Lotte Theuwis

    2010-12-01

    Full Text Available Introduction. Disturbed eating attitudes may be important precursors of pathological eating patterns and, therefore need to be researched adequately. The Children's Eating Attitude Test (ChEAT is indicated for detecting at-risk attitudes and concerns in youngsters. Method. The present study was designed to provide a preliminary psychometric evaluation of the Dutch version of the ChEAT, by examining reliability and validity in a sample of 166 youngsters. Results. Generally the ChEAT seems to be a reliable instrument. Concurrent validity was demonstrated by positive correlations with measures assessing pathological eating behaviour and with related psychological problems. The discriminant validity was good. Based on ChEAT scores we can distinguish overweight youngsters from the community sample and “dieters” from “non dieters”. Divergent validity and factor structure show still shortcomings. Discussion. The Dutch version of the ChEAT seems to be a promising screening- and research instrument. Future prospective research could focus on a cut-off score for identifying at-risk youngsters.

  5. Developing Test Score Reports that Work: The Process and Best Practices for Effective Communication

    Science.gov (United States)

    Zenisky, April L.; Hambleton, Ronald K.

    2012-01-01

    Test scores matter these days. Test-takers want to understand how they performed, and test score reports, particularly those for individual examinees, are the vehicles by which most people get the bulk of this information. Historically, score reports have not always met the examinees' information or usability needs, but this is clearly changing…

  6. Psychometric Evaluation of the Ford Insomnia Response to Stress Test (FIRST) in Early Pregnancy.

    Science.gov (United States)

    Gelaye, Bizu; Zhong, Qiu-Yue; Barrios, Yasmin V; Redline, Susan; Drake, Christopher L; Williams, Michelle A

    2016-04-15

    To evaluate the construct validity and factor structure of the Spanish-language version of the Ford Insomnia Response to Stress Test questionnaire (FIRST-S) when used in early pregnancy. A cohort of 647 women were interviewed at ≤ 16 weeks of gestation to collect information regarding lifestyle, demographic, and sleep characteristics. The factorial structure of the FIRST-S was tested through exploratory and confirmatory factor analyses (EFA and CFA). Internal consistency and construct validity were also assessed by evaluating the association between the FIRST-S with symptoms of depression, anxiety, and sleep quality. Item response theory (IRT) analyses were conducted to complement classical test theory (CTT) analytic approaches. The mean score of the FIRST-S was 13.8 (range: 9-33). The results of the EFA showed that the FIRST-S contained a one-factor solution that accounted for 69.8% of the variance. The FIRST-S items showed good internal consistency (Cronbach α = 0.81). CFA results corroborated the one-factor structure finding from the EFA; and yielded measures indicating goodness of fit (comparative fit index of 0.902) and accuracy (root mean square error of approximation of 0.057). The FIRST-S had good construct validity as demonstrated by statistically significant associations of FIRST-S scores with sleep quality, antepartum depression and anxiety symptoms. Finally, results from IRT analyses suggested excellent item infit and outfit measures. The FIRST-S was found to have good construct validity and internal consistency for assessing vulnerability to insomnia during early pregnancy. © 2016 American Academy of Sleep Medicine.

  7. Psychometric properties of the Neck OutcOme Score (NOOS), Neck Disability Index (NDI) and Short form-36 (SF-36) were evaluated in patients with neck pain

    DEFF Research Database (Denmark)

    Sørensen, Tina Juul; Søgaard, Karen; Davis, Aileen M;

    2016-01-01

    OBJECTIVE: To assess reliability, construct validity, responsiveness and interpretability for Neck OutcOme Score (NOOS), Neck Disability Index (NDI) and Short-form-36 (SF-36) in neck pain patients. STUDY DESIGN AND SETTING: Internal consistency was assessed by Cronbach's alpha. Test-retest reliab......OBJECTIVE: To assess reliability, construct validity, responsiveness and interpretability for Neck OutcOme Score (NOOS), Neck Disability Index (NDI) and Short-form-36 (SF-36) in neck pain patients. STUDY DESIGN AND SETTING: Internal consistency was assessed by Cronbach's alpha. Test......, valid and responsive measure of self-reported disability in neck pain patients, performing at least as well or better than the commonly used SF-36 and NDI....

  8. Measuring use of positive thinking skills: psychometric testing of a new scale.

    Science.gov (United States)

    Bekhet, Abir K; Zauszniewski, Jaclene A

    2013-09-01

    Positive thinking interventions improve adaptive functioning and quality of life in many populations. However, no direct measure of positive thinking skills taught during intervention exists. This psychometric study of a convenience sample of 109 autism spectrum disorder (ASD) caregivers examined a new eight-item Positive Thinking Skills Scale (PTSS), which measures the frequency of use of positive thinking skills. The PTSS was found to be internally consistent (α = .90). Construct validity was supported by significant correlations (p thinking skills could provide direction for future intervention.

  9. Psychometric characteristics of Spanish adaptation of a Test for Bulimia (BULIT).

    Science.gov (United States)

    Vázquez Morejón, A J; García-Bóveda, R Jiménez; Vázquez-Morejón Jiménez, R

    2007-01-01

    The high prevalence of bulimia nervosa among eating disorders and limited repertoire of assessment tools available for these disorders in Spanish have resulted in special interest regarding the adaptation of a specific instrument such as the Bulimia Test (BULIT). A total of 119 outpatients, most of them women (97.5%) seen in a community mental health center, were assessed with the Spanish adaptation of the Bulimia Test (BULIT) and by Eating Attitudes Test (EAT-40). Reliability (internal consistency and temporal reliability) and validity (convergent and discriminant) were studied. Cronbach's alpha coefficient (alpha: 0.93) shows high internal consistency while the temporal reliability is supported by the high correlation observed (r=0.83) between applications conducted in an interval of 8-10 weeks. In relationship with convergent validity, moderate correlations were observed with the global score on EAT (r=0.38) and substantial correlation with the EAT Bulimia subscale (r=0.69). By using an 88 cutoff on the BULIT, the test properly classified 90% of the bulimic subjects and 100% of those without eating disorder. The results firmly supported the reliability and validity of this Spanish adaptation, stressing its utility as a screening device to detect current or incipient cases of bulimia and to assess severity of bulimic symptoms.

  10. Discrepancies between modified Medical Research Council dyspnea score and COPD assessment test score in patients with COPD

    Directory of Open Access Journals (Sweden)

    Rhee CK

    2015-08-01

    Full Text Available Chin Kook Rhee,1 Jin Woo Kim,2 Yong Il Hwang,3 Jin Hwa Lee,4 Ki-Suck Jung,3 Myung Goo Lee,5 Kwang Ha Yoo,6 Sang Haak Lee,7 Kyeong-Cheol Shin,8 Hyoung Kyu Yoon9 1Division of Pulmonary, Allergy and Critical Care Medicine, Department of Internal Medicine, Seoul St Mary’s Hospital, College of Medicine, The Catholic University of Korea, Seoul, 2Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, Uijeongbu St Mary’s Hospital, College of Medicine, The Catholic University of Korea, Uijeongbu, 3Division of Pulmonary, Allergy and Critical Care Medicine, Department of Internal Medicine, Hallym University Medical Center, Hallym University College of Medicine, Anyang, 4Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, School of Medicine, Ewha Womans University, Seoul, 5Division of Pulmonary, Allergy and Critical Care Medicine, Department of Internal Medicine, Hallym University Chuncheon Sacred Heart Hospital, Hallym University College of Medicine, Chuncheon, 6Division of Pulmonary, Allergy and Critical Care Medicine, Department of Internal Medicine, Konkuk University School of Medicine, Seoul, 7Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, St Paul’s Hospital, College of Medicine, The Catholic University of Korea, Seoul, 8Regional Center for Respiratory Disease, Yeungnam University Medical Center, Yeungnam University College of Medicine, Daegu, 9Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, Yeouido St Mary’s Hospital, College of Medicine, The Catholic University of Korea, Seoul, Republic of Korea Background and objective: According to the Global Initiative for Chronic Obstructive Lung Disease (GOLD guidelines, either a modified Medical Research Council (mMRC dyspnea score of ≥2 or a chronic obstructive pulmonary disease (COPD assessment test (CAT score of ≥10 is considered to represent COPD patients who are

  11. Perceptions of coronary heart disease: the development and psychometric testing of a measurement scale.

    Science.gov (United States)

    Chan, C W

    2014-01-01

    Individuals' perceptions of coronary heart disease (CHD) have implications for the ways in which they respond to the disease, process risks, make decisions, and take action to reduce CHD risks. This study aimed to develop and evaluate the psychometric properties of the perceptions of coronary heart disease scale (PCS) among a Hong Kong Chinese population. A cross-sectional study was conducted using a convenient sample (n = 232) of participants recruited from a variety of catchments including public domains, a cardiac unit, and a cardiac rehabilitation and prevention center. Exploratory factor analysis identified a nine-item, two-factor model that accounted for 52.5% of the total explained variance. The two factors were the perceived risk (five items) and perceived seriousness (four items) of CHD. The PCS demonstrated good content validity; acceptable total, and subscale internal consistency (.73, .61 - .81); and significant contrast-group differences with higher levels of CHD perceptions among males (p = .002), younger participants (p psychometric properties as a short measurement scale, which supports its use in future research. Future validation of this scale is warranted.

  12. Achievement Testing with the Wechsler Quicktest: An Examination of Its Psychometric Properties and Applied Utility with a Greek-Cypriot Sample

    Science.gov (United States)

    Vrachimi-Souroulla, Andry; Panayiotou, Georgia; Kokkinos, Constantinos M.; Lamprianou, Iasonas

    2011-01-01

    The study aimed to field-test a Greek version of the Wechsler Quicktest and to examine its psychometric properties. The Quicktest was individually administered to 208 students, aged 5-14 years, along with a reading test. Based on the Rasch analysis, data for the Quicktest subtests showed acceptable fit to the model. Also, correlations were found…

  13. Testing the psychometric properties of the Ways of Coping Questionnaire (WCQ) in Turkish university students and community samples.

    Science.gov (United States)

    Senol-Durak, Emre; Durak, Mithat; Elagöz, Feride Özlem

    2011-01-01

    Ways of Coping Questionnaire (WCQ) has been extensively used to assess coping styles in the clinical or non-clinical context. The present study investigates the psychometric properties of WCQ in Turkish culture and attempts to get its shorter version by using three separate and independent samples (two groups of university students and one community sample). The constructive study was conducted with a sample of full-time university students (N = 472) and it indicated that the most relevant factor structure was seven-factor model. The factors were planful problem-solving, seeking refuge in supernatural forces, keep to self, seeking social support, seeking refuge in fate, escape/avoidance and accepting responsibility. After that, confirmatory factor analysis was performed to test the adequacy of these factors with two different samples. Study 1 was conducted with a sample of university students (N = 485) and Study 2 was conducted with a sample of community (N = 416). Results demonstrated that seven-factor solution revealed better results in both samples in terms of goodness of fit indexes in confirmatory factor analysis. The results of reliability and validity analysis revealed that psychometric properties of WCQ were acceptable. In addition to getting shorter version of WCQ, the present study enhances cross-cultural information that increases awareness about the coping styles in a non-western culture. Copyright © 2010 John Wiley & Sons, Ltd.

  14. Development and psychometric evaluation of an information literacy self-efficacy survey and an information literacy knowledge test.

    Science.gov (United States)

    Tepe, Rodger; Tepe, Chabha

    2015-03-01

    To develop and psychometrically evaluate an information literacy (IL) self-efficacy survey and an IL knowledge test. In this test-retest reliability study, a 25-item IL self-efficacy survey and a 50-item IL knowledge test were developed and administered to a convenience sample of 53 chiropractic students. Item analyses were performed on all questions. The IL self-efficacy survey demonstrated good reliability (test-retest correlation = 0.81) and good/very good internal consistency (mean κ = .56 and Cronbach's α = .92). A total of 25 questions with the best item analysis characteristics were chosen from the 50-item IL knowledge test, resulting in a 25-item IL knowledge test that demonstrated good reliability (test-retest correlation = 0.87), very good internal consistency (mean κ = .69, KR20 = 0.85), and good item discrimination (mean point-biserial = 0.48). This study resulted in the development of three instruments: a 25-item IL self-efficacy survey, a 50-item IL knowledge test, and a 25-item IL knowledge test. The information literacy self-efficacy survey and the 25-item version of the information literacy knowledge test have shown preliminary evidence of adequate reliability and validity to justify continuing study with these instruments.

  15. Development and psychometric testing of the attitude toward potential pregnancy scale.

    Science.gov (United States)

    Paterno, Mary T; Han, Hae-Ra

    2014-01-01

    To develop and test a comprehensive tool for measuring women's attitudes toward the possibility of becoming pregnant. Cross-sectional mixed methods study. Two obstetric/gynecologic (OB/GYN) clinics and one family planning clinic in Baltimore, Maryland. One-hundred thirty (130) nonpregnant, primarily African American women (84%) age 18 to 29. Participants completed a computer-based survey as part of a larger retrospective mixed-methods study. The Attitude Toward Potential Pregnancy Scale (APPS) was assessed using exploratory factor analysis and hypothesis testing. Cronbach's alpha for internal consistency for the APPS was 0.86. Item-total correlations ranged from 0.56 to 0.75. All items loaded on one factor. Support for construct validity was demonstrated using logistic regression, where the odds of being a highly effective contraceptive user decreased by 8% with each one-point increase in score on the APPS (odds ratio = 0.92; confidence interval [0.87, 0.98]). This study provides support for reliability and validity of the APPS. The APPS may be a useful tool for understanding pregnancy attitude in future studies and in clinical practice. Further research is needed to assess the usefulness of the scale with other groups of women, its utility in the clinical practice setting, and its potential predictive validity for unintended pregnancy. © 2014 AWHONN, the Association of Women's Health, Obstetric and Neonatal Nurses.

  16. Psychometric properties of Brief-Balance Evaluation Systems Test (Brief-BESTest) in evaluating balance performance in individuals with chronic stroke.

    Science.gov (United States)

    Huang, Meizhen; Pang, Marco Y C

    2017-03-01

    To examine the psychometric properties of the Brief-Balance Evaluation Systems Test (Brief-BESTest) in individuals with chronic stroke. This was an observational study with repeated measurements involving 50 participants with chronic stroke [mean (SD) age: 59.2 (7.3) years]. Each participant with stroke was evaluated with the Brief-BESTest, Berg balance scale (BBS), Postural Assessment Scale for Stroke Patients (PASS), Fugl-Meyer Motor Assessment (FMA), Chedoke-McMaster Stroke Assessment (CMSA), Montreal Cognitive Assessment (MoCA), and Geriatric Depression Scale (GDS). Two raters (rater 1 and 2) provided the Brief-BESTest scores of the first 27 participants independently to establish inter-rater reliability. After 15 min of rest, the same 27 participants were evaluated with the Brief-BESTest again by rater 1 to establish intra-rater reliability. The Brief-BESTest scores of the stroke group were also compared with those of the control group [n = 27, mean (SD) age: 56.7 (7.7) years]. The Brief-BESTest had no substantial floor and ceiling effects, good intra-rater (ICC 2,1 = 0.974) and inter-rater (ICC 2,1 = 0.980) reliability and internal consistency (Cronbach's alpha = 0.818). The minimal detectable change at 95% confidence level was 2 points. The Brief-BESTest showed moderate to very strong correlations with other balance (BBS and PASS) and motor impairment (FMA, CMSA) measures (rs = .547-.911, p Brief-BESTest was effective in distinguishing participants with stroke from controls (cutoff score: Brief-BESTest has good reliability and validity in assessing balance function in individuals with chronic stroke.

  17. Psychometric properties of teacher-made science tests used in national examinations for middle-grade students in Benin (West Africa): A longitudinal study

    Science.gov (United States)

    Gado, Issaou

    The purpose of this study was to determine the psychometric properties (item difficulty, item discrimination, internal consistency reliability, content validity and construct validity) of teacher-made science tests used in national examinations for Middle Grade students in Benin (West Africa) for three consecutive years. The study also described the assessment methods used in science classrooms. Research data were collected from two sources: first, a survey questionnaire administered to 250 secondary school teachers purposively selected; second, a total of 630 graded physical science copies for three consecutive years of national examinations randomly selected from the Service of Examination and Testing data sources. Descriptive statistics were used to analyze the survey questionnaire. Factor analysis was used to explore construct validity of the measurements. Classical test theory methods were used to explore item difficulty, item discrimination, and reliability of examination scores. Generalizability theory provided estimate of variance components and proportions of total variance accounted for by sources of error related to persons, items, and person-by-item interaction. The result of this study shows that teacher-made tests used in large scale high-stakes examination for three consecutive years are highly reliable and have a satisfactory level of difficulty and discrimination. However, even though the items of teacher-made tests are associated with the objectives of the national curriculum standards, the proportion of objectives tested in national examinations and the number of items across three consecutive years show a non uniform and inconsistent distribution of items across years, content domains, and within the fields of chemistry and physics. Therefore, teacher-made tests used in national examinations for three consecutive years lack content validity. Discussion of the results and suggestions about constructing exam items with high validity are provided.

  18. Improving Test Score Reporting: Perspectives from the ETS Score Reporting Conference. Research Report. ETS RR-11-45

    Science.gov (United States)

    Zapata-Rivera, Diego, Ed.; Zwick, Rebecca, Ed.

    2011-01-01

    This volume includes 3 papers based on presentations at a workshop on communicating assessment information to particular audiences, held at Educational Testing Service (ETS) on November 4th, 2010, to explore some issues that influence score reports and new advances that contribute to the effectiveness of these reports. Jessica Hullman, Rebecca…

  19. Effect of Examinee Certainty on Probabilistic Test Scores and a Comparison of Scoring Methods for Probabilistic Responses.

    Science.gov (United States)

    1983-07-01

    equal to the maximum value for this index is due to the dependence of this index upon the magnitude and sign of the factor loadings. Gorsuch (1974, p...Measurement, 1972, 9, 205-207. Gorsuch , R. L. Factor analysis. Philadelphia: W. B. Saunders Company, 1974. Guilford, J. P. A simple scoring weight for test

  20. Effect of Self-Assessment on Test Scores: Student Perceptions

    Science.gov (United States)

    Ramirez, Beatriz U.

    2010-01-01

    After a sudden increase in most of the individual grades in a multiple-choice test, students were asked to rank the three most relevant factors responsible for this outcome. Among eight others, the availability of a test for self-assessment before the final test was by far the most frequently mentioned (82.4% of the students). Questions applied…

  1. The Internet Addiction Test: assessing its psychometric properties in Bangladeshi culture.

    Science.gov (United States)

    Rezaul Karim, A K M; Nigar, Naima

    2014-08-01

    There is growing importance of the Internet Addiction Test (IAT) in Internet addiction research around the world. Since the development of the IAT (Young, 1996, 1998), a number of validation studies have been done in various cultures. The aim of this study was to translate the instrument into Bangla and validate in Bangladeshi culture, a culture vulnerable to Internet addiction. A total of 177 Internet users (77 females and 100 males) participated in the study. Exploratory factor analysis (EFA) of the data from 172 participants (who provided complete responses) identified a four factor structure of the IAT with 18 items. The four factors namely 'Neglect of duty', 'Online dependence', 'Virtual fantasies', and 'Privacy and self-defense' together explained 55.68% of the total variance. Problematic (moderate/excessive) users on the IAT scored, on average, higher on each of the four IAT factors as compared to average or non-problematic (minimal) users consistently across genders. The IAT and its factors showed good internal consistency (Cronbach's α=.89 for the IAT, and .60-.84 for the factors), strong convergent and discriminant validity. Thus, the Bangla version IAT appears to be valid and reliable and therefore may be used in further research on Internet addiction in the country.

  2. Neuropsychological test scores, academic performance, and developmental disorders in Spanish-speaking children.

    Science.gov (United States)

    Rosselli, M; Ardila, A; Bateman, J R; Guzmán, M

    2001-01-01

    Limited information is currently available about performance of Spanish-speaking children on different neuropsychological tests. This study was designed to (a) analyze the effects of age and sex on different neuropsychological test scores of a randomly selected sample of Spanish-speaking children, (b) analyze the value of neuropsychological test scores for predicting school performance, and (c) describe the neuropsychological profile of Spanish-speaking children with learning disabilities (LD). Two hundred ninety (141 boys, 149 girls) 6- to 11-year-old children were selected from a school in Bogotá, Colombia. Three age groups were distinguished: 6- to 7-, 8- to 9-, and 10- to 11-year-olds. Performance was measured utilizing the following neuropsychological tests: Seashore Rhythm Test, Finger Tapping Test (FTT), Grooved Pegboard Test, Children's Category Test (CCT), California Verbal Learning Test-Children's Version (CVLT-C), Benton Visual Retention Test (BVRT), and Bateria Woodcock Psicoeducativa en Español (Woodcock, 1982). Normative scores were calculated. Age effect was significant for most of the test scores. A significant sex effect was observed for 3 test scores. Intercorrelations were performed between neuropsychological test scores and academic areas (science, mathematics, Spanish, social studies, and music). In a post hoc analysis, children presenting very low scores on the reading, writing, and arithmetic achievement scales of the Woodcock battery were identified in the sample, and their neuropsychological test scores were compared with a matched normal group. Finally, a comparison was made between Colombian and American norms.

  3. Development and psychometric evaluation of an information literacy self-efficacy survey and an information literacy knowledge test*

    Science.gov (United States)

    Tepe, Rodger; Tepe, Chabha

    2015-01-01

    Objective To develop and psychometrically evaluate an information literacy (IL) self-efficacy survey and an IL knowledge test. Methods In this test–retest reliability study, a 25-item IL self-efficacy survey and a 50-item IL knowledge test were developed and administered to a convenience sample of 53 chiropractic students. Item analyses were performed on all questions. Results The IL self-efficacy survey demonstrated good reliability (test–retest correlation = 0.81) and good/very good internal consistency (mean κ = .56 and Cronbach's α = .92). A total of 25 questions with the best item analysis characteristics were chosen from the 50-item IL knowledge test, resulting in a 25-item IL knowledge test that demonstrated good reliability (test–retest correlation = 0.87), very good internal consistency (mean κ = .69, KR20 = 0.85), and good item discrimination (mean point-biserial = 0.48). Conclusions This study resulted in the development of three instruments: a 25-item IL self-efficacy survey, a 50-item IL knowledge test, and a 25-item IL knowledge test. The information literacy self-efficacy survey and the 25-item version of the information literacy knowledge test have shown preliminary evidence of adequate reliability and validity to justify continuing study with these instruments. PMID:25517736

  4. AMOVA ["Accumulative Manifold Validation Analysis"]: An Advanced Statistical Methodology Designed to Measure and Test the Validity, Reliability, and Overall Efficacy of Inquiry-Based Psychometric Instruments

    Science.gov (United States)

    Osler, James Edward, II

    2015-01-01

    This monograph provides an epistemological rational for the Accumulative Manifold Validation Analysis [also referred by the acronym "AMOVA"] statistical methodology designed to test psychometric instruments. This form of inquiry is a form of mathematical optimization in the discipline of linear stochastic modelling. AMOVA is an in-depth…

  5. Governing by Testing: Circulation, Psychometric Knowledge, Experts and the "Alliance for Progress" in Latin America during the 1960s and 1970s

    Science.gov (United States)

    Alarcón, Cristina

    2015-01-01

    This paper analyzes the activities, members, and effects of an inter-American expert network for the diffusion of psychometric knowledge, specifically of standardized aptitude testing for university admission in Latin America during the 1960s and 1970s. Within the framework of educational transfer studies, the role of international,…

  6. Psychometric Features of the General Aptitude Test-Verbal Part (GAT-V): A Large-Scale Assessment of High School Graduates in Saudi Arabia

    Science.gov (United States)

    Dimitrov, Dimiter M.; Shamrani, Abdul Rahman

    2015-01-01

    This study examines the psychometric features of a General Aptitude Test-Verbal Part, which is used with assessments of high school graduates in Saudi Arabia. The data supported a bifactor model, with one general factor and three content domains (Analogy, Sentence Completion, and Reading Comprehension) as latent aspects of verbal aptitude.

  7. Governing by Testing: Circulation, Psychometric Knowledge, Experts and the "Alliance for Progress" in Latin America during the 1960s and 1970s

    Science.gov (United States)

    Alarcón, Cristina

    2015-01-01

    This paper analyzes the activities, members, and effects of an inter-American expert network for the diffusion of psychometric knowledge, specifically of standardized aptitude testing for university admission in Latin America during the 1960s and 1970s. Within the framework of educational transfer studies, the role of international,…

  8. Psychometric Features of the General Aptitude Test-Verbal Part (GAT-V): A Large-Scale Assessment of High School Graduates in Saudi Arabia

    Science.gov (United States)

    Dimitrov, Dimiter M.; Shamrani, Abdul Rahman

    2015-01-01

    This study examines the psychometric features of a General Aptitude Test-Verbal Part, which is used with assessments of high school graduates in Saudi Arabia. The data supported a bifactor model, with one general factor and three content domains (Analogy, Sentence Completion, and Reading Comprehension) as latent aspects of verbal aptitude.

  9. [Cross-cultural adaptation and psychometric tests of a self-efficacy scale and an adherence scale for French adolescents with Type 1 diabetes

    Science.gov (United States)

    Colson, Sébastien; Coté, José; Collombier, Madeleine; Debout, Christophe; Bonnel, Galadriel; Reynaud, Rachel; Lagouanelle-Simeoni, Marie-Claude

    2016-12-01

    Introduction : many structured educational programs, using the concept of self-efficacy, have been studied in English-speaking countries. Background : tools were developed in English to assess this concept along with treatment adherence. However, there seems to be no French version of these tools in scientific literature. Aim : to adapt the tools to the French language and to test the psychometric properties of the Self-Efficacy for Diabetes Self-Management (SEDM) and the Diabetes Self-Management Profile (DSMP). Methods : a cross-cultural adaptation of the SEDM and DSMP in French was performed. The psychometric properties were tested in a pilot study that took place between January 1st and December 31st, 2015. Results : Cronbach’s alpha coefficient of SEDM in French was 0.84, test-retest reliability 0.80 and sensitivity to change was moderate. The Cronbach’s alpha and sensitivity to change of the French DSMP were low, and the test-retest was 0.71. Discussion and conclusions : the first results of the psychometric properties of French SEDM were rather encouraging. The use of the French version of DSMP seems compromised in terms of psychometric properties and the opinion of the participants.

  10. Attitudes toward Science: Measurement and Psychometric Properties of the Test of Science-Related Attitudes for Its Use in Spanish-Speaking Classrooms

    Science.gov (United States)

    Navarro, Marianela; Förster, Carla; González, Caterina; González-Pose, Paulina

    2016-01-01

    Understanding attitudes toward science and measuring them remain two major challenges for science teaching. This article reviews the concept of attitudes toward science and their measurement. It subsequently analyzes the psychometric properties of the "Test of Science-Related Attitudes" (TOSRA), such as its construct validity, its…

  11. Construct Validity and Test Re-Test Reliability of the Forgotten Joint Score.

    Science.gov (United States)

    Thompson, Simon M; Salmon, Lucy J; Webb, Justin M; Pinczewski, Leo A; Roe, Justin P

    2015-11-01

    Consecutive patients undergoing knee arthroplasty completed questionnaires: FJS, Knee Injury and Osteoarthritis Outcome Score (KOOS) and WOMAC Score (mean 39 months after surgery), and were mailed a repeat questionnaire after 4 to 6 weeks. The test-retest reliability was almost perfect for the FJS (ICC = 0.97), and the FJS subdomains (ICC > 0.8). Convergent construct validity of the FJS was correlated with the KOOS Subscores of Quality of Life (0.63, P = 0.001), Symptom (0.33, P = 0.001), Pain (0.68, P = 0.001) and ADL (0.66, P = 0.001) and the Total WOMAC (0.70, P = 0.001). The FJS demonstrates high test-retest reliability and construct validity compared to the Normalised WOMAC and KOOS Subscales. The FJS does not demonstrate the ceiling effect of the WOMAC or KOOS pain scores so may have greater discriminatory ability following TKR. Copyright © 2015 Elsevier Inc. All rights reserved.

  12. Further Validation of the Qualitative Scoring System for the Modified Bender-Gestalt Test.

    Science.gov (United States)

    Brannigan, Gary G.; And Others

    1995-01-01

    Compares the Qualitative Scoring System and the Developmental Scoring Systems, both Bender-Gestalt tests, in predicting achievement on the Metropolitan Achievement Test (MAT). In this study, first through fourth graders (n=409) from regular elementary schools were subjected to both tests; both systems correlated significantly with school…

  13. Are Score Comparisons across Language Proficiency Test Batteries Justified?: An IELTS-TOEFL Comparability Study.

    Science.gov (United States)

    Geranpayeh, Ardeshir

    1994-01-01

    This paper reports on a study conducted to determine if comparisons between scores on the Test of English as a Foreign Language (TOEFL) and the International English Language Testing Service (IELTS) are justifiable. The test scores of 216 Iranian graduate students who took the TOEFL and IELTS, as well as the Iranian Ministry of Culture and Higher…

  14. Comparing Graphical and Verbal Representations of Measurement Error in Test Score Reports

    Science.gov (United States)

    Zwick, Rebecca; Zapata-Rivera, Diego; Hegarty, Mary

    2014-01-01

    Research has shown that many educators do not understand the terminology or displays used in test score reports and that measurement error is a particularly challenging concept. We investigated graphical and verbal methods of representing measurement error associated with individual student scores. We created four alternative score reports, each…

  15. Detection of aberrant item score patterns in computerized adaptive testing : An empirical example using the CUSUM

    NARCIS (Netherlands)

    Egberink, Iris J. L.; Meijer, Rob R.; Veldkamp, Bernard P.; Schakel, Lolle; Smid, Nico G.

    2010-01-01

    The scalability of individual trait scores on a computerized adaptive test (CAT) was assessed through investigating the consistency of individual item score patterns. A sample of N = 428 persons completed a personality CAT as part of a career development procedure. To detect inconsistent item score

  16. Comparing Graphical and Verbal Representations of Measurement Error in Test Score Reports

    Science.gov (United States)

    Zwick, Rebecca; Zapata-Rivera, Diego; Hegarty, Mary

    2014-01-01

    Research has shown that many educators do not understand the terminology or displays used in test score reports and that measurement error is a particularly challenging concept. We investigated graphical and verbal methods of representing measurement error associated with individual student scores. We created four alternative score reports, each…

  17. PSYCHOMETRIC PROPERTIES OF ALCOHOL SCREENING TESTS IN THE EMERGENCY DEPARTMENT IN ARGENTINA, MEXICO AND THE UNITED STATES

    Science.gov (United States)

    Ledesma, Rubén Daniel; Cherpitel, Cheryl J.; Borges, Guilherme

    2010-01-01

    The objective of this article is to report psychometric characteristics of the AUDIT, CAGE, RAPS4, and TWEAK and to compare them across three countries: Argentina, Mexico, and the United States which used a similar protocol and methodology. Probability samples of patients 18 years and older were drawn from emergency departments in Mar del Plata, Argentina (n=780), Pachuca, Mexico (n=1624) and Santa Clara, U.S. (n=1220). Concurrent validity was assessed by comparing their performance against a diagnosis of alcohol dependence (DSM-IV) obtained through the Composite International Diagnostic Interview, and for the briefer measures, also by their correlation with the AUDIT. The internal consistency of the CAGE, RAPS4, and TWEAK scores was estimated by the KR-20 formula and by Cronbach’s Alpha for the AUDIT. Corrected item-total correlation and D-values were used as item discrimination measures. In Argentina and Mexico the AUDIT and the RAPS4 showed the highest validity. Reliability of all instruments was higher in the US than in Argentina or Mexico. In all three countries, reliability of the TWEAK was lowest, while the AUDIT was highest. With a few exceptions, all items showed good discrimination powers. PMID:20472341

  18. Effects of Test Media on Different EFL Test-Takers in Writing Scores and in the Cognitive Writing Process

    Science.gov (United States)

    Zou, Xiao-Ling; Chen, Yan-Min

    2016-01-01

    The effects of computer and paper test media on EFL test-takers with different computer familiarity in writing scores and in the cognitive writing process have been comprehensively explored from the learners' aspect as well as on the basis of related theories and practice. The results indicate significant differences in test scores among the…

  19. Prediction of WAIS Scores from Group Ability Tests

    Science.gov (United States)

    Watson, Charles G.; Klett, William G.

    1973-01-01

    In a search for an adequate but efficient substitute, the authors have instituted three evaluations of the relationships between potential WAIS-substitutes and the WAIS itself. The present report describes the first of these researches-- a study of the relationships between the four group ability tests and the WAIS in a mental hospital setting.…

  20. Maintaining Equivalent Cut Scores for Small Sample Test Forms

    Science.gov (United States)

    Dwyer, Andrew C.

    2016-01-01

    This study examines the effectiveness of three approaches for maintaining equivalent performance standards across test forms with small samples: (1) common-item equating, (2) resetting the standard, and (3) rescaling the standard. Rescaling the standard (i.e., applying common-item equating methodology to standard setting ratings to account for…

  1. Generation of GHS Scores from TEST and online sources

    Science.gov (United States)

    Alternatives assessment frameworks such as DfE (Design for the Environment) evaluate chemical alternatives in terms of human health effects, ecotoxicity, and fate. T.E.S.T. (Toxicity Estimation Software Tool) can be utilized to evaluate human health in terms of acute oral rat tox...

  2. Study protocol of psychometric properties of the Spanish translation of a competence test in evidence based practice: The Fresno test

    Directory of Open Access Journals (Sweden)

    Villafafila-Ferrero Rosa

    2009-02-01

    Full Text Available Abstract Background There are few high-quality instruments for evaluating the effectiveness of Evidence-Based Practice (EBP curricula with objective outcomes measures. The Fresno test is an instrument that evaluates most of EBP steps with a high reliability and validity in the English original version. The present study has the aims to translate the Fresno questionnaire into Spanish and its subsequent validation to ensure the equivalence of the Spanish version against the English original. Methods and design The questionnaire will be translated with the back translation technique and tested in Primary Care Teaching Units in Catalonia (PCTU. Participants will be: (a tutors of Family Medicine residents (expert group; (b Family Medicine residents in their second year of the Family Medicine training program (novice group, and (c Family Medicine physicians (intermediate group. The questionnaire will be administered before and after an educational intervention. The educational intervention will be an interactive four half-day sessions designed to develop the knowledge and skills required to EBP. Responsiveness statistics used in the analysis will be the effect size, the standardised response mean and Guyatt's method. For internal consistency reliability, two measures will be used: corrected item-total correlations and Cronbach's alpha. Inter-rater reliability will be tested using Kappa coefficient for qualitative items and intra-class correlation coefficient for quantitative items and the overall score. Construct validity, item difficulty, item discrimination and feasibility will be determined. Discussion The validation of the Fresno questionnaire into different languages will enable the expansion of the questionnaire, as well as allowing comparison between countries and the evaluation of different teaching models.

  3. The Mother’s Autonomy in Decision Making (MADM) scale: Patient-led development and psychometric testing of a new instrument to evaluate experience of maternity care

    Science.gov (United States)

    Vedam, Saraswathi; Stoll, Kathrin; Martin, Kelsey; Rubashkin, Nicholas; Partridge, Sarah; Thordarson, Dana; Jolicoeur, Ganga

    2017-01-01

    Shared decision making (SDM) is core to person-centered care and is associated with improved health outcomes. Despite this, there are no validated scales measuring women’s agency and ability to lead decision making during maternity care. Objective To develop and validate a new instrument that assesses women’s autonomy and role in decision making during maternity care. Design Through a community-based participatory research process, service users designed, content validated, and administered a cross-sectional quantitative survey, including 31 items on the experience of decision-making. Setting and participants Pregnancy experiences (n = 2514) were reported by 1672 women who saw a single type of primary maternity care provider in British Columbia. They described care by a midwife, family physician or obstetrician during 1, 2 or 3 maternity care cycles. We conducted psychometric testing in three separate samples. Main outcome measures We assessed reliability, item-to-total correlations, and the factor structure of the The Mothers’ Autonomy in Decision Making (MADM) scale. We report MADM scores by care provider type, length of prenatal appointments, preferences for role in decision-making, and satisfaction with experience of decision-making. Results The MADM scale measures a single construct: autonomy in decision-making during maternity care. Cronbach alphas for the scale exceeded 0.90 for all samples and all provider groups. All item-to-total correlations were replicable across three samples and exceeded 0.7. Eigenvalue and scree plots exhibited a clear 90-degree angle, and factor analysis generated a one factor scale. MADM median scores were highest among women who were cared for by midwives, and 10 or more points lower for those who saw physicians. Increased time for prenatal appointments was associated with higher scale scores, and there were significant differences between providers with respect to average time spent in prenatal appointments. Midwifery care

  4. Freudenfreude and Schadenfreude Test (FAST) scores of depressed and non-depressed undergraduates.

    Science.gov (United States)

    Chambliss, Catherine; Cattai, Ashley; Benton, Peter; Elghawy, Ahmed; Fan, Madde; Thompson, Kayleigh; Scavicchio, Daniel; Tanenbaum, Joshua

    2012-08-01

    The Freudenfreude and Schadenfreude Test (FAST) had moderate test-retest reliability in an undergraduate sample. Freudenfreude scores were lower and Schadenfreude scores were higher among mildly depressed than nondepressed students. Distinctive reactions to personal success and failure were associated with depression. Responses to others' success and failure may also be related to depression.

  5. Using Raters from India to Score a Large-Scale Speaking Test

    Science.gov (United States)

    Xi, Xiaoming; Mollaun, Pam

    2011-01-01

    We investigated the scoring of the Speaking section of the Test of English as a Foreign Language[TM] Internet-based (TOEFL iBT[R]) test by speakers of English and one or more Indian languages. We explored the extent to which raters from India, after being trained and certified, were able to score the TOEFL examinees with mixed first languages…

  6. Comparison of Two Scoring Systems for the Modified Version of the Bender-Gestalt Test.

    Science.gov (United States)

    Schachter, Steven; And Others

    1991-01-01

    Examined relative utility of two scoring systems for Modified Version of Bender-Gestalt Test in predicting performance on Developmental Test of Visual-Motor Integration. Findings from 53 kindergarten and 47 first grade students indicated that Qualitative Scoring System was significantly better predictor of visual-motor integration skills than…

  7. Many Children Left Behind? Textbooks and Test Scores in Kenya. NBER Working Paper No. 13300

    Science.gov (United States)

    Glewwe, Paul; Kremer, Michael; Moulin, Sylvie

    2007-01-01

    A randomized evaluation suggests that a program which provided official textbooks to randomly selected rural Kenyan primary schools did not increase test scores for the average student. In contrast, the previous literature suggests that textbook provision has a large impact on test scores. Disaggregating the results by students' initial academic…

  8. Beyond Correlations: Usefulness of High School GPA and Test Scores in Making College Admissions Decisions

    Science.gov (United States)

    Sawyer, Richard

    2013-01-01

    Correlational evidence suggests that high school GPA is better than admission test scores in predicting first-year college GPA, although test scores have incremental predictive validity. The usefulness of a selection variable in making admission decisions depends in part on its predictive validity, but also on institutions' selectivity and…

  9. Using Raters from India to Score a Large-Scale Speaking Test

    Science.gov (United States)

    Xi, Xiaoming; Mollaun, Pam

    2011-01-01

    We investigated the scoring of the Speaking section of the Test of English as a Foreign Language[TM] Internet-based (TOEFL iBT[R]) test by speakers of English and one or more Indian languages. We explored the extent to which raters from India, after being trained and certified, were able to score the TOEFL examinees with mixed first languages…

  10. The Effects of Developmental Placement and Early Retention on Children's Later Scores on Standardized Tests.

    Science.gov (United States)

    May, Deborah C.; Welch, Edward L.

    1984-01-01

    Examined the relationship between early school retention as a result of preschool and kindergarten developmental testing and children's later academic achievement (N=223). Results showed children who scored as immature on the Gesell Screening Test and who were retained a year had the lowest scores on all measures. (JAC)

  11. Comparing the Effects of Elementary Music and Visual Arts Lessons on Standardized Mathematics Test Scores

    Science.gov (United States)

    King, Molly Elizabeth

    2016-01-01

    The purpose of this quantitative, causal-comparative study was to compare the effect elementary music and visual arts lessons had on third through sixth grade standardized mathematics test scores. Inferential statistics were used to compare the differences between test scores of students who took in-school, elementary, music instruction during the…

  12. Life Stress and Reading Comprehension Test Scores in the Middle School Student.

    Science.gov (United States)

    Jones, Maryann Clementi

    A study determined the relationship between life stress and reading comprehension test scores on the IOWA Tests of Basic Skills. Subjects, 41 middle-school students attending Lincoln School in Garwood, New Jersey, were surveyed as to the amount of life stress prevalent in their lives. In addition, the Iowa scores for reading comprehension were…

  13. Scoring Yes-No Vocabulary Tests: Reaction Time vs. Nonword Approaches

    Science.gov (United States)

    Pellicer-Sanchez, Ana; Schmitt, Norbert

    2012-01-01

    Despite a number of research studies investigating the Yes-No vocabulary test format, one main question remains unanswered: What is the best scoring procedure to adjust for testee overestimation of vocabulary knowledge? Different scoring methodologies have been proposed based on the inclusion and selection of nonwords in the test. However, there…

  14. Comparing the Effects of Elementary Music and Visual Arts Lessons on Standardized Mathematics Test Scores

    Science.gov (United States)

    King, Molly Elizabeth

    2016-01-01

    The purpose of this quantitative, causal-comparative study was to compare the effect elementary music and visual arts lessons had on third through sixth grade standardized mathematics test scores. Inferential statistics were used to compare the differences between test scores of students who took in-school, elementary, music instruction during the…

  15. Correcting for Test Score Measurement Error in ANCOVA Models for Estimating Treatment Effects

    Science.gov (United States)

    Lockwood, J. R.; McCaffrey, Daniel F.

    2014-01-01

    A common strategy for estimating treatment effects in observational studies using individual student-level data is analysis of covariance (ANCOVA) or hierarchical variants of it, in which outcomes (often standardized test scores) are regressed on pretreatment test scores, other student characteristics, and treatment group indicators. Measurement…

  16. Health status assessment tool for the family member caregiver of patients with bipolar disorder: development and psychometric testing.

    Science.gov (United States)

    Shamsaei, Farshid; Kermanshahi, Sima Mohamad Khan; Vanaki, Zohreh; Hajizadeh, Ebrahim; Holtforth, Martin Grosse; Cheragi, Fatemeh

    2013-06-01

    The caregivers' health assessment requires the application of a valid instrument that provided based on their experiences about the health concept. The purpose of this study was to develop and test the psychometric properties of the health assessment tool for family member caregivers' of patients with bipolar disorder. This study utilized mixed research. The specific validation processes used were: content and face validity, construct validity using factor analysis, reliability and internal consistency using test-retest reliability and Cronbach's alpha correlation coefficient. The exploratory factor analysis revealed ten factors: safe life with peace, maintaining physical health, painful emotions, psychological tolerance, maintaining physical-psychological potency, families and relatives support, health care system support, moral-financial support, maintaining social relationships, and worry over the label. The internal consistency (Cronbach's alpha) was 95. Test-retest reliability of the questionnaire with interval time of two weeks was 0.93 (p Family caregivers' health assessment questionnaire with 75 items helped to determine family caregivers' health in different settings such as clinical settings, homes and research environments by health care providers. Copyright © 2013 Elsevier B.V. All rights reserved.

  17. An Item Analysis and Validity Investigation of Bender Visual Motor Gestalt Test Score Items

    Science.gov (United States)

    Lambert, Nadine M.

    1971-01-01

    This investigation attempted to demonstrate the utility of standard item analysis procedures for selecting the most reliable and valid items for scoring Bender Visual Motor Gestalt Test test records. (Author)

  18. Are Increasing Test Scores in Texas Really a Myth?

    Directory of Open Access Journals (Sweden)

    A. Toenjes

    2002-03-01

    Full Text Available Pass rates by Texas tenth-graders on the high school exit exam improved from 52 percent in 1994 to 72 percent in 1998. In his article "The Myth of the Texas Miracle in Education" (EPAA, August 2000 Professor Walt Haney argued that some part of this increased pass rate was, as he put it, an illusion. Haney contended that the combined effects of students dropping out of school prior to taking the 10th grade TAAS and special education exemptions accounted for much of the increase in TAAS pass rates. Relying on the same methodology and data that Haney used, we demonstrate that his conclusion is incorrect. None of the 20 percent improvement in the TAAS exit test pass rate between 1994 and 1998 is explained by combined increases in dropout rates or special education exemptions.

  19. The non-credible score of the Rey Auditory Verbal Learning Test: is it better at predicting non-credible neuropsychological test performance than the RAVLT recognition score?

    Science.gov (United States)

    Whitney, Kriscinda A; Davis, Jeremy J

    2015-03-01

    The ability of both the non-credible score of the Rey Auditory Verbal Learning Test (RAVLT NC) and the recognition score of the RAVLT (RAVLT Recog) to predict credible versus non-credible neuropsychological test performance was examined. Credible versus non-credible group membership was determined according to diagnostic criteria with consideration of performance on two stand-alone performance validity tests. Findings from this retrospective data analysis of outpatients seen for neuropsychological testing within a Veterans Affairs Medical Center (N = 175) showed that RAVLT Recog demonstrated better classification accuracy than RAVLT NC in predicting credible versus non-credible neuropsychological test performance. Specifically, an RAVLT Recog cutoff of ≤9 resulted in reasonable sensitivity (48%) and acceptable specificity (91%) in predicting non-credible neuropsychological test performance. Implications for clinical practice are discussed. Note: The views contained here within are those of the authors and not representative of the institutions with which they are associated.

  20. The Professional Nurse Self-Assessment Scale: Psychometric testing in Norwegian long term and home care contexts.

    Science.gov (United States)

    Finnbakk, Elisabeth; Wangensteen, Sigrid; Skovdahl, Kirsti; Fagerström, Lisbeth

    2015-01-01

    Nurses' clinical competence is vital to ensure safe and high quality care, and the continuous assessment of nurses' clinical competence is of major concern. A validated instrument for the self-assessment of nurses' clinical competence at different educational levels across specialties and countries is lacking. The aim of this study was to test the reliability and construct validity of the new Professional Nurse Self-Assessment Scale (ProffNurse SAS) questionnaire in long term and home care contexts in Norway. The questionnaire is based on the Nordic Advanced Practice Nursing model, in which the nurse-patient relationship is central. The study has a cross-sectional survey design. A purposive sample of 357 registered nurses who worked in long term and home care contexts in two geographical regions encompassing eight municipalities and three counties was included. The respondents completed the 74-item ProffNurse SAS questionnaire and demographic background data was collected. Data collection was conducted in two phases: first region autumn 2011 and second region spring 2012. Exploratory factor analyses (EFA) were used to test the psychometric properties of the questionnaire and included the following steps: assessment of the factorality of the data, factor extraction by Principal Component Analysis (PCA), oblimin (oblique) factor rotation, and interpretation. Cronbach's alpha was used to estimate the internal consistency. The PCA revealed a six-component structure, reducing the number of items in the questionnaire from 74 to 51. Based on the content of the highest-loading items, the six components were named: Direct Clinical Practice, Professional Development, Ethical Decision-Making, Clinical Leadership, Cooperation and Consultation, and Critical Thinking. The Cronbach's alpha values ranged from 0.940 (highest; Direct Clinical Practice) to 0.737 (lowest; Critical Thinking), leading to the estimation that the ProffNurse SAS is reliable. The six components support the

  1. The Relationship between Scores on the Graduate Management Admission Test and the Test of English as a Foreign Language.

    Science.gov (United States)

    Powers, Donald E.

    In addition to scores on the Graduate Management Admission Test (GMAT), which are required of applicants to a substantial number of graduate management schools, foreign candidates may also be required to submit scores on the Test of English as a Foreign Language (TOEFL) as an indication of English language proficiency. The present study provides…

  2. The Relationship between Scores on the Graduate Management Admission Test and the Test of English as a Foreign Language.

    Science.gov (United States)

    Powers, Donald E.

    In addition to scores on the Graduate Management Admission Test (GMAT), which are required of applicants to a substantial number of graduate management schools, foreign candidates may also be required to submit scores on the Test of English as a Foreign Language (TOEFL) as an indication of English language proficiency. The present study provides…

  3. Personnel Test Battery and Scoring Procedures. Memorandum No. L.S. 15.

    Science.gov (United States)

    Berson, Barry L.

    The purpose of this memo is to present tests that comprise the test battery used to select Navy personnel to train marine mammals, and to describe the scoring procedures of the tests. The test battery consists of: Biosystems General Information Test (BGIT), Personnel History Questionnaire (PHQ), Gordon Personal Inventory, Gordon Personal Profile,…

  4. Methods for Examining the Psychometric Quality of Subscores: A Review and Application

    Science.gov (United States)

    Wedman, Jonathan; Lyrén, Per-Erik

    2015-01-01

    When subscores on a test are reported to the test taker, the appropriateness of reporting them depends on whether they provide useful information above what is provided by the total score. Subscores that fail to do so lack adequate psychometric quality and should not be reported. There are several methods for examining the quality of subscores,…

  5. Effects of Targeted Test Preparation on Scores of Two Tests of Oral English as a Second Language

    Science.gov (United States)

    Farnsworth, Tim

    2013-01-01

    This study investigated the effect of targeted test preparation, or coaching, on oral English as a second language test scores. The tests in question were the Basic English Skills Test Plus (BEST Plus), a scripted oral interview published by the Center for Applied Linguistics, and the Versant English Test (VET), a computer-administered and…

  6. Cross-Cultural Adaptation and Psychometric Testing of the Brazilian Version of the Self-Care of Heart Failure Index Version 6.2

    Science.gov (United States)

    Ávila, Christiane Wahast; Riegel, Barbara; Pokorski, Simoni Chiarelli; Camey, Suzi; Silveira, Luana Claudia Jacoby; Rabelo-Silva, Eneida Rejane

    2013-01-01

    Objective. To adapt and evaluate the psychometric properties of the Brazilian version of the SCHFI v 6.2. Methods. With the approval of the original author, we conducted a complete cross-cultural adaptation of the instrument (translation, synthesis, back translation, synthesis of back translation, expert committee review, and pretesting). The adapted version was named Brazilian version of the self-care of heart failure index v 6.2. The psychometric properties assessed were face validity and content validity (by expert committee review), construct validity (convergent validity and confirmatory factor analysis), and reliability. Results. Face validity and content validity were indicative of semantic, idiomatic, experimental, and conceptual equivalence. Convergent validity was demonstrated by a significant though moderate correlation (r = −0.51) on comparison with equivalent question scores of the previously validated Brazilian European heart failure self-care behavior scale. Confirmatory factor analysis supported the original three-factor model as having the best fit, although similar results were obtained for inadequate fit indices. The reliability of the instrument, as expressed by Cronbach's alpha, was 0.40, 0.82, and 0.93 for the self-care maintenance, self-care management, and self-care confidence scales, respectively. Conclusion. The SCHFI v 6.2 was successfully adapted for use in Brazil. Nevertheless, further studies should be carried out to improve its psychometric properties. PMID:24163765

  7. Wisconsin card sorting test: a new global score, with Italian norms, and its relationship with the Weigl sorting test.

    Science.gov (United States)

    Laiacona, M; Inzaghi, M G; De Tanti, A; Capitani, E

    2000-10-01

    The Wisconsin card sorting test and the Weigl test are two neuropsychological tools widely used in clinical practice to assess frontal lobe functions. In this study we present norms useful for Italian subjects aged from 15 to 85 years, with 5-17 years of education. Concerning the Wisconsin card sorting test, a new measure of global efficiency (global score) is proposed as well as norms for some well known qualitative aspects of the performance, i.e. perseverative responses, failure to maintain the set and non-perseverative errors. In setting normative values, we followed a statistical methodology (equivalent scores) employed in Italy for other neuropsychological tests, in order to favour the possibility of comparison among these tests. A correlation study between the global score of the Wisconsin card sorting test and the score on the Weigl test was carried out and it emerges that some cognitive aspects are not overlapping in these two measures.

  8. Field testing, refinement, and psychometric evaluation of a new measure of quality of care for assisted living.

    Science.gov (United States)

    Rantz, Marilyn J; Aud, Myra A; Zwygart-Stauffacher, Mary; Mehr, David R; Petroski, Gregory F; Owen, Steven V; Madsen, Richard W; Flesner, Marcia; Conn, Vicki; Maas, Meridean

    2008-01-01

    Field test results are reported for the Observable Indicators of Nursing Home Care Quality Instrument-Assisted Living Version, an instrument designed to measure the quality of care in assisted living facilities after a brief 30-minute walk-through. The OIQ-AL was tested in 207 assisted-living facilities in two states using classical test theory, generalizability theory, and exploratory factor analysis. The 34-item scale has a coherent six-factor structure that conceptually describes the multidimensional concept of care quality in assisted living. The six factors can be logically clustered into process (Homelike and Caring, 21 items) and structure (Access and Choice; Lighting; Plants and Pets; Outdoor Spaces) subscales and for a total quality score. Classical test theory results indicate most subscales and the total quality score from the OIQ-AL have acceptable interrater, test-retest, and strong internal consistency reliabilities. Generalizability theory analyses reveal that dependability of scores from the instrument are strong, particularly by including a second observer who conducts a site visit and independently completes an instrument, or by a single observer conducting two site visits and completing instruments during each visit. Scoring guidelines based on the total sample of observations (N = 358) help guide those who want to use the measure to interpret both subscale and total scores. Content validity was supported by two expert panels of people experienced in the assisted-living field, and a content validity index calculated for the first version of the scale is high (3.43 on a four-point scale). The OIQ-AL gives reliable and valid scores for researchers, and may be useful for consumers, providers, and others interested in measuring quality of care in assisted-living facilities.

  9. Estimating Achievement Gaps from Test Scores Reported in Ordinal "Proficiency" Categories

    Science.gov (United States)

    Ho, Andrew D.; Reardon, Sean F.

    2012-01-01

    Test scores are commonly reported in a small number of ordered categories. Examples of such reporting include state accountability testing, Advanced Placement tests, and English proficiency tests. This paper introduces and evaluates methods for estimating achievement gaps on a familiar standard-deviation-unit metric using data from these ordered…

  10. Development and Psychometric Testing of a Scale for Evaluating Self-Management Needs of Knee Osteoarthritis (SMNKOA) in Taiwan.

    Science.gov (United States)

    Kao, Mei-Hua; Tsai, Yun-Fang

    2017-06-01

    Self-management of osteoarthritis (OA) of the knee is important for treating this chronic disease. This study developed and psychometrically tested a new instrument for measuring adult patients' self-management needs of knee osteoarthritis (SMNKOA). The theoretical framework of self-care guided the development of the 35-item SMNKOA scale. Participants ( N = 372) were purposively sampled from orthopedic clinics at medical centers in Taiwan. The content validity index was 0.83. Principal components analysis identified a three-factor solution, accounting for 53.19% of the variance. The divergent validity was -0.67; convergent validity was -0.51. Cronbach's alpha was .95, Pearson's correlation coefficient was .88, and the intraclass correlation coefficient was .95. The scale's reliability and validity supports the SMNKOA, as a tool to measure self-management needs of adults with knee OA. Nurses and other health care providers can use this instrument to evaluate knee OA patients and identify strategies for improving health-related outcomes and patient education.

  11. TOEFL iBT Speaking Test Scores as Indicators of Oral Communicative Language Proficiency

    Science.gov (United States)

    Bridgeman, Brent; Powers, Donald; Stone, Elizabeth; Mollaun, Pamela

    2012-01-01

    Scores assigned by trained raters and by an automated scoring system (SpeechRater[TM]) on the speaking section of the TOEFL iBT[TM] were validated against a communicative competence criterion. Specifically, a sample of 555 undergraduate students listened to speech samples from 184 examinees who took the Test of English as a Foreign Language…

  12. Optimal Scoring Methods of Hand-Strength Tests in Patients with Stroke

    Science.gov (United States)

    Huang, Sheau-Ling; Hsieh, Ching-Lin; Lin, Jau-Hong; Chen, Hui-Mei

    2011-01-01

    The purpose of this study was to determine the optimal scoring methods for measuring strength of the more-affected hand in patients with stroke by examining the effect of reducing measurement errors. Three hand-strength tests of grip, palmar pinch, and lateral pinch were administered at two sessions in 56 patients with stroke. Five scoring methods…

  13. The Impact of the 2004 Hurricanes on Florida Comprehensive Assessment Test Scores: Implications for School Counselors

    Science.gov (United States)

    Baggerly, Jennifer; Ferretti, Larissa K.

    2008-01-01

    What is the impact of natural disasters on students' statewide assessment scores? To answer this question, Florida Comprehensive Assessment Test (FCAT) scores of 55,881 students in grades 4 through 10 were analyzed to determine if there were significant decreases after the 2004 hurricanes. Results reveal that there was statistical but no practical…

  14. See It, Be It, Write It: Using Performing Arts to Improve Writing Skills and Test Scores

    Science.gov (United States)

    Blecher-Sass, Hope Sara; Moffitt, Maryellen

    2010-01-01

    Improve students' writing skills and boost their assessment scores while adding arts education, creativity, and fun to your writing curriculum. With this vibrant resource, improving writing skills goes hand-in-hand with improving test scores. Students learn how to use acting and visualization as prewriting activities to help them connect writing…

  15. TOEFL iBT Speaking Test Scores as Indicators of Oral Communicative Language Proficiency

    Science.gov (United States)

    Bridgeman, Brent; Powers, Donald; Stone, Elizabeth; Mollaun, Pamela

    2012-01-01

    Scores assigned by trained raters and by an automated scoring system (SpeechRater[TM]) on the speaking section of the TOEFL iBT[TM] were validated against a communicative competence criterion. Specifically, a sample of 555 undergraduate students listened to speech samples from 184 examinees who took the Test of English as a Foreign Language…

  16. Comparison of the Koppitz and Watkins Scoring Systems for the Bender Gestalt Test.

    Science.gov (United States)

    Johnston, Cris W.; Lanak, Brenda

    1985-01-01

    The Bender Gestalt Test was administered to 25 children (7-10 years old) referred for neuropsychological assessment and scored using the Koppitz system and the Watkins system. Although the scores obtained using the two different sets of criteria were highly correlated, the Watkins rules produced generally better performance. (Author/CL)

  17. Use of Standardized Test Scores to Predict Success in a Computer Applications Course

    Science.gov (United States)

    Harris, Robert V.; King, Stephanie B.

    2016-01-01

    The purpose of this study was to see if a relationship existed between American College Testing (ACT) scores (i.e., English, reading, mathematics, science reasoning, and composite) and student success in a computer applications course at a Mississippi community college. The study showed that while the ACT scores were excellent predictors of…

  18. See It, Be It, Write It: Using Performing Arts to Improve Writing Skills and Test Scores

    Science.gov (United States)

    Blecher-Sass, Hope Sara; Moffitt, Maryellen

    2010-01-01

    Improve students' writing skills and boost their assessment scores while adding arts education, creativity, and fun to your writing curriculum. With this vibrant resource, improving writing skills goes hand-in-hand with improving test scores. Students learn how to use acting and visualization as prewriting activities to help them connect writing…

  19. Assessing the Relationship among Defining Issues Test Scores and Crystallised and Fluid Intellectual Indices

    Science.gov (United States)

    Derryberry, W. Pitt; Jones, Kristy L.; Grieve, Frederick G.; Barger, Brian

    2007-01-01

    Differing findings exist on how Defining Issues Test (DIT) scores relate to intelligence. Further study is needed in order to address aspects of intellect not previously considered and to address how these relationships rival studies that have compared indices of intellect with constructs similar to DIT scores. In the present study, a sample of…

  20. A comparison of likelihood ratio tests and Rao's score test for three separable covariance matrix structures.

    Science.gov (United States)

    Filipiak, Katarzyna; Klein, Daniel; Roy, Anuradha

    2017-01-01

    The problem of testing the separability of a covariance matrix against an unstructured variance-covariance matrix is studied in the context of multivariate repeated measures data using Rao's score test (RST). The RST statistic is developed with the first component of the separable structure as a first-order autoregressive (AR(1)) correlation matrix or an unstructured (UN) covariance matrix under the assumption of multivariate normality. It is shown that the distribution of the RST statistic under the null hypothesis of any separability does not depend on the true values of the mean or the unstructured components of the separable structure. A significant advantage of the RST is that it can be performed for small samples, even smaller than the dimension of the data, where the likelihood ratio test (LRT) cannot be used, and it outperforms the standard LRT in a number of contexts. Monte Carlo simulations are then used to study the comparative behavior of the null distribution of the RST statistic, as well as that of the LRT statistic, in terms of sample size considerations, and for the estimation of the empirical percentiles. Our findings are compared with existing results where the first component of the separable structure is a compound symmetry (CS) correlation matrix. It is also shown by simulations that the empirical null distribution of the RST statistic converges faster than the empirical null distribution of the LRT statistic to the limiting χ(2) distribution. The tests are implemented on a real dataset from medical studies. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  1. Understanding and using the brief Implicit Association Test: recommended scoring procedures.

    Directory of Open Access Journals (Sweden)

    Brian A Nosek

    Full Text Available A brief version of the Implicit Association Test (BIAT has been introduced. The present research identified analytical best practices for overall psychometric performance of the BIAT. In 7 studies and multiple replications, we investigated analytic practices with several evaluation criteria: sensitivity to detecting known effects and group differences, internal consistency, relations with implicit measures of the same topic, relations with explicit measures of the same topic and other criterion variables, and resistance to an extraneous influence of average response time. The data transformation algorithms D outperformed other approaches. This replicates and extends the strong prior performance of D compared to conventional analytic techniques. We conclude with recommended analytic practices for standard use of the BIAT.

  2. Psychometric Evaluation of the Life Orientation Test-Revised in Treated Opiate Dependent Individuals

    Science.gov (United States)

    Hirsch, Jameson K.; Britton, Peter C.; Conner, Kenneth R.

    2010-01-01

    We examined internal consistency and test-retest reliability of a measure of dispositional optimism, the Life Orientation Test-Revised, in 121 opiate-dependent patients seeking methadone treatment. Internal consistency was adequate at baseline (alpha = 0.69) and follow-up (alpha = 0.72). Low socioeconomic status and being on disability were…

  3. THE SIMULATED SOCIAL-INTERACTION TEST - A PSYCHOMETRIC EVALUATION WITH DUTCH SOCIAL PHOBIC PATIENTS

    NARCIS (Netherlands)

    MERSCH, PPA; BREUKERS, P; EMMELKAMP, PMG

    1992-01-01

    The Simulated Social Interaction Test (SSIT) was translated and adjusted for use on a population of Dutch males and females. Seventy-four social phobic patients were assessed with the SSIT, a conversation test, and an interview with an independent observer. Results show that the SSIT is a relatively

  4. Evaluating the Psychometric Characteristics of Generated Multiple-Choice Test Items

    Science.gov (United States)

    Gierl, Mark J.; Lai, Hollis; Pugh, Debra; Touchie, Claire; Boulais, André-Philippe; De Champlain, André

    2016-01-01

    Item development is a time- and resource-intensive process. Automatic item generation integrates cognitive modeling with computer technology to systematically generate test items. To date, however, items generated using cognitive modeling procedures have received limited use in operational testing situations. As a result, the psychometric…

  5. Evaluating the Psychometric Characteristics of Generated Multiple-Choice Test Items

    Science.gov (United States)

    Gierl, Mark J.; Lai, Hollis; Pugh, Debra; Touchie, Claire; Boulais, André-Philippe; De Champlain, André

    2016-01-01

    Item development is a time- and resource-intensive process. Automatic item generation integrates cognitive modeling with computer technology to systematically generate test items. To date, however, items generated using cognitive modeling procedures have received limited use in operational testing situations. As a result, the psychometric…

  6. Strategies Underlying Psychometric Test Responses in Young and Middle-aged Adults of Varying Educational Background.

    Science.gov (United States)

    Macrae, Kristina S.

    The aim of this study was to investigate the strategies leading to test item responses in 60 young (20-25 years) and 60 middle-aged (35-40 years) adults, whose highest level of education had been either secondary, technical or university. Subjects were individually administered a 12 item test similar to Raven's Progressive Matrices, and were…

  7. Strategies Underlying Psychometric Test Responses in Young and Middle-aged Adults of Varying Educational Background.

    Science.gov (United States)

    Macrae, Kristina S.

    The aim of this study was to investigate the strategies leading to test item responses in 60 young (20-25 years) and 60 middle-aged (35-40 years) adults, whose highest level of education had been either secondary, technical or university. Subjects were individually administered a 12 item test similar to Raven's Progressive Matrices, and were…

  8. THE SIMULATED SOCIAL-INTERACTION TEST - A PSYCHOMETRIC EVALUATION WITH DUTCH SOCIAL PHOBIC PATIENTS

    NARCIS (Netherlands)

    MERSCH, PPA; BREUKERS, P; EMMELKAMP, PMG

    1992-01-01

    The Simulated Social Interaction Test (SSIT) was translated and adjusted for use on a population of Dutch males and females. Seventy-four social phobic patients were assessed with the SSIT, a conversation test, and an interview with an independent observer. Results show that the SSIT is a relatively

  9. Estimating the Relationship between Use of Test-Preparation Methods and Scores on the Graduate Management Admission Test.

    Science.gov (United States)

    Leary, Linda F.; Wightman, Lawrence E.

    This study sought to examine the relationship between five methods of test preparation and test performance as measured by Graduate Management Admission Test (GMAT) Verbal, Quantitative and Total scores. Data on method of test preparation were obtained through voluntary examinee response to five questions which appeared on the answer sheets. One…

  10. Estimating the Relationship between Use of Test-Preparation Methods and Scores on the Graduate Management Admission Test.

    Science.gov (United States)

    Leary, Linda F.; Wightman, Lawrence E.

    This study sought to examine the relationship between five methods of test preparation and test performance as measured by Graduate Management Admission Test (GMAT) Verbal, Quantitative and Total scores. Data on method of test preparation were obtained through voluntary examinee response to five questions which appeared on the answer sheets. One…

  11. Psychometric and psychological effects of review on computerized fixed and adaptive tests

    OpenAIRE

    F J. Abad; M.C. Ximénez; J. Revuelta; Olea, J. (J.)

    2000-01-01

    Efectos psicométricos y psicológicos de la revisión de respuestas en tests fijos y adaptativos informatizados. Se aplicaron dos versiones informatizadas de un test de vocabulario inglés para hispanohablantes (uno fijo y otro adaptativo) a una muestra de estudiantes españoles de primer curso de Psicología. Se estudiaron los efectos del tipo de test (fijo versus adaptativo) y de la condición de revisión (permitida versus no permitida) sobre diversas variables psico...

  12. Scoring Divergent Thinking Tests by Computer With a Semantics-Based Algorithm

    Directory of Open Access Journals (Sweden)

    Kenes Beketayev

    2016-05-01

    Full Text Available Divergent thinking (DT tests are useful for the assessment of creative potentials. This article reports the semantics-based algorithmic (SBA method for assessing DT. This algorithm is fully automated: Examinees receive DT questions on a computer or mobile device and their ideas are immediately compared with norms and semantic networks. This investigation compared the scores generated by the SBA method with the traditional methods of scoring DT (i.e., fluency, originality, and flexibility. Data were collected from 250 examinees using the “Many Uses Test” of DT. The most important finding involved the flexibility scores from both scoring methods. This was critical because semantic networks are based on conceptual structures, and thus a high SBA score should be highly correlated with the traditional flexibility score from DT tests. Results confirmed this correlation (r = .74. This supports the use of algorithmic scoring of DT. The nearly-immediate computation time required by SBA method may make it the method of choice, especially when it comes to moderate- and large-scale DT assessment investigations. Correlations between SBA scores and GPA were insignificant, providing evidence of the discriminant and construct validity of SBA scores. Limitations of the present study and directions for future research are offered.

  13. A Maturing Global Testing Regime Meets the World Economy: Test Scores and Economic Growth, 1960-2012

    Science.gov (United States)

    Kamens, David H.

    2015-01-01

    This article considers the growth of the international testing regime. It discusses sources of growth and empirically examines two related sets of issues: (1) the stability of countries' achievement scores, and (2) the influence of those national scores on subsequent economic development over different time lags. The article suggests that…

  14. Peabody Picture Vocabulary Test (PPVT-III: Psychometric properties and significance for application

    Directory of Open Access Journals (Sweden)

    Nataša Bucik

    2003-12-01

    Full Text Available The purpose of this article is to present the content, conceptual structure and methodological steps of the latest revision of the Peabody Picture Vocabulary Test (PPVT-III, which is a highly functional and valuable vocabulary test that has been in use since 1959 in different language and cultural surroundings. On the case of the PPVT-III we are presenting the procedure of development and standardization of such vocabulary tests as well as its translation and adaptation from one language and cultural milieu to another. We also note the practical use of the PPVT-III for research purposes. In Slovenian language no vocabulary tests were developed or adapted so far; PPVT-III is presented in this context, too.

  15. The Sinonasal Outcome Test 22 score in persons without chronic rhinosinusitis

    DEFF Research Database (Denmark)

    Lange, Bibi; Thilsing, T; Baelum, J

    2016-01-01

    -67 with a mean score of 10.5 (CI: 9.1 - 11.9) and the median score was 7. Persons with allergic rhinitis and blue collar workers had a significant higher score. CONCLUSION: The median value of 7 is taken as the normal SNOT 22 score in persons without CRS and can be used as a reference in clinical settings......OBJECTIVES: To determine the Sino Nasal Outcome Test 22 (SNOT 22) score in persons without chronic rhinosinusitis. DESIGN AND SETTING: As part of a trans-European study selected respondents to a survey questionnaire were invited for a clinical visit. Subjective symptoms and rhinoscopy were used...... for the clinical diagnosis of chronic rhinosinusitis according to EPOS. PARTICIPANTS: A total of 366 persons participated at the clinical visit and of these 268 did not have chronic rhinosinusitis. All participants completed the SNOT 22. MAIN OUTCOME MEASURES: The SNOT 22. RESULTS: The SNOT 22 score ranged from 0...

  16. Validity of Alternative Cut-Off Scores for the Back-Saver Sit and Reach Test

    Science.gov (United States)

    Looney, Marilyn A.; Gilbert, Jennie

    2012-01-01

    The purpose of the study was to determine if currently used FITNESSGRAM[R] cut-off scores for the Back Saver Sit and Reach Test had the best criterion-referenced validity evidence for 6-12 year old children. Secondary analyses of an existing data set focused on the passive straight leg raise and Back Saver Sit and Reach Test flexibility scores of…

  17. Development and psychometric testing of a new instrument to measure factors influencing women's breast cancer prevention behaviors (ASSISTS).

    Science.gov (United States)

    Khazaee-Pool, Maryam; Majlessi, Fereshteh; Montazeri, Ali; Pashaei, Tahereh; Gholami, Ali; Ponnet, Koen

    2016-07-22

    Breast cancer preventive behaviors have an extreme effect on women's health. Despite the benefits of preventive behaviors regarding breast cancer, they have not been implemented as routine care for healthy women. To assess this health issue, a reliable and valid scale is needed. The aim of the present study is to develop and examine the psychometric properties of a new scale, called the ASSISTS, in order to identify factors that affect women's breast cancer prevention behaviors. A multi-phase instrument development method was performed to develop the questionnaire from February 2012 to September 2014. The item pool was generated based on secondary analyses of previous qualitative data. Then, content and face validity were applied to provide a pre-final version of the scale. The scale validation was conducted with a sample of women recruited from health centers affiliated with Tehran University of Medical Sciences. The construct validity (both exploratory and confirmatory), convergent validity, discriminate validity, internal consistency reliability and test-retest analysis of the questionnaire were tested. Fifty-eight items were initially extracted from the secondary analysis of previous qualitative data. After content validity, this was reduced to 49 items. The exploratory factor analysis revealed seven factors (Attitude, supportive systems, self-efficacy, information seeking, stress management, stimulant and self-care) containing 33 items that jointly accounted for 60.62 % of the observed variance. The confirmatory factor analysis showed a model with appropriate fitness for the data. The Cronbach's alpha coefficient for the subscales ranged from 0.68 to 0.85, and the Intraclass Correlation Coefficient (ICC) ranged from 0.71 to 0.98; which is well above the acceptable thresholds. The findings showed that the designed questionnaire was a valid and reliable instrument for assessing factors affecting women's breast cancer prevention behaviors that can be used both

  18. The 24-h recall instrument for home nursing to measure the activity profile of home nurses: development and psychometric testing.

    Science.gov (United States)

    De Vliegher, Kristel; Aertgeerts, Bert; Declercq, Anja; Gosset, Christiane; Heyden, Isabelle; Van Geert, Michel; Moons, Philip

    2015-01-01

    Home health care today is challenged by a shift from an acute to a chronic health-care model, moving the focus of care from the hospital to home-care setting. This increased focus on care at home emphasizes the need for an efficient, effective, and transparent management of home health care. However, it is not precisely known what home-care nurses do; what kind of care is received by patients; what the performance of home nurses is; and what the impact of the increasing need for home nursing is on the current and future role of home nurses. In this respect, it is necessary to gain a clear insight into the activity profile of home nurses, but there is no gold standard to measure their activities. This study reports on the development and psychometric testing of the '24-hour recall instrument for home nursing' to measure the activity profile of home nurses. Five home nurses in Belgium, simultaneously with the researcher, registered the performed activities in a total of 69 patients, using the 24-h recall instrument for home nursing. The validity and the interrater reliability of this instrument were high: the proportions that observed agreement were very high; the strength of kappa agreement was substantial to almost perfect; the prevalence index showed great variety; and the bias index was low. The findings in this study support the validity evidence based on test content and the interrater reliability of the 24-h recall instrument. This instrument can help to shape practice and policy by making the home nursing profession more transparent: a clear insight into the kind of care that is provided by home nurses and is received by the patients in primary care contributes to the development of a clear definition of the role of home nurses in health care.

  19. Demands on Users for Interpretation of Achievement Test Scores: Implications for the Evaluation Profession

    Science.gov (United States)

    Della-Piana, Gabriel Mario; Gardner, Michael

    2011-01-01

    Background: Professional standards for validity of achievement tests have long reflected a consensus that validity is the degree to which evidence and theory support interpretations of test scores entailed by the intended uses of tests. Yet there are convincing lines of evidence that the standards are not adequately followed in practice, that…

  20. The Role of Psychometric Modeling in Test Validation: An Application of Multidimensional Item Response Theory

    Science.gov (United States)

    Schilling, Stephen G.

    2007-01-01

    In this paper the author examines the role of item response theory (IRT), particularly multidimensional item response theory (MIRT) in test validation from a validity argument perspective. The author provides justification for several structural assumptions and interpretations, taking care to describe the role he believes they should play in any…

  1. Testing the Psychometric Properties of the Scale of Ethnocultural Empathy in Turkey

    Science.gov (United States)

    Özdikmenli-Demir, Gözde; Demir, Serdar

    2014-01-01

    The validity and reliability of the Scale of Ethnocultural Empathy (SEE) was tested using a sample of 348 emerging adult university students living in Turkey. Different from the original scale's four-factor structure, results of principal components analyses and confirmatory factor analyses exposed that there were three factors explaining 45% of…

  2. Testing the Psychometric Features of the Academic Intellectual Leadership Scale in a University Environment

    Science.gov (United States)

    Uslu, Baris

    2015-01-01

    The purpose of this research is to develop a scale for measuring the level of academics' intellectual leadership, test the scale by examining the influence of their personal and institutional characteristics, and then investigate the relationship of academic intellectual leadership (AIL) to communication, climate, and managerial flexibility…

  3. Psychometric evaluation of the EORTC computerized adaptive test (CAT) fatigue item pool

    DEFF Research Database (Denmark)

    Petersen, Morten Aa; Giesinger, Johannes M; Holzner, Bernhard

    2013-01-01

    Fatigue is one of the most common symptoms associated with cancer and its treatment. To obtain a more precise and flexible measure of fatigue, the EORTC Quality of Life Group has developed a computerized adaptive test (CAT) measure of fatigue. This is part of an ongoing project developing a CAT v...

  4. Psychometric Properties and Factorial Structure of the Chinese Version of the Gratitude Resentment and Appreciation Test

    Science.gov (United States)

    Lin, Shu-Hui; Huang, Yun-Chen

    2016-01-01

    The purpose of this study was to validate a Chinese version of the Gratitude Resentment and Appreciation Test (GRAT) with Taiwanese students. In Study 1, a total of 2511 Taiwanese students participated and completed the translated GRAT. Exploratory factor analysis, confirmatory factor analysis and reliability analysis were undertaken to assess the…

  5. A Review of Literature Regarding Scientific Controversies Surrounding the Psychometric Properties of the Rorschach Inkblot Test

    Science.gov (United States)

    Park, Kevin Neil

    2009-01-01

    The Rorschach Inkblot Test has been the focus of intense controversy, significantly impacting clinicians who currently rely on Exner's Comprehensive System (CS; Exner, 2003) in clinical and forensic settings. This paper evaluates recent empirical CS research to determine whether or not it reveals lack of scientific merit as some skeptics have…

  6. The Alliance Negotiation Scale: A psychometric investigation.

    Science.gov (United States)

    Doran, Jennifer M; Safran, Jeremy D; Muran, J Christopher

    2016-08-01

    This study investigates the utility and psychometric properties of a new measure of psychotherapy process, the Alliance Negotiation Scale (ANS; Doran, Safran, Waizmann, Bolger, & Muran, 2012). The ANS was designed to operationalize the theoretical construct of negotiation (Safran & Muran, 2000), and to extend our current understanding of the working alliance concept (Bordin, 1979). The ANS was also intended to improve upon existing measures such as the Working Alliance Inventory (WAI; Horvath & Greenberg, 1986, 1989) and its short form (WAI-S; Tracey & Kokotovic, 1989) by expanding the emphasis on negative therapy process. The present study investigates the psychometric validity of the ANS test scores and interpretation-including confirming its original factor structure and evaluating its internal consistency and construct validity. Construct validity was examined through the ANS' convergence and divergence with several existing scales that measure theoretically related constructs. The results bolster and extend previous findings about the psychometric integrity of the ANS, and begin to illuminate the relationship between negotiation and other important variables in psychotherapy research. (PsycINFO Database Record (c) 2016 APA, all rights reserved).

  7. Preliminary test of effects of cognitive ability, experience, and teaching methods on Verbal Analogy Test scores.

    Science.gov (United States)

    Rosenberg, D; Willson-Quayle, A; Pasnak, R

    2000-06-01

    The methods from which one can choose when preparing for the GRE Verbal Analogies include books, software, audiotapes, and formal classroom instruction. What teaching method will work best for a given individual? To begin the search for an answer, Gray's test of reasoning ability was given to 28 undergraduates who also answered a questionnaire detailing their experience with analogies. They were randomly assigned to teaching conditions ranging from self-directed workbook study to intensive interactive assistance. No teaching method was superior overall, but interactions showed that (1) students who scored worst on the pretest improved the most, (2) those higher in cognitive functioning and experience performed better after intensive interactive assistance, and (3) those lower in both cognitive functioning and experience did significantly better with self-paced workbooks. This preliminary work suggests that it may be profitable to assess the prior experience and reasoning of potential students and adopt the methods for teaching formal operational thought found empirically to be most suitable.

  8. An Analysis of Cross Racial Identity Scale Scores Using Classical Test Theory and Rasch Item Response Models

    Science.gov (United States)

    Sussman, Joshua; Beaujean, A. Alexander; Worrell, Frank C.; Watson, Stevie

    2013-01-01

    Item response models (IRMs) were used to analyze Cross Racial Identity Scale (CRIS) scores. Rasch analysis scores were compared with classical test theory (CTT) scores. The partial credit model demonstrated a high goodness of fit and correlations between Rasch and CTT scores ranged from 0.91 to 0.99. CRIS scores are supported by both methods.…

  9. Generalization of the Lord-Wingersky Algorithm to Computing the Distribution of Summed Test Scores Based on Real-Number Item Scores

    Science.gov (United States)

    Kim, Seonghoon

    2013-01-01

    With known item response theory (IRT) item parameters, Lord and Wingersky provided a recursive algorithm for computing the conditional frequency distribution of number-correct test scores, given proficiency. This article presents a generalized algorithm for computing the conditional distribution of summed test scores involving real-number item…

  10. Development and Psychometric Testing of the Humanistic Nurse-Patient Scale.

    Science.gov (United States)

    Boscart, Veronique M; Pringle, Dorothy; Peter, Elizabeth; Wynn, Francine; McGilton, Katherine S

    2016-03-01

    Quality of life and well-being of older patients in chronic care facilities is often determined by their relationships with nurses. The authors developed and tested a scale to assess patients' views of what matters most when relating to nurses. Based on the humanistic nursing theory by Paterson and Zderad (1988), 69 items were created and tested with a sample of 40 patients, resulting in refinement of a scale with 24 items. This scale was factor analysed on responses from 249 patients residing in five facilities in Ontario, Canada. The Humanistic Relationship Importance Scale demonstrated strong internal consistency, stability, and reliability with a five-factor solution (α = .87). Construct validity was supported through factual identification. This scale is a valid measure of patients' perspectives of a nurse-patient relationship in chronic care and can be used to measure health professionals' relationships with their older patients and evaluate interventions to enhance relational care.

  11. Improvement in Intelligence Test Scores from 6 to 10 years in Children of Teenage Mothers

    Science.gov (United States)

    Cornelius, Marie D.; Goldschmidt, Lidush; De Genna, Natacha M.; Richardson, Gale A.; Leech, Sharon L.; Day, Richard

    2010-01-01

    Objective This study investigates change in IQ scores among 290 children born to teenage mothers and identifies social, economic, and environmental variables that may be associated with change in intelligence test performance. Methods The children of 290 teenage mothers (72% African American and 28% European American) were assessed with the Stanford-Binet Intelligence Scale-4th Edition (SBIS) at ages 6 and 10. Results The mean composite score at age 6 was 84.8 and was 91.2 at age 10, an improvement of 6.4 points. Significant cross-sectional predictors at both ages 6 and 10 of higher SBIS scores were maternal cognitive ability, school grade, Caucasian ethnicity, and caregiver education. Having more children in the household significantly predicted lower SBIS scores at age 6. Higher satisfaction with maternal social support predicted higher SBIS scores at age 10. Change in IQ scores was not related to maternal socioeconomic status, social support, home environment, ethnicity, or family interactions. Custodial stability was associated with an improvement in IQ scores, while increase in caregiver depression was related to decline in IQ scores. Conclusions Our findings suggest that improvement in IQ scores of offspring of teenage mothers may be related to stability of maternal custody. More research is needed to determine the impact of the maturation of adolescent mothers' parenting and the role of early education on improvement in cognitive abilities. PMID:20495472

  12. A Psychometric Review of Measures Assessing Discrimination Against Sexual Minorities.

    Science.gov (United States)

    Morrison, Todd G; Bishop, C J; Morrison, Melanie A; Parker-Taneo, Kandice

    2016-08-01

    Discrimination against sexual minorities is widespread and has deleterious consequences on victims' psychological and physical wellbeing. However, a review of the psychometric properties of instruments measuring lesbian, gay, and bisexual (LGB) discrimination has not been conducted. The results of this review, which involved evaluating 162 articles, reveal that most have suboptimal psychometric properties. Specifically, myriad scales possess questionable content validity as (1) items are not created in collaboration with sexual minorities; (2) measures possess a small number of items and, thus, may not sufficiently represent the domain of interest; and (3) scales are "adapted" from measures designed to examine race- and gender-based discrimination. Additional limitations include (1) summed scores are computed, often in the absence of scale score reliability metrics; (2) summed scores operate from the questionable assumption that diverse forms of discrimination are necessarily interrelated; (3) the dimensionality of instruments presumed to consist of subscales is seldom tested; (4) tests of criterion-related validity are routinely omitted; and (5) formal tests of measures' construct validity are seldom provided, necessitating that one infer validity based on the results obtained. The absence of "gold standard" measures, the attendant difficulty in formulating a coherent picture of this body of research, and suggestions for psychometric improvements are noted.

  13. Multiple Constructs and Effects of Accommodations on Accommodated Test Scores for Students with Disabilities

    Directory of Open Access Journals (Sweden)

    Stephanie W. Cawthon

    2009-10-01

    Full Text Available Students with disabilities frequently use accommodations to participate in large-scale, standardized assessments. Accommodations can include changes to the administration of the test, such as extended time, changes to the test items, such as read aloud, or changes to the student's response, such as the use of a scribe. Some accommodations or modifications risk changing the difficulty of the test items or decreasing the validity of how test scores are interpreted. Questions regarding the validity of accommodated tests are heightened when scores are used in high-stakes decisions such as grade promotion, graduation, teacher merit pay, or other accountability initiatives. The purpose of this article is to review existing literature on multiple constructs that affect validity of interpretations of accommodated assessment scores. Research on assessment accommodations continues to grow but offers few conclusive findings on whether they facilitate fair and accurate measurement of student knowledge and skill. The validity of an accommodated score appears to vary depending on several factors such as student characteristics, test characteristics, and the accommodations themselves. A multiple construct approach may facilitate more accurate evaluations of the effects of accommodated test scores

  14. Development and psychometric testing of an instrument to compare career choice influences and perceptions of nursing among healthcare students.

    Science.gov (United States)

    Liaw, Sok Ying; Wu, Ling Ting; Lopez, Violeta; Chow, Yeow Leng; Lim, Siriwan; Holroyd, Eleanor; Tan, Khoon Kiat; Wang, Wenru

    2017-04-27

    With the availability of more healthcare courses and an increased intake of nursing students, education institutions are facing challenges to attract school leavers to enter nursing courses. The comparison of career choice influences and perception of nursing among healthcare students can provide information for recruitment strategies. An instrument to compare the influences of healthcare career choice is lacking. The purpose of this study is to develop and evaluate the psychometric properties of an instrument to compare the influences of healthcare career choice with perceptions of nursing as a career choice. The study was conducted in two phases. In phase one, two sets of scales with parallel items that measure the influences of healthcare career choice and perceptions of nursing as a career choice were developed through an earlier qualitative study, literature review, and expert validation. Phase two involved testing the construct validity, concurrent validity and reliability with a convenience sample of 283 first year healthcare students who were recruited at two education institutions in Singapore. An exploratory factor analysis revealed 35-parallel items in a six-factor solution (personal interest, prior healthcare exposure, self-efficacy, perceived nature of work, job prospects, and social influences) that explained 59 and 64% of the variance for healthcare career choice and nursing as a career choice respectively. A high correlation (r = 0.76, p instrument provides opportunities for understanding the differences between influences of healthcare career choice and perceptions of nursing as a career choice. This comparative understanding of career choice influences can guide educator and policy-makers on nursing recruitment.

  15. IQs Are Very Strong but Imperfect Indicators of Psychometric "g": Results from Joint Confirmatory Factor Analysis

    Science.gov (United States)

    Farmer, Ryan L.; Floyd, Randy G.; Reynolds, Matthew R.; Kranzler, John H.

    2014-01-01

    The most global score yielded by intelligence tests, IQs, are supported by substantial validity evidence and have historically been central to the identification of intellectual disabilities, learning disabilities, and giftedness. This study examined the extent to which IQs measure the ability they target, psychometric "g." Data from…

  16. A Psychometric Investigation of the Academic Self-Concept of Asian American College Students

    Science.gov (United States)

    Cokley, Kevin; Patel, Nima

    2007-01-01

    The psychometric properties of scores from the Academic Self-Concept Scale are examined in a group of 198 Asian American college students. Using parallel analysis, a four-factor solution accounting for 46% of the variance was found. In a test of construct validity, academic self-concept was found to be negatively related to adherence to Asian…

  17. The Care Dependency Scale:psychometric testing of the Chinese version

    Institute of Scientific and Technical Information of China (English)

    Shu-Qi Zhang; Hong-Mei Wang; Xiao-Yuan Yang; Wen-Qin Ye

    2016-01-01

    Objective: To determine the validity and reliability of the Chinese version of the Care Dependency Scale (CDS) for proxy assessments by nurses in Chinese nursing homes. The CDS is an excellent tool for measuring care dependency in many countries. The Chinese version of the CDS contributes additional data regarding the scale's use. Methods: This was a cross-sectional study of 184 older adults; 128 were re-evaluated after 2 weeks. Cronbach's alpha, the mean inter-item correlation, the inter-rater reliability and the test-retest reliability were used to analyze the reliability of the CDS. A factor analysis was carried out to find common di-mensions between the CDS versions. Results: The alpha coefficient of the CDS was 0.95. The subsequent inter-rater and test-retest reliabilities were analyzed in terms of the Kappa values, which were between 0.84 and 0.89; 0.83 and 0.92, respectively. Three factors were identified in the factor analysis. Conclusions: The CDS is a suitable tool for assessing care dependency among elderly Chinese residents in nursing homes.

  18. Measuring patients’ satisfaction with their anti-TNF treatment in severe Crohn’s disease: scoring and psychometric validation of the Satisfaction for PAtients in Crohn’s diseasE Questionnaire (SPACE-Q©

    Directory of Open Access Journals (Sweden)

    Gilet H

    2014-12-01

    Full Text Available Hélène Gilet,1 Benoit Arnould,1 Fatoumata Fofana,1 Pierre Clerson,2 Jean-Frédéric Colombel,10 Olivier D’Hondt,2 Patrick Faure,4 Hervé Hagège,5 Maria Nachury,3 Stéphane Nahon,6 Gilbert Tucat,7 Luc Vandromme,8 Ines Cazala-Telinge,9 Emmanuel Thibout9 1HEOR and Strategic Market Access, Mapi, Lyon, France; 2Orgamétrie, Roubaix, France; 3Hôpital Claude Huriez, Lille, France; 4Clinique Saint-Jean du Languedoc, Toulouse, France; 5Centre Hospitalier Intercommunal, Créteil, France; 6Centre Hospitalier Intercommunal, Le Raincy Montfermeil, France; 7Gastroenterologist, Private Clinical Practice, Paris, France; 8Gastroenterologist, Private Clinical Practice, Reims, France; 9Abbvie France, Rungis, France; 10Icahn School of Medicine at Mount Sinai, New York, NY, USA Background: Severe Crohn’s disease management includes anti-tumor necrosis factor (anti-TNF drugs that differ from early-stage treatments regarding efficacy, safety, and convenience. This study aimed to finalize and psychometrically validate the Satisfaction for PAtients in Crohn’s diseasE Questionnaire (SPACE-Q©, developed to measure satisfaction with anti-TNF treatment in patients with severe Crohn’s disease. Methods: A total of 279 patients with severe Crohn’s disease receiving anti-TNF therapy completed the SPACE-Q 62-item pilot version at inclusion and 12 and 13 weeks after first anti-TNF injection. The final SPACE-Q scoring was defined using multitrait and regression analyses and clinical relevance considerations. Psychometric validation included clinical validity against Harvey–Bradshaw score, concurrent validity against Treatment Satisfaction Questionnaire for Medication (TSQM, internal consistency reliability, test–retest reliability, and responsiveness against the patient global impression of change (PGIC.Results: Quality of completion was good (55%–67% of patients completed all items. Four items were removed from the questionnaire. Eleven scores were defined

  19. Social Cognition Psychometric Evaluation: Results of the Initial Psychometric Study.

    Science.gov (United States)

    Pinkham, Amy E; Penn, David L; Green, Michael F; Harvey, Philip D

    2016-03-01

    Measurement of social cognition in treatment trials remains problematic due to poor and limited psychometric data for many tasks. As part of the Social Cognition Psychometric Evaluation (SCOPE) study, the psychometric properties of 8 tasks were assessed. One hundred and seventy-nine stable outpatients with schizophrenia and 104 healthy controls completed the battery at baseline and a 2-4-week retest period at 2 sites. Tasks included the Ambiguous Intentions Hostility Questionnaire (AIHQ), Bell Lysaker Emotion Recognition Task (BLERT), Penn Emotion Recognition Task (ER-40), Relationships Across Domains (RAD), Reading the Mind in the Eyes Task (Eyes), The Awareness of Social Inferences Test (TASIT), Hinting Task, and Trustworthiness Task. Tasks were evaluated on: (i) test-retest reliability, (ii) utility as a repeated measure, (iii) relationship to functional outcome, (iv) practicality and tolerability, (v) sensitivity to group differences, and (vi) internal consistency. The BLERT and Hinting task showed the strongest psychometric properties across all evaluation criteria and are recommended for use in clinical trials. The ER-40, Eyes Task, and TASIT showed somewhat weaker psychometric properties and require further study. The AIHQ, RAD, and Trustworthiness Task showed poorer psychometric properties that suggest caution for their use in clinical trials. © The Author 2015. Published by Oxford University Press on behalf of the Maryland Psychiatric Research Center. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  20. Propriedades psicométricas do conjunto de testes da habilidade visuo espacial Psychometric proprieties of visual-spatial ability tests kit

    Directory of Open Access Journals (Sweden)

    Cristiano Mauro Assis Gomes

    2009-04-01

    Full Text Available Este artigo analisa as propriedades psicométricas de um conjunto de testes elaborados para mensurar a habilidade visuo espacial (Gv, quanto a sua estrutura fatorial, unidimensionalidade, validade convergente e discriminante. Fizeram parte da pesquisa 558 participantes que cursavam ou concluíram o ensino médio. Utilizou-se o Teste de Visualização (Vz, Teste de Flexibilidade de Fechamento (CF, Teste de Memória Visual (MV, Teste de Fluência Figural (FF, Teste de Fluência Ideacional 1 (FI1 e Teste de Fluência Ideacional 2 (FI2. Foram usados procedimentos de análise fatorial exploratória, análise fatorial confirmatória e o ajuste ao modelo Rasch. Os índices de ajuste dos modelos aos dados estão adequados às recomendações da literatura. Os testes possuem uma estrutura fatorial de dois níveis, porém unidimensional. Convergem para mensurar a habilidade visuo espacial (Gv e divergem na mensuração da fluência, utilizada neste estudo como habilidade contrastante.This article analyses the psychometrical properties of a test kit developed to measure visual-spatial ability in terms of its factorial structure, unidimensionality, convergent validity and divergent validity. A sample of 558 students that were either attending High School or had just concluded it participated of this research. The Visualization Test (Vz, Flexibility of Closure Test (CF, Visual Memory Test (MV, Figural Fluency Test (FF, Ideational Fluency Test 1 (FI1 and Ideational Fluency Test 2 (FI2 were used. The data analysis used the exploratory factorial analysis, the confirmatory factorial analysis and the fit to the Rasch model procedures. The model fit indexes are in accordance with the cutoff values suggested by the literature. The tests showed a two-level factorial structure, although it can be taken as an unidimensional structure. The tests converge to measure the visual-spatial ability and diverge to measure the fluency ability which was used in this study as

  1. The Dental Hygiene Aptitude Tests and the American College Testing Program Tests as Predictors of Scores on the National Board Dental Hygiene Examination.

    Science.gov (United States)

    Longenbecker, Sueann; Wood, Peter H.

    1984-01-01

    Scores from the National Board Dental Hygiene Examination (NBDHE) served as the criterion variable in a comparison of the predictive validity of the Dental Hygiene Aptitude Tests (DHAT) and the ACT Assessment tests. The DHAT-Science and Verbal tests combined to produce the highest multiple correlation with NBDHE scores. (Author/DWH)

  2. Discordance between Psychometric Testing and Questionnaire-Based Definitions of Executive Function Deficits in Individuals with ADHD

    Science.gov (United States)

    Biederman, Joseph; Petty, Carter R.; Fried, Ronna; Black, Sarah; Faneuil, Alicia; Doyle, Alysa E.; Seidman, Larry J.; Faraone, Stephen V.

    2008-01-01

    Objective: One suspected source of negative outcomes associated with ADHD has been deficits in executive functions. Although both psychometrically defined and self-reported executive function deficits (EFDs) have been shown to be associated with poor academic and occupational outcomes, whether these two approaches define the same individuals…

  3. DEVELOPMENT AND TESTING PSYCHOMETRIC PROPERTIES OF AN ICF-BASED HEALTH MEASURE : THE NEUROMUSCULAR DISEASE IMPACT PROFILE

    NARCIS (Netherlands)

    Bos, Isaac; Kuks, Jan B. M.; Wynia, Klaske

    2015-01-01

    Objectives: To develop a measure that is based on the International Classification of Functioning, Disability and Health (ICF) and reflects the prevalence and severity of disabilities related to neuromuscular disorders, and to evaluate the psychometric properties of this measure. Methods: A prelimin

  4. Raven’s test performance of sub-Saharan Africans: average performance, psychometric properties, and the Flynn Effect

    NARCIS (Netherlands)

    Wicherts, J.M.; Dolan, C.V.; Carlson, J.S.; van der Maas, H.L.J.

    2010-01-01

    This paper presents a systematic review of published data on the performance of sub-Saharan Africans on Raven's Progressive Matrices. The specific goals were to estimate the average level of performance, to study the Flynn Effect in African samples, and to examine the psychometric meaning of Raven's

  5. Psychometric testing of the Revised Humane Caring Scale for adult patients in Singapore.

    Science.gov (United States)

    Goh, Mien Li; Ang, Emily N K; Chan, Yiong-Huak; He, Hong-Gu; Vehviläinen-Julkunen, Katri

    2015-09-01

    In this study, we examined the validity and reliability of the Revised Humane Caring Scale as used by adult patients in a tertiary hospital in Singapore. A three-phase descriptive quantitative study was conducted. In phase I, an expert panel of nurses and inpatients examined the content validity of the scale; phase II comprised a pilot study on 20 patients; and in phase III, a large-scale study on 235 patients was implemented to test the internal consistency of the scale. The results revealed that the content validity index of the scale ranged from 0.856 to 1, and the scale had a high inter-rater agreement kappa value of 0.940. Cronbach's alpha ranged from 0.798 to 0.877 in phase II, and from 0.579 to 0.760 in phase III, respectively. The Revised Humane Caring Scale revealed good content validity and an acceptable level of internal consistency. The scale is an acceptable measurement tool for evaluating adult patients' satisfaction during hospitalization.

  6. Measuring quality improvement in public health: the development and psychometric testing of a QI Maturity Tool.

    Science.gov (United States)

    Joly, Brenda M; Booth, Maureen; Mittal, Prashant; Shaler, George

    2012-06-01

    There is growing interest and investment in improving the quality of public health services and outcomes. Following the lead of other sectors, efforts are underway to introduce systematic quality improvement (QI) tools and approaches to state and local public health agencies. Little is known, however, about how to describe and reliably measure the level of QI maturity within a public health agency. The authors describe the development of a QI Maturity Tool using research from the fields of organizational design, psychology, health care, and complexity theory. The 37-item assessment tool is based on four quality domains derived from the literature: (a) organizational culture, (b) capacity and competency, (c) practice, and (d) alignment and spread. The tool was designed to identify features of an organization that may be enhancing or impeding QI; monitor the impact of efforts to create a more favorable environment for QI; and define potential cohorts of public health agencies for evaluation purposes. The article presents initial steps in testing and validating the QI Maturity Tool including: (a) developing a theoretical framework, (b) assuring face and content validity, (c) determining the tool's reliability based on estimates of internal consistency, (d) assessing the dimensionality, and (f) determining the construct validity of the instrument. The authors conclude that there is preliminary evidence that the QI Maturity Tool is a promising instrument. Further work is underway to explore whether self-reported survey results align with an agency's actions and the products of their QI efforts.

  7. Measuring Sexual Motives: A Test of the Psychometric Properties of the Sexual Motivations Scale.

    Science.gov (United States)

    Jardin, Charles; Garey, Lorra; Zvolensky, Michael J

    2016-11-14

    Sexual motives refer to functions served by sexual behavior. The Sex Motivations Scale (SMS) has frequently been used to assess sexual motives. At its development, the SMS demonstrated good internal consistency; convergent, divergent, and criterion validity; and configural invariance across sex, age, and Caucasians and African Americans. Yet the metric and scalar invariance of the SMS has not been examined, nor has the measurement invariance of the SMS across Hispanic and Asian Americans, sexual minority status, and relationship status been tested. The criterion validity of the SMS also has yet to be examined for nonintercourse sexual behaviors, such as sexting. The present study aimed to address these gaps in a diverse sample of 2,201 college students (77.60% female; Mage = 22.06; 27.84% Caucasian). Results further affirmed the configural, metric, and scalar invariance of the SMS. The convergent and divergent validity of the SMS was supported in relation to positive and negative affect and attachment patterns; and specific SMS subscales demonstrated associations with sexual intercourse behaviors and sexting, supporting the criterion validity of the SMS. These findings suggest the relevance of the SMS in assessing sexual motives across diverse populations and behaviors.

  8. Exploring discrepancies in the TOEFL iBT scores of repeat test takers

    OpenAIRE

    2011-01-01

    Students choosing to study abroad either independently or as a component of a local degree must first demonstrate their language proficiency by achieving the required score of their destination institution in a TOEFL iBT or IELTS examination. With such high stakes, many candidates opt to repeat the test a number of times before the submission date in the hope that one of the tests will yield a sufficiently high score. This study analyzed the test results of twenty-five students who, toward th...

  9. Examining alternative scoring rubrics on a statewide test: The impact of different scoring methods on science and social studies performance assessments

    Science.gov (United States)

    Creighton, Susan Dabney

    There is no consensus regarding the most reliable and valid scoring methods for the assessment of higher order thinking skills. Most of the research on alternative formats has focused on the scoring of writing ability. This study examined the value of different types of performance assessment scoring guides on state mandated science and social studies tests. A proportional stratified sample of raters were randomly assigned to one of four scoring groups: checklist, analytic rubric, holistic rubric, and generic rubrics. A fifth method, the weighted analytic rubric, was included by applying an algorithmic formula to the scores assigned by raters using the analytic rubric. A comparison of the mean scores for the five scoring groups suggests that there may be a difference in the way raters applied the rubric for each group. Although the literature suggests that it is possible to achieve high levels of inter-rater reliability, across forms of scoring, phi coefficients of moderate strength were obtained for three of the four constructed-response items. Results for each scoring group were compared indicating that item complexity may impact the level of inter-rate, reliability and the selection of the most reliable rubric for each discipline. Analytic rubrics appear to achieve more reliable results with less complex items. A multitrait-multimethod approach was utilized to investigate the external validity of the social studies and science tasks. As expected, there tended to be a stronger association between the PACT science constructed-response scores with scores based on science multiple-choice scores than between the science constructed-response scores and the writing ability subtest scores. A similar pattern was seen with social studies items. These results provide some evidence for the validity of the performance assessments. A post study survey completed by raters provided qualitative information regarding their thought processes and their primary focus during the

  10. An electrophysiological correlate of Eating Attitudes Test scores in female college students.

    Science.gov (United States)

    Wilson, J F; Mercer, J C

    1990-11-01

    Eating Attitudes Test (EAT) scores of forty female college students were compared to their electrodermal activity (EDA) responses when offered a plate of chocolate chip cookies. A significant positive correlation was detected between the EAT scores and the skin conductivity measures associated with the presentation of food. Women with the highest EAT scores also exhibited the greatest sympathetic nervous system responses to a plate of cookies. This finding supports the conclusion that the EAT is capable of identifying individuals who are preoccupied with food or anxious about eating.

  11. Psychometric properties of the Polish version of the Body Attitude Test

    Directory of Open Access Journals (Sweden)

    Brytek-Matera, Anna

    2014-03-01

    Full Text Available Background. Polish studies have not been focused on measures which assess attitude toward one’s own body. The purpose of the present study was to develop the Polish version of the Body Attitude Test (BAT and examine its reliability. Methods. The study included 48 patients with anorexia nervosa, 39 patients with bulimia nervosa as well as 115 female university students who had never suffered from any eating disorder. The mean age was 18.69 years (SD= 3.52 in women with anorexia nervosa, 22.28 years (SD= 3.79 in women with bulimia nervosa as well as 20.53 years (SD= 1.79 in women without a current eating disorder. The mean duration of illness was 3.31 years (SD= 2.71 in patients suffering from anorexia nervosa and 5.10 years (SD= 2.92 in patients with bulimia nervosa, respectively Results. The BAT demonstrates satisfactory levels of internal reliabilit (α = 0.892. The BAT was strongly related with the Eating Disorder Inventory subscales: drive for thinness (r = 0.687, p < 0.001, ineffectiveness (r = 0.643, p < 0.001 and body dissatisfaction (r = 0.634, p < 0.001 and the Self-Esteem Inventory subscale: familial self-esteem (r = -0.601, p < 0.001. Discussion. The results support the criterion validity of the BAT which is able to differentiate clinical from non-clinical subjects. Conclusions. The Polish version of the BAT could be used as an appropriate measure assessing subjective attitude towards the body in women with and without eating disorders in Poland.

  12. Passing score and length of a mastery test: An old problem appraoched anew

    NARCIS (Netherlands)

    van der Linden, Willem J.

    1980-01-01

    A classical problem in mastery testing is the choice of passing score and test length so that the mastery decisions are optimal. This problem has been addressed several times from a variety of viewpoints. In this paper, the usual indifference zone approach is adopted, with a new criterion for optimi

  13. An Investigation of the Effectiveness of Vocabulary Learning Strategies on Iranian EFL Learners' Vocabulary Test Score

    Science.gov (United States)

    Rahimy, Ramin; Shams, Kiana

    2012-01-01

    This study aims to investigate the effectiveness of vocabulary learning strategies on Iranian EFL learners' vocabulary test score. To achieve this aim, fifty Intermediate level students from Kish English Institute were randomly selected from among fifteen classes after administering the Oxford Placement Test (OPT). Then, an intermediate level…

  14. The Disaggregation of Value-Added Test Scores to Assess Learning Outcomes in Economics Courses

    Science.gov (United States)

    Walstad, William B.; Wagner, Jamie

    2016-01-01

    This study disaggregates posttest, pretest, and value-added or difference scores in economics into four types of economic learning: positive, retained, negative, and zero. The types are derived from patterns of student responses to individual items on a multiple-choice test. The micro and macro data from the "Test of Understanding in College…

  15. Two for One: Using QAR to Increase Reading Comprehension and Improve Test Scores

    Science.gov (United States)

    Green, Susan

    2016-01-01

    This teaching tip describes an intervention used in a third-grade classroom implemented to help students pass an end-of-grade reading comprehension test. Low scores on a practice end-of-grade comprehension test prompted a re-examination of classroom reading instruction and a plan for intervention. This teaching tip describes the phases implemented…

  16. Zertifikat Deutsch als Fremdsprache and the Oral Proficiency Interview: A Comparison of Test Scores and Examinations.

    Science.gov (United States)

    Lalande, John F.; Schweckendiek, Jurgen

    1986-01-01

    Investigates what correlations might exist between an individual's score on the Zertifikat Deutsch als Fremdsprache and on the Oral Proficiency Interview. The tests themselves are briefly described. Results indicate that the two tests appear to correlate well in their evaluation of speaking skills. (SED)

  17. A review of methods for evaluating the fit of item score patterns on a test

    NARCIS (Netherlands)

    Meijer, R.R.; Sijtsma, Klaas

    1999-01-01

    Methods are discussed that can be used to investigate the fit of an item score pattern to a test model. Model-based tests and personality inventories are administered to more than 100 million people a year and, as a result, individual fit is of great concern. Item Response Theory (IRT) modeling and

  18. The Effects of Using Selected Metacognitive Strategies on ACT Mathematics Sub-Test Scores

    Science.gov (United States)

    LeMay, Jeffrey W.

    2016-01-01

    This quasi-experimental post-test only control group designed quantitative study examined whether or not members of an experimental group of participants who utilized two metacognitive strategy training regimens experienced a significant increase in their ACT mathematics sub-test scores compared to a group of students who did not utilize either of…

  19. Zertifikat Deutsch als Fremdsprache and the Oral Proficiency Interview: A Comparison of Test Scores and Examinations.

    Science.gov (United States)

    Lalande, John F.; Schweckendiek, Jurgen

    1986-01-01

    Investigates what correlations might exist between an individual's score on the Zertifikat Deutsch als Fremdsprache and on the Oral Proficiency Interview. The tests themselves are briefly described. Results indicate that the two tests appear to correlate well in their evaluation of speaking skills. (SED)

  20. Beating the Odds: A Low Equalized Assessed Valuation Elementary School with High Standardized Test Scores

    Science.gov (United States)

    Levin, Brian

    2011-01-01

    This mixed methods study examines what makes Bluffview Elementary School a success as measured by the ISAT, the mandated state test of Illinois. Despite national reports of achievement gaps and low test scores, Bluffview Elementary has shown sustained success in educating children. This paper reviews how Bluffview Elementary students are achieving…

  1. A review of methods for evaluating the fit of item score patterns on a test

    NARCIS (Netherlands)

    Meijer, Rob R.; Sijtsma, Klaas

    1999-01-01

    Methods are discussed that can be used to investigate the fit of an item score pattern to a test model. Model-based tests and personality inventories are administered to more than 100 million people a year and, as a result, individual fit is of great concern. Item Response Theory (IRT) modeling and

  2. The Disaggregation of Value-Added Test Scores to Assess Learning Outcomes in Economics Courses

    Science.gov (United States)

    Walstad, William B.; Wagner, Jamie

    2016-01-01

    This study disaggregates posttest, pretest, and value-added or difference scores in economics into four types of economic learning: positive, retained, negative, and zero. The types are derived from patterns of student responses to individual items on a multiple-choice test. The micro and macro data from the "Test of Understanding in College…

  3. Predicting Teacher Performance with Test Scores and Grade Point Average: A Meta-Analysis

    Science.gov (United States)

    D'Agostino, Jerome V.; Powers, Sonya J.

    2009-01-01

    A meta-analysis was conducted to examine the degree to which teachers' test scores and their performance in preparation programs as measured by their collegiate grade point average (GPA) predicted their teaching competence. Results from 123 studies that yielded 715 effect sizes were analyzed, and the mediating effects of test and GPA type,…

  4. The Effects of Testing Circumstance and Education Level on MMPI-2 Correction Scale Scores

    Science.gov (United States)

    2010-02-01

    Report The Effects of Testing Circumstance and Education Level on MMPI - 2 Correction Scale Scores DOT/FAA/AM-10/3 Office of Aerospace Medicine Washington...Education Level on MMPI - 2 Correction Scale Scores 6. Performing Organization Code 7. Author(s) 8. Performing Organization Report No...Inventory- 2 ( MMPI - 2 ) is used by the Federal Aviation Administration to assess psychopathology in air traffic control specialist applicants after a

  5. Sex differences on Purpose-In-Life Test total and factorial scores among spanish undergraduates

    OpenAIRE

    2011-01-01

    The aim of this paper is to analyze the differences on Purpose-In-Life Test [PIL] (Crumbaugh & Maholic, 1969) total and factorial scores associated to sex, among 309 spanish undergratudates (207 women, 102 men), range 18 to 45 years. PIL Spanish version is used (Noblejas de la Flor, 1994). PIL evalues life meaning achievement vs. existential vacuum. Women achieve higher means on PIL total and factorial scores, and estatistical analysis show that sex is significantly associated to total PIL sc...

  6. Correlation of Simulation Examination to Written Test Scores for Advanced Cardiac Life Support Testing: Prospective Cohort Study

    Directory of Open Access Journals (Sweden)

    Suzanne L. Strom

    2015-11-01

    Full Text Available Introduction: Traditional Advanced Cardiac Life Support (ACLS courses are evaluated using written multiple-choice tests. High-fidelity simulation is a widely used adjunct to didactic content, and has been used in many specialties as a training resource as well as an evaluative tool. There are no data to our knowledge that compare simulation examination scores with written test scores for ACLS courses. Objective: To compare and correlate a novel high-fidelity simulation-based evaluation with traditional written testing for senior medical students in an ACLS course. Methods: We performed a prospective cohort study to determine the correlation between simulationbased evaluation and traditional written testing in a medical school simulation center. Students were tested on a standard acute coronary syndrome/ventricular fibrillation cardiac arrest scenario. Our primary outcome measure was correlation of exam results for 19 volunteer fourth-year medical students after a 32-hour ACLS-based Resuscitation Boot Camp course. Our secondary outcome was comparison of simulation-based vs. written outcome scores. Results: The composite average score on the written evaluation was substantially higher (93.6% than the simulation performance score (81.3%, absolute difference 12.3%, 95% CI [10.6-14.0%], p<0.00005. We found a statistically significant moderate correlation between simulation scenario test performance and traditional written testing (Pearson r=0.48, p=0.04, validating the new evaluation method. Conclusion: Simulation-based ACLS evaluation methods correlate with traditional written testing and demonstrate resuscitation knowledge and skills. Simulation may be a more discriminating and challenging testing method, as students scored higher on written evaluation methods compared to simulation.

  7. The Impact of Correction for Guessing Formula on MC and Yes/No Vocabulary Tests' Scores

    Directory of Open Access Journals (Sweden)

    abdollah baradaran

    2009-10-01

    Full Text Available A standard correction for random guessing (cfg formula on multiple-choice and Yes/Noexaminations was examined retrospectively in the scores of the intermediate female EFL learners in an English language school. The correctionwas a weighting formula for points awarded for correct answers,incorrect answers, and unanswered questions so that the expectedvalue of the increase in test score due to guessing was zero. The researcher compared uncorrected and corrected scores on examinationsusing multiple-choice and Yes/No formats. These short-answer formats eliminatedor at least greatly reduced the potential for guessing the correctanswer. The expectation for students to improve their grade by guessingon multiple-choice and Yes/No format examinations is well known. The researcher examined a method for correcting for random guessing (cfg " no knowledge" on multiple- choice and Yes/No vocabulary examinations by comparing application and non-application of correction for guessing (cfg formula on scores on these examinations. It was done to determine whether the test takers really knew the correct answer, or they had resorted to a kind of guessing. This study represented a unique opportunity to compare scores from multiple-choice and Yes/No examinations in a settingin which students were given the same number of questions ineach of the two format types testing their knowledge over thesame subject matter. The results of this study indicated that the significant differences were highlighted between the subjects' scores when cfg formula was applied and when it was not.

  8. Deriving utility scores for co-morbid conditions: a test of the multiplicative model for combining individual condition scores

    Directory of Open Access Journals (Sweden)

    Le Petit Christel

    2006-10-01

    Full Text Available Abstract Background The co-morbidity of health conditions is becoming a significant health issue, particularly as populations age, and presents important methodological challenges for population health research. For example, the calculation of summary measures of population health (SMPH can be compromised if co-morbidity is not taken into account. One popular co-morbidity adjustment used in SMPH computations relies on a straightforward multiplicative combination of the severity weights for the individual conditions involved. While the convenience and simplicity of the multiplicative model are attractive, its appropriateness has yet to be formally tested. The primary objective of the current study was therefore to examine the empirical evidence in support of this approach. Methods The present study drew on information on the prevalence of chronic conditions and a utility-based measure of health-related quality of life (HRQoL, namely the Health Utilities Index Mark 3 (HUI3, available from Cycle 1.1 of the Canadian Community Health Survey (CCHS; 2000–01. Average HUI3 scores were computed for both single and co-morbid conditions, and were also purified by statistically removing the loss of functional health due to health problems other than the chronic conditions reported. The co-morbidity rule was specified as a multiplicative combination of the purified average observed HUI3 utility scores for the individual conditions involved, with the addition of a synergy coefficient s for capturing any interaction between the conditions not explained by the product of their utilities. The fit of the model to the purified average observed utilities for the co-morbid conditions was optimized using ordinary least squares regression to estimate s. Replicability of the results was assessed by applying the method to triple co-morbidities from the CCHS cycle 1.1 database, as well as to double and triple co-morbidities from cycle 2.1 of the CCHS (2003–04. Results

  9. Measuring Nurses' Value, Implementation, and Knowledge of Evidence-Based Practice: Further Psychometric Testing of the Quick-EBP-VIK Survey.

    Science.gov (United States)

    Connor, Linda; Paul, Fiona; McCabe, Margaret; Ziniel, Sonja

    2017-02-01

    The Quick-EBP-VIK is a new instrument for measuring nurses' value, implementation, and knowledge of EBP. Psychometric testing was conducted in two parts. Part 1 describes the tool development and validity testing which resulted in the development of a 25-item survey after receiving ≥0.80 Item-Level Content Validity Index for both clarity and relevance. Part 2 describes psychometric testing was necessary to assess additional types of validity and reliability. The purpose of this paper is to further describe the psychometric testing of the Quick-EBP-VIK survey instrument. This descriptive study was designed to assess test-retest reliability, internal consistency and construct validity via a web-based survey. The survey instrument was e-mailed to all nurses at the study hospital. Nurses who responded to the first survey (Wave 1) received another e-mail invitation to complete the survey instrument again (Wave 2) for the purpose of assessing the test-retest reliability of the instrument. A total of 1,177 deliverable e-mails were sent to all nursing staff at one free standing pediatric hospital with Magnet(®) designation in the northeast. A total of 382 nurses returned completed surveys, indicating a 32.5% response rate for Wave 1. A total of 131 nurses responded to Wave 2 indicating a response rate of 34.3%. The intraclass correlation coefficients for the items included in the final instrument ranged from 0.43 to 0.80 and were deemed sufficient. These represent a sufficient intraclass correlation coefficient. The Cronbach's Alpha values for each of the three domains are all higher than 0.7 indicating that the items of each of the measurement dimension are internally consistent. However, the composite reliability of the third domain was slightly lower than 0.7 when using Raykov's Rho. The Quick-EBP-VIK instrument has gone through rigorous comprehensive testing and has demonstrated good psychometric properties. © 2016 Sigma Theta Tau International.

  10. A score based on screening tests to differentiate mild cognitive impairment from subjective memory complaints

    Directory of Open Access Journals (Sweden)

    Fábio Henrique de Gobbi Porto

    2013-09-01

    Full Text Available It is not easy to differentiate patients with mild cognitive impairment (MCI from subjective memory complainers (SMC. Assessments with screening cognitive tools are essential, particularly in primary care where most patients are seen. The objective of this study was to evaluate the diagnostic accuracy of screening cognitive tests and to propose a score derived from screening tests. Elderly subjects with memory complaints were evaluated using the Mini Mental State Examination (MMSE and the Brief Cognitive Battery (BCB. We added two delayed recalls in the MMSE (a delayed recall and a late-delayed recall, LDR, and also a phonemic fluency test of letter P fluency (LPF. A score was created based on these tests. The diagnoses were made on the basis of clinical consensus and neuropsychological testing. Receiver operating characteristic curve analyses were used to determine area under the curve (AUC, the sensitivity and specificity for each test separately and for the final proposed score. MMSE, LDR, LPF and delayed recall of BCB scores reach statistically significant differences between groups (P=0.000, 0.03, 0.001 and 0.01, respectively. Sensitivity, specificity and AUC were MMSE: 64%, 79% and 0.75 (cut off <29; LDR: 56%, 62% and 0.62 (cut off <3; LPF: 71%, 71% and 0.71 (cut off <14; delayed recall of BCB: 56%, 82% and 0.68 (cut off <9. The proposed score reached a sensitivity of 88% and 76% and specificity of 62% and 75% for cut off over 1 and over 2, respectively. AUC were 0.81. In conclusion, a score created from screening tests is capable of discriminating MCI from SMC with moderate to good accurancy.

  11. The Effect of School Poverty on Racial Gaps in Tests Scores: The Case of the Minnesota Basic Standards Tests

    Science.gov (United States)

    Myers, Samuel L.; Kim, Hyeoneui; Mandala, Cheryl

    2004-01-01

    A data from 1996,1998 and 1999 Minnesota comprehensive statewide testing on eight graders is used to analyze whether African American students perform worse than the white students who attend the poverty schools. The analyses conclude that African American-White test score gap is attributed more to the racial discriminations and racial treatments…

  12. The Effects of Group Members' Personalities on a Test Taker's L2 Group Oral Discussion Test Scores

    Science.gov (United States)

    Ockey, Gary J.

    2009-01-01

    The second language group oral is a test of second language speaking proficiency, in which a group of three or more English language learners discuss an assigned topic without interaction with interlocutors. Concerns expressed about the extent to which test takers' personal characteristics affect the scores of others in the group have limited its…

  13. The Effect of School Poverty on Racial Gaps in Tests Scores: The Case of the Minnesota Basic Standards Tests

    Science.gov (United States)

    Myers, Samuel L.; Kim, Hyeoneui; Mandala, Cheryl

    2004-01-01

    A data from 1996,1998 and 1999 Minnesota comprehensive statewide testing on eight graders is used to analyze whether African American students perform worse than the white students who attend the poverty schools. The analyses conclude that African American-White test score gap is attributed more to the racial discriminations and racial treatments…

  14. Development of and psychometric testing for the Brief Pain Inventory-Facial in patients with facial pain syndromes.

    Science.gov (United States)

    Lee, John Y K; Chen, H Isaac; Urban, Christopher; Hojat, Anahita; Church, Ephraim; Xie, Sharon X; Farrar, John T

    2010-09-01

    have sound psychometric properties and is responsive to differences between classic and atypical TN. Future studies must assess the instrument's test-retest reliability, validity in additional populations, and responsiveness with respect to changes in patient outcomes following neurosurgical interventions and medical therapies.

  15. Impact of a standardized test package on exit examination scores and NCLEX-RN outcomes.

    Science.gov (United States)

    Homard, Catherine M

    2013-03-01

    The purpose of this ex post facto correlational study was to compare exit examination scores and NCLEX-RN(®) pass rates of baccalaureate nursing students who differed in level of participation in a standardized test package. Three cohort groups emerged as a standardized test package was introduced: (a) students who did not participate in a standardized test package; (b) students with two semesters of a standardized test package; and (c) students with four semesters of a standardized test package. Benner's novice-to-expert theory framed the study in the belief that students best acquire knowledge and skills through practice and reflection. Students participating in four semesters of a standardized test package demonstrated higher exit examination scores and NCLEX-RN pass rates compared with students who did not participate in this package. This study's results could inform nurse educators about strategies to facilitate nursing student success on exit examinations and the NCLEX-RN.

  16. Do We Really Become Smarter When Our Fluid-Intelligence Test Scores Improve?

    Science.gov (United States)

    Hayes, Taylor R; Petrov, Alexander A; Sederberg, Per B

    2015-01-01

    Recent reports of training-induced gains on fluid intelligence tests have fueled an explosion of interest in cognitive training-now a billion-dollar industry. The interpretation of these results is questionable because score gains can be dominated by factors that play marginal roles in the scores themselves, and because intelligence gain is not the only possible explanation for the observed control-adjusted far transfer across tasks. Here we present novel evidence that the test score gains used to measure the efficacy of cognitive training may reflect strategy refinement instead of intelligence gains. A novel scanpath analysis of eye movement data from 35 participants solving Raven's Advanced Progressive Matrices on two separate sessions indicated that one-third of the variance of score gains could be attributed to test-taking strategy alone, as revealed by characteristic changes in eye-fixation patterns. When the strategic contaminant was partialled out, the residual score gains were no longer significant. These results are compatible with established theories of skill acquisition suggesting that procedural knowledge tacitly acquired during training can later be utilized at posttest. Our novel method and result both underline a reason to be wary of purported intelligence gains, but also provide a way forward for testing for them in the future.

  17. Grades--Scores--Predictions: A Study of the Efficiency of High School Grades and American College Test Scores in Predicting Academic Achievement at Montgomery College.

    Science.gov (United States)

    Gell, Robert L.; Bleil, David F.

    This report analyzes the relationship between high school grades, American College Test (ACT) scores, and first-semester college grades. Based on the Standard Research Service of the ACT program, 1,379 students in the fall 1969 freshman class of Montgomery College (Maryland) were studied. Measures of academic background used ACT scores in English,…

  18. Individual differences in left parietal white matter predict math scores on the Preliminary Scholastic Aptitude Test.

    Science.gov (United States)

    Matejko, Anna A; Price, Gavin R; Mazzocco, Michèle M M; Ansari, Daniel

    2013-02-01

    Mathematical skills are of critical importance, both academically and in everyday life. Neuroimaging research has primarily focused on the relationship between mathematical skills and functional brain activity. Comparatively few studies have examined which white matter regions support mathematical abilities. The current study uses diffusion tensor imaging (DTI) to test whether individual differences in white matter predict performance on the math subtest of the Preliminary Scholastic Aptitude Test (PSAT). Grades 10 and 11 PSAT scores were obtained from 30 young adults (ages 17-18) with wide-ranging math achievement levels. Tract based spatial statistics was used to examine the correlation between PSAT math scores, fractional anisotropy (FA), radial diffusivity (RD) and axial diffusivity (AD). FA in left parietal white matter was positively correlated with math PSAT scores (specifically in the left superior longitudinal fasciculus, left superior corona radiata, and left corticospinal tract) after controlling for chronological age and same grade PSAT critical reading scores. Furthermore, RD, but not AD, was correlated with PSAT math scores in these white matter microstructures. The negative correlation with RD further suggests that participants with higher PSAT math scores have greater white matter integrity in this region. Individual differences in FA and RD may reflect variability in experience dependent plasticity over the course of learning and development. These results are the first to demonstrate that individual differences in white matter are associated with mathematical abilities on a nationally administered scholastic aptitude measure.

  19. The claustrophobia scale: a psychometric evaluation.

    Science.gov (United States)

    Ost, Lars-Göran

    2007-05-01

    This article presents a psychometric evaluation of the Claustrophobia Scale (CS), consisting of one subscale for measuring anxiety (20 items, 0-4) and one for avoidance (18 items, 0-2). Participants were 87 claustrophobic patients and 200 normal controls randomly selected from the community. The results show that CS has excellent internal consistency, high test-retest reliability, concurrent and discriminant validity. The patients and controls differ significantly on the total scores of anxiety and avoidance, as well as on each individual item scores. The CS was also found to be sensitive to change after cognitive behavioral treatment. Preliminary factor analyses yielded two factors for each subscale; "Being in small enclosed spaces" and "Other people present", accounting for large proportions of the variance. The CS is useful both as a state, and as an outcome self-report measure of claustrophobia.

  20. A Score Type Test for General Autoregressive Models in Time Series

    Institute of Scientific and Technical Information of China (English)

    Jian-hong Wu; Li-xing Zhu

    2007-01-01

    This paper is devoted to the goodness-of-fit test for the general autoregressive models in time series. By averaging for the weighted residuals, we construct a score type test which is asymptotically standard chi-squared under the null and has some desirable power properties under the alternatives. Specifically, the test is sensitive to alternatives and can detect the alternatives approaching, along a direction, the null at a rate that is arbitrarily close to n-1/2. Furthermore, when the alternatives are not directional, we construct asymptotically distribution-free maximin tests for a large class of alternatives. The performance of the tests is evaluated through simulation studies.

  1. Psychometric properties of the Fagerström Test for Nicotine Dependence As propriedades psicométricas do Teste de Fagerström para Dependência de Nicotina

    Directory of Open Access Journals (Sweden)

    Izilda Carolina de Meneses-Gaya

    2009-01-01

    Full Text Available OBJECTIVE: The Fagerström Test for Nicotine Dependence (FTND is a screening instrument for physical nicotine dependence and is extensively used in various countries. The objective of the present report was to review articles related to the psychometric properties of the FTND. METHODS: A systematic search for articles published up through December of 2007 was carried out in various electronic databases. The following search terms were used: "Fagerström Test for Nicotine Dependence"; "FTND"; "psychometric"; "validity"; "reliability"; "feasibility"; and "factors". We included articles published in English, Spanish or Portuguese and in which the psychometric properties of the FTND were evaluated. RESULTS: Twenty-six studies related to the psychometric properties of the FTND were identified in the indexed literature. Analysis of the studies confirmed the reliability of the FTND for the assessment of nicotine dependence in different settings and populations. CONCLUSIONS: Further validation studies using previously validated instruments as a comparative measure are needed before the extensive use of the FTND can be justified on the basis of its psychometric qualities.OBJETIVO: O Fagerström Test for Nicotine Dependence (FTND, Teste de Fagerström para Dependência de Nicotina é um instrumento de rastreamento para dependência física de tabaco, amplamente utilizado em diversos países. Objetivou-se realizar uma revisão de artigos relacionados às propriedades psicométricas do FTND. MÉTODOS: Uma busca sistemática foi realizada usando-se vários indexadores eletrônicos até dezembro de 2007, com os seguintes descritores: "Fagerström Test for Nicotine Dependence"; "FTND"; "psychometric"; "validity"; "reliability"; "feasibility"; e "factors". Foram incluídos os artigos relacionados à avaliação das propriedades psicométricas do FTND publicados em inglês, espanhol e português. RESULTADOS: Vinte e seis estudos relativos às propriedades psicom

  2. The Fight's Not Always Fixed: Using Literary Response to Transcend Standardized Test Scores

    Science.gov (United States)

    Avila, JuliAnna

    2012-01-01

    In 2004, the National Endowment for the Arts (NEA) concluded that "literature reading is fading as a meaningful activity, especially among younger people." How can educators continue to teach students about the power of literary response when the priority is for them to achieve proficiency on standardized tests, whose scores can only be narrowly…

  3. The Effects of Family Background, Test Scores, Personality Traits and Education on Economic Success.

    Science.gov (United States)

    Jencks, Christopher; Rainwater, Lee

    Ten surveys of American men aged 25-64 were analyzed to determine the effects of family background, adolescent personality traits, cognitive test scores, and years of schooling on occupational status and earnings in maturity. Some of the findings follow: Data on brothers indicated that prior research has underestimated the effect of family…

  4. Integrating GIS in the Middle School Curriculum: Impacts on Diverse Students' Standardized Test Scores

    Science.gov (United States)

    Goldstein, Donna; Alibrandi, Marsha

    2013-01-01

    This case study conducted with 1,425 middle school students in Palm Beach County, Florida, included a treatment group receiving GIS instruction (256) and a control group without GIS instruction (1,169). Quantitative analyses on standardized test scores indicated that inclusion of GIS in middle school curriculum had a significant effect on student…

  5. End of Course Grades and Standardized Test Scores: Are Grades Predictive of Student Achievement?

    Science.gov (United States)

    Ricketts, Christine R.

    2010-01-01

    This study examined the extent to which end-of-course grades are predictive of Virginia Standards of Learning test scores in nine high school content areas. It also analyzed the impact of the variables school cluster attended, gender, ethnicity, disability status, Limited English Proficiency status, and socioeconomic status on the relationship…

  6. Score test for familial aggregation in probands studies: application to Alzheimer's disease

    NARCIS (Netherlands)

    D. Commenges; H. Jacqmin; L. Letenneur; C.M. van Duijn (Cock)

    1995-01-01

    textabstractWhen studying familial aggregation of a disease, the following two-stage design is often used: first select index subjects (cases and controls); then record data on their relatives. The likelihood corresponding to this design is derived and a score test of homogeneity is proposed for tes

  7. The Fight's Not Always Fixed: Using Literary Response to Transcend Standardized Test Scores

    Science.gov (United States)

    Avila, JuliAnna

    2012-01-01

    In 2004, the National Endowment for the Arts (NEA) concluded that "literature reading is fading as a meaningful activity, especially among younger people." How can educators continue to teach students about the power of literary response when the priority is for them to achieve proficiency on standardized tests, whose scores can only be narrowly…

  8. Relationship of Friends, Physical Education, and State Test Scores: Implications for School Counselors

    Science.gov (United States)

    Hollingsworth, Mary Ann

    2010-01-01

    This study examined the relationship between dimensions of wellness and academic performance for 634 third through fifth grade students in Title One schools in rural Mississippi, using composites of the Five Factor Wellness Inventory for Elementary Children and Reading, Language, and Math Scores of the Mississippi Curriculum Test (a state level…

  9. Permanent Income and the Black-White Test Score Gap. NBER Working Paper No. 17610

    Science.gov (United States)

    Rothstein, Jesse; Wozny, Nathan

    2011-01-01

    Analysts often examine the black-white test score gap conditional on family income. Typically only a current income measure is available. We argue that the gap conditional on permanent income is of greater interest, and we describe a method for identifying this gap using an auxiliary data set to estimate the relationship between current and…

  10. Using College Admission Test Scores to Clarify High School Placement. Leading Indicator Spotlight

    Science.gov (United States)

    Flug, Susanna

    2010-01-01

    In "Beyond Test Scores: Leading Indicators for Education," Foley and colleagues (2008) define leading indicators as those that "provide early signals of progress toward academic achievement" (p. 1) and stress that educators "need leading indicators to help them see the direction their efforts are going in and to take…

  11. Raise Test Scores without Selling Your Soul: An Interview with Scott Mandel

    Science.gov (United States)

    Curriculum Review, 2006

    2006-01-01

    With his 10th book, Improving Test Scores: A Practical Approach for Teachers and Administrators, Scott Mandel outlines steps educators can take to boost achievement on standardized exams while maintaining the integrity of their day-to-day teaching. Mandel, who holds a Ph.D. in curriculum and instruction from USC, teaches history and English at…

  12. Detecting Dissimulation in Personality Test Scores: A Comparison between Person-Fit Indices and Detection Scales.

    Science.gov (United States)

    Ferrando, Pere J.; Chico, Eliseo

    2001-01-01

    Examined whether a procedure based on item response theory (IRT) for assessing the scalability of response patterns could detect deliberate dissimulation (faking good) on scores from three tests of the Eysenck Personality Questionnaire Revised. Results for 489 and 140 undergraduates show that IRT measures were not powerful enough to detect…

  13. Testing Vegetation Flammability: The Problem of Extremely Low Ignition Frequency and Overall Flammability Score

    Directory of Open Access Journals (Sweden)

    Zorica Kauf

    2014-01-01

    Full Text Available In the recent decades changes in fire regimes led to higher vulnerability of fire prone ecosystems, with vegetation being the only component influencing fire regime which can be managed in order to reduce probability of extreme fire events. For these management practices to be effective reliable information on the vegetation flammability is being crucial. Epiradiator based testing methods are one of the methods commonly used to investigate vegetation flammability and decrease in ignition frequency is always interpreted as a decrease in flammability. Furthermore, gathered information is often combined into a single flammability score. Here we present results of leaf litter testing which, together with previously conducted research on similar materials, show that material with very low ignition frequency under certain testing conditions can be extremely flammable if testing conditions are slightly changed. Additionally, our results indicate that combining measured information into one single flammability score, even though sometimes useful, is not always meaningful and should be performed with caution.

  14. Rey's Auditory Verbal Learning Test scores can be predicted from whole brain MRI in Alzheimer's disease

    Directory of Open Access Journals (Sweden)

    Elaheh Moradi

    2017-01-01

    Full Text Available Rey's Auditory Verbal Learning Test (RAVLT is a powerful neuropsychological tool for testing episodic memory, which is widely used for the cognitive assessment in dementia and pre-dementia conditions. Several studies have shown that an impairment in RAVLT scores reflect well the underlying pathology caused by Alzheimer's disease (AD, thus making RAVLT an effective early marker to detect AD in persons with memory complaints. We investigated the association between RAVLT scores (RAVLT Immediate and RAVLT Percent Forgetting and the structural brain atrophy caused by AD. The aim was to comprehensively study to what extent the RAVLT scores are predictable based on structural magnetic resonance imaging (MRI data using machine learning approaches as well as to find the most important brain regions for the estimation of RAVLT scores. For this, we built a predictive model to estimate RAVLT scores from gray matter density via elastic net penalized linear regression model. The proposed approach provided highly significant cross-validated correlation between the estimated and observed RAVLT Immediate (R = 0.50 and RAVLT Percent Forgetting (R = 0.43 in a dataset consisting of 806 AD, mild cognitive impairment (MCI or healthy subjects. In addition, the selected machine learning method provided more accurate estimates of RAVLT scores than the relevance vector regression used earlier for the estimation of RAVLT based on MRI data. The top predictors were medial temporal lobe structures and amygdala for the estimation of RAVLT Immediate and angular gyrus, hippocampus and amygdala for the estimation of RAVLT Percent Forgetting. Further, the conversion of MCI subjects to AD in 3-years could be predicted based on either observed or estimated RAVLT scores with an accuracy comparable to MRI-based biomarkers.

  15. The reliability and validity of qualitative scores for the Controlled Oral Word Association Test.

    Science.gov (United States)

    Ross, Thomas P; Calhoun, Emily; Cox, Tara; Wenner, Carolyn; Kono, Whitney; Pleasant, Morgan

    2007-05-01

    The reliability and validity of two qualitative scoring systems for the Controlled Oral Word Association Test [Benton, A. L., Hamsher, de S. K., & Sivan, A. B. (1983). Multilingual aplasia examination (2nd ed.). Iowa City, IA: AJA Associates] were examined in 108 healthy young adults. The scoring systems developed by Troyer et al. [Troyer, A. K., Moscovich, M., & Winocur, G. (1997). Clustering and switching as two components of verbal fluency: Evidence from younger and older healthy adults. Neuropsychology, 11, 138-146] and by Abwender et al. [Abwender, D. A., Swan, J. G., Bowerman, J. T., & Connolly, S. W. (2001a). Qualitative analysis of verbal fluency output: Review and comparison of several scoring methods. Assessment, 8, 323-336] each demonstrated excellent interrater reliability (all indices at or above r(icc)=.9). Consistent with previous research [e.g., Ross, T. P. (2003). The reliability of cluster and switch scores for the COWAT. Archives of Clinical Psychology, 18, 153-164), test-retest reliability coefficients (N=53; M interval 44.6 days) for the qualitative scores were modest to poor (r(icc)=.6 to .4 range). Correlations among COWAT scores, measures of executive functioning, verbal learning, working memory, and vocabulary were examined. The idea that qualitative scores represent distinct executive functions such as cognitive flexibility or strategy utilization was not supported. We offer the interpretation that COWAT performance may require the ability to retrieve words in a non-routine manner while suppressing habitual responses and associated processing interference, presumably due to a spread of activation across semantic or lexical networks. This interpretation, though speculative at present, implies that clustering and switching on the COWAT may not be entirely deliberate, but rather an artifact of a passive (i.e., state-dependent) process. Ideas for future research, most noticeably experimental studies using cognitive methods (e.g., priming), are

  16. Rey's Auditory Verbal Learning Test scores can be predicted from whole brain MRI in Alzheimer's disease.

    Science.gov (United States)

    Moradi, Elaheh; Hallikainen, Ilona; Hänninen, Tuomo; Tohka, Jussi

    2017-01-01

    Rey's Auditory Verbal Learning Test (RAVLT) is a powerful neuropsychological tool for testing episodic memory, which is widely used for the cognitive assessment in dementia and pre-dementia conditions. Several studies have shown that an impairment in RAVLT scores reflect well the underlying pathology caused by Alzheimer's disease (AD), thus making RAVLT an effective early marker to detect AD in persons with memory complaints. We investigated the association between RAVLT scores (RAVLT Immediate and RAVLT Percent Forgetting) and the structural brain atrophy caused by AD. The aim was to comprehensively study to what extent the RAVLT scores are predictable based on structural magnetic resonance imaging (MRI) data using machine learning approaches as well as to find the most important brain regions for the estimation of RAVLT scores. For this, we built a predictive model to estimate RAVLT scores from gray matter density via elastic net penalized linear regression model. The proposed approach provided highly significant cross-validated correlation between the estimated and observed RAVLT Immediate (R = 0.50) and RAVLT Percent Forgetting (R = 0.43) in a dataset consisting of 806 AD, mild cognitive impairment (MCI) or healthy subjects. In addition, the selected machine learning method provided more accurate estimates of RAVLT scores than the relevance vector regression used earlier for the estimation of RAVLT based on MRI data. The top predictors were medial temporal lobe structures and amygdala for the estimation of RAVLT Immediate and angular gyrus, hippocampus and amygdala for the estimation of RAVLT Percent Forgetting. Further, the conversion of MCI subjects to AD in 3-years could be predicted based on either observed or estimated RAVLT scores with an accuracy comparable to MRI-based biomarkers.

  17. Effects of Classroom Ventilation Rate and Temperature on Students' Test Scores.

    Directory of Open Access Journals (Sweden)

    Ulla Haverinen-Shaughnessy

    Full Text Available Using a multilevel approach, we estimated the effects of classroom ventilation rate and temperature on academic achievement. The analysis is based on measurement data from a 70 elementary school district (140 fifth grade classrooms from Southwestern United States, and student level data (N = 3109 on socioeconomic variables and standardized test scores. There was a statistically significant association between ventilation rates and mathematics scores, and it was stronger when the six classrooms with high ventilation rates that were indicated as outliers were filtered (> 7.1 l/s per person. The association remained significant when prior year test scores were included in the model, resulting in less unexplained variability. Students' mean mathematics scores (average 2286 points were increased by up to eleven points (0.5% per each liter per second per person increase in ventilation rate within the range of 0.9-7.1 l/s per person (estimated effect size 74 points. There was an additional increase of 12-13 points per each 1°C decrease in temperature within the observed range of 20-25°C (estimated effect size 67 points. Effects of similar magnitude but higher variability were observed for reading and science scores. In conclusion, maintaining adequate ventilation and thermal comfort in classrooms could significantly improve academic achievement of students.

  18. A Psychometric Analysis of the Reading the Mind in the Eyes Test: Towards a Brief Form for Research and Applied Settings

    Directory of Open Access Journals (Sweden)

    Sally eOlderbak

    2015-10-01

    Full Text Available The Reading the Mind in the Eyes Test is a popular measure of individual differences in Theory of Mind that is often applied in the assessment of particular clinical populations (primarily, individuals on the autism spectrum. However, little is known about the test’s psychometric properties, including factor structure, internal consistency, and convergent validity evidence. We present a psychometric analysis of the test followed by an evaluation of other empirically proposed and statistically identified structures. We identified, and cross-validated in a second sample, an adequate short-form solution that is homogeneous with adequate internal consistency, and is moderately related to Cognitive Empathy, Emotion Perception, and strongly related to Vocabulary. We recommend the use of this short-form solution in normal adults as a more precise measure over the original version. Future revisions of the test should seek to reduce the test’s reliance on one’s vocabulary and evaluate the short-form structure in clinical populations.

  19. The effect of hearing impairment in older people on the spouse: development and psychometric testing of the significant other scale for hearing disability (SOS-HEAR).

    Science.gov (United States)

    Scarinci, Nerina; Worrall, Linda; Hickson, Louise

    2009-01-01

    The effects of hearing impairment on the person with the impairment and on their significant others are pervasive and affect the quality of life for all involved. The effect of hearing impairment on significant others is known as a third-party disability. This study aimed to develop and psychometrically test a scale to measure the third-party disability experienced by spouses of older people with hearing impairment. The Significant Other Scale for Hearing Disability (SOS-HEAR) was based on results of a previous qualitative study investigating the effect of hearing impairment on a spouse's everyday life. Psychometric testing with 100 spouses was conducted using item analysis, Cronbach's alpha, factor analysis, and test-retest reliability. Principal components analysis identified six key underlying factors. A combined set of 27 items was found to be reliable (alpha = 0.94), with weighted kappa for items ranging from fair to very good. The SOS-HEAR is a brief, easy to administer instrument that has evidence of reliability and validity. The SOS-HEAR could serve as a means of identifying spouses of older people with hearing impairment in need of intervention, directed towards either the couple or the spouse alone.

  20. The Relationships between Social Class, Listening Test Anxiety and Test Scores

    OpenAIRE

    Omid Talebi Rezaabadi

    2016-01-01

    This study investigated the relationships between the social anxiety, social class and listening-test anxiety of students learning English as a foreign language. The aims of the study were to examine the relationship between listening-test anxiety and listening-test performance. The data were collected using an adapted Foreign Language Listening Anxiety Scale and a newly developed Foreign Language Social Anxiety Scale. The potential correlation between social anxiety and listening-test perfor...

  1. The Relationships between Social Class, Listening Test Anxiety and Test Scores

    OpenAIRE

    Omid Talebi Rezaabadi

    2016-01-01

    This study investigated the relationships between the social anxiety, social class and listening-test anxiety of students learning English as a foreign language. The aims of the study were to examine the relationship between listening-test anxiety and listening-test performance. The data were collected using an adapted Foreign Language Listening Anxiety Scale and a newly developed Foreign Language Social Anxiety Scale. The potential correlation between social anxiety and listening-test perfor...

  2. Identifying genetic marker sets associated with phenotypes via an efficient adaptive score test

    KAUST Repository

    Cai, T.

    2012-06-25

    In recent years, genome-wide association studies (GWAS) and gene-expression profiling have generated a large number of valuable datasets for assessing how genetic variations are related to disease outcomes. With such datasets, it is often of interest to assess the overall effect of a set of genetic markers, assembled based on biological knowledge. Genetic marker-set analyses have been advocated as more reliable and powerful approaches compared with the traditional marginal approaches (Curtis and others, 2005. Pathways to the analysis of microarray data. TRENDS in Biotechnology 23, 429-435; Efroni and others, 2007. Identification of key processes underlying cancer phenotypes using biologic pathway analysis. PLoS One 2, 425). Procedures for testing the overall effect of a marker-set have been actively studied in recent years. For example, score tests derived under an Empirical Bayes (EB) framework (Liu and others, 2007. Semiparametric regression of multidimensional genetic pathway data: least-squares kernel machines and linear mixed models. Biometrics 63, 1079-1088; Liu and others, 2008. Estimation and testing for the effect of a genetic pathway on a disease outcome using logistic kernel machine regression via logistic mixed models. BMC bioinformatics 9, 292-2; Wu and others, 2010. Powerful SNP-set analysis for case-control genome-wide association studies. American Journal of Human Genetics 86, 929) have been proposed as powerful alternatives to the standard Rao score test (Rao, 1948. Large sample tests of statistical hypotheses concerning several parameters with applications to problems of estimation. Mathematical Proceedings of the Cambridge Philosophical Society, 44, 50-57). The advantages of these EB-based tests are most apparent when the markers are correlated, due to the reduction in the degrees of freedom. In this paper, we propose an adaptive score test which up- or down-weights the contributions from each member of the marker-set based on the Z-scores of

  3. The Relationships between Social Class, Listening Test Anxiety and Test Scores

    Science.gov (United States)

    Rezaabadi, Omid Talebi

    2016-01-01

    This study investigated the relationships between the social anxiety, social class and listening-test anxiety of students learning English as a foreign language. The aims of the study were to examine the relationship between listening-test anxiety and listening-test performance. The data were collected using an adapted Foreign Language Listening…

  4. Does Maximizing Information at the Cut Score Always Maximize Classification Accuracy and Consistency?

    Science.gov (United States)

    Wyse, Adam E.; Babcock, Ben

    2016-01-01

    A common suggestion made in the psychometric literature for fixed-length classification tests is that one should design tests so that they have maximum information at the cut score. Designing tests in this way is believed to maximize the classification accuracy and consistency of the assessment. This article uses simulated examples to illustrate…

  5. Admissions Testing at Career College and Trade School Training Programs. Test Score Guidelines, Norms, and Student Demographics.

    Science.gov (United States)

    Wonderlic, Charles F.; And Others

    This report provides a method for determining minimum score by vocational program based on the use of the Wonderlic Scholastic Level Exam (SLE). The SLE has been demonstrated to be a highly accurate and reliable measure of adult cognitive ability. It is currently in use as an admissions test at many career colleges and trade schools. The SLE test…

  6. Bayesian and Empirical Bayes Approaches to Setting Passing Scores on Mastery Tests. Publication Series in Mastery Testing.

    Science.gov (United States)

    Huynh, Huynh; Saunders, Joseph C., III

    The Bayesian approach to setting passing scores, as proposed by Swaminathan, Hambleton, and Algina, is compared with the empirical Bayes approach to the same problem that is derived from Huynh's decision-theoretic framework. Comparisons are based on simulated data which follow an approximate beta-binomial distribution and on real test results from…

  7. School Readiness and the Draw-a-Man Test: An Empiricaly Derived Alternative to Harris' Scoring System.

    Science.gov (United States)

    Simner, Marvin L.

    1985-01-01

    An abbreviated scoring system for the Goodenough-Harris Draw-A-Man Test found that three items had the same overall potential for correctly identifying at-risk kindergarteners as more time-consuming scoring methods. (CL)

  8. Hurlbert Index of Sexual Assertiveness: a study of psychometric properties in a Spanish sample.

    Science.gov (United States)

    Santos-Iglesias, Pablo; Sierra, Juan Carlos

    2010-08-01

    The study analyzed psychometric properties of a Spanish version of the Hurlbert Index of Sexual Assertiveness in a Spanish sample of 400 men and 453 women who had had a partner for the last 6 mo. or longer at the time of the study. Exploratory and confirmatory factor analyses suggested a two-factor solution with the factors Initiation and No shyness/Refusal. Internal consistency values for total scores were .87 and .83 for the factors, respectively. Convergent validity tests were also satisfactory. It is therefore reasonable to conclude that the Spanish version of the scale has appropriate psychometric properties.

  9. Spatial and verbal memory test scores following yoga and fine arts camps for school children.

    Science.gov (United States)

    Manjunath, N K; Telles, Shirley

    2004-07-01

    The performance scores of children (aged 11 to 16 years) in verbal and spatial memory tests were compared for two groups (n = 30, each), one attending a yoga camp and the other a fine arts camp. Both groups were assessed on the memory tasks initially and after ten days of their respective interventions. A control group (n = 30) was similarly studied to assess the test-retest effect. At the final assessment the yoga group showed a significant increase of 43% in spatial memory scores (Multivariate analysis, Tukey test), while the fine arts and control groups showed no change. The results suggest that yoga practice, including physical postures, yoga breathing, meditation and guided relaxation improved delayed recall of spatial information.

  10. The Twelve Promises of Alcoholics Anonymous: psychometric measure validation and mediational testing as a 12-step specific mechanism of behavior change.

    Science.gov (United States)

    Kelly, John F; Greene, M Claire

    2013-12-01

    Empirical support for the recovery utility of 12-step mutual-help organizations (MHOs) has led to increased investigation of how such organizations confer benefit. The Twelve Promises of Alcoholics Anonymous (AA) feature prominently in 12-step philosophy and culture and are one of the few documented explications of the cognitive, affective, and behavioral benefits that members might accrue. This study investigated the psychometric properties of a measure of AA's Twelve Promises and examined whether it mediated the effect of 12-step participation on abstinence. Young adults (N=302, M age 20.4 [1.6], range 18-25; 27% female; 95% White) enrolled in an addiction treatment effectiveness study completed assessments at intake and 3-, 6-, and 12-months post treatment including a 26-item, Twelve Promises Scale (TPS). Factor analyses examined the TPS' psychometrics and lagged mediational analyses tested the TPS as a mechanism of behavior change. Robust principal axis factoring extraction with Varimax rotation revealed a 2-factor solution explaining 45-58% of the variance across three administrations ("Psychological Wellbeing"=26-39%; "Freedom from Craving=17-21%); internal consistency was high (alpha=.83-.93). Both factors were found to increase in relation to greater 12-step participation, but significant mediation was found only for the Freedom from Craving factor explaining 21-34% of the effect of 12-step participation in increasing abstinence. The TPS shows potential as a conceptually relevant, and psychometrically sound measure and may be useful in helping elucidate the extent to which the Twelve Promises emerge as an independent benefit of 12-step participation and/or explain SUD remission and recovery. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

  11. The relationship between the ability to identify evaluation criteria and integrity test scores

    Directory of Open Access Journals (Sweden)

    CORNELIUS J. KÖNIG

    2006-09-01

    Full Text Available It has been argued that applicants who have the ability to identify what kind of behavior is evaluated positively in a personnel selection situation can use this information to adapt their behavior accordingly. Although this idea has been tested for assessment centers and structured interviews, it has not been studied with regard to integrity tests (or other personality tests. Therefore, this study tested whether candidates’ ability to identify evaluation criteria (ATIC correlates with their integrity test scores. Candidates were tested in an application training setting (N = 92. The results supported the idea that ATIC also plays an important role for integrity tests. New directions for future research are suggested based on this finding.

  12. Student Test Scores: How the Sausage Is Made and Why You Should Care. Evidence Speaks Reports, Vol 1, #25

    Science.gov (United States)

    Jacob, Brian A.

    2016-01-01

    Contrary to popular belief, modern cognitive assessments--including the new Common Core tests--produce test scores based on sophisticated statistical models rather than the simple percent of items a student answers correctly. While there are good reasons for this, it means that reported test scores depend on many decisions made by test designers,…

  13. Psychometric evaluation of an item bank for computerized adaptive testing of the EORTC QLQ-C30 cognitive functioning dimension in cancer patients.

    Science.gov (United States)

    Dirven, Linda; Groenvold, Mogens; Taphoorn, Martin J B; Conroy, Thierry; Tomaszewski, Krzysztof A; Young, Teresa; Petersen, Morten Aa

    2017-07-13

    The European Organisation of Research and Treatment of Cancer (EORTC) Quality of Life Group is developing computerized adaptive testing (CAT) versions of all EORTC Quality of Life Questionnaire (QLQ-C30) scales with the aim to enhance measurement precision. Here we present the results on the field-testing and psychometric evaluation of the item bank for cognitive functioning (CF). In previous phases (I-III), 44 candidate items were developed measuring CF in cancer patients. In phase IV, these items were psychometrically evaluated in a large sample of international cancer patients. This evaluation included an assessment of dimensionality, fit to the item response theory (IRT) model, differential item functioning (DIF), and measurement properties. A total of 1030 cancer patients completed the 44 candidate items on CF. Of these, 34 items could be included in a unidimensional IRT model, showing an acceptable fit. Although several items showed DIF, these had a negligible impact on CF estimation. Measurement precision of the item bank was much higher than the two original QLQ-C30 CF items alone, across the whole continuum. Moreover, CAT measurement may on average reduce study sample sizes with about 35-40% compared to the original QLQ-C30 CF scale, without loss of power. A CF item bank for CAT measurement consisting of 34 items was established, applicable to various cancer patients across countries. This CAT measurement system will facilitate precise and efficient assessment of HRQOL of cancer patients, without loss of comparability of results.

  14. Pathology update to the Manchester Scoring System based on testing in over 4000 families.

    Science.gov (United States)

    Evans, D Gareth; Harkness, Elaine F; Plaskocinska, Inga; Wallace, Andrew J; Clancy, Tara; Woodward, Emma R; Howell, Tony A; Tischkowitz, Marc; Lalloo, Fiona

    2017-05-10

    While the requirement for thresholds for testing for mutations in BRCA1/2 is being questioned, they are likely to remain for individuals unaffected by a relevant cancer. It is still useful to provide pretesting likelihoods, but models need to take into account tumour pathology. The Manchester Scoring System (MSS) is a well-used, simple, paper-based model for assessing carrier probability that already incorporates pathology data. We have used mutation testing data from 4115 unrelated samples from affected non-Jewish individuals alongside tumour pathology to further refine the scoring system. Adding additional points for high-grade serous ovarian cancer <60 (HGSOC=+2) and adding grade score to those with triple-negative breast cancer, while reducing the score for those with HER2+ breast cancer (-6), resulted in significantly improved sensitivity and minor improvements in specificity to the MSS. Sporadic HGSOC <60 years thus reached a score of 15-19 points within the 10% grouping consistent with the 15/113-13.2% that were identified with a BRCA1/2 pathogenic variant. Validation in a population series of ovarian cancer from Cambridge showed high sensitivity at the 10% threshold 15/17 (88.2%). The new pathology-adjusted Manchester score MSS3 appears to provide an effective and simple-to-use estimate of the 10% and 20% thresholds for BRCA1/2 likelihood. For unaffected individuals, the 20-point (20%) threshold in their affected first-degree relative can be used to determine eligibility at the 10% threshold. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  15. Pediatric residents' learning styles and temperaments and their relationships to standardized test scores.

    Science.gov (United States)

    Tuli, Sanjeev Y; Thompson, Lindsay A; Saliba, Heidi; Black, Erik W; Ryan, Kathleen A; Kelly, Maria N; Novak, Maureen; Mellott, Jane; Tuli, Sonal S

    2011-12-01

    Board certification is an important professional qualification and a prerequisite for credentialing, and the Accreditation Council for Graduate Medical Education (ACGME) assesses board certification rates as a component of residency program effectiveness. To date, research has shown that preresidency measures, including National Board of Medical Examiners scores, Alpha Omega Alpha Honor Medical Society membership, or medical school grades poorly predict postresidency board examination scores. However, learning styles and temperament have been identified as factors that 5 affect test-taking performance. The purpose of this study is to characterize the learning styles and temperaments of pediatric residents and to evaluate their relationships to yearly in-service and postresidency board examination scores. This cross-sectional study analyzed the learning styles and temperaments of current and past pediatric residents by administration of 3 validated tools: the Kolb Learning Style Inventory, the Keirsey Temperament Sorter, and the Felder-Silverman Learning Style test. These results were compared with known, normative, general and medical population data and evaluated for correlation to in-service examination and postresidency board examination scores. The predominant learning style for pediatric residents was converging 44% (33 of 75 residents) and the predominant temperament was guardian 61% (34 of 56 residents). The learning style and temperament distribution of the residents was significantly different from published population data (P  =  .002 and .04, respectively). Learning styles, with one exception, were found to be unrelated to standardized test scores. The predominant learning style and temperament of pediatric residents is significantly different than that of the populations of general and medical trainees. However, learning styles and temperament do not predict outcomes on standardized in-service and board examinations in pediatric residents.

  16. Development and Psychometric Testing Chinese Version of the Frommelt Attitude Toward Care of the Dying Scale, Form B in Nurses and Nursing Students.

    Science.gov (United States)

    Wang, Li-Ping; Li, Ya-Jie; Yan, Wen-Zhen; Li, Guan-Mei

    2016-03-01

    Nursing students' and nurses' attitudes toward caring for the dying need to be explored. The Frommelt Attitude Toward Care of the Dying (FATCOD) scale has not previously been used in the Chinese language. The aim of this study was to examine the reliability and validity of the Chinese version of the FATCOD scale. A convenience sample of 154 nurses and 200 nursing students was recruited. The Chinese version of the FATCOD was used to test construct validity, concurrent validity, convergent validity, and internal consistency. The Cronbach's alpha coefficient of the Chinese version of the FATCOD scale, Form B (FATCOD-B-C) was 0.790. The Cronbach's alpha coefficients for each subscale ranged from 0.610 to 0.863. The test-retest reliability was satisfactory (r = 0.959, P psychometric testing is recommended to confirm the factor analysis, but this study provides further evidence of the applicability of the FATCOD-B-C in clinical care services.

  17. The effect of instructional methodology on high school students natural sciences standardized tests scores

    Science.gov (United States)

    Powell, P. E.

    Educators have recently come to consider inquiry based instruction as a more effective method of instruction than didactic instruction. Experience based learning theory suggests that student performance is linked to teaching method. However, research is limited on inquiry teaching and its effectiveness on preparing students to perform well on standardized tests. The purpose of the study to investigate whether one of these two teaching methodologies was more effective in increasing student performance on standardized science tests. The quasi experimental quantitative study was comprised of two stages. Stage 1 used a survey to identify teaching methods of a convenience sample of 57 teacher participants and determined level of inquiry used in instruction to place participants into instructional groups (the independent variable). Stage 2 used analysis of covariance (ANCOVA) to compare posttest scores on a standardized exam by teaching method. Additional analyses were conducted to examine the differences in science achievement by ethnicity, gender, and socioeconomic status by teaching methodology. Results demonstrated a statistically significant gain in test scores when taught using inquiry based instruction. Subpopulation analyses indicated all groups showed improved mean standardized test scores except African American students. The findings benefit teachers and students by presenting data supporting a method of content delivery that increases teacher efficacy and produces students with a greater cognition of science content that meets the school's mission and goals.

  18. Adults with poor reading skills: How lexical knowledge interacts with scores on standardized reading comprehension tests.

    Science.gov (United States)

    McKoon, Gail; Ratcliff, Roger

    2016-01-01

    Millions of adults in the United States lack the necessary literacy skills for most living wage jobs. For students from adult learning classes, we used a lexical decision task to measure their knowledge of words and we used a decision-making model (Ratcliff's, 1978, diffusion model) to abstract the mechanisms underlying their performance from their RTs and accuracy. We also collected scores for each participant on standardized IQ tests and standardized reading tests used commonly in the education literature. We found significant correlations between the model's estimates of the strengths with which words are represented in memory and scores for some of the standardized tests but not others. The findings point to the feasibility and utility of combining a test of word knowledge, lexical decision, that is well-established in psycholinguistic research, a decision-making model that supplies information about underlying mechanisms, and standardized tests. The goal for future research is to use this combination of approaches to understand better how basic processes relate to standardized tests with the eventual aim of understanding what these tests are measuring and what the specific difficulties are for individual, low-literacy adults. Copyright © 2015. Published by Elsevier B.V.

  19. Psychometric Properties of the Literacy Test from China Family Panel Studies%《中国家庭追踪调查》中字词测试的信度和效度分析

    Institute of Scientific and Technical Information of China (English)

    吴琼; 李佩华

    2016-01-01

    大型综合性调查经常包含认知测试部分。基于控制调查成本以及降低数据缺失率的要求,大型调查中的认知测试设计一般较为简洁,在这种情况下,其信度和效度可能受到影响。本文运用经典测试理论和项目反应理论两种方式来分析“中国家庭追踪调查”中字词测试的信度和效度。我们还同时比较了3种计分方法,它们分别是原始分计分法、最难题计分法以及基于项目反应理论的计分法。分析结果显示,“中国家庭追踪调查”中字词测试信度较高,其结构效度和效标效度良好。3种计分方法结果的相关度很高,在分析截面数据时没有实质性的差别。%Many large scale surveys contain cognitive assessment modules. The design of those cognitive tests is often brief in order to minimize test administration cost and maintain acceptable response rates. This paper evaluated the psychometric properties of the literacy test from China Family Panel Studies (CFPS) using both Classical Test Theory (CTT) and Item Response Theory (IRT) approaches. We also compared three different scoring methods:number-right, highest sequence number, and IRT scaled scores. Both CTT and IRT approaches provided positive evidence for reliability and validity of test scores from the CFPS literacy test. Three scoring methods yielded results that were highly consistent with one another.

  20. Predicting scores of the Halstead Category Test with the WAIS-III.

    Science.gov (United States)

    Titus, Jeffrey B; Retzlaff, Paul D; Dean, Raymond S

    2002-09-01

    The Halstead Category Test (HCT) and the Wechsler Adult Intelligence Scale (WAIS) are two of the most widely used neuropsychological tests. Often assessment conclusions are dependent upon the comparison of these measures. Therefore, it is crucial for clinicians to know how they relate to one another. This study examined the relationship between the HCT and the WAIS-III with undergraduate psychology students. Correlational analyses were conducted between HCT scores and WAIS-III subtests, Verbal and Performance IQ, and Full Scale IQ scores. Additionally, the new WAIS-III scales (Letter-Number Sequencing, Matrix Reasoning, and Symbol Search) were further examined. Regression analyses were run to develop predictor equations for the HCT using VIQ, PIQ, and FSIQ. Finally, predictor tables were generated between the HCT and VIQ, PIQ, and FSIQ to provide assessment of brain dysfunction for clinical use.

  1. Screening for depressed mood in an adolescent psychiatric context by brief self-assessment scales -- testing psychometric validity of WHO-5 and BDI-6 indices by latent trait analyses

    DEFF Research Database (Denmark)

    Blom, Eva Henje; Bech, Per; Högberg, Göran

    2012-01-01

    but they are time-consuming and often not fully implemented in clinical practice. Screening for depressive symptoms in the child psychiatric context using brief, user-friendly and easily managed self-assessment scales may be of clinical value and utility. The aim of the study is to test the psychometric validity.......52 and for the BDI-6 of 0.46. Rasch analysis also accepted unidimensionality when testing males versus females (p > 0.05). CONCLUSIONS: The WHO-5 is psychometrically valid in an adolescent psychiatric context including both genders to assess the wellness dimension and applicable as a first step in screening for MDD...

  2. Psychometric properties of the STAT for early autism screening.

    Science.gov (United States)

    Stone, Wendy L; Coonrod, Elaine E; Turner, Lauren M; Pozdol, Stacie L

    2004-12-01

    The STAT is an interactive screening measure for autism that assesses behaviors in the areas of play, communication, and imitation skills. In Study 1, signal detection procedures were employed to identify a cutoff score for the STAT using developmentally matched groups of 2-year-old children with autism and with nonspectrum disorders. The resulting cutoff yielded high sensitivity, specificity, and predictive values for the development sample as well as for an independent validation sample. Study 2 examined psychometric properties of the STAT and revealed acceptable levels of interrater agreement, test-retest reliability, and agreement between STAT risk category and ADOS-G classification. The STAT demonstrates strong psychometric properties and shows promising utility as a Level 2 screening measure for autism.

  3. Generalizing Terwilliger's likelihood approach: a new score statistic to test for genetic association

    OpenAIRE

    Hsu Li; Helmer Quinta; de Visser Marieke CH; Uitte de Willige Shirley; el Galta Rachid; Houwing-Duistermaat Jeanine J

    2007-01-01

    Abstract Background: In this paper, we propose a one degree of freedom test for association between a candidate gene and a binary trait. This method is a generalization of Terwilliger's likelihood ratio statistic and is especially powerful for the situation of one associated haplotype. As an alternative to the likelihood ratio statistic, we derive a score statistic, which has a tractable expression. For haplotype analysis, we assume that phase is known. Results: By means of a simulation study...

  4. Attitudes toward science: measurement and psychometric properties of the Test of Science-Related Attitudes for its use in Spanish-speaking classrooms

    Science.gov (United States)

    Navarro, Marianela; Förster, Carla; González, Caterina; González-Pose, Paulina

    2016-06-01

    Understanding attitudes toward science and measuring them remain two major challenges for science teaching. This article reviews the concept of attitudes toward science and their measurement. It subsequently analyzes the psychometric properties of the Test of Science-Related Attitudes (TOSRA), such as its construct validity, its discriminant and concurrent validity, and its reliability. The evidence presented suggests that TOSRA, in its Spanish-adapted version, has adequate construct validity regarding its theoretical referents, as well as good indexes of reliability. In addition, it determines the attitudes toward science of secondary school students in Santiago de Chile (n = 664) and analyzes the sex variable as a differentiating factor in such attitudes. The analysis by sex revealed low-relevance gender difference. The results are contrasted with those obtained in English-speaking countries. This TOSRA sample showed good psychometric parameters for measuring and evaluating attitudes toward science, which can be used in classrooms of Spanish-speaking countries or with immigrant populations with limited English proficiency.

  5. Development and psychometric testing of the Clinical Learning Environment, Supervision and Nurse Teacher evaluation scale (CLES+T): the Spanish version.

    Science.gov (United States)

    Vizcaya-Moreno, M Flores; Pérez-Cañaveras, Rosa M; De Juan, Joaquín; Saarikoski, Mikko

    2015-01-01

    The Clinical Learning Environment, Supervision and Nurse Teacher scale is a reliable and valid instrument to evaluate the quality of the clinical learning process in international nursing education contexts. This paper reports the development and psychometric testing of the Spanish version of the Clinical Learning Environment, Supervision and Nurse Teacher scale. Cross-sectional validation study of the scale. 10 public and private hospitals in the Alicante area, and the Faculty of Health Sciences (University of Alicante, Spain). 370 student nurses on clinical placement (January 2011-March 2012). The Clinical Learning Environment, Supervision and Nurse Teacher scale was translated using the modified direct translation method. Statistical analyses were performed using PASW Statistics 18 and AMOS 18.0.0 software. A multivariate analysis was conducted in order to assess construct validity. Cronbach's alpha coefficient was used to evaluate instrument reliability. An exploratory factorial analysis identified the five dimensions from the original version, and explained 66.4% of the variance. Confirmatory factor analysis supported the factor structure of the Spanish version of the instrument. Cronbach's alpha coefficient for the scale was .95, ranging from .80 to .97 for the subscales. This version of the Clinical Learning Environment, Supervision and Nurse Teacher scale instrument showed acceptable psychometric properties for use as an assessment scale in Spanish-speaking countries. Copyright © 2014 Elsevier Ltd. All rights reserved.

  6. Team-Based Learning's Effects on Standardized Test Scores and Student Reactions

    Science.gov (United States)

    Ulrich, Deborah L.; Brewer, Tracy; Steele-Johnson, Deborah; Juvina, Ion; Peyton, Elizabeth; Hammond, Crystal

    2017-01-01

    We examined the effects of team-based learning (TBL) versus traditional and enhanced lecture-based instruction over time (Study 1; N = 532 nursing students) and the psychometric properties of the Team Based Learning Student Assessment Instrument (TBL-SAI), a popular measure of reactions to TBL (Study 2; N = 323 nursing and medical students).…

  7. Open-book tests : Search behaviour, time used and test scores

    NARCIS (Netherlands)

    Westerkamp, Andrie C.; Heijne-Penninga, Marjolein; Kuks, Jan B. M.; Cohen-Schotanus, Janke

    2013-01-01

    Background: Because of the increasing medical knowledge and the focus of medical education on acquiring competencies, the use of open-book tests seems inevitable. Dealing with a large body of information, indicating which kind of information is needed to solve a problem, and finding and understandin

  8. Comparison of the Qualitative and Developmental Scoring Systems for the Modified Version of the Bender-Gestalt Test.

    Science.gov (United States)

    Brannigan, Gary G.; Brunner, Nancy A.

    1993-01-01

    Examined two scoring systems for Modified Version of the Bender-Gestalt Test. Administered Bender-Gestalt and Otis-Lennon School Ability Test to 75 first-grade and 84 second-grade students. Both systems were significantly correlated with school ability. Results of tests for differences between correlations indicated that Qualitative Scoring System…

  9. Modeling psychometric functions in R.

    Science.gov (United States)

    Yssaad-Fesselier, Rosa; Knoblauch, Kenneth

    2006-02-01

    We demonstrate some procedures in the statistical computing environment R for obtaining maximum likelihood estimates of the parameters of a psychometric function by fitting a generalized nonlinear regression model to the data. A feature for fitting a linear model to the threshold (or other) parameters of several psychometric functions simultaneously provides a powerful tool for testing hypotheses about the data and, potentially, for reducing the number of parameters necessary to describe them. Finally, we illustrate procedures for treating one parameter as a random effect that would permit a simplified approach to modeling stimulus-independent variability due to factors such as lapses or interobserver differences. These tools will facilitate a more comprehensive and explicit approach to the modeling of psychometric data.

  10. Normalization of the Psychometric Hepatic Encephalopathy score ...

    African Journals Online (AJOL)

    2017 Nigerian Journal of Clinical Practice | Published by Wolters Kluwer - Medknow. Acceptance Date: 09-05-2016. Aim: To construct normal values for .... Alzheimer's disease or Parkinson's disease, inability to read and write. Patients with ...

  11. Psychometric properties of the Cognitive Failures Questionnaire.

    Science.gov (United States)

    Bridger, Robert S; Johnsen, Svein Åge Kjøs; Brasher, Kate

    2013-01-01

    The Cognitive Failures Questionnaire (CFQ) is used in ergonomics research to measure behavioural problems associated with attentiveness and memory in everyday life. CFQ scores have been related to constructs such as accident proneness and outcomes such as human error and psychological strain. The two-year test-retest reliability of the CFQ is reported together with the findings of factor analyses of CFQ data from 535 respondents. Evidence for the predictive and criterion validity and internal reliability of the CFQ is provided. Psychological strain was measured concurrently with CFQ on both testing occasions, two years apart. The test-retest reliability of the summated CFQ score was found to be 0.71, while for the General Health Questionnaire (GHQ-12) strain measure it was 0.32.The relative variance stability was five times greater for the CFQ than the GHQ, indicating that scores on these questionnaires are not covariates. The use of the CFQ as a measure of cognitive control capacity is also discussed. Ergonomists have long been interested in human error and the role of high work demands due to poor equipment design and excessive workload. The CFQ measures attentiveness in daily life and is shown to have excellent psychometric properties that make it suitable for use in both laboratory and field studies as a trait measure of attentiveness in daily life.

  12. Classroom attributes and achievement test scores for deaf and hard of hearing students.

    Science.gov (United States)

    Holt, J

    1994-10-01

    This study examined reading comprehension and mathematics computation achievement of deaf and hard-of-hearing students in a variety of school settings. Data were collected by Gallaudet University's Center for Assessment and Demographic Studies during its 1990 standardization of the 8th Edition Stanford Achievement Test. Descriptive and inferential statistical methods were used to analyze the relationships among achievement scores, classroom attributes, and demographic factors associated with achievement. Based on the results of this study, inclusion with hearing students in regular classrooms is related to a variety of demographic factors. When reading comprehension and mathematics computation scores are adjusted for these factors, they are higher for the deaf and hard-of-hearing students in regular classrooms. However, it is not known whether the higher achievement is due to inclusion or whether students were selected for inclusion due to their higher achievement levels.

  13. Test Scores, Class Rank and College Performance: Lessons for Broadening Access and Promoting Success

    Science.gov (United States)

    Niu, Sunny X.; Tienda, Marta

    2012-01-01

    Using administrative data for five Texas universities that differ in selectivity, this study evaluates the relative influence of two key indicators for college success—high school class rank and standardized tests. Empirical results show that class rank is the superior predictor of college performance and that test score advantages do not insulate lower ranked students from academic underperformance. Using the UT-Austin campus as a test case, we conduct a simulation to evaluate the consequences of capping students admitted automatically using both achievement metrics. We find that using class rank to cap the number of students eligible for automatic admission would have roughly uniform impacts across high schools, but imposing a minimum test score threshold on all students would have highly unequal consequences by greatly reduce the admission eligibility of the highest performing students who attend poor high schools while not jeopardizing admissibility of students who attend affluent high schools. We discuss the implications of the Texas admissions experiment for higher education in Europe. PMID:23788828

  14. COMPARISON BETWEEN WOOD DRYING DEFECT SCORES: SPECIMEN TESTING X ANALYSIS OF KILN-DRIED BOARDS

    Directory of Open Access Journals (Sweden)

    Djeison Cesar Batista

    2015-04-01

    Full Text Available It is important to develop drying technologies for Eucalyptus grandis lumber, which is one of the most planted species of this genus in Brazil and plays an important role as raw material for the wood industry. The general aim of this work was to assess the conventional kiln drying of juvenile wood of three clones of Eucalyptus grandis. The specific aims were to compare the behavior between: i drying defects indicated by tests with wood specimens and conventional kiln-dried boards; and ii physical properties and the drying quality. Five 11-year-old trees of each clone were felled, and only flatsawn boards of the first log were used. Basic density and total shrinkage were determined, and the drying test with wood specimens at 100 °C was carried out. Kiln drying of boards was performed, and initial and final moisture content, moisture gradient in thickness, drying stresses and drying defects were assessed. The defect scoring method was used to verify the behavior between the defects detected by specimen testing and the defects detected in kiln-dried boards. As main results, the drying schedule was too severe for the wood, resulting in a high level of boards with defects. The behavior between the defects in the drying test with specimens and the defects of kiln-dried boards was different, there was no correspondence, according to the defect scoring method.

  15. Can the external masculinization score predict the success of genetic testing in 46,XY DSD?

    Directory of Open Access Journals (Sweden)

    Ruthie Su

    2015-05-01

    Full Text Available Genetic testing is judiciously applied to individuals with Disorders of Sex Development (DSD and so it is necessary to identify those most likely to benefit from such testing. We hypothesized that the external masculinization score (EMS is inversely associated with the likelihood of finding a pathogenic genetic variant. Patients with 46,XY DSD from a single institution evaluated from 1994-2014 were included. Results of advanced cytogenetic and gene sequencing tests were recorded. An EMS score (range 0-12 was assigned to each patient according to the team's initial external genitalia physical examination. During 1994-2011, 44 (40% patients with 46,XY DSD were evaluated and underwent genetic testing beyond initial karyotype; 23% (10/44 had a genetic diagnosis made by gene sequencing or array. The median EMS score of those with an identified pathogenic variant was significantly different from those in whom no confirmed genetic cause was identified [median 3 (95% CI, 2-6 versus 6 (95% CI, 5-7, respectively (p = 0.02], but limited to diagnoses of complete or partial androgen insensitivity (8/10 or 5-reductase deficiency (2/10. In the modern cohort (2012-2014, the difference in median EMS in whom a genetic cause was or was not identified approached significance (p = 0.05, median 3 (95% CI, 0-7 versus 7 (95% CI, 6-9, respectively. When all patients from 1994-2014 are pooled, the EMS is significantly different amongst those with compared to those without a genetic cause (median EMS 3 vs. 6, p < 0.02. We conclude that an EMS of 3 or less may indicate a higher likelihood of identifying a genetic cause of 46,XY DSD and justify genetic screening, especially when androgen insensitivity is suspected.

  16. The Student Human Papillomavirus Survey: Nurse-Led Instrument Development and Psychometric Testing to Increase Human Papillomavirus Vaccine Series Completion in Young Adults.

    Science.gov (United States)

    Thomas, Tami; Dalmida, Safyia; Higgins, Melinda

    2016-01-01

    The Student Human Papillomavirus Survey (SHPVS) was developed to examine students' perceived benefits or barriers to human papillomavirus (HPV) vaccination. Survey development included (a) 2-phase integrative literature reviews; (b) draft of survey items based on the literature; (c) critique of survey items by young adults, nursing and psychology faculty, and health care providers; and (d) pilot testing. The psychometric properties of the SHPVS were evaluated using classical item analysis and exploratory factor analysis (EFA) among a sample of 527 university students' ages 18-24 years. The estimated Cronbach's alpha for the SHPVS is .74. The SHPVS is a measure of young adults HPV perceived vulnerability, perceived severity, perceived barriers, and perceived benefits of HPV vaccination.

  17. Apgar score

    Science.gov (United States)

    ... this page: //medlineplus.gov/ency/article/003402.htm Apgar score To use the sharing features on this page, ... birth. Virginia Apgar, MD (1909-1974) introduced the Apgar score in 1952. How the Test is Performed The ...

  18. Test and Score Data Summary for TOEFL[R] Internet-Based and Paper-Based Tests. January 2008-December 2008 Test Data

    Science.gov (United States)

    Educational Testing Service, 2008

    2008-01-01

    The Test of English as a Foreign Language[TM], better known as TOEFL[R], is designed to measure the English-language proficiency of people whose native language is not English. TOEFL scores are accepted by more than 6,000 colleges, universities, and licensing agencies in 130 countries. The test is also used by governments, and scholarship and…

  19. Comparison of the Bender Gestalt Test for Both Black and White Brain-Damaged Patients Using Two Scoring Systems

    Science.gov (United States)

    Butler, Oliver T.; And Others

    1976-01-01

    This study tested for cultural bias in the Bender Visual Motor Gestalt Test. Subjects were 72 black and white patients diagnosed as either brain damaged or psychiatric. Bender protocols were scored by Pascal-Suttell and Hain systems. No race effect appeared except for the Pascal-Suttell system for which blacks scored significantly better. (Author)

  20. Interpreting the "g" Loadings of Intelligence Test Composite Scores in Light of Spearman's Law of Diminishing Returns

    Science.gov (United States)

    Reynolds, Matthew R.

    2013-01-01

    The linear loadings of intelligence test composite scores on a general factor ("g") have been investigated recently in factor analytic studies. Spearman's law of diminishing returns (SLODR), however, implies that the "g" loadings of test scores likely decrease in magnitude as g increases, or they are nonlinear. The purpose of this study was to (a)…

  1. Interpreting the "g" Loadings of Intelligence Test Composite Scores in Light of Spearman's Law of Diminishing Returns

    Science.gov (United States)

    Reynolds, Matthew R.

    2013-01-01

    The linear loadings of intelligence test composite scores on a general factor ("g") have been investigated recently in factor analytic studies. Spearman's law of diminishing returns (SLODR), however, implies that the "g" loadings of test scores likely decrease in magnitude as g increases, or they are nonlinear. The purpose of…

  2. Guided-Inquiry Lessons Raise Scores on the Sixth Grade Georgia Science Test

    Science.gov (United States)

    Page, Purlie M.

    At the local level, G Middle School has the highest district-wide percentage of 6th grade science students who are not meeting standards. It is imperative that G middle school take corrective action to reduce the number of students failing to meet state science standards. Dewey's theory of conceptual framework, which involves knowledge constructed on a person's personal experience and mind activity through active forms of learning, guided this study. The goal of the study was to determine whether inquiry-based science modules produce greater 6th grade science achievement, as measured by an equivalent instrument of the science section of the Georgia Criterion-Referenced Competency Test, when compared to traditional instruction among eastern Georgia 6th graders. The sample consisted of 230 students in the nonintervention group and 119 students in the intervention group. All students were from intact classes. At the end of the intervention, an independent t test was conducted to analyze the scores. According to the study t test, (t = 12.33, df = 304.56, p motivation towards, comprehension of, and interest in science concepts. At the local level, these inquiry lessons can be shared with science teachers across grade levels and within the district to improve county-wide science scores. An increase in student interest and comprehension of science concepts could ultimately lead to the United States producing more students in the fields of science, technology, engineering, and mathematics (STEM) education.

  3. Level and change in cognitive test scores predict risk of first stroke.

    Science.gov (United States)

    DeFries, Triveni; Avendaño, Mauricio; Glymour, M Maria

    2009-03-01

    To determine whether cognitive test scores and cognitive decline predict incidence of first diagnosed stroke. Stroke-free Health and Retirement Study participants were followed on average 7.6 years for self- or proxy-reported first stroke (1,483 events). Predictors included baseline performance on a modified Telephone Interview for Cognitive Status (Mental Status) and Word Recall test and decline between baseline and second assessment in either measure. Hazard ratios (HRs) were estimated using Cox proportional hazards models for the whole sample and stratified according to five major cardiovascular risk factors. National cohort study of noninstitutionalized adults with a mean baseline age of 64+/-9.9. Health and Retirement Study participants (n=19,699) aged 50 and older. Word Recall (HR for 1 standard deviation difference=0.92, 95% confidence interval (CI)=0.86-0.97)) and Mental Status (HR=0.89, 95% CI=0.84-0.95) predicted incident stroke. Mental Status predicted stroke risk in those with (HR=0.93, 95%=0.87-0.99) and without (HR=0.81, 95% CI=0.72-.91) one or more vascular risk factors. Word Recall declines predicted a 16% elevation in subsequent stroke risk (95% CI=1.01-1.34). Declines in Mental Status predicted a 37% elevation in stroke risk (95% CI=1.11-1.70). Cognitive test scores predict future stroke risk, independent of other major vascular risk factors.

  4. Micronucleus test for radiation biodosimetry in mass casualty events: Evaluation of visual and automated scoring

    Energy Technology Data Exchange (ETDEWEB)

    Bolognesi, Claudia, E-mail: claudia.bolognesi@istge.i [Environmental Carcinogenesis Unit, National Cancer Research Institute, Largo R. Benzi 10, 16132 Genoa (Italy); Balia, Cristina; Roggieri, Paola [Environmental Carcinogenesis Unit, National Cancer Research Institute, Largo R. Benzi 10, 16132 Genoa (Italy); Cardinale, Francesco [Clinical Epidemiology Unit, National Cancer Research Institute, Largo R. Benzi 10, 16132 Genoa (Italy); Department of Health Sciences, University of Genoa, Genoa (Italy); Bruzzi, Paolo [Clinical Epidemiology Unit, National Cancer Research Institute, Largo R. Benzi 10, 16132 Genoa (Italy); Sorcinelli, Francesca [Environmental Carcinogenesis Unit, National Cancer Research Institute, Largo R. Benzi 10, 16132 Genoa (Italy); Laboratory of Genetics, Histology and Molecular Biology Section, Army Medical and Veterinary, Research Center, Via Santo Stefano Rotondo 4, 00184 Roma (Italy); Lista, Florigio [Laboratory of Genetics, Histology and Molecular Biology Section, Army Medical and Veterinary, Research Center, Via Santo Stefano Rotondo 4, 00184 Roma (Italy); D' Amelio, Raffaele [Sapienza, Universita di Roma II Facolta di Medicina e Chirurgia and Ministero della Difesa, Direzione Generale Sanita Militare (Italy); Righi, Enzo [Frascati National Laboratories, National Institute of Nuclear Physics, Via Enrico Fermi 40, 00044 Frascati, Rome (Italy)

    2011-02-15

    In the case of a large-scale nuclear or radiological incidents a reliable estimate of dose is an essential tool for providing timely assessment of radiation exposure and for making life-saving medical decisions. Cytogenetics is considered as the 'gold standard' for biodosimetry. The dicentric analysis (DA) represents the most specific cytogenetic bioassay. The micronucleus test (MN) applied in interphase in peripheral lymphocytes is an alternative and simpler approach. A dose-effect calibration curve for the MN frequency in peripheral lymphocytes from 27 adult donors was established after in vitro irradiation at a dose range 0.15-8 Gy of {sup 137}Cs gamma rays (dose rate 6 Gy min{sup -1}). Dose prediction by visual scoring in a dose-blinded study (0.15-4.0 Gy) revealed a high level of accuracy (R = 0.89). The scoring of MN is time consuming and requires adequate skills and expertise. Automated image analysis is a feasible approach allowing to reduce the time and to increase the accuracy of the dose estimation decreasing the variability due to subjective evaluation. A good correlation (R = 0.705) between visual and automated scoring with visual correction was observed over the dose range 0-2 Gy. Almost perfect discrimination power for exposure to 1-2 Gy, and a satisfactory power for 0.6 Gy were detected. This threshold level can be considered sufficient for identification of sub lethally exposed individuals by automated CBMN assay.

  5. The National Early Warning Score: Translation, testing and prediction in a Swedish setting.

    Science.gov (United States)

    Spångfors, Martin; Arvidsson, Lisa; Karlsson, Victoria; Samuelson, Karin

    2016-12-01

    The National Early Warning Score - NEWS is a "track and trigger" scale designed to assess in-hospital patients' vital signs and detect clinical deterioration. In this study the NEWS was translated into Swedish and its association with the need of intensive care was investigated. A total of 868 patient charts, recorded by the medical emergency team at a university hospital, containing the parameters needed to calculate the NEWS were audited. The NEWS was translated into Swedish and tested for inter-rater reliability with a perfect agreement (weighted κ=1.0) among the raters. The median score for patients admitted to the ICU were higher than for those who were not (10 vs. 8, p<0.0001). AUROC for discriminating admittance to the ICU was 0.68 (95% CI: 0.622-0.739, p<0.0001). A regression analysis showed that lower oxygen saturation and a lower level of consciousness were significantly associated with ICU admission (OR 1.27 [1.06-1.52], p=0.01 and OR 1.77 [1.12-2.82], p=0.02) and may predict admission to the ICU better than the other parameters. The Swedish translated NEWS seems to have excellent inter-rater reliability and can be used without risk of linguistic misinterpretation. High scores for the parameters oxygen saturation and level of consciousness in the NEWS may predict admission to the ICU. Copyright © 2016 Elsevier Ltd. All rights reserved.

  6. Increased correlation coefficient between the written test score and tutors' performance test scores after training of tutors for assessment of medical students during problem-based learning course in Malaysia.

    Science.gov (United States)

    Jaiprakash, Heethal; Min, Aung Ko Ko; Ghosh, Sarmishtha

    2016-03-01

    This paper is aimed at finding if there was a change of correlation between the written test score and tutors' performance test scores in the assessment of medical students during a problem-based learning (PBL) course in Malaysia. This is a cross-sectional observational study, conducted among 264 medical students in two groups from November 2010 to November 2012. The first group's tutors did not receive tutor training; while the second group's tutors were trained in the PBL process. Each group was divided into high, middle and low achievers based on their end-of-semester exam scores. PBL scores were taken which included written test scores and tutors' performance test scores. Pearson correlation coefficient was calculated between the two kinds of scores in each group. The correlation coefficient between the written scores and tutors' scores in group 1 was 0.099 (pcorrelation coefficient in the group where tutors received the PBL training reinforces the importance of tutor training before their participation in the PBL course.

  7. Increased correlation coefficient between the written test score and tutors’ performance test scores after training of tutors for assessment of medical students during problem-based learning course in Malaysia

    Directory of Open Access Journals (Sweden)

    Heethal Jaiprakash

    2016-03-01

    Full Text Available This paper is aimed at finding if there was a change of correlation between the written test score and tutors’ performance test scores in the assessment of medical students during a problem-based learning (PBL course in Malaysia. This is a cross-sectional observational study, conducted among 264 medical students in two groups from November 2010 to November 2012. The first group’s tutors did not receive tutor training; while the second group’s tutors were trained in the PBL process. Each group was divided into high, middle and low achievers based on their end-of-semester exam scores. PBL scores were taken which included written test scores and tutors’ performance test scores. Pearson correlation coefficient was calculated between the two kinds of scores in each group. The correlation coefficient between the written scores and tutors’ scores in group 1 was 0.099 (p<0.001 and for group 2 was 0.305 (p<0.001. The higher correlation coefficient in the group where tutors received the PBL training reinforces the importance of tutor training before their participation in the PBL course.

  8. The Bender Gestalt Test with the Human Figure Drawing Test for Young School Children. A Manual for Use with the Koppitz Scoring System.

    Science.gov (United States)

    Koppitz, Elizabeth Munsterberg

    Presented is a manual for scoring the Bender Gestalt Test and the Human Figure Drawing Test for screening and diagnostic uses with emotionally disturbed, brain damaged, or perceptually handicapped 5- to 11-year-old children. Given are suggestions for administering and scoring the Bender test which examines distortion of shape, rotation,…

  9. The relationship between selected standardized test scores and performance in advanced placement math and science exams: Analyzing the differential effectiveness of scores for course identification and placement

    Science.gov (United States)

    Urbina, Josue N.

    There is a national need to increase the STEM-related workforce. Among factors leading towards STEM careers include the number of advanced high school mathematics and science courses students complete. Florida's enrollment patterns in STEM-related Advanced Placement (AP) courses, however, reveal that only a small percentage of students enroll into these classes. Therefore, screening tools are needed to find more students for these courses, who are academically ready, yet have not been identified. The purpose of this study was to investigate the extent to which scores from a national standardized test, Preliminary Scholastic Assessment Test/ National Merit Qualifying Test (PSAT/NMSQT), in conjunction with and compared to a state-mandated standardized test, Florida Comprehensive Assessment Test (FCAT), are related to selected AP exam performance in Seminole County Public Schools. An ex post facto correlational study was conducted using 6,189 student records from the 2010 - 2012 academic years. Multiple regression analyses using simultaneous Full Model testing showed differential moderate to strong relationships between scores in eight of the nine AP courses (i.e., Biology, Environmental Science, Chemistry, Physics B, Physics C Electrical, Physics C Mechanical, Statistics, Calculus AB and BC) examined. For example, the significant unique contribution to overall variance in AP scores was a linear combination of PSAT Math (M), Critical Reading (CR) and FCAT Reading (R) for Biology and Environmental Science. Moderate relationships for Chemistry included a linear combination of PSAT M, W (Writing) and FCAT M; a combination of FCAT M and PSAT M was most significantly associated with Calculus AB performance. These findings have implications for both research and practice. FCAT scores, in conjunction with PSAT scores, can potentially be used for specific STEM-related AP courses, as part of a systematic approach towards AP course identification and placement. For courses with

  10. CT densitovolumetry in children with obliterative bronchiolitis: correlation with clinical scores and pulmonary function test results

    Directory of Open Access Journals (Sweden)

    Helena Mocelin

    2013-12-01

    Full Text Available OBJECTIVE: To determine whether air trapping (expressed as the percentage of air trapping relative to total lung volume [AT%] correlates with clinical and functional parameters in children with obliterative bronchiolitis (OB.METHODS: CT scans of 19 children with OB were post-processed for AT% quantification with the use of a fixed threshold of −950 HU (AT%950 and of thresholds selected with the aid of density masks (AT%DM. Patients were divided into three groups by AT% severity. We examined AT% correlations with oxygen saturation (SO2 at rest, six-minute walk distance (6MWD, minimum SO2 during the six-minute walk test (6MWT_SO2, FVC, FEV1, FEV1/FVC, and clinical parameters.RESULTS: The 6MWD was longer in the patients with larger normal lung volumes (r = 0.53. We found that AT%950 showed significant correlations (before and after the exclusion of outliers, respectively with the clinical score (r = 0.72; 0.80, FVC (r = 0.24; 0.59, FEV1 (r = −0.58; −0.67, and FEV1/FVC (r = −0.53; r = −0.62, as did AT%DM with the clinical score (r = 0.58; r = 0.63, SO2 at rest (r = −0.40; r = −0.61, 6MWT_SO2 (r = −0.24; r = −0.55, FVC (r = −0.44; r = −0.80, FEV1 (r = −0.65; r = −0.71, and FEV1/FVC (r = −0.41; r = −0.52.CONCLUSIONS: Our results show that AT% correlates significantly with clinical scores and pulmonary function test results in children with OB.

  11. The Dutch version of the Forgotten Joint Score: test-retesting reliability and validation.

    Science.gov (United States)

    Shadid, Marvan B; Vinken, Nick S; Marting, Louis N; Wolterbeek, Nienke

    2016-03-01

    The aim of this study was to translate the Forgotten Joint Score (FJS) into the Dutch language. This -questionnaire was tested for internal consistency (Cronbach's alpha) and test-retest reliability (intraclass correlation coefficients (ICC)). 159 patients were included in this study; 74 with a total hip arthroplasty (THA) and 85 with a total knee arthroplasty (TKA). The FJS showed a high internal consistency (Cronbach's alpha=0.957; ICC=0.943). The FJS showed a significant correlation (r=0.751) with the WOMAC and low ceiling effects (3.1%). This study proved the Dutch FJS to be highly discriminative in patients treated with a THA or TKA. This makes the FJS a reliable patient related outcome measurement, measuring a new dimension in arthroplasty: the ability to forget an artificial joint in everyday life.

  12. Development and psychometric testing of a barriers to HIV testing scale among individuals with HIV infection in Sweden; The Barriers to HIV testing scale-Karolinska version

    OpenAIRE

    2015-01-01

    © 2015 Wiklander et al. Background: Barriers to HIV testing experienced by individuals at risk for HIV can result in treatment delay and further transmission of the disease. Instruments to systematically measure barriers are scarce, but could contribute to improved strategies for HIV testing. Aims of this study were to develop and test a barriers to HIV testing scale in a Swedish context. Methods: An 18-item scale was developed, based on an existing scale with addition of six new it...

  13. Reliability and validity of the new Tanaka B Intelligence Scale scores: a group intelligence test.

    Directory of Open Access Journals (Sweden)

    Yota Uno

    Full Text Available OBJECTIVE: The present study evaluated the reliability and concurrent validity of the new Tanaka B Intelligence Scale, which is an intelligence test that can be administered on groups within a short period of time. METHODS: The new Tanaka B Intelligence Scale and Wechsler Intelligence Scale for Children-Third Edition were administered to 81 subjects (mean age ± SD 15.2 ± 0.7 years residing in a juvenile detention home; reliability was assessed using Cronbach's alpha coefficient, and concurrent validity was assessed using the one-way analysis of variance intraclass correlation coefficient. Moreover, receiver operating characteristic analysis for screening for individuals who have a deficit in intellectual function (an FIQ<70 was performed. In addition, stratum-specific likelihood ratios for detection of intellectual disability were calculated. RESULTS: The Cronbach's alpha for the new Tanaka B Intelligence Scale IQ (BIQ was 0.86, and the intraclass correlation coefficient with FIQ was 0.83. Receiver operating characteristic analysis demonstrated an area under the curve of 0.89 (95% CI: 0.85-0.96. In addition, the stratum-specific likelihood ratio for the BIQ≤65 stratum was 13.8 (95% CI: 3.9-48.9, and the stratum-specific likelihood ratio for the BIQ≥76 stratum was 0.1 (95% CI: 0.03-0.4. Thus, intellectual disability could be ruled out or determined. CONCLUSION: The present results demonstrated that the new Tanaka B Intelligence Scale score had high reliability and concurrent validity with the Wechsler Intelligence Scale for Children-Third Edition score. Moreover, the post-test probability for the BIQ could be calculated when screening for individuals who have a deficit in intellectual function. The new Tanaka B Intelligence Test is convenient and can be administered within a variety of settings. This enables evaluation of intellectual development even in settings where performing intelligence tests have previously been difficult.

  14. Multiple tests for wind turbine fault detection and score fusion using two- level multidimensional scaling (MDS)

    Science.gov (United States)

    Ye, Xiang; Gao, Weihua; Yan, Yanjun; Osadciw, Lisa A.

    2010-04-01

    Wind is an important renewable energy source. The energy and economic return from building wind farms justify the expensive investments in doing so. However, without an effective monitoring system, underperforming or faulty turbines will cause a huge loss in revenue. Early detection of such failures help prevent these undesired working conditions. We develop three tests on power curve, rotor speed curve, pitch angle curve of individual turbine. In each test, multiple states are defined to distinguish different working conditions, including complete shut-downs, under-performing states, abnormally frequent default states, as well as normal working states. These three tests are combined to reach a final conclusion, which is more effective than any single test. Through extensive data mining of historical data and verification from farm operators, some state combinations are discovered to be strong indicators of spindle failures, lightning strikes, anemometer faults, etc, for fault detection. In each individual test, and in the score fusion of these tests, we apply multidimensional scaling (MDS) to reduce the high dimensional feature space into a 3-dimensional visualization, from which it is easier to discover turbine working information. This approach gains a qualitative understanding of turbine performance status to detect faults, and also provides explanations on what has happened for detailed diagnostics. The state-of-the-art SCADA (Supervisory Control And Data Acquisition) system in industry can only answer the question whether there are abnormal working states, and our evaluation of multiple states in multiple tests is also promising for diagnostics. In the future, these tests can be readily incorporated in a Bayesian network for intelligent analysis and decision support.

  15. [Information Concerning Mean Test Scores for the Graduate Management Admission Test (GMAT), Graduate Record Examinations (GRE), Law School Admission Test (LSAT), Preliminary Scholastic Aptitude Test (PSAT), and Scholastic Aptitude Test (SAT) for the National Commission on Excellence in Education.

    Science.gov (United States)

    Solomon, Robert J.

    Data are provided to the National Commission on Excellence in Education on the Graduate Management Admission Test (GMAT), Graduate Record Examinations (GRE), Law School Admission Test (LSAT), Preliminary Scholastic Aptitude Test (PSAT), and Scholastic Aptitude Test (SAT). Statistics are provided on the following: yearly GMAT mean scores 1965-1966…

  16. International field testing of the psychometric properties of an EORTC quality of life module for oral health: the EORTC QLQ-OH15.

    Science.gov (United States)

    Hjermstad, Marianne J; Bergenmar, Mia; Bjordal, Kristin; Fisher, Sheila E; Hofmeister, Dirk; Montel, Sébastien; Nicolatou-Galitis, Ourania; Pinto, Monica; Raber-Durlacher, Judith; Singer, Susanne; Tomaszewska, Iwona M; Tomaszewski, Krzysztof A; Verdonck-de Leeuw, Irma; Yarom, Noam; Winstanley, Julie B; Herlofson, Bente B

    2016-09-01

    This international EORTC validation study (phase IV) is aimed at testing the psychometric properties of a quality of life (QoL) module related to oral health problems in cancer patients. The phase III module comprised 17 items with four hypothesized multi-item scales and three single items. In phase IV, patients with mixed cancers, in different treatment phases from 10 countries completed the EORTC QLQ-C30, the QLQ-OH module, and a debriefing interview. The hypothesized structure was tested using combinations of classical test theory and item response theory, following EORTC guidelines. Test-retest assessments and responsiveness to change analysis (RCA) were performed after 2 weeks. Five hundred seventy-two patients (median age 60.3, 54 % females) were analyzed. Completion took EORTC module QLQ-OH15 is a short, well-accepted assessment tool focusing on oral problems and QoL to improve clinical management. ClinicalTrials.gov Identifier: NCT01724333.

  17. Development and psychometric testing of the Attitudes, Subjective Norms, Perceived Behavioural Control, and Intention to Pursue a Career in Mental Health Nursing scale.

    Science.gov (United States)

    Wilbourn, Mark; Salamonson, Yenna; Ramjan, Lucie; Chang, Sungwon

    2017-01-19

    The aim of the present study was to develop and test the psychometric properties of the Attitudes, Subjective Norms, Perceived Behavioural Control, and Intention to Pursue a Career in Mental Health Nursing (ASPIRE) scale, an instrument to assess nursing students' intention to work in mental health nursing. Understanding the factors influencing undergraduate nursing students' career intentions might lead to improved recruitment strategies. However, there are no standardized tools to measure and assess students' intention to pursue a career in mental health nursing. The present study used a cross-sectional survey design undertaken at a large tertiary institution in Western Sydney (Australia) between May and August 2013. It comprised three distinct and sequential phases: (i) items were generated representing the four dimensions of the theory of planned behaviour; (ii) face and content validity were tested by a representative reference group and panel of experts; and (iii) survey data from 1109 first- and second-year and 619 third-year students were used in exploratory and confirmatory factor analyses to test the factorial validity of the scale. Internal consistency was measured using Cronbach's alpha. Items generated for the ASPIRE scale were subject to face and content validity testing. Results showed good factorial validity and reliability for the final 14-item scale. Principal axis factoring revealed a one-factor solution, the hypothesized model being supported by confirmatory factor analysis. The ASPIRE scale is a valid and reliable instrument for measuring intention to pursue a career in mental health nursing among Bachelor of Nursing students.

  18. The psychometric properties of cancer multisymptom assessment instruments: a clinical review.

    Science.gov (United States)

    Aktas, Aynur; Walsh, Declan; Kirkova, Jordanka

    2015-07-01

    Various instruments are used to assess both individual and multiple cancer symptoms. We evaluated the psychometric properties of cancer multisymptom assessment instruments. An Ovid MEDLINE search was done. All searches were limited to adults and in English. All instruments published from 2005 to 2014 (and with at least one validity test) were included. We excluded those who only reported content validity. Instruments were categorized by the three major types of symptom measurement scales employed as follows: visual analogue (VAS), verbal rating (VRS), and numerical rating (NRS) scales. They were then examined in two areas: (1) psychometric thoroughness (number of tests) and (2) psychometric strength of evidence (validity, reliability, generalizability). We also assigned an empirical global psychometric quality score (which combined the concepts of thoroughness and strength of evidence) to rank the instruments. We analyzed 57 instruments (17 original, 40 modifications). They varied in types of scales used, symptom dimensions measured, and time frames evaluated. Of the 57, 10 used VAS, 28 VRS, and 19 NRS. The Edmonton Symptom Assessment System (ESAS), ESAS-Spanish, Hospital Anxiety and Depression Scale (HADS), Profile of Mood States (POMS), Symptom Distress Scale (SDS), M.D. Anderson Symptom Inventory (MDASI)-Russian, and MDASI-Taiwanese were the most comprehensively tested for validity and reliability. The ESAS, ESAS-Spanish, ASDS-2, Memorial Symptom Assessment Scale (MSAS)-SF, POMS, SDS, MDASI (and some translations), and MDASI-Heart Failure all showed good validity and reliability. The MDASI appeared to be the best overall from a psychometric perspective. This was followed by the ESAS, ESAS-Spanish, POMS, SDS, and some MDASI translations. VRS-based instruments were most common. There was a wide range of psychometric rigor in validation. Consequently, meta-analysis was not possible. Most cancer multisymptom assessment instruments need further extensive validation

  19. The Test Anxiety Measure for Adolescents (TAMA): Examination of the Reliability and Validity of the Scores of a New Multidimensional Measure of Test Anxiety for Middle and High School Students

    Science.gov (United States)

    Lowe, Patricia A.

    2014-01-01

    A new multidimensional measure of test anxiety, the Test Anxiety Measure for Adolescents (TAMA), specifically designed for U.S. adolescents in Grades 6 to 12 was developed and its psychometric properties were examined. The TAMA consists of five scales (Cognitive Interference, Physiological Hyperarousal, Social Concerns, Task Irrelevant Behavior,…

  20. Performance on large-scale science tests: Item attributes that may impact achievement scores

    Science.gov (United States)

    Gordon, Janet Victoria

    Significant differences in achievement among ethnic groups persist on the eighth-grade science Washington Assessment of Student Learning (WASL). The WASL measures academic performance in science using both scenario and stand-alone question types. Previous research suggests that presenting target items connected to an authentic context, like scenario question types, can increase science achievement scores especially in underrepresented groups and thus help to close the achievement gap. The purpose of this study was to identify significant differences in performance between gender and ethnic subgroups by question type on the 2005 eighth-grade science WASL. MANOVA and ANOVA were used to examine relationships between gender and ethnic subgroups as independent variables with achievement scores on scenario and stand-alone question types as dependent variables. MANOVA revealed no significant effects for gender, suggesting that the 2005 eighth-grade science WASL was gender neutral. However, there were significant effects for ethnicity. ANOVA revealed significant effects for ethnicity and ethnicity by gender interaction in both question types. Effect sizes were negligible for the ethnicity by gender interaction. Large effect sizes between ethnicities on scenario question types became moderate to small effect sizes on stand-alone question types. This indicates the score advantage the higher performing subgroups had over the lower performing subgroups was not as large on stand-alone question types compared to scenario question types. A further comparison examined performance on multiple-choice items only within both question types. Similar achievement patterns between ethnicities emerged; however, achievement patterns between genders changed in boys' favor. Scenario question types appeared to register differences between ethnic groups to a greater degree than stand-alone question types. These differences may be attributable to individual differences in cognition

  1. Survival analysis of colorectal cancer patients with tumor recurrence using global score test methodology

    Energy Technology Data Exchange (ETDEWEB)

    Zain, Zakiyah, E-mail: zac@uum.edu.my; Ahmad, Yuhaniz, E-mail: yuhaniz@uum.edu.my [School of Quantitative Sciences, Universiti Utara Malaysia, UUM Sintok 06010, Kedah (Malaysia); Azwan, Zairul, E-mail: zairulazwan@gmail.com, E-mail: farhanaraduan@gmail.com, E-mail: drisagap@yahoo.com; Raduan, Farhana, E-mail: zairulazwan@gmail.com, E-mail: farhanaraduan@gmail.com, E-mail: drisagap@yahoo.com; Sagap, Ismail, E-mail: zairulazwan@gmail.com, E-mail: farhanaraduan@gmail.com, E-mail: drisagap@yahoo.com [Surgery Department, Universiti Kebangsaan Malaysia Medical Centre, Jalan Yaacob Latif, 56000 Bandar Tun Razak, Kuala Lumpur (Malaysia); Aziz, Nazrina, E-mail: nazrina@uum.edu.my

    2014-12-04

    Colorectal cancer is the third and the second most common cancer worldwide in men and women respectively, and the second in Malaysia for both genders. Surgery, chemotherapy and radiotherapy are among the options available for treatment of patients with colorectal cancer. In clinical trials, the main purpose is often to compare efficacy between experimental and control treatments. Treatment comparisons often involve several responses or endpoints, and this situation complicates the analysis. In the case of colorectal cancer, sets of responses concerned with survival times include: times from tumor removal until the first, the second and the third tumor recurrences, and time to death. For a patient, the time to recurrence is correlated to the overall survival. In this study, global score test methodology is used in combining the univariate score statistics for comparing treatments with respect to each survival endpoint into a single statistic. The data of tumor recurrence and overall survival of colorectal cancer patients are taken from a Malaysian hospital. The results are found to be similar to those computed using the established Wei, Lin and Weissfeld method. Key factors such as ethnic, gender, age and stage at diagnose are also reported.

  2. CORRELATION BETWEEN SYMPTOM SCORE, WHEEZE, REVERSIBILITY OF PULMONARY FUNCTION TESTS AND TREATMENT RESPONSE IN ASTHMA

    Directory of Open Access Journals (Sweden)

    M.H. Boskabady

    2003-06-01

    Full Text Available Asthma management is a major concern because some asthmatic patients either do not respond or else hardly respond to treatment. Therefore in the present study, an attempt has been made to determine the predictors of treatment response in asthmatic patients.Thirty six asthmatic adults including 13 male and 23 female were studied dur¬ing a 3 month treatment period. Asthma symptom score (SS and wheezing were recorded before and after treatment. Pulmonary function tests (PFTs including forced vital capacity (FVC, forced expiratory volume in one second (FEVj, peak expiratory flow (PEF, maximal expiratory measured at the beginning and the end of the study. The increase in PFT values 10 min after 200 ug inhaled salbutamol (in percentage was considered as reversibility in airway constriction.There were significant improvements in SS (/JKO.001 and PFT variables (/;>The results of these study showed that a well conducted therapeutic program could lead to improvement in symptoms, wheeze, and PFT values. In addition symptom score, wheeze, and reversibility in FEV1 and PEF could be good indi¬cators of response to treatment in asthma.

  3. Systematic Review of Health-Related Work Outcome Measures and Quality Criteria-Based Evaluations of Their Psychometric Properties.

    Science.gov (United States)

    Mateen, Bilal Akhter; Doogan, Catherine; Hayward, Kate; Hourihan, Susan; Hurford, Joanne; Playford, E Diane

    2017-03-01

    To examine the state of psychometric validation in the health-related work outcome literature. We searched PubMed, PubMed Central, CINAHL, Embase (plus Embase Classic), and PsycINFO from inception to January 2016 using the following search terms: stroke, multiple sclerosis, epilepsy, spinal cord injury, brain injury, musculoskeletal disease, work, absenteeism, presenteeism, occupation, employment, job, outcome measure, assessment, work capacity evaluation, scale, and questionnaire. From the 22,676 retrieved abstracts, 597 outcome measures were identified. Inclusion was based on content analysis. There were 95 health-related work outcome measures retained; of these, 2 were treated as outliers and therefore are discussed separately. All 6 authors individually organized the 93 remaining scales based on their content. A follow-up search using the same sources, and time period, with the name of the outcome measures and the terms psychometric, reliability, validity, and responsiveness, identified 263 unique classical test theory psychometric property datasets for the 93 tools. An assessment criterion for psychometric properties was applied to each article, and where consensus was not achieved, the rating delivered by most of the assessors was reported. Of the articles reported, 18 reporting psychometric data were not accessible and therefore could not be assessed. There were 39 that scored 80%. The 3 outcome measures associated with the highest scoring datasets were the Sheehan Disability Scale, the Fear Avoidance Beliefs Questionnaire, and the assessment of the Subjective Handicap of Epilepsy. Finally, only 2 psychometric validation datasets reported the complete set of baseline psychometric properties. This systematic review highlights the current limitations of the health-related work outcome measure literature, including the limited number of robust tools available. Copyright © 2016. Published by Elsevier Inc.

  4. The Mote In Thy Brother's Eye, and The Beam in Thine Own: Predicting One's Own and Others' Personality Test Scores.

    Science.gov (United States)

    Furnham, Adrian; Henderson, Monika

    1983-01-01

    Examined the similarity between subjects' (N=63) ratings of themselves and others, on various tests of personality. Results revealed that subjects correctly estimated several of their own scores, but only two scores of another person. They believed themselves to be similar to their friend, thereby showing attributional errors. (JAC)

  5. Improving Personality Facet Scores with Multidimensional Computer Adaptive Testing: An Illustration with the Neo Pi-R

    Science.gov (United States)

    Makransky, Guido; Mortensen, Erik Lykke; Glas, Cees A. W.

    2013-01-01

    Narrowly defined personality facet scores are commonly reported and used for making decisions in clinical and organizational settings. Although these facets are typically related, scoring is usually carried out for a single facet at a time. This method can be ineffective and time consuming when personality tests contain many highly correlated…

  6. Expanded Koppitz Scoring System of the Bender Gestalt Visual-Motor Test for Adolescents: A Pilot Study.

    Science.gov (United States)

    Bolen, Larry M.; And Others

    1992-01-01

    Examined use of Bender Gestalt Visual-Motor Test with school-age adolescents over age 11. Mean error scores suggest that visual-motor development is not maturationally complete by age 11 years, 11 months. Suggests additional research focusing on extending normative sample or developing new scoring system for adolescents. (Author/NB)

  7. Validation of Automated Scores of TOEFL iBT Tasks against Non-Test Indicators of Writing Ability

    Science.gov (United States)

    Weigle, Sara Cushing

    2010-01-01

    Automated scoring has the potential to dramatically reduce the time and costs associated with the assessment of complex skills such as writing, but its use must be validated against a variety of criteria for it to be accepted by test users and stakeholders. This study approaches validity by comparing human and automated scores on responses to…

  8. Validation of Automated Scores of TOEFL iBT Tasks against Non-Test Indicators of Writing Ability

    Science.gov (United States)

    Weigle, Sara Cushing

    2010-01-01

    Automated scoring has the potential to dramatically reduce the time and costs associated with the assessment of complex skills such as writing, but its use must be validated against a variety of criteria for it to be accepted by test users and stakeholders. This study approaches validity by comparing human and automated scores on responses to…

  9. Assessing Statistical Aspects of Test Fairness with Structural Equation Modelling

    Science.gov (United States)

    Kline, Rex B.

    2013-01-01

    Test fairness and test bias are not synonymous concepts. Test bias refers to statistical evidence that the psychometrics or interpretation of test scores depend on group membership, such as gender or race, when such differences are not expected. A test that is grossly biased may be judged to be unfair, but test fairness concerns the broader, more…

  10. Academic self-concept, interest, grades, and standardized test scores: reciprocal effects models of causal ordering.

    Science.gov (United States)

    Marsh, Herbert W; Trautwein, Ulrich; Lüdtke, Oliver; Köller, Olaf; Baumert, Jürgen

    2005-01-01

    Reciprocal effects models of longitudinal data show that academic self-concept is both a cause and an effect of achievement. In this study this model was extended to juxtapose self-concept with academic interest. Based on longitudinal data from 2 nationally representative samples of German 7th-grade students (Study 1: N = 5,649, M age = 13.4; Study 2: N = 2,264, M age = 13.7 years), prior self-concept significantly affected subsequent math interest, school grades, and standardized test scores, whereas prior math interest had only a small effect on subsequent math self-concept. Despite stereotypic gender differences in means, linkages relating these constructs were invariant over gender. These results demonstrate the positive effects of academic self-concept on a variety of academic outcomes and integrate self-concept with the developmental motivation literature.

  11. Genetic analysis of somatic cell score in Norwegian cattle using random regression test-day models.

    Science.gov (United States)

    Odegård, J; Jensen, J; Klemetsdal, G; Madsen, P; Heringstad, B

    2003-12-01

    The dataset used in this analysis contained a total of 341,736 test-day observations of somatic cell scores from 77,110 primiparous daughters of 1965 Norwegian Cattle sires. Initial analyses, using simple random regression models without genetic effects, indicated that use of homogeneous residual variance was appropriate. Further analyses were carried out by use of a repeatability model and 12 random regression sire models. Legendre polynomials of varying order were used to model both permanent environmental and sire effects, as did the Wilmink function, the Lidauer-Mäntysaari function, and the Ali-Schaeffer function. For all these models, heritability estimates were lowest at the beginning (0.05 to 0.07) and higher at the end (0.09 to 0.12) of lactation. Genetic correlations between somatic cell scores early and late in lactation were moderate to high (0.38 to 0.71), whereas genetic correlations for adjacent DIM were near unity. Models were compared based on likelihood ratio tests, Bayesian information criterion, Akaike information criterion, residual variance, and predictive ability. Based on prediction of randomly excluded observations, models with 4 coefficients for permanent environmental effect were preferred over simpler models. More highly parameterized models did not substantially increase predictive ability. Evaluation of the different model selection criteria indicated that a reduced order of fit for sire effects was desireable. Models with zeroth- or first-order of fit for sire effects and higher order of fit for permanent environmental effects probably underestimated sire variance. The chosen model had Legendre polynomials with 3 coefficients for sire, and 4 coefficients for permanent environmental effects. For this model, trajectories of sire variance and heritability were similar assuming either homogeneous or heterogeneous residual variance structure.

  12. REPRODUCIBILITY OF THE MODIFIED STAR EXCURSION BALANCE TEST COMPOSITE AND SPECIFIC REACH DIRECTION SCORES

    Science.gov (United States)

    van Lieshout, Remko; Reijneveld, Elja A.E.; van den Berg, Sandra M.; Haerkens, Gijs M.; Koenders, Niek H.; de Leeuw, Arina J.; van Oorsouw, Roel G.; Paap, Davy; Scheffer, Else; Weterings, Stijn

    2016-01-01

    ABSTRACT Background The mSEBT is a screening tool used to evaluate dynamic balance. Most research investigating measurement properties focused on intrarater reliability and was done in small samples. To know whether the mSEBT is useful to discriminate dynamic balance between persons and to evaluate changes in dynamic balance, more research into intra- and interrater reliability and smallest detectable change (synonymous with minimal detectable change) is needed. Purpose To estimate intra- and interrater reliability and smallest detectable change of the mSEBT in adults at risk for ankle sprain. Study Design Cross-sectional, test-retest design Methods Fifty-five healthy young adults participating in sports at risk for ankle sprain participated (mean ± SD age, 24.0 ± 2.9 years). Each participant performed three test sessions within one hour and was rated by two physical therapists (session 1, rater 1; session 2, rater 2; session 3, rater 1). Participants and raters were blinded for previous measurements. Normalized composite and reach direction scores for the right and left leg were collected. Analysis of variance was used to calculate intraclass correlation coefficient values for intra- and interrater reliability. Smallest detectable change values were calculated based on the standard error of measurement. Results Intra- and interrater reliability for both legs was good to excellent (intraclass correlation coefficient ranging from 0.87 to 0.94). The intrarater smallest detectable change for the composite score of the right leg was 7.2% and for the left 6.2%. The interrater smallest detectable change for the composite score of the right leg was 6.9% and for the left 5.0%. Conclusion The mSEBT is a reliable measurement instrument to discriminate dynamic balance between persons. Most smallest detectable change values of the mSEBT appear to be large. More research is needed to investigate if the mSEBT is usable for evaluative purposes. Level of Evidence Level 2

  13. The achievement impact of the inclusion model on the standardized test scores of general education students

    Science.gov (United States)

    Garrett-Rainey, Syrena

    The purpose of this study was to compare the achievement of general education students within regular education classes to the achievement of general education students in inclusion/co-teach classes to determine whether there was a significant difference in the achievement between the two groups. The school district's inclusion/co-teach model included ongoing professional development support for teachers and administrators. General education teachers, special education teachers, and teacher assistants collaborated to develop instructional strategies to provide additional remediation to help students to acquire the skills needed to master course content. This quantitative study reviewed the end-of course test (EoCT) scores of Grade 10 physical science and math students within an urban school district. It is not known whether general education students in an inclusive/co-teach science or math course will demonstrate a higher achievement on the EoCT in math or science than students not in an inclusive/co-teach classroom setting. In addition, this study sought to determine if students classified as low socioeconomic status benefited from participating in co-teaching classrooms as evidenced by standardized tests. Inferential statistics were used to determine whether there was a significant difference between the achievements of the treatment group (inclusion/co-teach) and the control group (non-inclusion/co-teach). The findings can be used to provide school districts with optional instructional strategies to implement in the diverse classroom setting in the modern classroom to increase academic performance on state standardized tests.

  14. Poisson Approximation-Based Score Test for Detecting Association of Rare Variants.

    Science.gov (United States)

    Fang, Hongyan; Zhang, Hong; Yang, Yaning

    2016-07-01

    Genome-wide association study (GWAS) has achieved great success in identifying genetic variants, but the nature of GWAS has determined its inherent limitations. Under the common disease rare variants (CDRV) hypothesis, the traditional association analysis methods commonly used in GWAS for common variants do not have enough power for detecting rare variants with a limited sample size. As a solution to this problem, pooling rare variants by their functions provides an efficient way for identifying susceptible genes. Rare variant typically have low frequencies of minor alleles, and the distribution of the total number of minor alleles of the rare variants can be approximated by a Poisson distribution. Based on this fact, we propose a new test method, the Poisson Approximation-based Score Test (PAST), for association analysis of rare variants. Two testing methods, namely, ePAST and mPAST, are proposed based on different strategies of pooling rare variants. Simulation results and application to the CRESCENDO cohort data show that our methods are more powerful than the existing methods.

  15. Should We Stop Looking for a Better Scoring Algorithm for Handling Implicit Association Test Data? Test of the Role of Errors, Extreme Latencies Treatment, Scoring Formula, and Practice Trials on Reliability and Validity.

    Directory of Open Access Journals (Sweden)

    Juliette Richetin

    Full Text Available Since the development of D scores for the Implicit Association Test, few studies have examined whether there is a better scoring method. In this contribution, we tested the effect of four relevant parameters for IAT data that are the treatment of extreme latencies, the error treatment, the method for computing the IAT difference, and the distinction between practice and test critical trials. For some options of these different parameters, we included robust statistic methods that can provide viable alternative metrics to existing scoring algorithms, especially given the specificity of reaction time data. We thus elaborated 420 algorithms that result from the combination of all the different options and test the main effect of the four parameters with robust statistical analyses as well as their interaction with the type of IAT (i.e., with or without built-in penalty included in the IAT procedure. From the results, we can elaborate some recommendations. A treatment of extreme latencies is preferable but only if it consists in replacing rather than eliminating them. Errors contain important information and should not be discarded. The D score seems to be still a good way to compute the difference although the G score could be a good alternative, and finally it seems better to not compute the IAT difference separately for practice and test critical trials. From this recommendation, we propose to improve the traditional D scores with small yet effective modifications.

  16. The TSCA interagency testing committee`s approaches to screening and scoring chemicals and chemical groups: 1977-1983

    Energy Technology Data Exchange (ETDEWEB)

    Walker, J.D. [Environmental Protection Agency, Washington, DC (United States)

    1990-12-31

    This paper describes the TSCA interagency testing committee`s (ITC) approaches to screening and scoring chemicals and chemical groups between 1977 and 1983. During this time the ITC conducted five scoring exercises to select chemicals and chemical groups for detailed review and to determine which of these chemicals and chemical groups should be added to the TSCA Section 4(e) Priority Testing List. 29 refs., 1 fig., 2 tabs.

  17. The Effect of Transient Students' Scores on the Norm of One High School's Standardized Basic Skills Test Battery.

    Science.gov (United States)

    Hill, Carolyn Stevens

    The effect of the standardized test scores of transient students on the mean of the 9th, 10th, and 11th grade standardized test scores of a school was studied. The groups used in the study were grades 9, 10, and 11 at a new high school in Clarksburg (West Virginia). The study was conducted in the spring of 1998. Groups consisted of 204 9th…

  18. A novel questionnaire to measure staff perception of end-of-life decision making in the intensive care unit--development and psychometric testing.

    Science.gov (United States)

    Schwarzkopf, Daniel; Westermann, Isabella; Skupin, Helga; Riedemann, Niels C; Reinhart, Konrad; Pfeifer, Ruediger; Fritzenwanger, Michael; Günther, Albrecht; Witte, Otto W; Hartog, Christiane S

    2015-02-01

    The aim of this study was to create a questionnaire that measures barriers and facilitators of effective end-of-life (EOL) decision making and communication and associated stress as perceived by intensive care unit (ICU) staff. The questionnaire was developed on the basis of a theoretical framework and discussion with ICU staff. It was pretested among 15 ICU nurses and physicians. A field test was conducted in 4 interdisciplinary ICUs of one university hospital Descriptive item analysis, exploratory factor analysis, and reliability and validity analysis were performed. Overall, 174 of 284 ICU staff participated in the field test (61% response). Factor analysis indicated a 7-factor solution: (1) collaboration in the EOL context, (2) role clarity in the EOL context, (3) work-related interruptions of communication with families, (4) emotional support, (5) stress by involvement in EOL decision making and communication with families, (6) stress by work overload, and (7) taking initiative toward EOL decision making. Internal consistency of the scales was acceptable (range, 0.69-0.85). Construct validity was shown by relationships of the scales to several constructs, for example, satisfaction with EOL decision making and emotional exhaustion. Overall, 26 of 31 expected relationships achieved significance. The new questionnaire meets psychometric criteria of reliability and validity and promises to be a useful quality measure of EOL decision making in the ICU. Copyright © 2014 Elsevier Inc. All rights reserved.

  19. Rugby versus Soccer in South Africa: Content Familiarity Contributes to Cross-Cultural Differences in Cognitive Test Scores

    Science.gov (United States)

    Malda, Maike; van de Vijver, Fons J. R.; Temane, Q. Michael

    2010-01-01

    In this study, cross-cultural differences in cognitive test scores are hypothesized to depend on a test's cultural complexity (Cultural Complexity Hypothesis: CCH), here conceptualized as its content familiarity, rather than on its cognitive complexity (Spearman's Hypothesis: SH). The content familiarity of tests assessing short-term memory,…

  20. Rugby versus Soccer in South Africa: Content Familiarity Contributes to Cross-Cultural Differences in Cognitive Test Scores

    Science.gov (United States)

    Malda, Maike; van de Vijver, Fons J. R.; Temane, Q. Michael

    2010-01-01

    In this study, cross-cultural differences in cognitive test scores are hypothesized to depend on a test's cultural complexity (Cultural Complexity Hypothesis: CCH), here conceptualized as its content familiarity, rather than on its cognitive complexity (Spearman's Hypothesis: SH). The content familiarity of tests assessing short-term memory,…