WorldWideScience

Sample records for ability test scores

  1. Cognitive Ability and Personality Variables as Predictors of School Grades and Test Scores in Adolescents

    Science.gov (United States)

    Hofer, Manfred; Kuhnle, Claudia; Kilian, Britta; Fries, Stefan

    2012-01-01

    The predictive power of cognitive ability and self-control strength for self-reported grades and an achievement test were studied. It was expected that the variables use of time structure, academic procrastination, and motivational interference during learning further aid in predicting students' achievement because they are operative in situations…

  2. The Score Reliability of Draw-a-Person Intellectual Ability Test (DAP: IQ) for Rural Malawi Students

    Science.gov (United States)

    Khasu, Denis S.; Williams, Thomas O., Jr.

    2016-01-01

    In this brief article, the reliability of scores for the Draw-A-Person Intellectual Ability Test for Children, Adolescents, and Adults (DAP: IQ; Reynolds & Hickman, 2004) was examined through several analyses with a sample of 147 children from rural Malawi, Africa using a Chichewa translation of instructions. Cronbach alpha coefficients for…

  3. Visual-Constructional Ability in Individuals with Severe Obesity: Rey Complex Figure Test Accuracy and the Q-Score

    Directory of Open Access Journals (Sweden)

    Hanna L. Sargénius

    2017-09-01

    Full Text Available The aims of this study were to investigate visual-construction and organizational strategy among individuals with severe obesity, as measured by the Rey Complex Figure Test (RCFT, and to examine the validity of the Q-score as a measure for the quality of performance on the RCFT. Ninety-six non-demented morbidly obese (MO patients and 100 healthy controls (HC completed the RCFT. Their performance was calculated by applying the standard scoring criteria. The quality of the copying process was evaluated per the directions of the Q-score scoring system. Results revealed that the MO did not perform significantly lower than the HC on Copy accuracy (mean difference −0.302, CI −1.374 to 0.769, p = 0.579. In contrast, the groups did statistically differ from each other, with MO performing poorer than the HC on the Q-score (mean −1.784, CI −3.237 to −0.331, p = 0.016 and the Unit points (mean −1.409, CI −2.291 to −0.528, p = 0.002, but not on the Order points score (mean −0.351, CI −0.994 to 0.293, p = 0.284. Differences on the Unit score and the Q-score were slightly reduced when adjusting for gender, age, and education. This study presents evidence supporting the presence of inefficiency in visuospatial constructional ability among MO patients. We believe we have found an indication that the Q-score captures a wider range of cognitive processes that are not described by traditional scoring methods. Rather than considering accuracy and placement of the different elements only, the Q-score focuses more on how the subject has approached the task.

  4. A Test of the Relationship between Reading Ability & Standardized Biology Assessment Scores

    Science.gov (United States)

    Allen, Denise A.

    2014-01-01

    Little empirical evidence suggested that independent reading abilities of students enrolled in biology predicted their performance on the Biology I Graduation End-of-Course Assessment (ECA). An archival study was conducted at one Indiana urban public high school in Indianapolis, Indiana, by examining existing educational assessment data to test…

  5. Addressing criticisms of existing predictive bias research: cognitive ability test scores still overpredict African Americans' job performance.

    Science.gov (United States)

    Berry, Christopher M; Zhao, Peng

    2015-01-01

    Predictive bias studies have generally suggested that cognitive ability test scores overpredict job performance of African Americans, meaning these tests are not predictively biased against African Americans. However, at least 2 issues call into question existing over-/underprediction evidence: (a) a bias identified by Aguinis, Culpepper, and Pierce (2010) in the intercept test typically used to assess over-/underprediction and (b) a focus on the level of observed validity instead of operational validity. The present study developed and utilized a method of assessing over-/underprediction that draws on the math of subgroup regression intercept differences, does not rely on the biased intercept test, allows for analysis at the level of operational validity, and can use meta-analytic estimates as input values. Therefore, existing meta-analytic estimates of key parameters, corrected for relevant statistical artifacts, were used to determine whether African American job performance remains overpredicted at the level of operational validity. African American job performance was typically overpredicted by cognitive ability tests across levels of job complexity and across conditions wherein African American and White regression slopes did and did not differ. Because the present study does not rely on the biased intercept test and because appropriate statistical artifact corrections were carried out, the present study's results are not affected by the 2 issues mentioned above. The present study represents strong evidence that cognitive ability tests generally overpredict job performance of African Americans. (c) 2015 APA, all rights reserved.

  6. Fluctuation in Spatial Ability Scores during the Menstrual Cycle.

    Science.gov (United States)

    Moody, M. Suzanne

    Whether or not fluctuations in spatial ability as measured by S. G. Vandenberg's Mental Rotations Test occur during the menstrual cycle was studied with 133 female students from 9 undergraduate educational psychology and nursing classes. For comparison, 28 male students also took the test. Scores from 55 females fell into the relevant menstrual…

  7. Discrimination ability of the Energy score

    DEFF Research Database (Denmark)

    Pinson, Pierre; Tastu, Julija

    as appealing since being proper, we show that its discrimination ability may be limited when focusing on the dependence structure of multivariate probabilistic forecasts. For the case of multivariate Gaussian process, a theoretical upper for such discrimination ability is derived and discussed. This limited...... discrimination ability may eventually get compromised by computational and sampling issues, as dimension increases....

  8. Do Test Scores Buy Happiness?

    Science.gov (United States)

    McCluskey, Neal

    2017-01-01

    Since at least the enactment of No Child Left Behind in 2002, standardized test scores have served as the primary measures of public school effectiveness. Yet, such scores fail to measure the ultimate goal of education: maximizing happiness. This exploratory analysis assesses nation level associations between test scores and happiness, controlling…

  9. Predicting occupational personality test scores.

    Science.gov (United States)

    Furnham, A; Drakeley, R

    2000-01-01

    The relationship between students' actual test scores and their self-estimated scores on the Hogan Personality Inventory (HPI; R. Hogan & J. Hogan, 1992), an omnibus personality questionnaire, was examined. Despite being given descriptive statistics and explanations of each of the dimensions measured, the students tended to overestimate their scores; yet all correlations between actual and estimated scores were positive and significant. Correlations between self-estimates and actual test scores were highest for sociability, ambition, and adjustment (r = .62 to r = .67). The results are discussed in terms of employers' use and abuse of personality assessment for job recruitment.

  10. From neural oscillations to reasoning ability: Simulating the effect of the theta-to-gamma cycle length ratio on individual scores in a figural analogy test.

    Science.gov (United States)

    Chuderski, Adam; Andrelczyk, Krzysztof

    2015-02-01

    Several existing computational models of working memory (WM) have predicted a positive relationship (later confirmed empirically) between WM capacity and the individual ratio of theta to gamma oscillatory band lengths. These models assume that each gamma cycle represents one WM object (e.g., a binding of its features), whereas the theta cycle integrates such objects into the maintained list. As WM capacity strongly predicts reasoning, it might be expected that this ratio also predicts performance in reasoning tasks. However, no computational model has yet explained how the differences in the theta-to-gamma ratio found among adult individuals might contribute to their scores on a reasoning test. Here, we propose a novel model of how WM capacity constraints figural analogical reasoning, aimed at explaining inter-individual differences in reasoning scores in terms of the characteristics of oscillatory patterns in the brain. In the model, the gamma cycle encodes the bindings between objects/features and the roles they play in the relations processed. Asynchrony between consecutive gamma cycles results from lateral inhibition between oscillating bindings. Computer simulations showed that achieving the highest WM capacity required reaching the optimal level of inhibition. When too strong, this inhibition eliminated some bindings from WM, whereas, when inhibition was too weak, the bindings became unstable and fell apart or became improperly grouped. The model aptly replicated several empirical effects and the distribution of individual scores, as well as the patterns of correlations found in the 100-people sample attempting the same reasoning task. Most importantly, the model's reasoning performance strongly depended on its theta-to-gamma ratio in same way as the performance of human participants depended on their WM capacity. The data suggest that proper regulation of oscillations in the theta and gamma bands may be crucial for both high WM capacity and effective complex

  11. Reliability, validity, and minimal detectable change of the push-off test scores in assessing upper extremity weight-bearing ability.

    Science.gov (United States)

    Mehta, Saurabh P; George, Hannah R; Goering, Christian A; Shafer, Danielle R; Koester, Alan; Novotny, Steven

    2017-11-01

    Clinical measurement study. The push-off test (POT) was recently conceived and found to be reliable and valid for assessing weight bearing through injured wrist or elbow. However, further research with larger sample can lend credence to the preliminary findings supporting the use of the POT. This study examined the interrater reliability, construct validity, and measurement error for the POT in patients with wrist conditions. Participants with musculoskeletal (MSK) wrist conditions were recruited. The performance on the POT, grip isometric strength of wrist extensors was assessed. The shortened version of the Disabilities of the Arm, Shoulder and Hand and numeric pain rating scale were completed. The intraclass correlation coefficient assessed interrater reliability of the POT. Pearson correlation coefficients (r) examined the concurrent relationships between the POT and other measures. The standard error of measurement and the minimal detectable change at 90% confidence interval were assessed as measurement error and index of true change for the POT. A total of 50 participants with different elbow or wrist conditions (age: 48.1 ± 16.6 years) were included in this study. The results of this study strongly supported the interrater reliability (intraclass correlation coefficient: 0.96 and 0.93 for the affected and unaffected sides, respectively) of the POT in patients with wrist MSK conditions. The POT showed convergent relationships with the grip strength on the injured side (r = 0.89) and the wrist extensor strength (r = 0.7). The POT showed smaller standard error of measurement (1.9 kg). The minimal detectable change at 90% confidence interval for the POT was 4.4 kg for the sample. This study provides additional evidence to support the reliability and validity of the POT. This is the first study that provides the values for the measurement error and true change on the POT scores in patients with wrist MSK conditions. Further research should examine the

  12. Relationship between candidate communication ability and oral certification examination scores.

    Science.gov (United States)

    Lunz, Mary E; Bashook, Philip G

    2008-12-01

    Structured case-based oral examinations are widely used in medical certifying examinations in the USA. These orals assess the candidate's decision-making skills using real or realistic patient cases. Frequently mentioned but not empirically evaluated is the potential bias introduced by the candidate's communication ability. This study aimed to assess the relationship between candidate communication ability and medical certification oral examination scores. Non-doctor communication observers rated a random sample of 90 candidates on communication ability during a medical oral certification examination. The multi-facet Rasch model was used to analyse the communication survey and the oral examination data. The multi-facet model accounts for observer and examiner severity bias. anova was used to measure differences in communication ability between passing and failing candidates and candidates grouped by level of communication ability. Pearson's correlations were used to compare candidate communication ability and oral certification examination performance. Candidate separation reliability values for the communication survey and the oral examination were 0.85 and 0.97, respectively, suggesting accurate candidate measurement. The correlation between communication scores and oral examination scores was 0.10. No significant difference was found between passing and failing candidates for measured communication ability. When candidates were grouped by high, moderate and low communication ability, there was no significant difference in their oral certification examination performance. Candidates' communication ability has little relationship to candidate performance on high-stakes, case-based oral examinations. Examiners for this certifying examination focused on assessing candidate decision-making ability and were not influenced by candidate communication ability.

  13. Mixed-Format Test Score Equating: Effect of Item-Type Multidimensionality, Length and Composition of Common-Item Set, and Group Ability Difference

    Science.gov (United States)

    Wang, Wei

    2013-01-01

    Mixed-format tests containing both multiple-choice (MC) items and constructed-response (CR) items are now widely used in many testing programs. Mixed-format tests often are considered to be superior to tests containing only MC items although the use of multiple item formats leads to measurement challenges in the context of equating conducted under…

  14. Work ability score of solvent-exposed workers.

    Science.gov (United States)

    Furu, Heidi; Sainio, Markku; Hyvärinen, Hanna-Kaisa; Kaukiainen, Ari

    2018-03-28

    Occupational chronic solvent encephalopathy (CSE), characterized by neurocognitive dysfunction, often leads to early retirement. However, only the more severe cases are diagnosed with CSE, and little is known about the work ability of solvent-exposed workers in general. The aim was to study memory and concentration symptoms, work ability and the effect of both solvent-related and non-occupational factors on work ability, in an actively working solvent-exposed population. A questionnaire on exposure and health was sent to 3640 workers in four solvent-exposed fields, i.e. painters and floor-layers, boat builders, printers, and metal workers. The total number of responses was 1730. We determined the work ability score (WAS), a single question item of the Work Ability Index, and studied solvent exposure, demographic factors, Euroquest memory and concentration symptoms, chronic diseases, and employment status using univariate and multivariate analyses. The findings were compared to those of a corresponding national blue-collar reference population (n = 221), and a small cohort of workers with CSE (n = 18). The proportion of workers with memory and concentration symptoms was significantly associated with solvent exposure. The WAS of solvent-exposed workers was lower than that of the national blue-collar reference group, and the difference was significant in the oldest age group (those aged over 60). Solvent-exposed worker's WAS were higher than those of workers diagnosed with CSE. The WAS were lowest among painters and floor-layers, followed by metal workers and printers, and highest among boat builders. The strongest explanatory factors for poor work ability were the number of chronic diseases, age and employment status. Solvent exposure was a weak independent risk factor for reduced WAS, comparable to a level of high alcohol consumption. Even if memory and concentration symptoms were associated with higher solvent exposure, the effect of solvents on self

  15. TIE: An Ability Test of Emotional Intelligence

    Science.gov (United States)

    Śmieja, Magdalena; Orzechowski, Jarosław; Stolarski, Maciej S.

    2014-01-01

    The Test of Emotional Intelligence (TIE) is a new ability scale based on a theoretical model that defines emotional intelligence as a set of skills responsible for the processing of emotion-relevant information. Participants are provided with descriptions of emotional problems, and asked to indicate which emotion is most probable in a given situation, or to suggest the most appropriate action. Scoring is based on the judgments of experts: professional psychotherapists, trainers, and HR specialists. The validation study showed that the TIE is a reliable and valid test, suitable for both scientific research and individual assessment. Its internal consistency measures were as high as .88. In line with theoretical model of emotional intelligence, the results of the TIE shared about 10% of common variance with a general intelligence test, and were independent of major personality dimensions. PMID:25072656

  16. TIE: an ability test of emotional intelligence.

    Science.gov (United States)

    Śmieja, Magdalena; Orzechowski, Jarosław; Stolarski, Maciej S

    2014-01-01

    The Test of Emotional Intelligence (TIE) is a new ability scale based on a theoretical model that defines emotional intelligence as a set of skills responsible for the processing of emotion-relevant information. Participants are provided with descriptions of emotional problems, and asked to indicate which emotion is most probable in a given situation, or to suggest the most appropriate action. Scoring is based on the judgments of experts: professional psychotherapists, trainers, and HR specialists. The validation study showed that the TIE is a reliable and valid test, suitable for both scientific research and individual assessment. Its internal consistency measures were as high as .88. In line with theoretical model of emotional intelligence, the results of the TIE shared about 10% of common variance with a general intelligence test, and were independent of major personality dimensions.

  17. TIE: an ability test of emotional intelligence.

    Directory of Open Access Journals (Sweden)

    Magdalena Śmieja

    Full Text Available The Test of Emotional Intelligence (TIE is a new ability scale based on a theoretical model that defines emotional intelligence as a set of skills responsible for the processing of emotion-relevant information. Participants are provided with descriptions of emotional problems, and asked to indicate which emotion is most probable in a given situation, or to suggest the most appropriate action. Scoring is based on the judgments of experts: professional psychotherapists, trainers, and HR specialists. The validation study showed that the TIE is a reliable and valid test, suitable for both scientific research and individual assessment. Its internal consistency measures were as high as .88. In line with theoretical model of emotional intelligence, the results of the TIE shared about 10% of common variance with a general intelligence test, and were independent of major personality dimensions.

  18. Work ability assessment in a worker population: comparison and determinants of Work Ability Index and Work Ability score.

    Science.gov (United States)

    El Fassi, Mehdi; Bocquet, Valery; Majery, Nicole; Lair, Marie Lise; Couffignal, Sophie; Mairiaux, Philippe

    2013-04-08

    Public authorities in European countries are paying increasing attention to the promotion of work ability throughout working life and the best method to monitor work ability in populations of workers is becoming a significant question. The present study aims to compare the assessment of work ability based on the use of the Work Ability Index (WAI), a 7-item questionnaire, with another one based on the use of WAI's first item, which consists in the worker's self-assessment of his/her current work ability level as opposed to his/her lifetime best, this single question being termed "Work Ability score" (WAS). Using a database created by an occupational health service, the study intends to answer the following questions: could the assessment of work ability be based on a single-item measure and which are the variables significantly associated with self-reported work ability among those systematically recorded by the occupational physician during health examinations? A logistic regression model was used in order to estimate the probability of observing "poor" or "moderate" WAI levels depending on age, gender, body mass index, smoking status, position held, firm size and diseases reported by the worker in a population of workers aged 40 to 65 and examined between January 2006 and June 2010 (n=12389). The convergent validity between WAS and WAI was statistically significant (rs=0.63). In the multivariable model, age (pwork ability. A work position characterized by the predominance of mental activity (OR=0.71, 95%CI [0.61-0.84]) had a favourable impact on work ability. These relations were observed regardless of the work ability measurement tool used. The convergent validity and the similarity in results between WAI and WAS observed in a large population of employed workers should thus foster the use of WAS for systematic screening of work ability. Ageing, overweight, decline in health status, holding a mostly physical job and working in a large-sized firm increase the

  19. Summary of Score Changes (in other Tests).

    Science.gov (United States)

    Cleary, T. Anne; McCandless, Sam A.

    Scholastic Aptitude Test (SAT) scores have declined during the last 14 years. Similar score declines have been observed in many different testing programs, many groups, and tested areas. The declines, while not large in any given year, have been consistent over time, area, and group. The period around 1965 is critical for the interpretation of…

  20. A Human Capital Model of Educational Test Scores

    DEFF Research Database (Denmark)

    McIntosh, James; D. Munk, Martin

    Latent class Poisson count models are used to analyze a sample of Danish test score results from a cohort of individuals born in 1954-55 and tested in 1968. The procedure takes account of unobservable effects as well as excessive zeros in the data. The bulk of unobservable effects are uncorrelated...... with observable parental attributes and, thus, are environmental rather than genetic in origin. We show that the test scores measure manifest or measured ability as it has evolved over the life of the respondent and is, thus, more a product of the human capital formation process than some latent or fundamental...... measure of pure cognitive ability. We find that variables which are not closely associated with traditional notions of intelligence explain a significant proportion of the variation in test scores. This adds to the complexity of interpreting test scores and suggests that school culture, attitudes...

  1. Work ability as prognostic risk marker of disability pension : Single-item work ability score versus multi-item work ability index

    NARCIS (Netherlands)

    Roelen, C.A.M.; Rhenen, van W.; Groothoff, J.W.; Klink, van der J.J.L.; Twisk, W.R.; Heymans, M.W.

    2014-01-01

    Work ability predicts future disability pension (DP). A single-item work ability score (WAS) is emerging as a measure for work ability. This study compared single-item WAS with the multi-item work ability index (WAI) in its ability to identify workers at risk of DP.

  2. Work ability as prognostic risk marker of disability pension: single-item work ability score versus multi-item work ability index

    NARCIS (Netherlands)

    Roelen, C.A.M.; van Rhenen, W.; Groothoff, J.W.; van der Klink, J.J.L.; Twisk, J.W.R.; Heymans, M.W.

    2014-01-01

    Objectives Work ability predicts future disability pension (DP). A single-item work ability score (WAS) is emerging as a measure for work ability. This study compared single-item WAS with the multi-item work ability index (WAI) in its ability to identify workers at risk of DP. Methods This

  3. Work ability as prognostic risk marker of disability pension : single-item work ability score versus multi-item work ability index

    NARCIS (Netherlands)

    Roelen, Corne A. M.; van Rhenen, Willem; Groothoff, Johan W.; van der Klink, Jac J. L.; Twisk, Jos W. R.; Heymans, Martijn W.

    Objectives Work ability predicts future disability pension (DP). A single-item work ability score (WAS) is emerging as a measure for work ability. This study compared single-item WAS with the multi-item work ability index (WAI) in its ability to identify workers at risk of DP. Methods This

  4. Accountancy, teaching methods, sex, and American College Test scores.

    Science.gov (United States)

    Heritage, J; Harper, B S; Harper, J P

    1990-10-01

    This study examines the significance of sex, methodology, academic preparation, and age as related to development of judgmental and problem-solving skills. Sex, American College Test (ACT) Mathematics scores, Composite ACT scores, grades in course work, grade point average (GPA), and age were used in studying the effects of teaching method on 96 students' ability to analyze data in financial statements. Results reflect positively on accounting students compared to the general college population and the women students in particular.

  5. Measurement of ability emotional intelligence: results for two new tests.

    Science.gov (United States)

    Austin, Elizabeth J

    2010-08-01

    Emotional intelligence (EI) has attracted considerable interest amongst both individual differences researchers and those in other areas of psychology who are interested in how EI relates to criteria such as well-being and career success. Both trait (self-report) and ability EI measures have been developed; the focus of this paper is on ability EI. The associations of two new ability EI tests with psychometric intelligence, emotion perception, and the Mayer-Salovey-Caruso EI test (MSCEIT) were examined. The new EI tests were the Situational Test of Emotion Management (STEM) and the Situational Test of Emotional Understanding (STEU). Only the STEU and the MSCEIT Understanding Emotions branch were significantly correlated with psychometric intelligence, suggesting that only understanding emotions can be regarded as a candidate new intelligence component. These understanding emotions tests were also positively correlated with emotion perception tests, and STEM and STEU scores were positively correlated with MSCEIT total score and most branch scores. Neither the STEM nor the STEU were significantly correlated with trait EI tests, confirming the distinctness of trait and ability EI. Taking the present results as a starting-point, approaches to the development of new ability EI tests and models of EI are suggested.

  6. Some procedures for computerized ability testing

    NARCIS (Netherlands)

    van der Linden, Willem J.; Zwarts, Michel A.

    1989-01-01

    For computerized test systems to be operational, the use of item response theory is a prerequisite. As opposed to classical test theory, in item response models the abilities of the examinees and the properties of the items are parameterized separately. Hence, when measuring the abilities of

  7. What Do Test Scores Really Mean? A Latent Class Analysis of Danish Test Score Performance

    DEFF Research Database (Denmark)

    Munk, Martin D.; McIntosh, James

    2014-01-01

    Latent class Poisson count models are used to analyze a sample of Danish test score results from a cohort of individuals born in 1954-55, tested in 1968, and followed until 2011. The procedure takes account of unobservable effects as well as excessive zeros in the data. We show that the test scores...... of intelligence explain a significant proportion of the variation in test scores. This adds to the complexity of interpreting test scores and suggests that school culture and possible incentive problems make it more di¢ cult to understand what the tests measure....

  8. What do educational test scores really measure?

    DEFF Research Database (Denmark)

    McIntosh, James; D. Munk, Martin

    Latent class Poisson count models are used to analyze a sample of Danish test score results from a cohort of individuals born in 1954-55 and tested in 1968. The procedure takes account of unobservable effects as well as excessive zeros in the data. The bulk of unobservable effects are uncorrelate......, and possible incentive problems make it more difficult to elicit true values of what the tests measure....

  9. Work ability as prognostic risk marker of disability pension: single-item work ability score versus multi-item work ability index.

    Science.gov (United States)

    Roelen, Corné A M; van Rhenen, Willem; Groothoff, Johan W; van der Klink, Jac J L; Twisk, Jos W R; Heymans, Martijn W

    2014-07-01

    Work ability predicts future disability pension (DP). A single-item work ability score (WAS) is emerging as a measure for work ability. This study compared single-item WAS with the multi-item work ability index (WAI) in its ability to identify workers at risk of DP. This prospective cohort study comprised 11 537 male construction workers, who completed the WAI at baseline and reported DP after a mean 2.3 years of follow-up. WAS and WAI were calibrated for DP risk predictions with the Hosmer-Lemeshow (H-L) test and their ability to discriminate between high- and low-risk construction workers was investigated with the area under the receiver operating characteristic curve (AUC). At follow-up, 336 (3%) construction workers reported DP. Both WAS [odds ratio (OR) 0.72, 95% confidence interval (95% CI) 0.66-0.78] and WAI (OR 0.57, 95% CI 0.52-0.63) scores were associated with DP at follow-up. The WAS showed miscalibration (H-L model χ (�)=10.60; df=3; P=0.01) and poorly discriminated between high- and low-risk construction workers (AUC 0.67, 95% CI 0.64-0.70). In contrast, calibration (H-L model χ �=8.20; df=8; P=0.41) and discrimination (AUC 0.78, 95% CI 0.75-0.80) were both adequate for the WAI. Although associated with the risk of future DP, the single-item WAS poorly identified male construction workers at risk of DP. We recommend using the multi-item WAI to screen for risk of DP in occupational health practice.

  10. Gender, Stereotype Threat and Mathematics Test Scores

    OpenAIRE

    Ming Tsui; Xiao Y. Xu; Edmond Venator

    2011-01-01

    Problem statement: Stereotype threat has repeatedly been shown to depress womens scores on difficult math tests. An attempt to replicate these findings in China found no support for the stereotype threat hypothesis. Our math test was characterized as being personally important for the student participants, an atypical condition in most stereotype threat laboratory research. Approach: To evaluate the effects of this personal demand, we conducted three experiments. Results: ...

  11. Exploring a Source of Uneven Score Equity across the Test Score Range

    Science.gov (United States)

    Huggins-Manley, Anne Corinne; Qiu, Yuxi; Penfield, Randall D.

    2018-01-01

    Score equity assessment (SEA) refers to an examination of population invariance of equating across two or more subpopulations of test examinees. Previous SEA studies have shown that score equity may be present for examinees scoring at particular test score ranges but absent for examinees scoring at other score ranges. No studies to date have…

  12. Work ability assessment in a worker population: comparison and determinants of Work Ability Index and Work Ability score

    OpenAIRE

    El Fassi, Mehdi; Bocquet, Valery; Majery, Nicole; Lair, Marie Lise; Couffignal, Sophie; Mairiaux, Philippe

    2013-01-01

    Background Public authorities in European countries are paying increasing attention to the promotion of work ability throughout working life and the best method to monitor work ability in populations of workers is becoming a significant question. The present study aims to compare the assessment of work ability based on the use of the Work Ability Index (WAI), a 7-item questionnaire, with another one based on the use of WAI?s first item, which consists in the worker?s self-assessment of his/he...

  13. The computer game training effect for women may depend on initial spatial ability scores

    OpenAIRE

    Iversen, Robert

    2010-01-01

    In this project we tried to explore what it is in games that may enhance spatial abilities. Previous research has shown that action games may enhance gamers’ scores on the Mental Rotation test (MRT), while evidence is found both for and against that puzzle games could do the same. We used three different games, and one control group, with a total of 32 participants matched over these four groups. The games were Medal of Honor: Pacific Assault, which has been used as an action game in previous...

  14. ITC Guidelines on Quality Control in Scoring, Test Analysis, and Reporting of Test Scores

    Science.gov (United States)

    Allalouf, Avi

    2014-01-01

    The Quality Control (QC) Guidelines are intended to increase the efficiency, precision, and accuracy of the scoring, analysis, and reporting process of testing. The QC Guidelines focus on large-scale testing operations where multiple forms of tests are created for use on set dates. However, they may also be used for a wide variety of other testing…

  15. Validating the Interpretations and Uses of Test Scores

    Science.gov (United States)

    Kane, Michael T.

    2013-01-01

    To validate an interpretation or use of test scores is to evaluate the plausibility of the claims based on the scores. An argument-based approach to validation suggests that the claims based on the test scores be outlined as an argument that specifies the inferences and supporting assumptions needed to get from test responses to score-based…

  16. Facilitating the Interpretation of English Language Proficiency Scores: Combining Scale Anchoring and Test Score Mapping Methodologies

    Science.gov (United States)

    Powers, Donald; Schedl, Mary; Papageorgiou, Spiros

    2017-01-01

    The aim of this study was to develop, for the benefit of both test takers and test score users, enhanced "TOEFL ITP"® test score reports that go beyond the simple numerical scores that are currently reported. To do so, we applied traditional scale anchoring (proficiency scaling) to item difficulty data in order to develop performance…

  17. Biering-Sorensen test scores in coal miners

    Energy Technology Data Exchange (ETDEWEB)

    Tekin, Y.; Ortancil, O.; Ankarali, H.; Basaran, A.; Sarikaya, S.; Ozdolap, S. [Zonguldak Karaelmas University, Zonguldak (Turkey)

    2009-05-15

    Biering-Sorensen test is an isometric back endurance test. Biering-Sorensen test scores have varied in different cultural and occupational groups. The aims of this study were to collect normative data on Biering-Sorensen holding times, to determine the discriminative ability of the Biering-Sorensen test in Turkish coal miners, and to examine the association between Biering-Sorensen test result and functional disability. One hundred and fifty male coal miners participated in this study. Trunk extensor muscle strength was measured using the Biering-Sorensen test. Oswestry disability index was used to measure the functional disability level of low back pain. The mean Biering-Sorensen holding time for the total subject group was 107.3 {+-} 22.5 s. The mean time of Biering-Sorensen test of the subjects with and without low back pain were 99.9 {+-} 19.8 and 128.6 {+-} 15.2 s, respectively. The difference between the subjects with and without low back pain was statistically significant (p < 0.001). There was a statistically significant negative correlation between Oswestry functional disability score and Biering-Sorensen holding time (R = -0.824, p < 0.001). Turkish coal miners have low mean back extensor endurance holding times. Biering-Sorensen test had a good discriminative ability in our study group. Trunk muscle strength has a significant effect on the disability level of low back pain. Thus trunk muscle endurance training exercise therapy may be effective for the reduction of disability in patients with low back pain.

  18. The Truth about Scores Children Achieve on Tests.

    Science.gov (United States)

    Brown, Jonathan R.

    1989-01-01

    The importance of using the standard error of measurement (SEm) in determining reliability in test scores is emphasized. The SEm is compared to the hypothetical true score for standardized tests, and procedures for calculation of the SEm are explained. (JDD)

  19. School accountability and the black-white test score gap.

    Science.gov (United States)

    Gaddis, S Michael; Lauen, Douglas Lee

    2014-03-01

    Since at least the 1960s, researchers have closely examined the respective roles of families, neighborhoods, and schools in producing the black-white achievement gap. Although many researchers minimize the ability of schools to eliminate achievement gaps, the No Child Left Behind Act (NCLB) increased pressure on schools to do so by 2014. In this study, we examine the effects of NCLB's subgroup-specific accountability pressure on changes in black-white math and reading test score gaps using a school-level panel dataset on all North Carolina public elementary and middle schools between 2001 and 2009. Using difference-in-difference models with school fixed effects, we find that accountability pressure reduces black-white achievement gaps by raising mean black achievement without harming mean white achievement. We find no differential effects of accountability pressure based on the racial composition of schools, but schools with more affluent populations are the most successful at reducing the black-white math achievement gap. Thus, our findings suggest that school-based interventions have the potential to close test score gaps, but differences in school composition and resources play a significant role in the ability of schools to reduce racial inequality. Copyright © 2013 Elsevier Inc. All rights reserved.

  20. ISSUE PAPER: What Do Test Scores in Texas Tell Us?

    National Research Council Canada - National Science Library

    Klein, Stephen

    2000-01-01

    ...) about possible unintended consequences of these programs. We conducted several analyses to examine the issue of whether TAAS scores can be trusted to provide an accurate index of student skills and abilities...

  1. Data-driven efficient score tests for deconvolution hypotheses

    NARCIS (Netherlands)

    Langovoy, M.

    2008-01-01

    We consider testing statistical hypotheses about densities of signals in deconvolution models. A new approach to this problem is proposed. We constructed score tests for the deconvolution density testing with the known noise density and efficient score tests for the case of unknown density. The

  2. Adaptive testing with equated number-correct scoring

    NARCIS (Netherlands)

    van der Linden, Willem J.

    1999-01-01

    A constrained CAT algorithm is presented that automatically equates the number-correct scores on adaptive tests. The algorithm can be used to equate number-correct scores across different administrations of the same adaptive test as well as to an external reference test. The constraints are derived

  3. Improving personality facet scores with multidimensional computer adaptive testing

    DEFF Research Database (Denmark)

    Makransky, Guido; Mortensen, Erik Lykke; Glas, Cees A W

    2013-01-01

    personality tests contain many highly correlated facets. This article investigates the possibility of increasing the precision of the NEO PI-R facet scores by scoring items with multidimensional item response theory and by efficiently administering and scoring items with multidimensional computer adaptive...

  4. Improving Scores on the IELTS Speaking Test

    Science.gov (United States)

    Issitt, Steve

    2008-01-01

    This article presents three strategies for teaching students who are taking the IELTS speaking test. The first strategy is aimed at improving confidence and uses a variety of self-help materials from the field of popular psychology. The second encourages students to think critically and invokes a range of academic perspectives. The third strategy…

  5. Item selection and ability estimation adaptive testing

    NARCIS (Netherlands)

    Pashley, Peter J.; van der Linden, Wim J.; van der Linden, Willem J.; Glas, Cornelis A.W.; Glas, Cees A.W.

    2010-01-01

    The last century saw a tremendous progression in the refinement and use of standardized linear tests. The first administered College Board exam occurred in 1901 and the first Scholastic Assessment Test (SAT) was given in 1926. Since then, progressively more sophisticated standardized linear tests

  6. Group differences in the heritability of items and test scores

    NARCIS (Netherlands)

    Wicherts, J.M.; Johnson, W.

    2009-01-01

    It is important to understand potential sources of group differences in the heritability of intelligence test scores. On the basis of a basic item response model we argue that heritabilities which are based on dichotomous item scores normally do not generalize from one sample to the next. If groups

  7. Testing the applicability of the SASS5 scoring procedure for ...

    African Journals Online (AJOL)

    A study was undertaken between 29th January and 17th February 2004 to test the applicability of the South African Scoring System Version 5 (SASS5) scoring and calculation procedure in nutrient-enriched palustrine wetlands in the midlands of KwaZulu-Natal, South Africa. Four reference wetlands and three dairy-effluent ...

  8. Evaluating the Predictive Validity of Graduate Management Admission Test Scores

    Science.gov (United States)

    Sireci, Stephen G.; Talento-Miller, Eileen

    2006-01-01

    Admissions data and first-year grade point average (GPA) data from 11 graduate management schools were analyzed to evaluate the predictive validity of Graduate Management Admission Test[R] (GMAT[R]) scores and the extent to which predictive validity held across sex and race/ethnicity. The results indicated GMAT verbal and quantitative scores had…

  9. From Test Scores to Language Use: Emergent Bilinguals Using English to Accomplish Academic Tasks

    Science.gov (United States)

    Rodriguez-Mojica, Claudia

    2018-01-01

    Prominent discourses about emergent bilinguals' academic abilities tend to focus on performance as measured by test scores and perpetuate the message that emergent bilinguals trail far behind their peers. When we remove the constraints of formal testing situations, what can emergent bilinguals do in English as they engage in naturally occurring…

  10. Relative Merits of Four Methods for Scoring Cloze Tests.

    Science.gov (United States)

    Brown, James Dean

    1980-01-01

    Describes study comparing merits of exact answer, acceptable answer, clozentropy and multiple choice methods for scoring tests. Results show differences among reliability, mean item facility, discrimination and usability, but not validity. (BK)

  11. Experimental testing of exchangeable cutting inserts cutting ability

    OpenAIRE

    Čep, Robert; Janásek, Adam; Čepová, Lenka; Petrů, Jana; Hlavatý, Ivo; Car, Zlatan; Hatala, Michal

    2013-01-01

    The article deals with experimental testing of the cutting ability of exchangeable cutting inserts. Eleven types of exchangeable cutting inserts from five different manufacturers were tested. The tested cutting inserts were of the same shape and were different especially in material and coating types. The main aim was both to select a suitable test for determination of the cutting ability of exchangeable cutting inserts and to design such testing procedure that could make it possible...

  12. Prediction of true test scores from observed item scores and ancillary data.

    Science.gov (United States)

    Haberman, Shelby J; Yao, Lili; Sinharay, Sandip

    2015-05-01

    In many educational tests which involve constructed responses, a traditional test score is obtained by adding together item scores obtained through holistic scoring by trained human raters. For example, this practice was used until 2008 in the case of GRE(®) General Analytical Writing and until 2009 in the case of TOEFL(®) iBT Writing. With use of natural language processing, it is possible to obtain additional information concerning item responses from computer programs such as e-rater(®). In addition, available information relevant to examinee performance may include scores on related tests. We suggest application of standard results from classical test theory to the available data to obtain best linear predictors of true traditional test scores. In performing such analysis, we require estimation of variances and covariances of measurement errors, a task which can be quite difficult in the case of tests with limited numbers of items and with multiple measurements per item. As a consequence, a new estimation method is suggested based on samples of examinees who have taken an assessment more than once. Such samples are typically not random samples of the general population of examinees, so that we apply statistical adjustment methods to obtain the needed estimated variances and covariances of measurement errors. To examine practical implications of the suggested methods of analysis, applications are made to GRE General Analytical Writing and TOEFL iBT Writing. Results obtained indicate that substantial improvements are possible both in terms of reliability of scoring and in terms of assessment reliability. © 2015 The British Psychological Society.

  13. Predicting Freshman Grade Point Average From College Admissions Test Scores and State High School Test Scores

    OpenAIRE

    Koretz, Daniel; Yu, C; Mbekeani, Preeya Pandya; Langi, M.; Dhaliwal, Tasminda Kaur; Braslow, David Arthur

    2016-01-01

    The current focus on assessing “college and career readiness” raises an empirical question: How do high school tests compare with college admissions tests in predicting performance in college? We explored this using data from the City University of New York and public colleges in Kentucky. These two systems differ in the choice of college admissions test, the stakes for students on the high school test, and demographics. We predicted freshman grade point average (FGPA) from high school GPA an...

  14. Whole-word response scoring underestimates functional spelling ability for some individuals with global agraphia

    Directory of Open Access Journals (Sweden)

    Andrew Tesla Demarco

    2015-05-01

    These data suggest that conventional whole-word scoring may significantly underestimate functional spelling performance. Because by-letter scoring boosted pre-treatment scores to the same extent as post-treatment scores, the magnitude of treatment gains was no greater than estimates from conventional whole-word scoring. Nonetheless, the surprisingly large disparity between conventional whole-word scoring and by-letter scoring suggests that by-letter scoring methods may warrant further investigation. Because by-letter analyses may hold interest to others, we plan to make the software tool used in this study available on-line for use to researchers and clinicians at large.

  15. High Test Scores: The Wrong Road to National Economic Success

    Science.gov (United States)

    Baker, Keith

    2011-01-01

    A widely held view is that good schools are essential to a nation's international economic success and that high test scores on international tests of academic skills and knowledge indicate how good a nation's schools are. The widespread belief that good schools are an important contributor to a nation's economic success in the world is supported…

  16. Predicting Freshman Grade Point Average From College Admissions Test Scores and State High School Test Scores

    Directory of Open Access Journals (Sweden)

    Daniel Koretz

    2016-09-01

    Full Text Available The current focus on assessing “college and career readiness” raises an empirical question: How do high school tests compare with college admissions tests in predicting performance in college? We explored this using data from the City University of New York and public colleges in Kentucky. These two systems differ in the choice of college admissions test, the stakes for students on the high school test, and demographics. We predicted freshman grade point average (FGPA from high school GPA and both college admissions and high school tests in mathematics and English. In both systems, the choice of tests had only trivial effects on the aggregate prediction of FGPA. Adding either test to an equation that included the other had only trivial effects on prediction. Although the findings suggest that the choice of test might advantage or disadvantage different students, it had no substantial effect on the over- and underprediction of FGPA for students classified by race-ethnicity or poverty.

  17. Explaining the black-white gap in cognitive test scores: Toward a theory of adverse impact.

    Science.gov (United States)

    Cottrell, Jonathan M; Newman, Daniel A; Roisman, Glenn I

    2015-11-01

    In understanding the causes of adverse impact, a key parameter is the Black-White difference in cognitive test scores. To advance theory on why Black-White cognitive ability/knowledge test score gaps exist, and on how these gaps develop over time, the current article proposes an inductive explanatory model derived from past empirical findings. According to this theoretical model, Black-White group mean differences in cognitive test scores arise from the following racially disparate conditions: family income, maternal education, maternal verbal ability/knowledge, learning materials in the home, parenting factors (maternal sensitivity, maternal warmth and acceptance, and safe physical environment), child birth order, and child birth weight. Results from a 5-wave longitudinal growth model estimated on children in the NICHD Study of Early Child Care and Youth Development from ages 4 through 15 years show significant Black-White cognitive test score gaps throughout early development that did not grow significantly over time (i.e., significant intercept differences, but not slope differences). Importantly, the racially disparate conditions listed above can account for the relation between race and cognitive test scores. We propose a parsimonious 3-Step Model that explains how cognitive test score gaps arise, in which race relates to maternal disadvantage, which in turn relates to parenting factors, which in turn relate to cognitive test scores. This model and results offer to fill a need for theory on the etiology of the Black-White ethnic group gap in cognitive test scores, and attempt to address a missing link in the theory of adverse impact. (c) 2015 APA, all rights reserved).

  18. Test-retest reliability of the Work Ability Index questionnaire

    NARCIS (Netherlands)

    de Zwart, B. C. H.; Frings-Dresen, M. H. W.; Van Duivenbooden, J. C.

    2002-01-01

    The goal of the study was to assess the test-retest reliability of the Work Ability Index (WAI) questionnaire. Reliability was tested using a test-retest design with a 4 week interval between measurements. Valid data were collected among 97 elderly construction workers aged 40 years and older. We

  19. A prognostic scoring system for arm exercise stress testing.

    Science.gov (United States)

    Xie, Yan; Xian, Hong; Chandiramani, Pooja; Bainter, Emily; Wan, Leping; Martin, Wade H

    2016-01-01

    Arm exercise stress testing may be an equivalent or better predictor of mortality outcome than pharmacological stress imaging for the ≥50% for patients unable to perform leg exercise. Thus, our objective was to develop an arm exercise ECG stress test scoring system, analogous to the Duke Treadmill Score, for predicting outcome in these individuals. In this retrospective observational cohort study, arm exercise ECG stress tests were performed in 443 consecutive veterans aged 64.1 (11.1) years. (mean (SD)) between 1997 and 2002. From multivariate Cox models, arm exercise scores were developed for prediction of 5-year and 12-year all-cause and cardiovascular mortality and 5-year cardiovascular mortality or myocardial infarction (MI). Arm exercise capacity in resting metabolic equivalents (METs), 1 min heart rate recovery (HRR) and ST segment depression ≥1 mm were the stress test variables independently associated with all-cause and cardiovascular mortality by step-wise Cox analysis (all pstatistic of 0.81 before and 0.88 after adjustment for significant demographic and clinical covariates. Arm exercise scores for the other outcome end points yielded C-statistic values of 0.77-0.79 before and 0.82-0.86 after adjustment for significant covariates versus 0.64-0.72 for best fit pharmacological myocardial perfusion imaging models in a cohort of 1730 veterans who were evaluated over the same time period. Arm exercise scores, analogous to the Duke Treadmill Score, have good power for prediction of mortality or MI in patients who cannot perform leg exercise.

  20. A Latent Class Approach to Estimating Test-Score Reliability

    Science.gov (United States)

    van der Ark, L. Andries; van der Palm, Daniel W.; Sijtsma, Klaas

    2011-01-01

    This study presents a general framework for single-administration reliability methods, such as Cronbach's alpha, Guttman's lambda-2, and method MS. This general framework was used to derive a new approach to estimating test-score reliability by means of the unrestricted latent class model. This new approach is the latent class reliability…

  1. Source Country Differences in Test Score Gaps: Evidence from Denmark

    Science.gov (United States)

    Rangvid, Beatrice Schindler

    2010-01-01

    We combine data from three studies for Denmark in the PISA 2000 framework to investigate differences in the native-immigrant test score gap by country of origin. In addition to the controls available from PISA data sources, we use student-level data on home background and individual migration histories linked from administrative registers. We find…

  2. Racial Differences in Mathematics Test Scores for Advanced Mathematics Students

    Science.gov (United States)

    Minor, Elizabeth Covay

    2016-01-01

    Research on achievement gaps has found that achievement gaps are larger for students who take advanced mathematics courses compared to students who do not. Focusing on the advanced mathematics student achievement gap, this study found that African American advanced mathematics students have significantly lower test scores and are less likely to be…

  3. Manual for Scoring the Test of Directed Imagination.

    Science.gov (United States)

    Veldman, Donald J.; And Others

    A scoring manual for the Directed Imagination Test, a projective technique wherein the subject is instructed to write four fictional stories (four minutes are allowed for each) about teachers and their experiences, is presented. The manual provides detailed instructions for rating each story by fifteen dimensions relevant to teacher education…

  4. America's Mediocre Test Scores: Education Crisis or Poverty Crisis?

    Science.gov (United States)

    Petrilli, Michael J.; Wright, Brandon L.

    2016-01-01

    At a time when the national conversation is focused on lagging upward mobility, it is no surprise that many educators point to poverty as the explanation for mediocre test scores among U.S. students compared to those of students in other countries. If American teachers in struggling U.S. schools taught in Finland, says Finnish educator Pasi…

  5. Job applicants’ attitudes towards cognitive ability and personality testing

    Directory of Open Access Journals (Sweden)

    Rachelle Visser

    2017-10-01

    Full Text Available Orientation: Growing research has shown that not only test validity considerations but also the test-taking attitudes of job applicants are important in the choice of selection instruments as these can contribute to test performance and the perceived fairness of the selection process. Research purpose: The main purpose of this study was to determine the test-taking attitudes of a diverse group of job applicants towards personality and cognitive ability tests administered conjointly online as part of employee selection in a financial services company in South Africa. Motivation for the study: If users understand how job applicants view specific test types, they will know which assessments are perceived more negatively and how this situation can potentially be rectified. Research design, approach and method: A non-experimental and cross-sectional survey design was used. An adapted version of the Test Attitude Survey was used to determine job applicants’ attitudes towards tests administered online as part of an employee selection process. The sample consisted of a group of job applicants (N = 160 who were diverse in terms of ethnicity and age and the educational level applicable for sales and supervisory positions. Main findings: On average, the job applicants responded equally positively to the cognitive ability and personality tests. The African job applicants had a statistically significantly more positive attitude towards the tests than the other groups, and candidates applying for the sales position viewed the cognitive ability tests significantly less positively than the personality test. Practical and managerial implications: The choice of selection tests used in combination as well as the testing conditions that are applicable should be considered carefully as they are the factors that can potentially influence the test-taking motivation and general test-taking attitudes of job applicants. Contribution: This study consolidated the

  6. Basketball ability testing and category for players with mental retardation: 8-month training effect.

    Science.gov (United States)

    Franciosi, Emanuele; Gallotta, Maria Chiara; Baldari, Carlo; Emerenziani, Gian Pietro; Guidetti, Laura

    2012-06-01

    Although sport for athletes with mental retardation (MR) is achieving an important role, the literature concerning basketball tests and training is still poor. The aims of this study were to verify whether the basketball test battery could be an appropriate modality to classify the players in the Promotion (Pro) category, to assess basketball abilities before (PRE) and after (POST) an 8-month training in players with MR in relation to Competitive (Comp) and Pro categories, to analyze the variation of specific basketball abilities based on subjects' MR diagnosis. Forty-one male basketball players with MR (17 Comp and 24 Pro; age range 18-45 years; MR: 15% mild, 54% moderate, 29% severe, and 2% profound) were assessed PRE and POST training through the basketball test battery, which assessed 4 ability levels of increasing difficulty (from I to IV), each one characterized by the analysis of fundamental areas (ball handling, reception, passing, and shooting). Level I was significantly changed after the intervention period regardless of the Category, whereas shooting was affected by the interaction between Category and Intervention. The results showed significant differences between categories in the scores of individual global, level I, level II, level III, and in all fundamental areas. Individual global score in both categories significantly increased. The players of Comp significantly improved in level III, in ball handling, reception, passing, and shooting scores. The players of Pro improved significantly in level II, in ball handling, reception, and passing scores. Individual global, ability levels I-III, and fundamental area scores were negatively correlated to the MR level indicating that the players with a lower MR obtained higher ability scores. In conclusion, it was found that the basketball test battery could be useful for improving and monitoring training in both Comp and Pro players.

  7. The Effect of Black Peers on Black Test Scores

    Science.gov (United States)

    Armor, David J.; Duck, Stephanie

    2007-01-01

    Recent studies have used increasingly complex methodologies to estimate the effect of peer characteristics--race, poverty, and ability--on student achievement. A paper by Hanushek, Kain, and Rivkin using Texas state testing data has received particularly wide attention because it found a large negative effect of school percent black on black math…

  8. Statistical tests for equal predictive ability across multiple forecasting methods

    DEFF Research Database (Denmark)

    Borup, Daniel; Thyrsgaard, Martin

    We develop a multivariate generalization of the Giacomini-White tests for equal conditional predictive ability. The tests are applicable to a mixture of nested and non-nested models, incorporate estimation uncertainty explicitly, and allow for misspecification of the forecasting model as well as ...

  9. The Ability of Career Maturity Indicators to Predict Interest Score Differentiation, Consistency, and Elevation.

    Science.gov (United States)

    Miner, Claire Usher; Osborne, W. Larry; Jaeger, Richard M.

    1997-01-01

    Uses regression analysis on career development measures to examine whether career maturity indicators are predictive of interest consistency, differentiation, and score elevation. Results indicate that interest consistency and score elevation were weakly predicted by the measure; no relationship existed between the attitudinal and cognitive…

  10. The Weighted Airman Promotion System: Standardizing Test Scores

    Science.gov (United States)

    2008-01-01

    u th o ri ze d Top 3/E6 ratio, inventory 1401206040 100 70 130 5R 2F 2G 3N 2M 2A 4J 4C 4P 4T 4B 1W 2T 3P 1T 4A 2S 5J 1A 1S1C 6F 4N 7S 4R 4E 1N 3A 3V...System: Standardizing Test Scores AFHRL convened a panel to identify the relevant factors to consider, and then sit as a promotion board and rank...Costs If the Air Force decided to standardize test scores, there would be three basic types of costs: implementation costs, marketing costs, and

  11. Spinal appearance questionnaire: factor analysis, scoring, reliability, and validity testing.

    Science.gov (United States)

    Carreon, Leah Y; Sanders, James O; Polly, David W; Sucato, Daniel J; Parent, Stefan; Roy-Beaudry, Marjolaine; Hopkins, Jeffrey; McClung, Anna; Bratcher, Kelly R; Diamond, Beverly E

    2011-08-15

    Cross sectional. This study presents the factor analysis of the Spinal Appearance Questionnaire (SAQ) and its psychometric properties. Although the SAQ has been administered to a large sample of patients with adolescent idiopathic scoliosis (AIS) treated surgically, its psychometric properties have not been fully evaluated. This study presents the factor analysis and scoring of the SAQ and evaluates its psychometric properties. The SAQ and the Scoliosis Research Society-22 (SRS-22) were administered to AIS patients who were being observed, braced or scheduled for surgery. Standard demographic data and radiographic measures including Lenke type and curve magnitude were also collected. Of the 1802 patients, 83% were female; with a mean age of 14.8 years and mean initial Cobb angle of 55.8° (range, 0°-123°). From the 32 items of the SAQ, 15 loaded on two factors with consistent and significant correlations across all Lenke types. There is an Appearance (items 1-10) and an Expectations factor (items 12-15). Responses are summed giving a range of 5 to 50 for the Appearance domain and 5 to 20 for the Expectations domain. The Cronbach's α was 0.88 for both domains and Total score with a test-retest reliability of 0.81 for Appearance and 0.91 for Expectations. Correlations with major curve magnitude were higher for the SAQ Appearance and SAQ Total scores compared to correlations between the SRS Appearance and SRS Total scores. The SAQ and SRS-22 Scores were statistically significantly different in patients who were scheduled for surgery compared to those who were observed or braced. The SAQ is a valid measure of self-image in patients with AIS with greater correlation to curve magnitude than SRS Appearance and Total score. It also discriminates between patients who require surgery from those who do not.

  12. Allele-sharing models: LOD scores and accurate linkage tests.

    Science.gov (United States)

    Kong, A; Cox, N J

    1997-11-01

    Starting with a test statistic for linkage analysis based on allele sharing, we propose an associated one-parameter model. Under general missing-data patterns, this model allows exact calculation of likelihood ratios and LOD scores and has been implemented by a simple modification of existing software. Most important, accurate linkage tests can be performed. Using an example, we show that some previously suggested approaches to handling less than perfectly informative data can be unacceptably conservative. Situations in which this model may not perform well are discussed, and an alternative model that requires additional computations is suggested.

  13. Parent Ratings of Impulsivity and Inhibition Predict State Testing Scores

    Directory of Open Access Journals (Sweden)

    Rebecca A. Lundwall

    2018-03-01

    Full Text Available One principle of cognitive development is that earlier intervention for educational difficulties tends to improve outcomes such as future educational and career success. One possible way to help students who struggle is to determine if they process information differently. Such determination might lead to clues for interventions. For example, early information processing requires attention before the information can be identified, encoded, and stored. The aim of the present study was to investigate whether parent ratings of inattention, inhibition, and impulsivity, and whether error rate on a reflexive attention task could be used to predict child scores on state standardized tests. Finding such an association could provide assistance to educators in identifying academically struggling children who might require targeted educational interventions. Children (N = 203 were invited to complete a peripheral cueing task (which measures the automatic reorienting of the brain’s attentional resources from one location to another. While the children completed the task, their parents completed a questionnaire. The questionnaire gathered information on broad indicators of child functioning, including observable behaviors of impulsivity, inattention, and inhibition, as well as state academic scores (which the parent retrieved online from their school. We used sequential regression to analyze contributions of error rate and parent-rated behaviors in predicting six academic scores. In one of the six analyses (for science, we found that the improvement was significant from the simplified model (with only family income, child age, and sex as predictors to the full model (adding error rate and three parent-rated behaviors. Two additional analyses (reading and social studies showed near significant improvement from simplified to full models. Parent-rated behaviors were significant predictors in all three of these analyses. In the reading score analysis

  14. Development of the Spatial Ability Test for Middle School Students

    Science.gov (United States)

    Yildiz, Sevda Göktepe; Özdemir, Ahmet Sükrü

    2017-01-01

    The purpose of this study was to develop a test to determine spatial ability of middle school students. The participants were 704 middle school students (6th, 7th and 8th grade) who were studying at different schools from Istanbul. Item analysis, exploratory and confirmatory factor analysis, reliability analysis were used to analyse the data.…

  15. Testing the "Work Ability House" Model in hospital workers.

    Science.gov (United States)

    Martinez, Maria Carmen; Latorre, Maria do Rosário Dias de Oliveira; Fischer, Frida Marina

    2016-01-01

    To test the Work Ability House model, verifying the hierarchy of proposed dimensions, among a group of hospital workers. A cohort study (2009-2011) was conducted with a sample of 599 workers from a hospital in the city of São Paulo. A questionnaire including sociodemographics, lifestyle and working conditions was used. The Brazilian versions of Job Stress Scale, Effort-Reward Imbalance, Work-Related Activities That May Contribute To Job-Related Pain and/or Injury, and the Work Ability Index (WAI) were also used. A hierarchical logistic regression analysis was performed: the independent variables were allocated into levels according to the dimensions of the theoretical model in order to evaluate the factors associated with work ability. Variables associated with impairment of work ability in each dimension were as follows: (a) sociodemographics: age work injuries (p = 0.029), (c) professional competence: low educational level (p = 0.008), (d) values : intensified in overcommitment (p work: intensification of effort-reward imbalance (p = 0.009) and high demands (p = 0.040). The results confirmed the dimensions proposed for the Work Ability House model, indicating that it is valid as a representation of a multidimensional construct of multifactorial determination and can be used in the management of work ability.

  16. Validity of GRE General Test scores and TOEFL scores for graduate admission to a technical university in Western Europe

    Science.gov (United States)

    Zimmermann, Judith; von Davier, Alina A.; Buhmann, Joachim M.; Heinimann, Hans R.

    2018-01-01

    Graduate admission has become a critical process in tertiary education, whereby selecting valid admissions instruments is key. This study assessed the validity of Graduate Record Examination (GRE) General Test scores for admission to Master's programmes at a technical university in Europe. We investigated the indicative value of GRE scores for the Master's programme grade point average (GGPA) with and without the addition of the undergraduate GPA (UGPA) and the TOEFL score, and of GRE scores for study completion and Master's thesis performance. GRE scores explained 20% of the variation in the GGPA, while additional 7% were explained by the TOEFL score and 3% by the UGPA. Contrary to common belief, the GRE quantitative reasoning score showed only little explanatory power. GRE scores were also weakly related to study progress but not to thesis performance. Nevertheless, GRE and TOEFL scores were found to be sensible admissions instruments. Rigorous methodology was used to obtain highly reliable results.

  17. The Predictive Ability of IQ and Working Memory Scores in Literacy in an Adult Population

    Science.gov (United States)

    Alloway, Tracy Packiam; Gregory, David

    2013-01-01

    Literacy problems are highly prevalent and can persist into adulthood. Yet, the majority of research on the predictive nature of cognitive skills to literacy has primarily focused on development and adolescent populations. The aim of the present study was to extend existing research to investigate the roles of IQ scores and Working Memory…

  18. Computerized scoring algorithms for the Autobiographical Memory Test.

    Science.gov (United States)

    Takano, Keisuke; Gutenbrunner, Charlotte; Martens, Kris; Salmon, Karen; Raes, Filip

    2018-02-01

    Reduced specificity of autobiographical memories is a hallmark of depressive cognition. Autobiographical memory (AM) specificity is typically measured by the Autobiographical Memory Test (AMT), in which respondents are asked to describe personal memories in response to emotional cue words. Due to this free descriptive responding format, the AMT relies on experts' hand scoring for subsequent statistical analyses. This manual coding potentially impedes research activities in big data analytics such as large epidemiological studies. Here, we propose computerized algorithms to automatically score AM specificity for the Dutch (adult participants) and English (youth participants) versions of the AMT by using natural language processing and machine learning techniques. The algorithms showed reliable performances in discriminating specific and nonspecific (e.g., overgeneralized) autobiographical memories in independent testing data sets (area under the receiver operating characteristic curve > .90). Furthermore, outcome values of the algorithms (i.e., decision values of support vector machines) showed a gradient across similar (e.g., specific and extended memories) and different (e.g., specific memory and semantic associates) categories of AMT responses, suggesting that, for both adults and youth, the algorithms well capture the extent to which a memory has features of specific memories. (PsycINFO Database Record (c) 2018 APA, all rights reserved).

  19. ANOVA Analysis of Student Daily Test Scores in Multi-Day Test Periods

    Science.gov (United States)

    Mouritsen, Matthew L.; Davis, Jefferson T.; Jones, Steven C.

    2016-01-01

    Instructors are often concerned when giving multiple-day tests because students taking the test later in the exam period may have an advantage over students taking the test early in the exam period due to information leakage. However, exam scores seemed to decline as students took the same test later in a multi-day exam period (Mouritsen and…

  20. The Performance of the Upper Limb scores correlate with pulmonary function test measures and Egen Klassifikation scores in Duchenne muscular dystrophy.

    Science.gov (United States)

    Lee, Ha Neul; Sawnani, Hemant; Horn, Paul S; Rybalsky, Irina; Relucio, Lani; Wong, Brenda L

    2016-01-01

    The Performance of the Upper Limb scale was developed as an outcome measure specifically for ambulant and non-ambulant patients with Duchenne muscular dystrophy and is implemented in clinical trials needing longitudinal data. The aim of this study is to determine whether this novel tool correlates with functional ability using pulmonary function test, cardiac function test and Egen Klassifikation scale scores as clinical measures. In this cross-sectional study, 43 non-ambulatory Duchenne males from ages 10 to 30 years and on long-term glucocorticoid treatment were enrolled. Cardiac and pulmonary function test results were analyzed to assess cardiopulmonary function, and Egen Klassifikation scores were analyzed to assess functional ability. The Performance of the Upper Limb scores correlated with pulmonary function measures and had inverse correlation with Egen Klassifikation scores. There was no correlation with left ventricular ejection fraction and left ventricular dysfunction. Body mass index and decreased joint range of motion affected total Performance of the Upper Limb scores and should be considered in clinical trial designs. Copyright © 2016 Elsevier B.V. All rights reserved.

  1. Your move: The effect of chess on mathematics test scores.

    Science.gov (United States)

    Rosholm, Michael; Mikkelsen, Mai Bjørnskov; Gumede, Kamilla

    2017-01-01

    We analyse the effect of substituting a weekly mathematics lesson in primary school grades 1-3 with a lesson in mathematics based on chess instruction. We use data from the City of Aarhus in Denmark, combining test score data with a comprehensive data set obtained from administrative registers. We use two different methodological approaches to identify and estimate treatment effects and we tend to find positive effects, indicating that knowledge acquired through chess play can be transferred to the domain of mathematics. We also find larger impacts for unhappy children and children who are bored in school, perhaps because chess instruction facilitates learning by providing an alternative approach to mathematics for these children. The results are encouraging and suggest that chess may be an important and effective tool for improving mathematical capacity in young students.

  2. Your move: The effect of chess on mathematics test scores

    DEFF Research Database (Denmark)

    Rosholm, Michael; Mikkelsen, Mai Bjørnskov; Gumede, Kamilla Trille

    2017-01-01

    We analyse the effect of substituting a weekly mathematics lesson in primary school grades 1–3 with a lesson in mathematics based on chess instruction. We use data from the City of Aarhus in Denmark, combining test score data with a comprehensive data set obtained from administrative registers. We...... use two different methodological approaches to identify and estimate treatment effects and we tend to find positive effects, indicating that knowledge acquired through chess play can be transferred to the domain of mathematics. We also find larger impacts for unhappy children and children who...... are bored in school, perhaps because chess instruction facilitates learning by providing an alternative approach to mathematics for these children. The results are encouraging and suggest that chess may be an important and effective tool for improving mathematical capacity in young students....

  3. Your move: The effect of chess on mathematics test scores.

    Directory of Open Access Journals (Sweden)

    Michael Rosholm

    Full Text Available We analyse the effect of substituting a weekly mathematics lesson in primary school grades 1-3 with a lesson in mathematics based on chess instruction. We use data from the City of Aarhus in Denmark, combining test score data with a comprehensive data set obtained from administrative registers. We use two different methodological approaches to identify and estimate treatment effects and we tend to find positive effects, indicating that knowledge acquired through chess play can be transferred to the domain of mathematics. We also find larger impacts for unhappy children and children who are bored in school, perhaps because chess instruction facilitates learning by providing an alternative approach to mathematics for these children. The results are encouraging and suggest that chess may be an important and effective tool for improving mathematical capacity in young students.

  4. Specific algorithm method of scoring the Clock Drawing Test applied in cognitively normal elderly

    Directory of Open Access Journals (Sweden)

    Liana Chaves Mendes-Santos

    Full Text Available The Clock Drawing Test (CDT is an inexpensive, fast and easily administered measure of cognitive function, especially in the elderly. This instrument is a popular clinical tool widely used in screening for cognitive disorders and dementia. The CDT can be applied in different ways and scoring procedures also vary. OBJECTIVE: The aims of this study were to analyze the performance of elderly on the CDT and evaluate inter-rater reliability of the CDT scored by using a specific algorithm method adapted from Sunderland et al. (1989. METHODS: We analyzed the CDT of 100 cognitively normal elderly aged 60 years or older. The CDT ("free-drawn" and Mini-Mental State Examination (MMSE were administered to all participants. Six independent examiners scored the CDT of 30 participants to evaluate inter-rater reliability. RESULTS AND CONCLUSION: A score of 5 on the proposed algorithm ("Numbers in reverse order or concentrated", equivalent to 5 points on the original Sunderland scale, was the most frequent (53.5%. The CDT specific algorithm method used had high inter-rater reliability (p<0.01, and mean score ranged from 5.06 to 5.96. The high frequency of an overall score of 5 points may suggest the need to create more nuanced evaluation criteria, which are sensitive to differences in levels of impairment in visuoconstructive and executive abilities during aging.

  5. Association between the gait pattern characteristics of older people and their two-step test scores.

    Science.gov (United States)

    Kobayashi, Yoshiyuki; Ogata, Toru

    2018-04-27

    The Two-Step test is one of three official tests authorized by the Japanese Orthopedic Association to evaluate the risk of locomotive syndrome (a condition of reduced mobility caused by an impairment of the locomotive organs). It has been reported that the Two-Step test score has a good correlation with one's walking ability; however, its association with the gait pattern of older people during normal walking is still unknown. Therefore, this study aims to clarify the associations between the gait patterns of older people observed during normal walking and their Two-Step test scores. We analyzed the whole waveforms obtained from the lower-extremity joint angles and joint moments of 26 older people in various stages of locomotive syndrome using principal component analysis (PCA). The PCA was conducted using a 260 × 2424 input matrix constructed from the participants' time-normalized pelvic and right-lower-limb-joint angles along three axes (ten trials of 26 participants, 101 time points, 4 angles, 3 axes, and 2 variable types per trial). The Pearson product-moment correlation coefficient between the scores of the principal component vectors (PCVs) and the scores of the Two-Step test revealed that only one PCV (PCV 2) among the 61 obtained relevant PCVs is significantly related to the score of the Two-Step test. We therefore concluded that the joint angles and joint moments related to PCV 2-ankle plantar-flexion, ankle plantar-flexor moments during the late stance phase, ranges of motion and moments on the hip, knee, and ankle joints in the sagittal plane during the entire stance phase-are the motions associated with the Two-Step test.

  6. External validation of the ability of the DRAGON score to predict outcome after thrombolysis treatment

    DEFF Research Database (Denmark)

    Ovesen, Christian Aavang; Christensen, Anders; Nielsen, J K

    2013-01-01

    Easy-to-perform and valid assessment scales for the effect of thrombolysis are essential in hyperacute stroke settings. Because of this we performed an external validation of the DRAGON scale proposed by Strbian et al. in a Danish cohort. All patients treated with intravenous recombinant plasmino......Easy-to-perform and valid assessment scales for the effect of thrombolysis are essential in hyperacute stroke settings. Because of this we performed an external validation of the DRAGON scale proposed by Strbian et al. in a Danish cohort. All patients treated with intravenous recombinant...... and their modified Rankin Scale (mRS) was assessed after 3 months. Three hundred and three patients were included in the analysis. The DRAGON scale proved to have a good discriminative ability for predicting highly unfavourable outcome (mRS 5-6) (area under the curve-receiver operating characteristic [AUC-ROC]: 0...

  7. Score Gains on g-loaded Tests: No g

    NARCIS (Netherlands)

    te Nijenhuis, J.; van Vianen, A.E.M.; van der Flier, H.

    2007-01-01

    IQ scores provide the best general predictor of success in education, job training, and work. However, there are many ways in which IQ scores can be increased, for instance by means of retesting or participation in learning potential training programs. What is the nature of these score gains? Jensen

  8. Validity of GRE General Test Scores and TOEFL Scores for Graduate Admission to a Technical University in Western Europe

    Science.gov (United States)

    Zimmermann, Judith; von Davier, Alina A.; Buhmann, Joachim M.; Heinimann, Hans R.

    2018-01-01

    Graduate admission has become a critical process in tertiary education, whereby selecting valid admissions instruments is key. This study assessed the validity of Graduate Record Examination (GRE) General Test scores for admission to Master's programmes at a technical university in Europe. We investigated the indicative value of GRE scores for the…

  9. External validation of the ability of the DRAGON score to predict outcome after thrombolysis treatment.

    Science.gov (United States)

    Ovesen, C; Christensen, A; Nielsen, J K; Christensen, H

    2013-11-01

    Easy-to-perform and valid assessment scales for the effect of thrombolysis are essential in hyperacute stroke settings. Because of this we performed an external validation of the DRAGON scale proposed by Strbian et al. in a Danish cohort. All patients treated with intravenous recombinant plasminogen activator between 2009 and 2011 were included. Upon admission all patients underwent physical and neurological examination using the National Institutes of Health Stroke Scale along with non-contrast CT scans and CT angiography. Patients were followed up through the Outpatient Clinic and their modified Rankin Scale (mRS) was assessed after 3 months. Three hundred and three patients were included in the analysis. The DRAGON scale proved to have a good discriminative ability for predicting highly unfavourable outcome (mRS 5-6) (area under the curve-receiver operating characteristic [AUC-ROC]: 0.89; 95% confidence interval [CI] 0.81-0.96; pDRAGON scale provided good discriminative capability (AUC-ROC: 0.89; 95% CI 0.78-1.0; p=0.003) for highly unfavourable outcome. We confirmed the validity of the DRAGON scale in predicting outcome after thrombolysis treatment. Copyright © 2013 Elsevier Ltd. All rights reserved.

  10. Characterizing Spatial Ability: Different Mental Processes Reflected in Accuracy and Latency Scores.

    Science.gov (United States)

    1978-08-01

    Prince- ton, New Jersey: Educational Testing Service, March, 1969. 6. Guilford, J. P., The Nature of Human Intelligence . New York: McGraw- Hill, 1969. 7...of visual- figural systems (CFS-V) , cognition of figural transformations (CFT), and cogni- tion of kinesthetic -figural systems (CFS-K) , represents...position and direction that has occurred from the top to the bottom drawing of a motorboat heading toward a coastline. The time limit on the 60-item GZO is

  11. The Effect of Mock Tests on Iranian EFL learners’ Test Scores

    OpenAIRE

    Hossein Khodabakhshzadeh; Reza Zardkanloo

    2016-01-01

    The effect of using tests in test preparation courses has been subject to debate. While some scholars such as Yang and Badger (2015) believe it is a cause of positive washback effect, others argue that this issue is tentative and context-bound (Green, 2007). Therefore, this study investigated the effect of using Mock tests in International English Language Testing System (IELTS) preparation courses on students’ overall IELTS scores. Fifty one IELTS students were selected non-randomly through ...

  12. Test/score/report: Simulation techniques for automating the test process

    Science.gov (United States)

    Hageman, Barbara H.; Sigman, Clayton B.; Koslosky, John T.

    1994-01-01

    A Test/Score/Report capability is currently being developed for the Transportable Payload Operations Control Center (TPOCC) Advanced Spacecraft Simulator (TASS) system which will automate testing of the Goddard Space Flight Center (GSFC) Payload Operations Control Center (POCC) and Mission Operations Center (MOC) software in three areas: telemetry decommutation, spacecraft command processing, and spacecraft memory load and dump processing. Automated computer control of the acceptance test process is one of the primary goals of a test team. With the proper simulation tools and user interface, the task of acceptance testing, regression testing, and repeatability of specific test procedures of a ground data system can be a simpler task. Ideally, the goal for complete automation would be to plug the operational deliverable into the simulator, press the start button, execute the test procedure, accumulate and analyze the data, score the results, and report the results to the test team along with a go/no recommendation to the test team. In practice, this may not be possible because of inadequate test tools, pressures of schedules, limited resources, etc. Most tests are accomplished using a certain degree of automation and test procedures that are labor intensive. This paper discusses some simulation techniques that can improve the automation of the test process. The TASS system tests the POCC/MOC software and provides a score based on the test results. The TASS system displays statistics on the success of the POCC/MOC system processing in each of the three areas as well as event messages pertaining to the Test/Score/Report processing. The TASS system also provides formatted reports documenting each step performed during the tests and the results of each step. A prototype of the Test/Score/Report capability is available and currently being used to test some POCC/MOC software deliveries. When this capability is fully operational it should greatly reduce the time necessary

  13. Predicting Bobsled Pushing Ability from Various Combine Testing Events.

    Science.gov (United States)

    Tomasevicz, Curtis L; Ransone, Jack W; Bach, Christopher W

    2018-03-12

    The requisite combination of speed, power, and strength necessary for a bobsled push athlete coupled with the difficulty in directly measuring pushing ability makes selecting effective push crews challenging. Current practices by USA Bobsled and Skeleton (USABS) utilize field combine testing to assess and identify specifically selected performance variables in an attempt to best predict push performance abilities. Combine data consisting of 11 physical performance variables were collected from 75 subjects across two winter Olympic qualification years (2009 and 2013). These variables were sprints of 15-, 30-, and 60 m, a flying 30 m sprint, a standing broad jump, a shot toss, squat, power clean, body mass, and dry-land brake and side bobsled pushes. Discriminant Analysis (DA) in addition to Principle Component Analysis (PCA) was used to investigate two cases (Case 1: Olympians vs. non-Olympians; Case 2: National Team vs. non-National Team). Using these 11 variables, DA led to a classification rule that proved capable of identifying Olympians from non-Olympians and National Team members from non-National Team members with 9.33% and 14.67% misclassification rates, respectively. The PCA was used to find similar test variables within the combine that provided redundant or useless data. After eliminating the unnecessary variables, DA on the new combinations showed that 8 (Case 1) and 20 (Case 2) other combinations with fewer performance variables yielded misclassification rates as low as 6.67% and 13.33% respectively. Utilizing fewer performance variables can allow governing bodies in many other sports to create more appropriate combine testing that maximize accuracy, while minimizing irrelevant and redundant strategies.

  14. The Effects of Video Game Experience on Computer-Based Air Traffic Controller Specialist, Air Traffic Scenario Test Scores.

    Science.gov (United States)

    1997-02-01

    application with a strong resemblance to a video game , concern has been raised that prior video game experience might have a moderating effect on scores. Much...such as spatial ability. The effects of computer or video game experience on work sample scores have not been systematically investigated. The purpose...of this study was to evaluate the incremental validity of prior video game experience over that of general aptitude as a predictor of work sample test

  15. Reporting Diagnostic Scores in Educational Testing: Temptations, Pitfalls, and Some Solutions

    Science.gov (United States)

    Sinharay, Sandip; Puhan, Gautam; Haberman, Shelby J.

    2010-01-01

    Diagnostic scores are of increasing interest in educational testing due to their potential remedial and instructional benefit. Naturally, the number of educational tests that report diagnostic scores is on the rise, as are the number of research publications on such scores. This article provides a critical evaluation of diagnostic score reporting…

  16. The Apgar score has survived the test of time.

    Science.gov (United States)

    Finster, Mieczyslaw; Wood, Margaret

    2005-04-01

    In 1953, Virginia Apgar, M.D. published her proposal for a new method of evaluation of the newborn infant. The avowed purpose of this paper was to establish a simple and clear classification of newborn infants which can be used to compare the results of obstetric practices, types of maternal pain relief and the results of resuscitation. Having considered several objective signs pertaining to the condition of the infant at birth she selected five that could be evaluated and taught to the delivery room personnel without difficulty. These signs were heart rate, respiratory effort, reflex irritability, muscle tone and color. Sixty seconds after the complete birth of the baby a rating of zero, one or two was given to each sign, depending on whether it was absent or present. Virginia Apgar reviewed anesthesia records of 1025 infants born alive at Columbia Presbyterian Medical Center during the period of this report. All had been rated by her method. Infants in poor condition scored 0-2, infants in fair condition scored 3-7, while scores 8-10 were achieved by infants in good condition. The most favorable score 1 min after birth was obtained by infants delivered vaginally with the occiput the presenting part (average 8.4). Newborns delivered by version and breech extraction had the lowest score (average 6.3). Infants delivered by cesarean section were more vigorous (average score 8.0) when spinal was the method of anesthesia versus an average score of 5.0 when general anesthesia was used. Correlating the 60 s score with neonatal mortality, Virginia found that mature infants receiving 0, 1 or 2 scores had a neonatal death rate of 14%; those scoring 3, 4, 5, 6 or 7 had a death rate of 1.1%; and those in the 8-10 score group had a death rate of 0.13%. She concluded that the prognosis of an infant is excellent if he receives one of the upper three scores, and poor if one of the lowest three scores.

  17. The Ability of the Acute Physiology and Chronic Health Evaluation (APACHE IV Score to Predict Mortality in a Single Tertiary Hospital

    Directory of Open Access Journals (Sweden)

    Jae Woo Choi

    2017-08-01

    Full Text Available Background The Acute Physiology and Chronic Health Evaluation (APACHE II model has been widely used in Korea. However, there have been few studies on the APACHE IV model in Korean intensive care units (ICUs. The aim of this study was to compare the ability of APACHE IV and APACHE II in predicting hospital mortality, and to investigate the ability of APACHE IV as a critical care triage criterion. Methods The study was designed as a prospective cohort study. Measurements of discrimination and calibration were performed using the area under the receiver operating characteristic curve (AUROC and the Hosmer-Lemeshow goodness-of-fit test respectively. We also calculated the standardized mortality ratio (SMR. Results The APACHE IV score, the Charlson Comorbidity index (CCI score, acute respiratory distress syndrome, and unplanned ICU admissions were independently associated with hospital mortality. The calibration, discrimination, and SMR of APACHE IV were good (H = 7.67, P = 0.465; C = 3.42, P = 0.905; AUROC = 0.759; SMR = 1.00. However, the explanatory power of an APACHE IV score >93 alone on hospital mortality was low at 44.1%. The explanatory power was increased to 53.8% when the hospital mortality was predicted using a model that considers APACHE IV >93 scores, medical admission, and risk factors for CCI >3 coincidentally. However, the discriminative ability of the prediction model was unsatisfactory (C index <0.70. Conclusions The APACHE IV presented good discrimination, calibration, and SMR for hospital mortality.

  18. Prepharmacy predictors of success in pharmacy school: grade point averages, pharmacy college admissions test, communication abilities, and critical thinking skills.

    Science.gov (United States)

    Allen, D D; Bond, C A

    2001-07-01

    Good admissions decisions are essential for identifying successful students and good practitioners. Various parameters have been shown to have predictive power for academic success. Previous academic performance, the Pharmacy College Admissions Test (PCAT), and specific prepharmacy courses have been suggested as academic performance indicators. However, critical thinking abilities have not been evaluated. We evaluated the connection between academic success and each of the following predictive parameters: the California Critical Thinking Skills Test (CCTST) score, PCAT score, interview score, overall academic performance prior to admission at a pharmacy school, and performance in specific prepharmacy courses. We confirmed previous reports but demonstrated intriguing results in predicting practice-based skills. Critical thinking skills predict practice-based course success. Also, the CCTST and PCAT scores (Pearson correlation [pc] = 0.448, p critical thinking skills in pharmacy practice courses and clerkships. Further study is needed to confirm this finding and determine which PCAT components predict critical thinking abilities.

  19. The Impact of Time-Series Diagnostic Tests on the Writing Ability of Iranian EFL learners

    Directory of Open Access Journals (Sweden)

    Bahareh Molazem Atashgahi

    2014-02-01

    Full Text Available This study aimed to show whether administering a battery of time-series diagnostic tests (screening has any impact on Iranian EFL learners’ writing ability. The study was conducted on the intermediate EFL learners at Islamic Azad University North Tehran branch.  The researcher administered a homogenizing test in order to exclude the exceptional scores, among all the testers, only those whose scores were nearly within one standard deviation above or below the mean were selected as the participants of this study. After the assignment of the participants to the control and experimental groups- 30 students in each group- they were asked to write five-paragraph-essays on two topics. Such a pretest was given to both groups to test their initial writing ability. Once scoring of the students’ writings (five- paragraph essay was finished the two means of the groups were calculated and compared with each other through the t-test analysis. The result demonstrated that there was no statistically significant difference between those two groups regarding the variable under investigation. Four sets of diagnostic tests were given to the experimental group every two weeks and after each test both the result of the exam and suitable feedback regarding students’ errors were given to them by the teacher, while the Current-Traditional Rhetoric method was administered in the control group. In the posttest which was run after giving the treatment and placebo to experimental group and control group respectively, students took another writing test with the same characteristics in administration, topics and scoring as the one in pretest. Thereafter, the significance of the difference between the obtained means of experimental and control groups in the posttest was determined through the t-test.  The result of the t-test analysis indicated a significant difference between the two groups which consequently rejected the null hypothesis of the study. Therefore, any

  20. Effects of white noise on Callsign Acquisition Test and Modified Rhyme Test scores.

    Science.gov (United States)

    Blue-Terry, Misty; Letowski, Tomasz

    2011-02-01

    The Callsign Acquisition Test (CAT) is a speech intelligibility test developed by the US Army Research Laboratory. The test has been used to evaluate speech transmission through various communication systems but has not been yet sufficiently standardised and validated. The aim of this study was to compare CAT and Modified Rhyme Test (MRT) performance in the presence of white noise across a range of signal-to-noise ratios (SNRs). A group of 16 normal-hearing listeners participated in the study. The speech items were presented at 65 dB(A) in the background of white noise at SNRs of -18, -15, -12, -9 and -6 dB. The results showed a strong positive association (75.14%) between the two tests, but significant differences between the CAT and MRT absolute scores in the range of investigated SNRs. Based on the data, a function to predict CAT scores based on existing MRT scores and vice versa was formulated. STATEMENT OF RELEVANCE: This work compares performance data of a common speech intelligibility test (MRT) with a new test (CAT) in the presence of white noise. The results here can be used as a part of the standardisation procedures and provide insights to the predictive capabilities of the CAT to quantify speech intelligibility communication in high-noise military environments.

  1. Use of Multi-Response Format Test in the Assessment of Medical Students’ Critical Thinking Ability

    Science.gov (United States)

    Mafinejad, Mahboobeh Khabaz; Monajemi, Alireza; Jalili, Mohammad; Soltani, Akbar; Rasouli, Javad

    2017-01-01

    Introduction To evaluate students critical thinking skills effectively, change in assessment practices is must. The assessment of a student’s ability to think critically is a constant challenge, and yet there is considerable debate on the best assessment method. There is evidence that the intrinsic nature of open and closed-ended response questions is to measure separate cognitive abilities. Aim To assess critical thinking ability of medical students by using multi-response format of assessment. Materials and Methods A cross-sectional study was conducted on a group of 159 undergraduate third-year medical students. All the participants completed the California Critical Thinking Skills Test (CCTST) consisting of 34 multiple-choice questions to measure general critical thinking skills and a researcher-developed test that combines open and closed-ended questions. A researcher-developed 48-question exam, consisting of 8 short-answers and 5 essay questions, 19 Multiple-Choice Questions (MCQ), and 16 True-False (TF) questions, was used to measure critical thinking skills. Correlation analyses were performed using Pearson’s coefficient to explore the association between the total scores of tests and subtests. Results One hundred and fifty-nine students participated in this study. The sample comprised 81 females (51%) and 78 males (49%) with an age range of 20±2.8 years (mean 21.2 years). The response rate was 64.1%. A significant positive correlation was found between types of questions and critical thinking scores, of which the correlations of MCQ (r=0.82) and essay questions (r=0.77) were strongest. The significant positive correlations between multi-response format test and CCTST’s subscales were seen in analysis, evaluation, inference and inductive reasoning. Unlike CCTST subscales, multi-response format test have weak correlation with CCTST total score (r=0.45, p=0.06). Conclusion This study highlights the importance of considering multi-response format test in

  2. Use of Multi-Response Format Test in the Assessment of Medical Students' Critical Thinking Ability.

    Science.gov (United States)

    Mafinejad, Mahboobeh Khabaz; Arabshahi, Seyyed Kamran Soltani; Monajemi, Alireza; Jalili, Mohammad; Soltani, Akbar; Rasouli, Javad

    2017-09-01

    To evaluate students critical thinking skills effectively, change in assessment practices is must. The assessment of a student's ability to think critically is a constant challenge, and yet there is considerable debate on the best assessment method. There is evidence that the intrinsic nature of open and closed-ended response questions is to measure separate cognitive abilities. To assess critical thinking ability of medical students by using multi-response format of assessment. A cross-sectional study was conducted on a group of 159 undergraduate third-year medical students. All the participants completed the California Critical Thinking Skills Test (CCTST) consisting of 34 multiple-choice questions to measure general critical thinking skills and a researcher-developed test that combines open and closed-ended questions. A researcher-developed 48-question exam, consisting of 8 short-answers and 5 essay questions, 19 Multiple-Choice Questions (MCQ), and 16 True-False (TF) questions, was used to measure critical thinking skills. Correlation analyses were performed using Pearson's coefficient to explore the association between the total scores of tests and subtests. One hundred and fifty-nine students participated in this study. The sample comprised 81 females (51%) and 78 males (49%) with an age range of 20±2.8 years (mean 21.2 years). The response rate was 64.1%. A significant positive correlation was found between types of questions and critical thinking scores, of which the correlations of MCQ (r=0.82) and essay questions (r=0.77) were strongest. The significant positive correlations between multi-response format test and CCTST's subscales were seen in analysis, evaluation, inference and inductive reasoning. Unlike CCTST subscales, multi-response format test have weak correlation with CCTST total score (r=0.45, p=0.06). This study highlights the importance of considering multi-response format test in the assessment of critical thinking abilities of medical

  3. The Formalization of Fairness: Issues in Testing for Measurement Invariance Using Subtest Scores

    Science.gov (United States)

    Molenaar, Dylan; Borsboom, Denny

    2013-01-01

    Measurement invariance is an important prerequisite for the adequate comparison of group differences in test scores. In psychology, measurement invariance is typically investigated by means of linear factor analyses of subtest scores. These subtest scores typically result from summing the item scores. In this paper, we discuss 4 possible problems…

  4. Concurrent Validity of the Woodcock-Johnson Tests of Cognitive Ability with the WISC-R: EMR Children.

    Science.gov (United States)

    Cummings, Jack A.; Sanville, David

    1983-01-01

    Administered the Wechsler Intelligence Scale for Children-Revised (WISC-R) and the Woodcock-Johnson Tests of Cognitive Ability (WJTCA) to educable mentally retarded children (N=30). Results showed significant mean differences between WISC-R and WJTCA full-scale standard scores, providing implications for placement of children in classes for the…

  5. The Effect of Mock Tests on Iranian EFL learners’ Test Scores

    Directory of Open Access Journals (Sweden)

    Hossein Khodabakhshzadeh

    2016-07-01

    Full Text Available The effect of using tests in test preparation courses has been subject to debate. While some scholars such as Yang and Badger (2015 believe it is a cause of positive washback effect, others argue that this issue is tentative and context-bound (Green, 2007. Therefore, this study investigated the effect of using Mock tests in International English Language Testing System (IELTS preparation courses on students’ overall IELTS scores. Fifty one IELTS students were selected non-randomly through the quota sampling approach out of 76 students at Mahan Language Institute in Birjand, Iran.  These participants were distributed into Group 1 (n=25 and Group 2 (n=26. A complete IELTS test was administered to ensure that the Groups were homogeneous and to serve as pretest. After 10 sessions of intervention, a different IELTS test was administered as posttest. The results of between subject analysis through independent samples t-test revealed that using Mock tests in the IELTS preparation courses can positively affect the participants scores on IELTS exam. Pedagogical implications are discussed.

  6. Work ability score and future work ability as predictors of register-based disability pension and long-term sickness absence: A three-year follow-up study.

    Science.gov (United States)

    Kinnunen, Ulla; Nätti, Jouko

    2018-05-01

    We investigated two single items of the Work Ability Index - work ability score, and future work ability - as predictors of register-based disability pension and long-term sickness absence over a three-year follow-up. Survey responses of 11,131 Finnish employees were linked to pension and long-term (more than 10 days) sickness absence register data by Statistics Finland. Work ability score was divided into poor (0-5), moderate (6-7) and good/excellent (8-10) and future work ability into poor (1-2) and good (3) work ability at baseline. Cox proportional hazard regressions were used in the analysis of disability pension, and a negative binomial model in the analysis of long-term sickness absence. The results were adjusted for several background, work- and health-related covariates. Compared with those with good/excellent work ability scores, the hazard ratios of disability pension after adjusting for all covariates were 9.84 (95% CI 6.68-14.49) for poor and 2.25 (CI 95% 1.51-3.35) for moderate work ability score. For future work ability, the hazard ratio was 8.19 (95% CI 4.71-14.23) among those with poor future work ability. The incidence rate ratios of accumulated long-term sickness absence days were 3.08 (95% CI 2.19-4.32) and 1.59 (95% CI 1.32-1.92) for poor and moderate work ability scores, and 1.51 (95% CI 0.97-2.36) for poor future work ability. The single items of work ability score and future work ability predicted register-based disability pension equally well, but work ability score was a better predictor of register-based long-term sickness absence days than future work ability in a three-year follow-up. Both items seem to be of use especially when examining the risk of poor work ability for disability but also for long sick leave.

  7. A Dynamic Speech Comprehension Test for Assessing Real-World Listening Ability.

    Science.gov (United States)

    Best, Virginia; Keidser, Gitte; Freeston, Katrina; Buchholz, Jörg M

    2016-07-01

    Many listeners with hearing loss report particular difficulties with multitalker communication situations, but these difficulties are not well predicted using current clinical and laboratory assessment tools. The overall aim of this work is to create new speech tests that capture key aspects of multitalker communication situations and ultimately provide better predictions of real-world communication abilities and the effect of hearing aids. A test of ongoing speech comprehension introduced previously was extended to include naturalistic conversations between multiple talkers as targets, and a reverberant background environment containing competing conversations. In this article, we describe the development of this test and present a validation study. Thirty listeners with normal hearing participated in this study. Speech comprehension was measured for one-, two-, and three-talker passages at three different signal-to-noise ratios (SNRs), and working memory ability was measured using the reading span test. Analyses were conducted to examine passage equivalence, learning effects, and test-retest reliability, and to characterize the effects of number of talkers and SNR. Although we observed differences in difficulty across passages, it was possible to group the passages into four equivalent sets. Using this grouping, we achieved good test-retest reliability and observed no significant learning effects. Comprehension performance was sensitive to the SNR but did not decrease as the number of talkers increased. Individual performance showed associations with age and reading span score. This new dynamic speech comprehension test appears to be valid and suitable for experimental purposes. Further work will explore its utility as a tool for predicting real-world communication ability and hearing aid benefit. American Academy of Audiology.

  8. Reading ability and print exposure: item response theory analysis of the author recognition test.

    Science.gov (United States)

    Moore, Mariah; Gordon, Peter C

    2015-12-01

    In the author recognition test (ART), participants are presented with a series of names and foils and are asked to indicate which ones they recognize as authors. The test is a strong predictor of reading skill, and this predictive ability is generally explained as occurring because author knowledge is likely acquired through reading or other forms of print exposure. In this large-scale study (1,012 college student participants), we used item response theory (IRT) to analyze item (author) characteristics in order to facilitate identification of the determinants of item difficulty, provide a basis for further test development, and optimize scoring of the ART. Factor analysis suggested a potential two-factor structure of the ART, differentiating between literary and popular authors. Effective and ineffective author names were identified so as to facilitate future revisions of the ART. Analyses showed that the ART is a highly significant predictor of the time spent encoding words, as measured using eyetracking during reading. The relationship between the ART and time spent reading provided a basis for implementing a higher penalty for selecting foils, rather than the standard method of ART scoring (names selected minus foils selected). The findings provide novel support for the view that the ART is a valid indicator of reading volume. Furthermore, they show that frequency data can be used to select items of appropriate difficulty, and that frequency data from corpora based on particular time periods and types of texts may allow adaptations of the test for different populations.

  9. Study Protocol on Intentional Distortion in Personality Assessment: Relationship with Test Format, Culture, and Cognitive Ability.

    Science.gov (United States)

    Van Geert, Eline; Orhon, Altan; Cioca, Iulia A; Mamede, Rui; Golušin, Slobodan; Hubená, Barbora; Morillo, Daniel

    2016-01-01

    Self-report personality questionnaires, traditionally offered in a graded-scale format, are widely used in high-stakes contexts such as job selection. However, job applicants may intentionally distort their answers when filling in these questionnaires, undermining the validity of the test results. Forced-choice questionnaires are allegedly more resistant to intentional distortion compared to graded-scale questionnaires, but they generate ipsative data. Ipsativity violates the assumptions of classical test theory, distorting the reliability and construct validity of the scales, and producing interdependencies among the scores. This limitation is overcome in the current study by using the recently developed Thurstonian item response theory model. As online testing in job selection contexts is increasing, the focus will be on the impact of intentional distortion on personality questionnaire data collected online. The present study intends to examine the effect of three different variables on intentional distortion: (a) test format (graded-scale versus forced-choice); (b) culture, as data will be collected in three countries differing in their attitudes toward intentional distortion (the United Kingdom, Serbia, and Turkey); and (c) cognitive ability, as a possible predictor of the ability to choose the more desirable responses. Furthermore, we aim to integrate the findings using a comprehensive model of intentional distortion. In the Anticipated Results section, three main aspects are considered: (a) the limitations of the manipulation, theoretical approach, and analyses employed; (b) practical implications for job selection and for personality assessment in a broader sense; and (c) suggestions for further research.

  10. Examining Method Effect of Synonym and Antonym Test in Verbal Abilities Measure

    Directory of Open Access Journals (Sweden)

    Wahyu Widhiarso

    2015-08-01

    Full Text Available Many researchers have assumed that different methods could be substituted to measure the same attributes in assessment. Various models have been developed to accommodate the amount of variance attributable to the methods but these models application in empirical research is rare. The present study applied one of those models to examine whether method effects were presents in synonym and antonym tests. Study participants were 3,469 applicants to graduate school. The instrument used was the Graduate Academic Potential Test (PAPS, which includes synonym and antonym questions to measure verbal abilities. Our analysis showed that measurement models that using correlated trait–correlated methods minus one, CT-C(M–1, that separated trait and method effect into distinct latent constructs yielded slightly better values for multiple goodness-of-fit indices than one factor model. However, either for the synonym or antonym items, the proportion of variance accounted for by the method is smaller than trait variance. The correlation between factor scores of both methods is high (r = 0.994. These findings confirm that synonym and antonym tests represent the same attribute so that both tests cannot be treated as two unique methods for measuring verbal ability.

  11. Examining Method Effect of Synonym and Antonym Test in Verbal Abilities Measure.

    Science.gov (United States)

    Widhiarso, Wahyu; Haryanta

    2015-08-01

    Many researchers have assumed that different methods could be substituted to measure the same attributes in assessment. Various models have been developed to accommodate the amount of variance attributable to the methods but these models application in empirical research is rare. The present study applied one of those models to examine whether method effects were presents in synonym and antonym tests. Study participants were 3,469 applicants to graduate school. The instrument used was the Graduate Academic Potential Test (PAPS), which includes synonym and antonym questions to measure verbal abilities. Our analysis showed that measurement models that using correlated trait-correlated methods minus one, CT-C(M-1), that separated trait and method effect into distinct latent constructs yielded slightly better values for multiple goodness-of-fit indices than one factor model. However, either for the synonym or antonym items, the proportion of variance accounted for by the method is smaller than trait variance. The correlation between factor scores of both methods is high (r = 0.994). These findings confirm that synonym and antonym tests represent the same attribute so that both tests cannot be treated as two unique methods for measuring verbal ability.

  12. Measuring College Students' Reading Comprehension Ability Using Cloze Tests

    Science.gov (United States)

    Williams, Rihana Shiri; Ari, Omer; Santamaria, Carmen Nicole

    2011-01-01

    Recent investigations challenge the construct validity of sustained silent reading tests. Performance of two groups of post-secondary students (e.g. struggling and non-struggling) on a sustained silent reading test and two types of cloze test (i.e. maze and open-ended) was compared in order to identify the test format that contributes greater…

  13. The Health Professions Admission Test (HPAT) score and leaving certificate results can independently predict academic performance in medical school: do we need both tests?

    LENUS (Irish Health Repository)

    Halpenny, D

    2010-11-01

    A recent study raised concerns regarding the ability of the health professions admission test (HPAT) Ireland to improve the selection process in Irish medical schools. We aimed to establish whether performance in a mock HPAT correlated with academic success in medicine. A modified HPAT examination and a questionnaire were administered to a group of doctors and medical students. There was a significant correlation between HPAT score and college results (r2: 0.314, P = 0.018, Spearman Rank) and between leaving cert score and college results (r2: 0.306, P = 0.049, Spearman Rank). There was no correlation between leaving cert points score and HPAT score. There was no difference in HPAT score across a number of other variables including gender, age and medical speciality. Our results suggest that both the HPAT Ireland and the leaving certificate examination could act as independent predictors of academic achievement in medicine.

  14. Reduce, Reuse, Recycle: The Longitudinal Value of Local Cut Scores Using State Test Data

    Science.gov (United States)

    Nelson, Peter M.; Van Norman, Ethan R.; VanDerHeyden, Amanda

    2017-01-01

    We used existing reading (n = 1,498) and math (n = 2,260) data to evaluate state test scores for screening middle school students. In Phase 1, state test data were used to create a research-derived cut score that was optimal for predicting state test performance the following year. In Phase 2, those cut scores were applied with future cohorts.…

  15. Predicting Student Success in a Major's Introductory Biology Course via Logistic Regression Analysis of Scientific Reasoning Ability and Mathematics Scores

    Science.gov (United States)

    Thompson, E. David; Bowling, Bethany V.; Markle, Ross E.

    2018-02-01

    Studies over the last 30 years have considered various factors related to student success in introductory biology courses. While much of the available literature suggests that the best predictors of success in a college course are prior college grade point average (GPA) and class attendance, faculty often require a valuable predictor of success in those courses wherein the majority of students are in the first semester and have no previous record of college GPA or attendance. In this study, we evaluated the efficacy of the ACT Mathematics subject exam and Lawson's Classroom Test of Scientific Reasoning in predicting success in a major's introductory biology course. A logistic regression was utilized to determine the effectiveness of a combination of scientific reasoning (SR) scores and ACT math (ACT-M) scores to predict student success. In summary, we found that the model—with both SR and ACT-M as significant predictors—could be an effective predictor of student success and thus could potentially be useful in practical decision making for the course, such as directing students to support services at an early point in the semester.

  16. The Effects of Listening to Music Just Before Reading Test on Students’ Test Score

    OpenAIRE

    MAHDAVI, Mojtaba

    2015-01-01

    Abstract. In this study the researcher  examined  the  effect  of  music  on  reading  comprehension played just before the test .  Because the emotional consequences of music listening are evident in stress and anxiety removal, it was used as a tool to pacify the mind of the tastes and boost their memory and the related cognitive processes. Experimental group did well with the mean score of) and control group (). This study confirmed that using multimedia devices such as music can not only i...

  17. Effects of Test Media on Different EFL Test-Takers in Writing Scores and in the Cognitive Writing Process

    Science.gov (United States)

    Zou, Xiao-Ling; Chen, Yan-Min

    2016-01-01

    The effects of computer and paper test media on EFL test-takers with different computer familiarity in writing scores and in the cognitive writing process have been comprehensively explored from the learners' aspect as well as on the basis of related theories and practice. The results indicate significant differences in test scores among the…

  18. Clinical implications of using the arm motor ability test in stroke rehabilitation.

    Science.gov (United States)

    O'Dell, Michael W; Kim, Grace; Finnen, Lisa Rivera; Polistena, Caitlin

    2011-05-01

    To identify all published studies using the Arm Motor Ability Test (AMAT), a standardized, laboratory-based measure for selected upper extremity activities of daily living (ADLs); and to summarize its current uses and provide recommendations for its future use. An Ovid online search was performed using the terms "Arm Motor Ability Test" and "AMAT." The reference lists of all articles obtained were reviewed for additional studies not appearing in the literature search. In addition, the original manual for the use and administration of the AMAT was reviewed. All studies examining the psychometric properties of the AMAT or using the AMAT as an outcome measure were identified. Articles simply mentioning the AMAT without providing data and case reports or abstracts (other than those addressing a specific aspect of the scale of interest) were excluded. Studies were reviewed by the primary author. No formal system of quality review was used. The AMAT has been used as an outcome measure in stroke rehabilitation research examining upper extremity robotics, functional electrical stimulation, and cortical stimulation. The most recent version contains 10 ADL tasks, each of which is composed of 1 to 3 subtasks. Of the 3 domains originally proposed, only the "functional ability" domain is routinely assessed. Psychometric studies have demonstrated good reliability and at least reasonable construct validity. The instrument's sensitivity to change over time is less well established, and no recommendation can be made regarding a minimal clinically important difference. We recommend that the 10-item version of the AMAT and assessment of only the functional ability domain be adopted as standard going forward. Further research should include examination of sensitivity over time, minimal clinically important change, reliability and validity in the mid and lower range of scores, and in neurologic diagnoses other than stroke. Copyright © 2011 American Congress of Rehabilitation Medicine

  19. A physical function test for use in the intensive care unit: validity, responsiveness, and predictive utility of the physical function ICU test (scored).

    Science.gov (United States)

    Denehy, Linda; de Morton, Natalie A; Skinner, Elizabeth H; Edbrooke, Lara; Haines, Kimberley; Warrillow, Stephen; Berney, Sue

    2013-12-01

    Several tests have recently been developed to measure changes in patient strength and functional outcomes in the intensive care unit (ICU). The original Physical Function ICU Test (PFIT) demonstrates reliability and sensitivity. The aims of this study were to further develop the original PFIT, to derive an interval score (the PFIT-s), and to test the clinimetric properties of the PFIT-s. A nested cohort study was conducted. One hundred forty-four and 116 participants performed the PFIT at ICU admission and discharge, respectively. Original test components were modified using principal component analysis. Rasch analysis examined the unidimensionality of the PFIT, and an interval score was derived. Correlations tested validity, and multiple regression analyses investigated predictive ability. Responsiveness was assessed using the effect size index (ESI), and the minimal clinically important difference (MCID) was calculated. The shoulder lift component was removed. Unidimensionality of combined admission and discharge PFIT-s scores was confirmed. The PFIT-s displayed moderate convergent validity with the Timed "Up & Go" Test (r=-.60), the Six-Minute Walk Test (r=.41), and the Medical Research Council (MRC) sum score (rho=.49). The ESI of the PFIT-s was 0.82, and the MCID was 1.5 points (interval scale range=0-10). A higher admission PFIT-s score was predictive of: an MRC score of ≥48, increased likelihood of discharge home, reduced likelihood of discharge to inpatient rehabilitation, and reduced acute care hospital length of stay. Scoring of sit-to-stand assistance required is subjective, and cadence cutpoints used may not be generalizable. The PFIT-s is a safe and inexpensive test of physical function with high clinical utility. It is valid, responsive to change, and predictive of key outcomes. It is recommended that the PFIT-s be adopted to test physical function in the ICU.

  20. Improving ability measurement in surveys by following the principles of IRT: The Wordsum vocabulary test in the General Social Survey.

    Science.gov (United States)

    Cor, M Ken; Haertel, Edward; Krosnick, Jon A; Malhotra, Neil

    2012-09-01

    Survey researchers often administer batteries of questions to measure respondents' abilities, but these batteries are not always designed in keeping with the principles of optimal test construction. This paper illustrates one instance in which following these principles can improve a measurement tool used widely in the social and behavioral sciences: the GSS's vocabulary test called "Wordsum". This ten-item test is composed of very difficult items and very easy items, and item response theory (IRT) suggests that the omission of moderately difficult items is likely to have handicapped Wordsum's effectiveness. Analyses of data from national samples of thousands of American adults show that after adding four moderately difficult items to create a 14-item battery, "Wordsumplus" (1) outperformed the original battery in terms of quality indicators suggested by classical test theory; (2) reduced the standard error of IRT ability estimates in the middle of the latent ability dimension; and (3) exhibited higher concurrent validity. These findings show how to improve Wordsum and suggest that analysts should use a score based on all 14 items instead of using the summary score provided by the GSS, which is based on only the original 10 items. These results also show more generally how surveys measuring abilities (and other constructs) can benefit from careful application of insights from the contemporary educational testing literature. Copyright © 2012 Elsevier Inc. All rights reserved.

  1. Comparing the Effects of Elementary Music and Visual Arts Lessons on Standardized Mathematics Test Scores

    Science.gov (United States)

    King, Molly Elizabeth

    2016-01-01

    The purpose of this quantitative, causal-comparative study was to compare the effect elementary music and visual arts lessons had on third through sixth grade standardized mathematics test scores. Inferential statistics were used to compare the differences between test scores of students who took in-school, elementary, music instruction during the…

  2. The Implications of Family Size and Birth Order for Test Scores and Behavioral Development

    Science.gov (United States)

    Silles, Mary A.

    2010-01-01

    This article, using longitudinal data from the National Child Development Study, presents new evidence on the effects of family size and birth order on test scores and behavioral development at age 7, 11 and 16. Sibling size is shown to have an adverse causal effect on test scores and behavioral development. For any given family size, first-borns…

  3. Using Raters from India to Score a Large-Scale Speaking Test

    Science.gov (United States)

    Xi, Xiaoming; Mollaun, Pam

    2011-01-01

    We investigated the scoring of the Speaking section of the Test of English as a Foreign Language[TM] Internet-based (TOEFL iBT[R]) test by speakers of English and one or more Indian languages. We explored the extent to which raters from India, after being trained and certified, were able to score the TOEFL examinees with mixed first languages…

  4. AP Trends: Tests Soar, Scores Slip--Gaps between Groups Spur Equity Concerns

    Science.gov (United States)

    Cech, Scott J.

    2008-01-01

    More students are taking Advanced Placement tests, but the proportion of tests receiving what is deemed a passing score has dipped, and the mean score is down for the fourth year in a row. Data released here this week by the New York City-based nonprofit organization that owns the AP brand shows that a greater-than-ever proportion of students…

  5. Musical Ability and the Drake Music Memory Test

    Science.gov (United States)

    Griffin, Lawrence R.; Eisenman, Russell

    1972-01-01

    Results show that the Drake Music Memory Test should be able to discriminate between the poorest and strongest prospects for success in profiting from musical instruction, although it may not be particularly useful in individual counseling. (Authors)

  6. Validating a test to assess early childhood learners’ ability to perceive, express and appreciate emotions

    Directory of Open Access Journals (Sweden)

    Jose Miguel Mestre Navas

    2011-10-01

    Full Text Available Emotional Education, regardless of the school level, has an important mission in the goal of any educational project: socialising younger generations. However, it is also important to assess implemented programs by means of a valid, reliable measure of the progression of children’s’ cognitive and emotional development. Using a sample of 138 early childhood learners (aged from 3 to 6 this paper tested an instrument for assessing the ability to perceive, appreciate and express emotions (as defined by Mayer & Salovey’s model, 1997; 2007. Also, external criteria were developed by teachers on several issues related to children’s social and personal adaptation (school rules, achievement, impulsiveness, social acceptance of peers and hostility. Findings suggest that children from 3 to 6 years who obtain best scores in the perception and assessment of basic emotions are considered by their teachers to better adjust to school rules, to better control impulses, to achieve better academic performance and to be less problematic. It is also important to note that the study is at its initial stages and presents some limitations, as certain important variables such as personality and verbal ability are not controlled. Nevertheless, it should be pointed out that children showed great enthusiasm in taking the test.

  7. The importance of measurement invariance in neurocognitive ability testing

    NARCIS (Netherlands)

    Wicherts, J.

    2016-01-01

    Objective: Neurocognitive test batteries such as recent editions of the Wechsler’s Adult Intelligence Scale (WAIS-III/WAIS-IV) typically use nation-level population-based norms. The question is whether these batteries function in the same manner across different subgroups based on gender, age,

  8. A process dissociation approach to objective-projective test score interrelationships.

    Science.gov (United States)

    Bornstein, Robert F

    2002-02-01

    Even when self-report and projective measures of a given trait or motive both predict theoretically related features of behavior, scores on the 2 tests correlate modestly with each other. This article describes a process dissociation framework for personality assessment, derived from research on implicit memory and learning, which can resolve these ostensibly conflicting results. Research on interpersonal dependency is used to illustrate 3 key steps in the process dissociation approach: (a) converging behavioral predictions, (b) modest test score intercorrelations, and (c) delineation of variables that differentially affect self-report and projective test scores. Implications of the process dissociation framework for personality assessment and test development are discussed.

  9. Do in-training evaluation reports deserve their bad reputations? A study of the reliability and predictive ability of ITER scores and narrative comments.

    Science.gov (United States)

    Ginsburg, Shiphra; Eva, Kevin; Regehr, Glenn

    2013-10-01

    Although scores on in-training evaluation reports (ITERs) are often criticized for poor reliability and validity, ITER comments may yield valuable information. The authors assessed across-rotation reliability of ITER scores in one internal medicine program, ability of ITER scores and comments to predict postgraduate year three (PGY3) performance, and reliability and incremental predictive validity of attendings' analysis of written comments. Numeric and narrative data from the first two years of ITERs for one cohort of residents at the University of Toronto Faculty of Medicine (2009-2011) were assessed for reliability and predictive validity of third-year performance. Twenty-four faculty attendings rank-ordered comments (without scores) such that each resident was ranked by three faculty. Mean ITER scores and comment rankings were submitted to regression analyses; dependent variables were PGY3 ITER scores and program directors' rankings. Reliabilities of ITER scores across nine rotations for 63 residents were 0.53 for both postgraduate year one (PGY1) and postgraduate year two (PGY2). Interrater reliabilities across three attendings' rankings were 0.83 for PGY1 and 0.79 for PGY2. There were strong correlations between ITER scores and comments within each year (0.72 and 0.70). Regressions revealed that PGY1 and PGY2 ITER scores collectively explained 25% of variance in PGY3 scores and 46% of variance in PGY3 rankings. Comment rankings did not improve predictions. ITER scores across multiple rotations showed decent reliability and predictive validity. Comment ranks did not add to the predictive ability, but correlation analyses suggest that trainee performance can be measured through these comments.

  10. Grammar tests increase the ability to lateralize language function in the Wada test.

    Science.gov (United States)

    Połczyńska, Monika; Curtiss, Susan; Walshaw, Particia; Siddarth, Prabha; Benjamin, Chris; Moseley, Brian D; Vigil, Celia; Jones, Michael; Eliashiv, Dawn; Bookheimer, Susan

    2014-12-01

    Grammar is a core component of the language system, yet it is rarely assessed during the Wada (intracarotid amobarbital) test. It is hypothesized that adding grammar tests to the recovery phase of the Wada test will increase our ability to lateralize language function. Sixteen individuals (nine females, fifteen right-handed, mean age 38.4 years, SD=10.7) with medically refractory temporal lobe epilepsy participated in the study. On EEG ten patients had seizures originating in the left hemisphere (LH), five in the right hemisphere (RH), and one was insufficiently lateralized. We included only patients who were LH-dominant on the standard test in the encoding phase of the Wada test. In the recovery phase of Wada testing the participants underwent evaluation with a standard language and a new test of grammar, the CYCLE-N. Ten patients underwent bilateral injections, six unilateral (one RH, five LH). As expected, injection in the LH decreased language performance to a greater extent than injection to the RH on both tests. However, the CYCLE-N produced more profound language deficits in the injected LH compared to the RH (p=0.01), whereas the standard tests did not cause such pronounced differences (p=0.2). The results suggest that the standard tests did not significantly differentiate the effects of the injections and the CYCLE-N, for the most part, did. Our results are of particular relevance to patients who are too obtunded to speak in the encoding phase. In sum, the CYCLE-N may be helpful in assessing hemispheric dominance for language. Copyright © 2014 Elsevier B.V. All rights reserved.

  11. Construction and Evaluation of Reliability and Validity of Reasoning Ability Test

    Science.gov (United States)

    Bhat, Mehraj A.

    2014-01-01

    This paper is based on the construction and evaluation of reliability and validity of reasoning ability test at secondary school students. In this paper an attempt was made to evaluate validity, reliability and to determine the appropriate standards to interpret the results of reasoning ability test. The test includes 45 items to measure six types…

  12. An Examination of English Speaking Tests and Research on English Speaking Ability.

    Science.gov (United States)

    Nakamura, Yuji

    This paper examines both overseas and domestic tests of English speaking ability from the viewpoint of the crucial testing elements such as definition of speaking ability, validity, reliability, and practicality. The paper points out problems to be solved and proposes suggestions for constructing an oral proficiency test in order to determine the…

  13. A comparison of likelihood ratio tests and Rao's score test for three separable covariance matrix structures.

    Science.gov (United States)

    Filipiak, Katarzyna; Klein, Daniel; Roy, Anuradha

    2017-01-01

    The problem of testing the separability of a covariance matrix against an unstructured variance-covariance matrix is studied in the context of multivariate repeated measures data using Rao's score test (RST). The RST statistic is developed with the first component of the separable structure as a first-order autoregressive (AR(1)) correlation matrix or an unstructured (UN) covariance matrix under the assumption of multivariate normality. It is shown that the distribution of the RST statistic under the null hypothesis of any separability does not depend on the true values of the mean or the unstructured components of the separable structure. A significant advantage of the RST is that it can be performed for small samples, even smaller than the dimension of the data, where the likelihood ratio test (LRT) cannot be used, and it outperforms the standard LRT in a number of contexts. Monte Carlo simulations are then used to study the comparative behavior of the null distribution of the RST statistic, as well as that of the LRT statistic, in terms of sample size considerations, and for the estimation of the empirical percentiles. Our findings are compared with existing results where the first component of the separable structure is a compound symmetry (CS) correlation matrix. It is also shown by simulations that the empirical null distribution of the RST statistic converges faster than the empirical null distribution of the LRT statistic to the limiting χ 2 distribution. The tests are implemented on a real dataset from medical studies. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  14. Psychometric Properties of Raw and Scale Scores on Mixed-Format Tests

    Science.gov (United States)

    Kolen, Michael J.; Lee, Won-Chan

    2011-01-01

    This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…

  15. Effects of Analytical and Holistic Scoring Patterns on Scorer Reliability in Biology Essay Tests

    Science.gov (United States)

    Ebuoh, Casmir N.

    2018-01-01

    Literature revealed that the patterns/methods of scoring essay tests had been criticized for not being reliable and this unreliability is more likely to be more in internal examinations than in the external examinations. The purpose of this study is to find out the effects of analytical and holistic scoring patterns on scorer reliability in…

  16. Optimal Scoring Methods of Hand-Strength Tests in Patients with Stroke

    Science.gov (United States)

    Huang, Sheau-Ling; Hsieh, Ching-Lin; Lin, Jau-Hong; Chen, Hui-Mei

    2011-01-01

    The purpose of this study was to determine the optimal scoring methods for measuring strength of the more-affected hand in patients with stroke by examining the effect of reducing measurement errors. Three hand-strength tests of grip, palmar pinch, and lateral pinch were administered at two sessions in 56 patients with stroke. Five scoring methods…

  17. Individual Differences in Digit Span, Susceptibility to Proactive Interference, and Aptitude/Achievement Test Scores.

    Science.gov (United States)

    Dempster, Frank N.; Cooney, John B.

    1982-01-01

    Individual differences in digit span, susceptibility to proactive interference, and various aptitude/achievement test scores were investigated in two experiments with college students. Results indicated that digit span was strongly correlated with aptitude/achievement scores, but did not indicate that susceptibility to proactive interference…

  18. TOEFL iBT Speaking Test Scores as Indicators of Oral Communicative Language Proficiency

    Science.gov (United States)

    Bridgeman, Brent; Powers, Donald; Stone, Elizabeth; Mollaun, Pamela

    2012-01-01

    Scores assigned by trained raters and by an automated scoring system (SpeechRater[TM]) on the speaking section of the TOEFL iBT[TM] were validated against a communicative competence criterion. Specifically, a sample of 555 undergraduate students listened to speech samples from 184 examinees who took the Test of English as a Foreign Language…

  19. The Impact of the 2004 Hurricanes on Florida Comprehensive Assessment Test Scores: Implications for School Counselors

    Science.gov (United States)

    Baggerly, Jennifer; Ferretti, Larissa K.

    2008-01-01

    What is the impact of natural disasters on students' statewide assessment scores? To answer this question, Florida Comprehensive Assessment Test (FCAT) scores of 55,881 students in grades 4 through 10 were analyzed to determine if there were significant decreases after the 2004 hurricanes. Results reveal that there was statistical but no practical…

  20. Use of Standardized Test Scores to Predict Success in a Computer Applications Course

    Science.gov (United States)

    Harris, Robert V.; King, Stephanie B.

    2016-01-01

    The purpose of this study was to see if a relationship existed between American College Testing (ACT) scores (i.e., English, reading, mathematics, science reasoning, and composite) and student success in a computer applications course at a Mississippi community college. The study showed that while the ACT scores were excellent predictors of…

  1. The ability of PAM50 risk of recurrence score to predict 10-year distant recurrence in hormone receptor-positive postmenopausal women with special histological subtypes

    DEFF Research Database (Denmark)

    Laenkholm, Anne-Vibeke; Jensen, Maj-Britt; Eriksen, Jens Ole

    2018-01-01

    INTRODUCTION: The Prosigna-PAM50 risk of recurrence (ROR) score has been validated in randomized clinical trials to predict 10-year distant recurrence (DR) in hormone receptor-positive breast cancer. Here, we examine the ability of Prosigna for predicting DR at 10 years in a subgroup of postmenop...

  2. Achilles tendon Total Rupture Score at 3 months can predict patients' ability to return to sport 1 year after injury

    DEFF Research Database (Denmark)

    Hansen, Maria Swennergren; Christensen, Marianne; Budolfsen, Thomas

    2016-01-01

    PURPOSE: To investigate how the Achilles tendon Total Rupture Score (ATRS) at 3 months and 1 year after injury is associated with a patient's ability to return to work and sports as well as to investigate whether sex and age influence ATRS after 3 months and 1 year. METHOD: This is a retrospectiv...

  3. Psychometric Evaluation of the Lower Extremity Computerized Adaptive Test, the Modified Harris Hip Score, and the Hip Outcome Score.

    Science.gov (United States)

    Hung, Man; Hon, Shirley D; Cheng, Christine; Franklin, Jeremy D; Aoki, Stephen K; Anderson, Mike B; Kapron, Ashley L; Peters, Christopher L; Pelt, Christopher E

    2014-12-01

    The applicability and validity of many patient-reported outcome measures in the high-functioning population are not well understood. To compare the psychometric properties of the modified Harris Hip Score (mHHS), the Hip Outcome Score activities of daily living subscale (HOS-ADL) and sports (HOS-sports), and the Lower Extremity Computerized Adaptive Test (LE CAT). The hypotheses was that all instruments would perform well but that the LE CAT would show superiority psychometrically because a combination of CAT and a large item bank allows for a high degree of measurement precision. Cohort study (diagnosis); Level of evidence, 2. Data were collected from 472 advanced-age, active participants from the Huntsman World Senior Games in 2012. Validity evidences were examined through item fit, dimensionality, monotonicity, local independence, differential item functioning, person raw score to measure correlation, and instrument coverage (ie, ceiling and floor effects), and reliability evidences were examined through Cronbach alpha and person separation index. All instruments demonstrated good item fit, unidimensionality, monotonicity, local independence, and person raw score to measure correlations. The HOS-ADL had high ceiling effects of 36.02%, and the mHHS had ceiling effects of 27.54%. The LE CAT had ceiling effects of 8.47%, and the HOS-sports had no ceiling effects. None of the instruments had any floor effects. The mHHS had a very low Cronbach alpha of 0.41 and an extremely low person separation index of 0.08. Reliabilities for the LE CAT were excellent and for the HOS-ADL and HOS-sports were good. The LE CAT showed better psychometric properties overall than the HOS-ADL, HOS-sports, and mHHS for the senior population. The mHHS demonstrated pronounced ceiling effects and poor reliabilities that should be of concern. The high ceiling effects for the HOS-ADL were also of concern. The LE CAT was superior in all psychometric aspects examined in this study. Future

  4. Contributions of Hamstring Stiffness to Straight-Leg-Raise and Sit-and-Reach Test Scores.

    Science.gov (United States)

    Miyamoto, Naokazu; Hirata, Kosuke; Kimura, Noriko; Miyamoto-Mikami, Eri

    2018-02-01

    The passive straight-leg-raise (PSLR) and the sit-and-reach (SR) tests have been widely used to assess hamstring extensibility. However, it remains unclear to what extent hamstring stiffness (a measure of material properties) contributes to PSLR and SR test scores. Therefore, we aimed to clarify the relationship between hamstring stiffness and PSLR and SR scores using ultrasound shear wave elastography. Ninety-eight healthy subjects completed the study. Each subject completed PSLR testing, and classic and modified SR testing of the right leg. Muscle shear modulus of the biceps femoris, semitendinosus, and semimembranosus was quantified as an index of muscle stiffness. The relationships between shear modulus of each muscle and PSLR or SR scores were calculated using Pearson's product-moment correlation coefficients. Shear modulus of the semitendinosus and semimembranosus showed negative correlations with the two PSLR and two SR scores (absolute r value≤0.484). Shear modulus of the biceps femoris was significantly correlated with the PSLR score determined by the examiner and the modified SR score (absolute r value≤0.308). The present findings suggest that PSLR and SR test scores are strongly influenced by factors other than hamstring stiffness and therefore might not accurately evaluate hamstring stiffness. © Georg Thieme Verlag KG Stuttgart · New York.

  5. Relationships between spatial activities and scores on the mental rotation test as a function of sex.

    Science.gov (United States)

    Ginn, Sheryl R; Pickens, Stefanie J

    2005-06-01

    Previous results suggested that female college students' scores on the Mental Rotations Test might be related to their prior experience with spatial tasks. For example, women who played video games scored better on the test than their non-game-playing peers, whereas playing video games was not related to men's scores. The present study examined whether participation in different types of spatial activities would be related to women's performance on the Mental Rotations Test. 31 men and 59 women enrolled at a small, private church-affiliated university and majoring in art or music as well as students who participated in intercollegiate athletics completed the Mental Rotations Test. Women's scores on the Mental Rotations Test benefitted from experience with spatial activities; the more types of experience the women had, the better their scores. Thus women who were athletes, musicians, or artists scored better than those women who had no experience with these activities. The opposite results were found for the men. Efforts are currently underway to assess how length of experience and which types of experience are related to scores.

  6. A Maturing Global Testing Regime Meets the World Economy: Test Scores and Economic Growth, 1960-2012

    Science.gov (United States)

    Kamens, David H.

    2015-01-01

    This article considers the growth of the international testing regime. It discusses sources of growth and empirically examines two related sets of issues: (1) the stability of countries' achievement scores, and (2) the influence of those national scores on subsequent economic development over different time lags. The article suggests that…

  7. Identifying genetic marker sets associated with phenotypes via an efficient adaptive score test

    KAUST Repository

    Cai, T.; Lin, X.; Carroll, R. J.

    2012-01-01

    the overall effect of a marker-set have been actively studied in recent years. For example, score tests derived under an Empirical Bayes (EB) framework (Liu and others, 2007. Semiparametric regression of multidimensional genetic pathway data: least

  8. Outcome of older persons admitted to intensive care unit, mortality, prognosis factors, dependency scores and ability trajectory within 1 year: a prospective cohort study.

    Science.gov (United States)

    Level, Claude; Tellier, Eric; Dezou, Patrick; Chaoui, Karim; Kherchache, Aissa; Sejourné, Philippe; Rullion-Pac Soo, Anne Marie

    2017-12-06

    The outcome and functional trajectory of older persons admitted to intensive care (ICU) unit remain a true question for critical care physicians and geriatricians, due to the heterogeneity of geriatric population, heterogeneity of practices and absence of guidelines. To describe the 1-year outcome, prognosis factors and functional trajectory for older people admitted to ICU. In a prospective 1-year cohort study, all patients aged 75 years and over admitted to our ICU were included according to a global comprehensive geriatric assessment. Follow-up was conducted for 1 year survivors, in particular, ability scores and living conditions. Of 188 patients included [aged 82.3 ± 4.7 years, 46% of admissions, median SAPS II 53.5 (43-74), ADL of Katz's score 4.2 ± 1.6, median Barthel's index 71 (55-90), AGGIR scale 4.5 ± 1.5], the ICU, hospital and 1-year mortality were, respectively, 34, 42.5 and 65.5%. Prognosis factors were: SAPS 2, mechanical ventilation, comorbidity (Lee's and Mc Cabe's scores), disability scores (ADL of Katz's score, Barthel's index and AGGIR scale), admission creatinin, hypoalbuminemia, malignant haemopathy, cognitive impairment. One-year survivors lived in their own home for 83%, with a preserved physical ability, without significant variation of the three ability assessed scores compared to prior ICU admission. The mortality of older people admitted to ICU is high, with a significant impact of disabilty scores, and preserved 1-year survivor independency. Other studies, including a better comprehensive geriatric assessment, seem necessary to determine a predictive "phenotype" of survival with a "satisfactory" level of autonomy.

  9. Generalization of the Lord-Wingersky Algorithm to Computing the Distribution of Summed Test Scores Based on Real-Number Item Scores

    Science.gov (United States)

    Kim, Seonghoon

    2013-01-01

    With known item response theory (IRT) item parameters, Lord and Wingersky provided a recursive algorithm for computing the conditional frequency distribution of number-correct test scores, given proficiency. This article presents a generalized algorithm for computing the conditional distribution of summed test scores involving real-number item…

  10. Does breastfeeding contribute to the racial gap in reading and math test scores?

    Science.gov (United States)

    Peters, Kristen E; Huang, Jin; Vaughn, Michael G; Witko, Christopher

    2013-10-01

    The aim of this study was to examine the impact of divergent breastfeeding practices between Caucasian and African American mothers on the lingering achievement test gap between Caucasian and African American children. The Child Development Supplement of the Panel Study of Income Dynamics, beginning in 1997, followed a cohort of 3563 children aged 0-12 years. Reading and math test scores from 2002 for 1928 children were linked with breastfeeding history. Regression analysis was used to examine associations between ever having been breastfed and duration of breastfeeding and test scores, controlling for characteristics of child, mother, and household. African American students scored significantly lower than Caucasian children by 10.6 and 10.9 points on reading and math tests, respectively. After accounting for the impact of having been breastfed during infancy, the racial test gap decreased by 17% for reading scores and 9% for math scores. Study findings indicate that breastfeeding explains 17% and 9% of the observed gaps in reading and math scores, respectively, between African Americans and Caucasians, an effect larger than most recent educational policy interventions. Renewed efforts around policies and clinical practices that promote and remove barriers for African American mothers to breastfeed should be implemented. Copyright © 2013 Elsevier Inc. All rights reserved.

  11. Depressive status explains a significant amount of the variance in COPD assessment test (CAT) scores.

    Science.gov (United States)

    Miravitlles, Marc; Molina, Jesús; Quintano, José Antonio; Campuzano, Anna; Pérez, Joselín; Roncero, Carlos

    2018-01-01

    COPD assessment test (CAT) is a short, easy-to-complete health status tool that has been incorporated into the multidimensional assessment of COPD in order to guide therapy; therefore, it is important to understand the factors determining CAT scores. This is a post hoc analysis of a cross-sectional, observational study conducted in respiratory medicine departments and primary care centers in Spain with the aim of identifying the factors determining CAT scores, focusing particularly on the cognitive status measured by the Mini-Mental State Examination (MMSE) and levels of depression measured by the short Beck Depression Inventory (BDI). A total of 684 COPD patients were analyzed; 84.1% were men, the mean age of patients was 68.7 years, and the mean forced expiratory volume in 1 second (%) was 55.1%. Mean CAT score was 21.8. CAT scores correlated with the MMSE score (Pearson's coefficient r =-0.371) and the BDI ( r =0.620), both p CAT scores and explained 45% of the variability. However, a model including only MMSE and BDI scores explained up to 40% and BDI alone explained 38% of the CAT variance. CAT scores are associated with clinical variables of severity of COPD. However, cognitive status and, in particular, the level of depression explain a larger percentage of the variance in the CAT scores than the usual COPD clinical severity variables.

  12. Test Reviews: Ginsburg, H., & Baroody, A. (2003). "Test of Early Mathematics Ability--Third Edition." Austin, TX: Pro-Ed

    Science.gov (United States)

    Bliss, Stacy

    2006-01-01

    The Test of Early Mathematics Ability--Third Edition (TEMA-3) is a norm-referenced parallel forms test intended to identify the level of mathematical ability for children aged 3 years 0 months through 8 years 11 months. According to the authors, the instrument can also be used as a criterion referenced or diagnostic tool for older students who are…

  13. An Analysis of Cross Racial Identity Scale Scores Using Classical Test Theory and Rasch Item Response Models

    Science.gov (United States)

    Sussman, Joshua; Beaujean, A. Alexander; Worrell, Frank C.; Watson, Stevie

    2013-01-01

    Item response models (IRMs) were used to analyze Cross Racial Identity Scale (CRIS) scores. Rasch analysis scores were compared with classical test theory (CTT) scores. The partial credit model demonstrated a high goodness of fit and correlations between Rasch and CTT scores ranged from 0.91 to 0.99. CRIS scores are supported by both methods.…

  14. The Dental Hygiene Aptitude Tests and the American College Testing Program Tests as Predictors of Scores on the National Board Dental Hygiene Examination.

    Science.gov (United States)

    Longenbecker, Sueann; Wood, Peter H.

    1984-01-01

    Scores from the National Board Dental Hygiene Examination (NBDHE) served as the criterion variable in a comparison of the predictive validity of the Dental Hygiene Aptitude Tests (DHAT) and the ACT Assessment tests. The DHAT-Science and Verbal tests combined to produce the highest multiple correlation with NBDHE scores. (Author/DWH)

  15. Standardised test protocol (Constant Score) for evaluation of functionality in patients with shoulder disorders

    DEFF Research Database (Denmark)

    Ban, Ilija; Troelsen, Anders; Christiansen, David Høyrup

    2013-01-01

    INTRODUCTION: The Constant Score (CS), developed as a scoring system to evaluate overall functionality of patients with shoulder disorders, is widely used but has been criticised for relying on an imprecise terminology and for lack of a standardised methodology. A modified guideline was therefore...... differences. One of the authors of the modified CS approved both the English and the Danish test protocol. CONCLUSION: A simple test protocol of the modified CS was developed in both English and Danish. With precise terminology and definitions, the test protocol is the first of its kind. We suggest its use...

  16. Decision making under internal uncertainty: the case of multiple-choice tests with different scoring rules.

    Science.gov (United States)

    Bereby-Meyer, Yoella; Meyer, Joachim; Budescu, David V

    2003-02-01

    This paper assesses framing effects on decision making with internal uncertainty, i.e., partial knowledge, by focusing on examinees' behavior in multiple-choice (MC) tests with different scoring rules. In two experiments participants answered a general-knowledge MC test that consisted of 34 solvable and 6 unsolvable items. Experiment 1 studied two scoring rules involving Positive (only gains) and Negative (only losses) scores. Although answering all items was the dominating strategy for both rules, the results revealed a greater tendency to answer under the Negative scoring rule. These results are in line with the predictions derived from Prospect Theory (PT) [Econometrica 47 (1979) 263]. The second experiment studied two scoring rules, which allowed respondents to exhibit partial knowledge. Under the Inclusion-scoring rule the respondents mark all answers that could be correct, and under the Exclusion-scoring rule they exclude all answers that might be incorrect. As predicted by PT, respondents took more risks under the Inclusion rule than under the Exclusion rule. The results illustrate that the basic process that underlies choice behavior under internal uncertainty and especially the effect of framing is similar to the process of choice under external uncertainty and can be described quite accurately by PT. Copyright 2002 Elsevier Science B.V.

  17. Construction of an Exome-Wide Risk Score for Schizophrenia Based on a Weighted Burden Test.

    Science.gov (United States)

    Curtis, David

    2018-01-01

    Polygenic risk scores obtained as a weighted sum of associated variants can be used to explore association in additional data sets and to assign risk scores to individuals. The methods used to derive polygenic risk scores from common SNPs are not suitable for variants detected in whole exome sequencing studies. Rare variants, which may have major effects, are seen too infrequently to judge whether they are associated and may not be shared between training and test subjects. A method is proposed whereby variants are weighted according to their frequency, their annotations and the genes they affect. A weighted sum across all variants provides an individual risk score. Scores constructed in this way are used in a weighted burden test and are shown to be significantly different between schizophrenia cases and controls using a five-way cross-validation procedure. This approach represents a first attempt to summarise exome sequence variation into a summary risk score, which could be combined with risk scores from common variants and from environmental factors. It is hoped that the method could be developed further. © 2017 John Wiley & Sons Ltd/University College London.

  18. The Impact of the Use of Hierarchical Teaching on Test Scores of Students’ Technology

    Directory of Open Access Journals (Sweden)

    Zhao Guorong

    2015-01-01

    Full Text Available Test scores of students’ technology is the main basis for physical examination of college students’ physical, fitness evaluation based on test results. To change the view by the stratified teaching method consistent system of teaching mode, special movement technical level of students is improved significantly.

  19. How Well Does the Sum Score Summarize the Test? Summability as a Measure of Internal Consistency

    NARCIS (Netherlands)

    Goeman, J.J.; De, Jong N.H.

    2018-01-01

    Many researchers use Cronbach's alpha to demonstrate internal consistency, even though it has been shown numerous times that Cronbach's alpha is not suitable for this. Because the intention of questionnaire and test constructers is to summarize the test by its overall sum score, we advocate

  20. The Disaggregation of Value-Added Test Scores to Assess Learning Outcomes in Economics Courses

    Science.gov (United States)

    Walstad, William B.; Wagner, Jamie

    2016-01-01

    This study disaggregates posttest, pretest, and value-added or difference scores in economics into four types of economic learning: positive, retained, negative, and zero. The types are derived from patterns of student responses to individual items on a multiple-choice test. The micro and macro data from the "Test of Understanding in College…

  1. Zertifikat Deutsch als Fremdsprache and the Oral Proficiency Interview: A Comparison of Test Scores and Examinations.

    Science.gov (United States)

    Lalande, John F.; Schweckendiek, Jurgen

    1986-01-01

    Investigates what correlations might exist between an individual's score on the Zertifikat Deutsch als Fremdsprache and on the Oral Proficiency Interview. The tests themselves are briefly described. Results indicate that the two tests appear to correlate well in their evaluation of speaking skills. (SED)

  2. A weighted generalized score statistic for comparison of predictive values of diagnostic tests.

    Science.gov (United States)

    Kosinski, Andrzej S

    2013-03-15

    Positive and negative predictive values are important measures of a medical diagnostic test performance. We consider testing equality of two positive or two negative predictive values within a paired design in which all patients receive two diagnostic tests. The existing statistical tests for testing equality of predictive values are either Wald tests based on the multinomial distribution or the empirical Wald and generalized score tests within the generalized estimating equations (GEE) framework. As presented in the literature, these test statistics have considerably complex formulas without clear intuitive insight. We propose their re-formulations that are mathematically equivalent but algebraically simple and intuitive. As is clearly seen with a new re-formulation we presented, the generalized score statistic does not always reduce to the commonly used score statistic in the independent samples case. To alleviate this, we introduce a weighted generalized score (WGS) test statistic that incorporates empirical covariance matrix with newly proposed weights. This statistic is simple to compute, always reduces to the score statistic in the independent samples situation, and preserves type I error better than the other statistics as demonstrated by simulations. Thus, we believe that the proposed WGS statistic is the preferred statistic for testing equality of two predictive values and for corresponding sample size computations. The new formulas of the Wald statistics may be useful for easy computation of confidence intervals for difference of predictive values. The introduced concepts have potential to lead to development of the WGS test statistic in a general GEE setting. Copyright © 2012 John Wiley & Sons, Ltd.

  3. A test for the Assessment of Pragmatic Abilities and Cognitive Substrates (APACS: Normative data and psychometric properties

    Directory of Open Access Journals (Sweden)

    Giorgio eArcara

    2016-02-01

    Full Text Available The Assessment of Pragmatic Abilities and Cognitive Substrates (APACS test is a new tool to evaluate pragmatic abilities in patients with acquired communicative deficits, ranging from schizophrenia to neurodegenerative diseases. APACS focuses on two main domains, namely discourse and non-literal language, combining traditional tasks with refined linguistic materials in Italian, in a unified framework inspired by language pragmatics. The test includes six tasks (Interview, Description, Narratives, Figurative Language 1, Humor, Figurative Language 2 and three composite scores (Pragmatic Productions, Pragmatic Comprehension, APACS Total. Psychometric properties and normative data were computed on a sample of 119 healthy participants representative of the general population. The analysis revealed acceptable internal consistency and good test-retest reliability for almost every APACS task, suggesting that items are coherent and performance is consistent over time. Factor analysis supports the validity of the test, revealing two factors possibly related to different facets and substrates of the pragmatic competence. Finally, excellent match between APACS items and scores and the pragmatic constructs measured in the test was evidenced by experts’ evaluation of content validity. The performance on APACS showed a general effect of demographic variables, with a negative effect of age and a positive effect of education. The norms were calculated by means of state-of-the-art regression methods. Overall, APACS is a valuable tool for the assessment of pragmatic deficits in verbal communication. The short duration and easiness of administration make the test especially suitable to use in clinical settings. In presenting APACS, we also aim at promoting the inclusion of pragmatics in the assessment practice, as a relevant dimension in defining the patient’s cognitive profile, given its vital role for communication and social interaction in daily life. The

  4. The Effect of Pretest Exercise on Baseline Computerized Neurocognitive Test Scores.

    Science.gov (United States)

    Pawlukiewicz, Alec; Yengo-Kahn, Aaron M; Solomon, Gary

    2017-10-01

    Baseline neurocognitive assessment plays a critical role in return-to-play decision making following sport-related concussions. Prior studies have assessed the effect of a variety of modifying factors on neurocognitive baseline test scores. However, relatively little investigation has been conducted regarding the effect of pretest exercise on baseline testing. The aim of our investigation was to determine the effect of pretest exercise on baseline Immediate Post-Concussion Assessment and Cognitive Testing (ImPACT) scores in adolescent and young adult athletes. We hypothesized that athletes undergoing self-reported strenuous exercise within 3 hours of baseline testing would perform more poorly on neurocognitive metrics and would report a greater number of symptoms than those who had not completed such exercise. Cross-sectional study; Level of evidence, 3. The ImPACT records of 18,245 adolescent and young adult athletes were retrospectively analyzed. After application of inclusion and exclusion criteria, participants were dichotomized into groups based on a positive (n = 664) or negative (n = 6609) self-reported history of strenuous exercise within 3 hours of the baseline test. Participants with a positive history of exercise were then randomly matched, based on age, sex, education level, concussion history, and hours of sleep prior to testing, on a 1:2 basis with individuals who had reported no pretest exercise. The baseline ImPACT composite scores of the 2 groups were then compared. Significant differences were observed for the ImPACT composite scores of verbal memory, visual memory, reaction time, and impulse control as well as for the total symptom score. No significant between-group difference was detected for the visual motor composite score. Furthermore, pretest exercise was associated with a significant increase in the overall frequency of invalid test results. Our results suggest a statistically significant difference in ImPACT composite scores between

  5. The Impact of Correction for Guessing Formula on MC and Yes/No Vocabulary Tests' Scores

    Directory of Open Access Journals (Sweden)

    abdollah baradaran

    2009-10-01

    Full Text Available A standard correction for random guessing (cfg formula on multiple-choice and Yes/Noexaminations was examined retrospectively in the scores of the intermediate female EFL learners in an English language school. The correctionwas a weighting formula for points awarded for correct answers,incorrect answers, and unanswered questions so that the expectedvalue of the increase in test score due to guessing was zero. The researcher compared uncorrected and corrected scores on examinationsusing multiple-choice and Yes/No formats. These short-answer formats eliminatedor at least greatly reduced the potential for guessing the correctanswer. The expectation for students to improve their grade by guessingon multiple-choice and Yes/No format examinations is well known. The researcher examined a method for correcting for random guessing (cfg " no knowledge" on multiple- choice and Yes/No vocabulary examinations by comparing application and non-application of correction for guessing (cfg formula on scores on these examinations. It was done to determine whether the test takers really knew the correct answer, or they had resorted to a kind of guessing. This study represented a unique opportunity to compare scores from multiple-choice and Yes/No examinations in a settingin which students were given the same number of questions ineach of the two format types testing their knowledge over thesame subject matter. The results of this study indicated that the significant differences were highlighted between the subjects' scores when cfg formula was applied and when it was not.

  6. Validation of new prognostic and predictive scores by sequential testing approach

    International Nuclear Information System (INIS)

    Nieder, Carsten; Haukland, Ellinor; Pawinski, Adam; Dalhaug, Astrid

    2010-01-01

    Background and Purpose: For practitioners, the question arises how their own patient population differs from that used in large-scale analyses resulting in new scores and nomograms and whether such tools actually are valid at a local level and thus can be implemented. A recent article proposed an easy-to-use method for the in-clinic validation of new prediction tools with a limited number of patients, a so-called sequential testing approach. The present study evaluates this approach in scores related to radiation oncology. Material and Methods: Three different scores were used, each predicting short overall survival after palliative radiotherapy (bone metastases, brain metastases, metastatic spinal cord compression). For each scenario, a limited number of consecutive patients entered the sequential testing approach. The positive predictive value (PPV) was used for validation of the respective score and it was required that the PPV exceeded 80%. Results: For two scores, validity in the own local patient population could be confirmed after entering 13 and 17 patients, respectively. For the third score, no decision could be reached even after increasing the sample size to 30. Conclusion: In-clinic validation of new predictive tools with sequential testing approach should be preferred over uncritical adoption of tools which provide no significant benefit to local patient populations. Often the necessary number of patients can be reached within reasonable time frames even in small oncology practices. In addition, validation is performed continuously as the data are collected. (orig.)

  7. Validation of new prognostic and predictive scores by sequential testing approach

    Energy Technology Data Exchange (ETDEWEB)

    Nieder, Carsten [Radiation Oncology Unit, Nordland Hospital, Bodo (Norway); Inst. of Clinical Medicine, Univ. of Tromso (Norway); Haukland, Ellinor; Pawinski, Adam; Dalhaug, Astrid [Radiation Oncology Unit, Nordland Hospital, Bodo (Norway)

    2010-03-15

    Background and Purpose: For practitioners, the question arises how their own patient population differs from that used in large-scale analyses resulting in new scores and nomograms and whether such tools actually are valid at a local level and thus can be implemented. A recent article proposed an easy-to-use method for the in-clinic validation of new prediction tools with a limited number of patients, a so-called sequential testing approach. The present study evaluates this approach in scores related to radiation oncology. Material and Methods: Three different scores were used, each predicting short overall survival after palliative radiotherapy (bone metastases, brain metastases, metastatic spinal cord compression). For each scenario, a limited number of consecutive patients entered the sequential testing approach. The positive predictive value (PPV) was used for validation of the respective score and it was required that the PPV exceeded 80%. Results: For two scores, validity in the own local patient population could be confirmed after entering 13 and 17 patients, respectively. For the third score, no decision could be reached even after increasing the sample size to 30. Conclusion: In-clinic validation of new predictive tools with sequential testing approach should be preferred over uncritical adoption of tools which provide no significant benefit to local patient populations. Often the necessary number of patients can be reached within reasonable time frames even in small oncology practices. In addition, validation is performed continuously as the data are collected. (orig.)

  8. Use of the Short Physical Performance Battery Score to predict loss of ability to walk 400 meters: analysis from the InCHIANTI study.

    Science.gov (United States)

    Vasunilashorn, Sarinnapha; Coppin, Antonia K; Patel, Kushang V; Lauretani, Fulvio; Ferrucci, Luigi; Bandinelli, Stefania; Guralnik, Jack M

    2009-02-01

    Early detection of mobility limitations remains an important goal for preventing mobility disability. The purpose of this study was to examine the association between the Short Physical Performance Battery (SPPB) and the loss of ability to walk 400 m, an objectively assessed mobility outcome increasingly used in clinical trials. The study sample consisted of 542 adults from the InCHIANTI study aged 65 and older, who completed the 400 m walk at baseline and had evaluations on the SPPB and 400 m walk at baseline and 3-year follow-up. Multiple logistic regression models were used to determine whether SPPB scores predict the loss of ability to walk 400 m at follow-up among persons able to walk 400 m at baseline. The 3-year incidence of failing the 400 m walk was 15.5%. After adjusting for age, sex, education, body mass index, Mini-Mental State Examination, number of medical conditions, and 400 m walk gait speed at baseline, SPPB score was significantly associated with loss of ability to walk 400 m after 3 years. Participants with SPPB scores of 10 or lower at baseline had significantly higher odds of mobility disability at follow-up (odds ratio [OR] = 3.38, 95% confidence interval [CI]: 1.32-8.65) compared with those who scored 12, with a graded response across the range of SPPB scores (OR = 26.93, 95% CI: 7.51-96.50; OR = 7.67, 95% CI: 2.26-26.04; OR = 8.28, 95% CI: 3.32-20.67 for SPPB 400 m. Thus, using the SPPB to identify older persons at high risk of lower body functional limitations seems a valid means of recognizing individuals who would benefit most from preventive interventions.

  9. A score based on screening tests to differentiate mild cognitive impairment from subjective memory complaints

    Directory of Open Access Journals (Sweden)

    Fábio Henrique de Gobbi Porto

    2013-09-01

    Full Text Available It is not easy to differentiate patients with mild cognitive impairment (MCI from subjective memory complainers (SMC. Assessments with screening cognitive tools are essential, particularly in primary care where most patients are seen. The objective of this study was to evaluate the diagnostic accuracy of screening cognitive tests and to propose a score derived from screening tests. Elderly subjects with memory complaints were evaluated using the Mini Mental State Examination (MMSE and the Brief Cognitive Battery (BCB. We added two delayed recalls in the MMSE (a delayed recall and a late-delayed recall, LDR, and also a phonemic fluency test of letter P fluency (LPF. A score was created based on these tests. The diagnoses were made on the basis of clinical consensus and neuropsychological testing. Receiver operating characteristic curve analyses were used to determine area under the curve (AUC, the sensitivity and specificity for each test separately and for the final proposed score. MMSE, LDR, LPF and delayed recall of BCB scores reach statistically significant differences between groups (P=0.000, 0.03, 0.001 and 0.01, respectively. Sensitivity, specificity and AUC were MMSE: 64%, 79% and 0.75 (cut off <29; LDR: 56%, 62% and 0.62 (cut off <3; LPF: 71%, 71% and 0.71 (cut off <14; delayed recall of BCB: 56%, 82% and 0.68 (cut off <9. The proposed score reached a sensitivity of 88% and 76% and specificity of 62% and 75% for cut off over 1 and over 2, respectively. AUC were 0.81. In conclusion, a score created from screening tests is capable of discriminating MCI from SMC with moderate to good accurancy.

  10. The Effects of Group Members' Personalities on a Test Taker's L2 Group Oral Discussion Test Scores

    Science.gov (United States)

    Ockey, Gary J.

    2009-01-01

    The second language group oral is a test of second language speaking proficiency, in which a group of three or more English language learners discuss an assigned topic without interaction with interlocutors. Concerns expressed about the extent to which test takers' personal characteristics affect the scores of others in the group have limited its…

  11. Do Standardized Tests Penalize Deep-Thinking, Creative, or Conscientious Students?: Some Personality Correlates of Graduate Record Examinations Test Scores

    Science.gov (United States)

    Powers, Donald E.; Kaufman, James C.

    2004-01-01

    The objective of the study reported here was to explore the relationship of Graduate Record Examinations (GRE) General Test scores to selected personality traits--conscientiousness, rationality, ingenuity, quickness, creativity, and depth. A sample of 342 GRE test takers completed short personality inventory scales for each trait. Analyses…

  12. Online pre-race education improves test scores for volunteers at a marathon.

    Science.gov (United States)

    Maxwell, Shane; Renier, Colleen; Sikka, Robby; Widstrom, Luke; Paulson, William; Christensen, Trent; Olson, David; Nelson, Benjamin

    2017-09-01

    This study examined whether an online course would lead to increased knowledge about the medical issues volunteers encounter during a marathon. Health care professionals who volunteered to provide medical coverage for an annual marathon were eligible for the study. Demographic information about medical volunteers including profession, specialty, education level and number of marathons they had volunteered for was collected. A 15-question test about the most commonly encountered medical issues was created by the authors and administered before and after the volunteers took the online educational course and compared to a pilot study the previous year. Seventy-four subjects completed the pre-test. Those who participated in the pilot study last year (N = 15) had pre-test scores that were an average of 2.4 points higher than those who did not (mean ranks: pilot study = 51.6 vs. non-pilot = 33.9, p = 0.004). Of the 74 subjects who completed the pre-test, 54 also completed the post-test. The overall post-pre mean score difference was 3.8 ± 2.7 (t = 10.5 df = 53 p online education demonstrated a long-term (one-year) increase in test scores. Testing also continued to show short-term improvement in post-course test scores, compared to pre-course test scores. In general, marathon medical volunteers who had no volunteer experience demonstrated greater improvement than those who had prior volunteer experience.

  13. Opportunity to learn: Investigating possible predictors for pre-course Test Of Astronomy STandards TOAST scores

    Science.gov (United States)

    Berryhill, Katie J.

    As astronomy education researchers become more interested in experimentally testing innovative teaching strategies to enhance learning in introductory astronomy survey courses ("ASTRO 101"), scholars are placing increased attention toward better understanding factors impacting student gain scores on the widely used Test Of Astronomy STandards (TOAST). Usually used in a pre-test and post-test research design, one might naturally assume that the pre-course differences observed between high- and low-scoring college students might be due in large part to their pre-existing motivation, interest, experience in science, and attitudes about astronomy. To explore this notion, 11 non-science majoring undergraduates taking ASTRO 101 at west coast community colleges were interviewed in the first few weeks of the course to better understand students' pre-existing affect toward learning astronomy with an eye toward predicting student success. In answering this question, we hope to contribute to our understanding of the incoming knowledge of students taking undergraduate introductory astronomy classes, but also gain insight into how faculty can best meet those students' needs and assist them in achieving success. Perhaps surprisingly, there was only weak correlation between students' motivation toward learning astronomy and their pre-test scores. Instead, the most fruitful predictor of TOAST pre-test scores was the quantity of pre-existing, informal, self-directed astronomy learning experiences.

  14. Gender Gaps in High School GPA and ACT Scores: High School Grade Point Average and ACT Test Score by Subject and Gender. Information Brief 2014-12

    Science.gov (United States)

    ACT, Inc., 2014

    2014-01-01

    Female students who graduated from high school in 2013 averaged higher grades than their male counterparts in all subjects, but male graduates earned higher scores on the math and science sections of the ACT. This information brief looks at high school grade point average and ACT test score by subject and gender

  15. The Impact of Linking Distinct Achievement Test Scores on the Interpretation of Student Growth in Achievement

    Science.gov (United States)

    Airola, Denise Tobin

    2011-01-01

    Changes to state tests impact the ability of State Education Agencies (SEAs) to monitor change in performance over time. The purpose of this study was to evaluate the Standardized Performance Growth Index (PGIz), a proposed statistical model for measuring change in student and school performance, across transitions in tests. The PGIz is a…

  16. EAP Study Recommendations and Score Gains on the IELTS Academic Writing Test

    Science.gov (United States)

    Green, Anthony

    2005-01-01

    The IELTS test is widely accepted by university admissions offices as evidence of English language ability. The test is also used to guide decisions about the amount of language study required for students to satisfy admissions requirements. Guidelines currently published by the British Association of Lecturers in English for Academic Purposes…

  17. Genetic Tests for Ability?: Talent Identification and the Value of an Open Future

    Science.gov (United States)

    Miah, Andy; Rich, Emma

    2006-01-01

    This paper explores the prospect of genetic tests for performance in physical activity and sports practices. It investigates the terminology associated with genetics, testing, selection and ability as a means towards a socio-ethical analysis of its value within sport, education and society. Our argument suggests that genetic tests need not even be…

  18. At the Interface between Language Testing and Second Language Acquisition: Language Ability and Context of Learning

    Science.gov (United States)

    Gu, Lin

    2014-01-01

    This study investigated the relationship between latent components of academic English language ability and test takers' study-abroad and classroom learning experiences through a structural equation modeling approach in the context of TOEFL iBT® testing. Data from the TOEFL iBT public dataset were used. The results showed that test takers'…

  19. Associations of maximal strength and muscular endurance test scores with cardiorespiratory fitness and body composition.

    Science.gov (United States)

    Vaara, Jani P; Kyröläinen, Heikki; Niemi, Jaakko; Ohrankämmen, Olli; Häkkinen, Arja; Kocay, Sheila; Häkkinen, Keijo

    2012-08-01

    The purpose of the present study was to assess the relationships between maximal strength and muscular endurance test scores additionally to previously widely studied measures of body composition and maximal aerobic capacity. 846 young men (25.5 ± 5.0 yrs) participated in the study. Maximal strength was measured using isometric bench press, leg extension and grip strength. Muscular endurance tests consisted of push-ups, sit-ups and repeated squats. An indirect graded cycle ergometer test was used to estimate maximal aerobic capacity (V(O2)max). Body composition was determined with bioelectrical impedance. Moreover, waist circumference (WC) and height were measured and body mass index (BMI) calculated. Maximal bench press was positively correlated with push-ups (r = 0.61, p strength (r = 0.34, p strength correlated positively (r = 0.36-0.44, p test scores were related to maximal aerobic capacity and body fat content, while fat free mass was associated with maximal strength test scores and thus is a major determinant for maximal strength. A contributive role of maximal strength to muscular endurance tests could be identified for the upper, but not the lower extremities. These findings suggest that push-up test is not only indicative of body fat content and maximal aerobic capacity but also maximal strength of upper body, whereas repeated squat test is mainly indicative of body fat content and maximal aerobic capacity, but not maximal strength of lower extremities.

  20. Robust joint score tests in the application of DNA methylation data analysis.

    Science.gov (United States)

    Li, Xuan; Fu, Yuejiao; Wang, Xiaogang; Qiu, Weiliang

    2018-05-18

    Recently differential variability has been showed to be valuable in evaluating the association of DNA methylation to the risks of complex human diseases. The statistical tests based on both differential methylation level and differential variability can be more powerful than those based only on differential methylation level. Anh and Wang (2013) proposed a joint score test (AW) to simultaneously detect for differential methylation and differential variability. However, AW's method seems to be quite conservative and has not been fully compared with existing joint tests. We proposed three improved joint score tests, namely iAW.Lev, iAW.BF, and iAW.TM, and have made extensive comparisons with the joint likelihood ratio test (jointLRT), the Kolmogorov-Smirnov (KS) test, and the AW test. Systematic simulation studies showed that: 1) the three improved tests performed better (i.e., having larger power, while keeping nominal Type I error rates) than the other three tests for data with outliers and having different variances between cases and controls; 2) for data from normal distributions, the three improved tests had slightly lower power than jointLRT and AW. The analyses of two Illumina HumanMethylation27 data sets GSE37020 and GSE20080 and one Illumina Infinium MethylationEPIC data set GSE107080 demonstrated that three improved tests had higher true validation rates than those from jointLRT, KS, and AW. The three proposed joint score tests are robust against the violation of normality assumption and presence of outlying observations in comparison with other three existing tests. Among the three proposed tests, iAW.BF seems to be the most robust and effective one for all simulated scenarios and also in real data analyses.

  1. Longitudinal Assessment of Intellectual Abilities of Children with Williams Syndrome: Multilevel Modeling of Performance on the Kaufman Brief Intelligence Test--Second Edition

    Science.gov (United States)

    Mervis, Carolyn B.; Kistler, Doris J.; John, Angela E.; Morris, Colleen A.

    2012-01-01

    Multilevel modeling was used to address the longitudinal stability of standard scores (SSs) measuring intellectual ability for children with Williams syndrome (WS). Participants were 40 children with genetically confirmed WS who completed the Kaufman Brief Intelligence Test--Second Edition (KBIT-2; A. S. Kaufman & N. L. Kaufman, 2004) 4-7…

  2. Relationships between the handball-specific complex test, non-specific field tests and the match performance score in elite professional handball players.

    Science.gov (United States)

    Hermassi, Souhail; Chelly, Mohamed-Souhaiel; Wollny, Rainer; Hoffmeyer, Birgit; Fieseler, Georg; Schulze, Stephan; Irlenbusch, Lars; Delank, Karl-Stefan; Shephard, Roy J; Bartels, Thomas; Schwesig, René

    2018-06-01

    This study assessed the validity of the handball-specific complex test (HBCT) and two non-specific field tests in professional elite handball athletes, using the match performance score (MPS) as the gold standard of performance. Thirteen elite male handball players (age: 27.4±4.8 years; premier German league) performed the HBCT, the Yo-Yo Intermittent Recovery (YYIR) test and a repeated shuttle sprint ability (RSA) test at the beginning of pre-season training. The RSA results were evaluated in terms of best time, total time, and fatigue decrement. Heart rates (HR) were assessed at selected times throughout all tests; the recovery HR was measured immediately post-test and 10 minutes later. The match performance score was based on various handball specific parameters (e.g., field goals, assists, steals, blocks, and technical mistakes) as seen during all matches of the immediately subsequent season (2015/2016). The parameters of run 1, run 2, and HR recovery at minutes 6 and 10 of the RSA test all showed a variance of more than 10% (range: 11-15%). However, the variance of scores for the YYIR test was much smaller (range: 1-7%). The resting HR (r2=0.18), HR recovery at minute 10 (r2=0.10), lactate concentration at rest (r2=0.17), recovery of heart rate from 0 to 10 minutes (r2=0.15), and velocity of second throw at first trial (r2=0.37) were the most valid HBCT parameters. Much effort is necessary to assess MPS and to develop valid tests. Speed and the rate of functional recovery seem the best predictors of competitive performance for elite handball players.

  3. The Relationships between Social Class, Listening Test Anxiety and Test Scores

    OpenAIRE

    Omid Talebi Rezaabadi

    2016-01-01

    This study investigated the relationships between the social anxiety, social class and listening-test anxiety of students learning English as a foreign language. The aims of the study were to examine the relationship between listening-test anxiety and listening-test performance. The data were collected using an adapted Foreign Language Listening Anxiety Scale and a newly developed Foreign Language Social Anxiety Scale. The potential correlation between social anxiety and listening-test perfor...

  4. Integrating GIS in the Middle School Curriculum: Impacts on Diverse Students' Standardized Test Scores

    Science.gov (United States)

    Goldstein, Donna; Alibrandi, Marsha

    2013-01-01

    This case study conducted with 1,425 middle school students in Palm Beach County, Florida, included a treatment group receiving GIS instruction (256) and a control group without GIS instruction (1,169). Quantitative analyses on standardized test scores indicated that inclusion of GIS in middle school curriculum had a significant effect on student…

  5. Intelligence Test Scores and Birth Order among Young Norwegian Men (Conscripts) Analyzed within and between Families

    Science.gov (United States)

    Bjerkedal, Tor; Kristensen, Petter; Skjeret, Geir A.; Brevik, John I.

    2007-01-01

    The present paper reports the results of a within and between family analysis of the relation between birth order and intelligence. The material comprises more than a quarter of a million test scores for intellectual performance of Norwegian male conscripts recorded during 1984-2004. Conscripts, mostly 18-19 years of age, were born to women for…

  6. International Test Score Comparisons and Educational Policy: A Review of the Critiques

    Science.gov (United States)

    Carnoy, Martin

    2015-01-01

    Stanford education professor Martin Carnoy examines four main critiques of how international test results are used in policymaking. Of particular interest are critiques of the policy analyses published by the Program for International Student Assessment (PISA). Using average PISA scores as a comparative measure of student achievement is misleading…

  7. Using Automated Essay Scores as an Anchor When Equating Constructed Response Writing Tests

    Science.gov (United States)

    Almond, Russell G.

    2014-01-01

    Assessments consisting of only a few extended constructed response items (essays) are not typically equated using anchor test designs as there are typically too few essay prompts in each form to allow for meaningful equating. This article explores the idea that output from an automated scoring program designed to measure writing fluency (a common…

  8. Changes in Student Populations and Average Test Scores of Dutch Primary Schools

    Science.gov (United States)

    Luyten, Hans; de Wolf, Inge

    2011-01-01

    This article focuses on the relation between student population characteristics and average test scores per school in the final grade of primary education from a dynamic perspective. Aggregated data of over 5,000 Dutch primary schools covering a 6-year period were used to study the relation between changes in school populations and shifts in mean…

  9. Evaluation of Two Methods for Modeling Measurement Errors When Testing Interaction Effects with Observed Composite Scores

    Science.gov (United States)

    Hsiao, Yu-Yu; Kwok, Oi-Man; Lai, Mark H. C.

    2018-01-01

    Path models with observed composites based on multiple items (e.g., mean or sum score of the items) are commonly used to test interaction effects. Under this practice, researchers generally assume that the observed composites are measured without errors. In this study, we reviewed and evaluated two alternative methods within the structural…

  10. Comprehensive School Reform and Standardized Test Scores in Illinois Elementary and Middle Schools

    Science.gov (United States)

    McEnroe, James D.

    2010-01-01

    The study examined the effects of the federally funded Comprehensive School Reform (CSR) program on student performance on mandated standardized tests. The study focused on the mathematics and reading scores of Illinois public elementary and middle and junior high school students. The federal CSR program provided Illinois schools with an annual…

  11. Test Score Gaps between Private and Government Sector Students at School Entry Age in India

    Science.gov (United States)

    Singh, Abhijeet

    2014-01-01

    Various studies have noted that students enrolled in private schools in India perform better on average than students in government schools. In this paper, I show that large gaps in the test scores of children in private and public sector education are evident even at the point of initial enrollment in formal schooling and are associated with…

  12. Classroom Organizational Structure in Fifth Grade Math Classrooms and the Effect on Standardized Test Scores

    Science.gov (United States)

    Lane, Dallas Marie

    2017-01-01

    The purpose of this study was to determine if there is a relationship between the classroom organizational structure and MCT2 test scores of fifth-grade math students. The researcher gained insight regarding which structure teachers believe is most beneficial to them and students, and whether or not their belief of classroom organizational…

  13. Using College Admission Test Scores to Clarify High School Placement. Leading Indicator Spotlight

    Science.gov (United States)

    Flug, Susanna

    2010-01-01

    In "Beyond Test Scores: Leading Indicators for Education," Foley and colleagues (2008) define leading indicators as those that "provide early signals of progress toward academic achievement" (p. 1) and stress that educators "need leading indicators to help them see the direction their efforts are going in and to take…

  14. Validating Score Interpretations and Uses: Messick Lecture, Language Testing Research Colloquium, Cambridge, April 2010

    Science.gov (United States)

    Kane, Michael

    2012-01-01

    The argument-based approach to validation involves two steps; specification of the proposed interpretations and uses of the test scores as an interpretive argument, and the evaluation of the plausibility of the proposed interpretive argument. More ambitious interpretations and uses tend to involve an extended network of inferences and assumptions…

  15. Virginia tech freshman class becoming more competitive; Rise in grades and test scores noted

    OpenAIRE

    Virginia Tech News

    2004-01-01

    Admission to Virginia Tech continues to become more competitive as applicants report higher grade point averages and test scores than previous years. The incoming class of 4,975 students has an average grade point average (GPA) of 3.68 and SAT 1203, up from 3.60 GPA and 1197 SAT in 2003.

  16. The Relationships between Social Class, Listening Test Anxiety and Test Scores

    Science.gov (United States)

    Rezaabadi, Omid Talebi

    2016-01-01

    This study investigated the relationships between the social anxiety, social class and listening-test anxiety of students learning English as a foreign language. The aims of the study were to examine the relationship between listening-test anxiety and listening-test performance. The data were collected using an adapted Foreign Language Listening…

  17. Conservatism and Cognitive Ability

    Science.gov (United States)

    Stankov, Lazar

    2009-01-01

    Conservatism and cognitive ability are negatively correlated. The evidence is based on 1254 community college students and 1600 foreign students seeking entry to United States' universities. At the individual level of analysis, conservatism scores correlate negatively with SAT, Vocabulary, and Analogy test scores. At the national level of…

  18. [An experimental proficiency test for ability to screen 104 residual pesticides in agricultural products].

    Science.gov (United States)

    Tsumura, Yukari; Ishimitsu, Susumu; Otaki, Kayo; Uchimi, Hiroyuki; Matsumoto, Nobuyuki; Daba, Masaki; Tsuchiya, Tetsu; Ukyo, Masaho; Tonogai, Yasuhide

    2003-10-01

    An experimental proficiency test program for ability to screen 104 residual pesticides in agricultural products has been conducted. Eight Japanese laboratories joined the program. Items tested in the present study were limit of detection, internal proficiency test (self spike) and external proficiency test (blind spike). All 104 pesticides were well detected and recovered from agricultural foods in the internal proficiency test. However, the results of the external proficiency test did not completely agree with those of the internal proficiency tests. After 5 rounds of the blind spike test, the ratio of the number of correctly detected pesticides to that of actually contained ones (49 total) ranged from 65% to 100% among laboratories. The numbers of mistakenly detected pesticides by a laboratory were 0 to 15. Thus, there was a great difference among the laboratories in the ability to screen multiresidual pesticides.

  19. Effects of Classroom Ventilation Rate and Temperature on Students' Test Scores.

    Directory of Open Access Journals (Sweden)

    Ulla Haverinen-Shaughnessy

    Full Text Available Using a multilevel approach, we estimated the effects of classroom ventilation rate and temperature on academic achievement. The analysis is based on measurement data from a 70 elementary school district (140 fifth grade classrooms from Southwestern United States, and student level data (N = 3109 on socioeconomic variables and standardized test scores. There was a statistically significant association between ventilation rates and mathematics scores, and it was stronger when the six classrooms with high ventilation rates that were indicated as outliers were filtered (> 7.1 l/s per person. The association remained significant when prior year test scores were included in the model, resulting in less unexplained variability. Students' mean mathematics scores (average 2286 points were increased by up to eleven points (0.5% per each liter per second per person increase in ventilation rate within the range of 0.9-7.1 l/s per person (estimated effect size 74 points. There was an additional increase of 12-13 points per each 1°C decrease in temperature within the observed range of 20-25°C (estimated effect size 67 points. Effects of similar magnitude but higher variability were observed for reading and science scores. In conclusion, maintaining adequate ventilation and thermal comfort in classrooms could significantly improve academic achievement of students.

  20. Effects of Classroom Ventilation Rate and Temperature on Students' Test Scores.

    Science.gov (United States)

    Haverinen-Shaughnessy, Ulla; Shaughnessy, Richard J

    2015-01-01

    Using a multilevel approach, we estimated the effects of classroom ventilation rate and temperature on academic achievement. The analysis is based on measurement data from a 70 elementary school district (140 fifth grade classrooms) from Southwestern United States, and student level data (N = 3109) on socioeconomic variables and standardized test scores. There was a statistically significant association between ventilation rates and mathematics scores, and it was stronger when the six classrooms with high ventilation rates that were indicated as outliers were filtered (> 7.1 l/s per person). The association remained significant when prior year test scores were included in the model, resulting in less unexplained variability. Students' mean mathematics scores (average 2286 points) were increased by up to eleven points (0.5%) per each liter per second per person increase in ventilation rate within the range of 0.9-7.1 l/s per person (estimated effect size 74 points). There was an additional increase of 12-13 points per each 1°C decrease in temperature within the observed range of 20-25°C (estimated effect size 67 points). Effects of similar magnitude but higher variability were observed for reading and science scores. In conclusion, maintaining adequate ventilation and thermal comfort in classrooms could significantly improve academic achievement of students.

  1. Effects of Classroom Ventilation Rate and Temperature on Students’ Test Scores

    Science.gov (United States)

    2015-01-01

    Using a multilevel approach, we estimated the effects of classroom ventilation rate and temperature on academic achievement. The analysis is based on measurement data from a 70 elementary school district (140 fifth grade classrooms) from Southwestern United States, and student level data (N = 3109) on socioeconomic variables and standardized test scores. There was a statistically significant association between ventilation rates and mathematics scores, and it was stronger when the six classrooms with high ventilation rates that were indicated as outliers were filtered (> 7.1 l/s per person). The association remained significant when prior year test scores were included in the model, resulting in less unexplained variability. Students’ mean mathematics scores (average 2286 points) were increased by up to eleven points (0.5%) per each liter per second per person increase in ventilation rate within the range of 0.9–7.1 l/s per person (estimated effect size 74 points). There was an additional increase of 12–13 points per each 1°C decrease in temperature within the observed range of 20–25°C (estimated effect size 67 points). Effects of similar magnitude but higher variability were observed for reading and science scores. In conclusion, maintaining adequate ventilation and thermal comfort in classrooms could significantly improve academic achievement of students. PMID:26317643

  2. Rey's Auditory Verbal Learning Test scores can be predicted from whole brain MRI in Alzheimer's disease

    Directory of Open Access Journals (Sweden)

    Elaheh Moradi

    2017-01-01

    Full Text Available Rey's Auditory Verbal Learning Test (RAVLT is a powerful neuropsychological tool for testing episodic memory, which is widely used for the cognitive assessment in dementia and pre-dementia conditions. Several studies have shown that an impairment in RAVLT scores reflect well the underlying pathology caused by Alzheimer's disease (AD, thus making RAVLT an effective early marker to detect AD in persons with memory complaints. We investigated the association between RAVLT scores (RAVLT Immediate and RAVLT Percent Forgetting and the structural brain atrophy caused by AD. The aim was to comprehensively study to what extent the RAVLT scores are predictable based on structural magnetic resonance imaging (MRI data using machine learning approaches as well as to find the most important brain regions for the estimation of RAVLT scores. For this, we built a predictive model to estimate RAVLT scores from gray matter density via elastic net penalized linear regression model. The proposed approach provided highly significant cross-validated correlation between the estimated and observed RAVLT Immediate (R = 0.50 and RAVLT Percent Forgetting (R = 0.43 in a dataset consisting of 806 AD, mild cognitive impairment (MCI or healthy subjects. In addition, the selected machine learning method provided more accurate estimates of RAVLT scores than the relevance vector regression used earlier for the estimation of RAVLT based on MRI data. The top predictors were medial temporal lobe structures and amygdala for the estimation of RAVLT Immediate and angular gyrus, hippocampus and amygdala for the estimation of RAVLT Percent Forgetting. Further, the conversion of MCI subjects to AD in 3-years could be predicted based on either observed or estimated RAVLT scores with an accuracy comparable to MRI-based biomarkers.

  3. Lower Quarter Y-Balance Test Scores and Lower Extremity Injury in NCAA Division I Athletes.

    Science.gov (United States)

    Lai, Wilson C; Wang, Dean; Chen, James B; Vail, Jeremy; Rugg, Caitlin M; Hame, Sharon L

    2017-08-01

    Functional movement tests that are predictive of injury risk in National Collegiate Athletic Association (NCAA) athletes are useful tools for sports medicine professionals. The Lower Quarter Y-Balance Test (YBT-LQ) measures single-leg balance and reach distances in 3 directions. To assess whether the YBT-LQ predicts the laterality and risk of sports-related lower extremity (LE) injury in NCAA athletes. Case-control study; Level of evidence, 3. The YBT-LQ was administered to 294 NCAA Division I athletes from 21 sports during preparticipation physical examinations at a single institution. Athletes were followed prospectively over the course of the corresponding season. Correlation analysis was performed between the laterality of reach asymmetry and composite scores (CS) versus the laterality of injury. Receiver operating characteristic (ROC) analysis was used to determine the optimal asymmetry cutoff score for YBT-LQ. A multivariate regression analysis adjusting for sex, sport type, body mass index, and history of prior LE surgery was performed to assess predictors of earlier and higher rates of injury. Neither the laterality of reach asymmetry nor the CS correlated with the laterality of injury. ROC analysis found optimal cutoff scores of 2, 9, and 3 cm for anterior, posteromedial, and posterolateral reach, respectively. All of these potential cutoff scores, along with a cutoff score of 4 cm used in the majority of prior studies, were associated with poor sensitivity and specificity. Furthermore, none of the asymmetric cutoff scores were associated with earlier or increased rate of injury in the multivariate analyses. YBT-LQ scores alone do not predict LE injury in this collegiate athlete population. Sports medicine professionals should be cautioned against using the YBT-LQ alone to screen for injury risk in collegiate athletes.

  4. Effects of correcting for prematurity on cognitive test scores in childhood.

    Science.gov (United States)

    Wilson-Ching, Michelle; Pascoe, Leona; Doyle, Lex W; Anderson, Peter J

    2014-03-01

    The American Academy of Pediatrics recommends that test scores should be corrected for prematurity up to 3 years of age, but this practice varies greatly in both clinical and research settings. The aim of this study was to contrast the effects of using chronological age and those of using corrected age on measures of cognitive outcome across childhood. A theoretical model was constructed using norms from the Bayley Scales of Infant and Toddler Development, Third Edition; the Wechsler Preschool and Primary Scale of Intelligence, Third Edition Australian; and the Wechsler Intelligence Scales for Children, Fourth Edition Australian. Baseline scores representing different levels of functioning (70, below average; 85, borderline; and 100, average) were recalculated using the normative data for ages 6 months to 16 years to account for 1, 2, 3 and 4 months of prematurity. The model created depicted the difference in standardised scores between chronological and corrected age. Compared with scores corrected for prematurity, the absolute reduction in scores using chronological age was greater for increasing degree of prematurity, younger ages at assessment and higher baseline scores and was substantial even beyond 3 years of age. However, the pattern was erratic, with considerable fluctuation evident across different ages and baseline scores. Chronological age results in a lowering of scores at all ages for preterm-born subjects that is greater in the first few years and in those born at earlier gestational ages. Whether or not to correct for prematurity depends upon the context of the assessment. © 2014 The Authors. Journal of Paediatrics and Child Health © 2014 Paediatrics and Child Health Division (Royal Australasian College of Physicians).

  5. The challenge of cross-cultural assessment--The Test of Ability To Explain for Zulu-speaking Children.

    Science.gov (United States)

    Solarsh, Barbara; Alant, Erna

    2006-01-01

    A culturally appropriate test, The Test of Ability To Explain for Zulu-speaking Children (TATE-ZC), was developed to measure verbal problem solving skills of rural, Zulu-speaking, primary school children. Principles of 'non-biased' assessment, as well as emic (culture specific) and etic (universal) aspects of intelligence formed the theoretical backdrop. In addition, specific principles relating to test translation; test content; culturally appropriate stimulus material; scoring procedures and test administration were applied. Five categories of abstract thinking skills formed the basis of the TATE-ZC. These were: (a) Explaining Inferences, (b) Determining Cause, (c) Negative Why Questions, (d) Determining Solutions and (e) Avoiding Problem. The process of test development underwent three pilot studies. Results indicate that the TATE-ZC is a reliable and valid test for the target population. A critical analysis of the efficacy of creating a test of verbal reasoning for children from the developing world concludes the article. As a result of this activity (1) the participant will have a clearer understanding of the principles that need to be followed when developing culturally appropriate test material; (2) the participant will understand the process of developing culturally appropriate test material for non-mainstream cultures; (3) the participant will be able to apply the process and principles to other cross-cultural testing situations.

  6. Identifying genetic marker sets associated with phenotypes via an efficient adaptive score test

    KAUST Repository

    Cai, T.

    2012-06-25

    In recent years, genome-wide association studies (GWAS) and gene-expression profiling have generated a large number of valuable datasets for assessing how genetic variations are related to disease outcomes. With such datasets, it is often of interest to assess the overall effect of a set of genetic markers, assembled based on biological knowledge. Genetic marker-set analyses have been advocated as more reliable and powerful approaches compared with the traditional marginal approaches (Curtis and others, 2005. Pathways to the analysis of microarray data. TRENDS in Biotechnology 23, 429-435; Efroni and others, 2007. Identification of key processes underlying cancer phenotypes using biologic pathway analysis. PLoS One 2, 425). Procedures for testing the overall effect of a marker-set have been actively studied in recent years. For example, score tests derived under an Empirical Bayes (EB) framework (Liu and others, 2007. Semiparametric regression of multidimensional genetic pathway data: least-squares kernel machines and linear mixed models. Biometrics 63, 1079-1088; Liu and others, 2008. Estimation and testing for the effect of a genetic pathway on a disease outcome using logistic kernel machine regression via logistic mixed models. BMC bioinformatics 9, 292-2; Wu and others, 2010. Powerful SNP-set analysis for case-control genome-wide association studies. American Journal of Human Genetics 86, 929) have been proposed as powerful alternatives to the standard Rao score test (Rao, 1948. Large sample tests of statistical hypotheses concerning several parameters with applications to problems of estimation. Mathematical Proceedings of the Cambridge Philosophical Society, 44, 50-57). The advantages of these EB-based tests are most apparent when the markers are correlated, due to the reduction in the degrees of freedom. In this paper, we propose an adaptive score test which up- or down-weights the contributions from each member of the marker-set based on the Z-scores of

  7. Higher-Order Asymptotics and Its Application to Testing the Equality of the Examinee Ability Over Two Sets of Items.

    Science.gov (United States)

    Sinharay, Sandip; Jensen, Jens Ledet

    2018-06-27

    In educational and psychological measurement, researchers and/or practitioners are often interested in examining whether the ability of an examinee is the same over two sets of items. Such problems can arise in measurement of change, detection of cheating on unproctored tests, erasure analysis, detection of item preknowledge, etc. Traditional frequentist approaches that are used in such problems include the Wald test, the likelihood ratio test, and the score test (e.g., Fischer, Appl Psychol Meas 27:3-26, 2003; Finkelman, Weiss, & Kim-Kang, Appl Psychol Meas 34:238-254, 2010; Glas & Dagohoy, Psychometrika 72:159-180, 2007; Guo & Drasgow, Int J Sel Assess 18:351-364, 2010; Klauer & Rettig, Br J Math Stat Psychol 43:193-206, 1990; Sinharay, J Educ Behav Stat 42:46-68, 2017). This paper shows that approaches based on higher-order asymptotics (e.g., Barndorff-Nielsen & Cox, Inference and asymptotics. Springer, London, 1994; Ghosh, Higher order asymptotics. Institute of Mathematical Statistics, Hayward, 1994) can also be used to test for the equality of the examinee ability over two sets of items. The modified signed likelihood ratio test (e.g., Barndorff-Nielsen, Biometrika 73:307-322, 1986) and the Lugannani-Rice approximation (Lugannani & Rice, Adv Appl Prob 12:475-490, 1980), both of which are based on higher-order asymptotics, are shown to provide some improvement over the traditional frequentist approaches in three simulations. Two real data examples are also provided.

  8. Just as smart but not as successful: obese students obtain lower school grades but equivalent test scores to nonobese students.

    Science.gov (United States)

    MacCann, C; Roberts, R D

    2013-01-01

    The obesity epidemic in industrialized nations has important implications for education, as research demonstrates lower academic achievement among obese students. The current paper compares the test scores and school grades of obese, overweight and normal-weight students in secondary and further education, controlling for demographic variables, personality, ability and well-being confounds. This study included 383 eighth-grade students (49% female; study 1) and 1036 students from 24 community colleges and universities (64% female, study 2), both drawn from five regions across the United States. In study 1, body mass index (BMI) was calculated using self-reports and parent reports of weight and height. In study 2, BMI was calculated from self-reported weight and height only. Both samples completed age-appropriate assessments of mathematics, vocabulary and the personality trait conscientiousness. Eighth-grade students additionally completed a measure of life satisfaction, with both self-reports and parent reports of their grades from the previous semester also obtained. Higher education students additionally completed measures of positive and negative affect, and self-reported their grades and college entrance scores. Obese students receive significantly lower grades in middle school (d=0.83), community college (d=0.34) and university (d=0.36), but show no statistically significant differences in intelligence or achievement test scores. Even after controlling for demographic variables, intelligence, personality and well-being, obese students obtain significantly lower grades than normal-weight students in the eighth grade (d=0.39), community college (d=0.42) and university (d=0.31). Lower grades may reflect peer and teacher prejudice against overweight and obese students rather than lack of ability among these students.

  9. Effect on intelligence test score of prenatal exposure to ionizing radiation in Hiroshima and Nagasaki

    International Nuclear Information System (INIS)

    Schull, W.J.; Otake, Masanori; Yoshimaru, Hiroshi.

    1988-10-01

    Analyses of intelligence test scores (Koga) at 10-11 years of age of individuals exposed prenatally to the atomic bombing of Hiroshima and Nagasaki using estimates of the uterine absorbed dose based on the recently introduced system of dosimetry, the Dosimetry System 1986 (DS86), reveal the following: 1) there is no evidence of a radiation-related effect on intelligence among those individuals exposed within 0-7 weeks after fertilization or in the 26th or subsequent weeks; 2) for individuals exposed at 8-15 weeks after fertilization, and to a lesser extent those exposed at 16-25 weeks, the mean tests scores but not the variances are significantly heterogeneous among exposure categories; 3) the cumulative distribution of test scores suggests a progressive shift downwards in individual scores with increasing exposure; and 4) within the group most sensitive to the occurrence of clinically recognizable severe mental retardation, individuals exposed 8 through 15 weeks after fertilization, the regression of intelligence score on estimated DS86 uterine absorbed dose is more linear than with T65DR fetal dose, the diminution in intelligence score under the linear model is 21-29 points at 1Gy. The effect is somewhat greater when the controls receiving less than 0.01 Gy are excluded, 24-33 points at 1 Gy. These findings are discussed in the light of the earlier analysis of the frequency of occurrence of mental retardation among the prenatally exposed survivors of the A-bombing of Hiroshima and Nagasaki. It is suggested that both are the consequences of the same underlying biological process or processes. (author)

  10. A Study on Variables that Affect Class Scores of Primary Education Students in Placement Test

    OpenAIRE

    Yavuz, Mustafa

    2010-01-01

    This study aims to determine the variables that predict class scores which are obtained by adding 70 % of the Placement Test (PT) scores of the primary education sixth and seventh grade students who took it for the first time in the 2007-2008 academic year within the framework of the system of passing to secondary education reorganized by the MNE, 25 % of their end-of-the-year passing grades. The study is of general survey model. The study group consists of students who took the PT in the 200...

  11. ACER Mathematics Profile Series: Number Test. (Test Booklet, Answer and Record Sheet, Score Key, and Teachers Handbook).

    Science.gov (United States)

    Cornish, Greg; Wines, Robin

    The Number Test of the ACER Mathematics Profile Series, contains 30 items, for each of three suggested grade levels: 7-8, 8-9, and 9-10. Raw scores on all tests in the ACER Mathematics Profile Series (Number, Operations, Space and Measurement) are converted to a common scale called MAPS, a major feature of the Series. Based on the Rasch Model,…

  12. The Impact of Time-Series Diagnostic Tests on the Writing Ability of Iranian EFL Learners

    Science.gov (United States)

    Atashgahi, Bahareh Molazem

    2014-01-01

    This study aimed to show whether administering a battery of time-series diagnostic tests (screening) has any impact on Iranian EFL learners' writing ability. The study was conducted on the intermediate EFL learners at Islamic Azad University North Tehran branch. The researcher administered a homogenizing test in order to exclude the exceptional…

  13. Association of Health Sciences Reasoning Test scores with academic and experiential performance.

    Science.gov (United States)

    Cox, Wendy C; McLaughlin, Jacqueline E

    2014-05-15

    To assess the association of scores on the Health Sciences Reasoning Test (HSRT) with academic and experiential performance in a doctor of pharmacy (PharmD) curriculum. The HSRT was administered to 329 first-year (P1) PharmD students. Performance on the HSRT and its subscales was compared with academic performance in 29 courses throughout the curriculum and with performance in advanced pharmacy practice experiences (APPEs). Significant positive correlations were found between course grades in 8 courses and HSRT overall scores. All significant correlations were accounted for by pharmaceutical care laboratory courses, therapeutics courses, and a law and ethics course. There was a lack of moderate to strong correlation between HSRT scores and academic and experiential performance. The usefulness of the HSRT as a tool for predicting student success may be limited.

  14. Effects of Public Preschool Expenditures on the Test Scores of 4th Graders: Evidence from TIMSS

    Science.gov (United States)

    Waldfogel, Jane; Zhai, Fuhua

    2011-01-01

    This study examines the effects of public preschool expenditures on the math and science scores of 4th graders, holding constant child, family, and school characteristics, other relevant social expenditures, and country and year effects, in seven Organization for Economic Co-operation and Development (OECD) countries -- Australia, Japan, Netherlands, New Zealand, Norway, U.K., and U.S -- using data from the 1995 and 2003 Trends in International Mathematics and Science Study (TIMSS). Our results indicate that there are small but significant positive effects of public preschool expenditures on the math and science scores of 4th graders and preschool expenditures reduce the risk of children scoring at the low level of proficiency. We also find some evidence that children from low-resource homes and homes where the test language is not always spoken may tend to gain more from increased public preschool expenditures than other children,. PMID:21442008

  15. Effects of Public Preschool Expenditures on the Test Scores of 4 Graders: Evidence from TIMSS.

    Science.gov (United States)

    Waldfogel, Jane; Zhai, Fuhua

    2008-02-01

    This study examines the effects of public preschool expenditures on the math and science scores of 4(th) graders, holding constant child, family, and school characteristics, other relevant social expenditures, and country and year effects, in seven Organization for Economic Co-operation and Development (OECD) countries -- Australia, Japan, Netherlands, New Zealand, Norway, U.K., and U.S -- using data from the 1995 and 2003 Trends in International Mathematics and Science Study (TIMSS). Our results indicate that there are small but significant positive effects of public preschool expenditures on the math and science scores of 4(th) graders and preschool expenditures reduce the risk of children scoring at the low level of proficiency. We also find some evidence that children from low-resource homes and homes where the test language is not always spoken may tend to gain more from increased public preschool expenditures than other children,.

  16. Testing statistical significance scores of sequence comparison methods with structure similarity

    Directory of Open Access Journals (Sweden)

    Leunissen Jack AM

    2006-10-01

    Full Text Available Abstract Background In the past years the Smith-Waterman sequence comparison algorithm has gained popularity due to improved implementations and rapidly increasing computing power. However, the quality and sensitivity of a database search is not only determined by the algorithm but also by the statistical significance testing for an alignment. The e-value is the most commonly used statistical validation method for sequence database searching. The CluSTr database and the Protein World database have been created using an alternative statistical significance test: a Z-score based on Monte-Carlo statistics. Several papers have described the superiority of the Z-score as compared to the e-value, using simulated data. We were interested if this could be validated when applied to existing, evolutionary related protein sequences. Results All experiments are performed on the ASTRAL SCOP database. The Smith-Waterman sequence comparison algorithm with both e-value and Z-score statistics is evaluated, using ROC, CVE and AP measures. The BLAST and FASTA algorithms are used as reference. We find that two out of three Smith-Waterman implementations with e-value are better at predicting structural similarities between proteins than the Smith-Waterman implementation with Z-score. SSEARCH especially has very high scores. Conclusion The compute intensive Z-score does not have a clear advantage over the e-value. The Smith-Waterman implementations give generally better results than their heuristic counterparts. We recommend using the SSEARCH algorithm combined with e-values for pairwise sequence comparisons.

  17. Validity and Relative Ability of 4 Balance Tests to Identify Fall Status of Older Adults With Type 2 Diabetes.

    Science.gov (United States)

    Marques, Alda; Silva, Alexandre; Oliveira, Ana; Cruz, Joana; Machado, Ana; Jácome, Cristina

    The Berg Balance Scale (BBS), the Balance Evaluation Systems Test (BESTest), the Mini-BESTest, and the Brief-BESTest are useful tests to assess balance; however, their clinimetric properties have not been studied well in older adults with type 2 diabetes (T2D). This study compared the validity and relative ability of the BBS, BESTest, Mini-BESTest, and Brief-BESTest to identify fall status in older adults with T2D. This study involved a cross-sectional design. Sixty-six older adults with T2D (75 ± 7.6 years) were included and asked to report the number of falls during the previous 12 months and to complete the Activities-specific Balance Confidence scale. The BBS and the BESTest were administered, and the Mini-BESTest and Brief-BESTest scores were computed based on the BESTest performance. Receiver operating characteristics were used to assess the ability of each balance test to differentiate between participants with and without a history of falls. The 4 balance tests were able to identify fall status (areas under the curve = 0.74-0.76), with similar sensitivity (60%-67%) and specificity (71%-76%). The 4 balance tests were able to differentiate between older adults with T2D with and without a history of falls. As the BBS and the BESTest require longer application time, the Brief-BESTest may be an appropriate choice to use in clinical practice to detect fall risk.

  18. Do candidate reactions relate to job performance or affect criterion-related validity? A multistudy investigation of relations among reactions, selection test scores, and job performance.

    Science.gov (United States)

    McCarthy, Julie M; Van Iddekinge, Chad H; Lievens, Filip; Kung, Mei-Chuan; Sinar, Evan F; Campion, Michael A

    2013-09-01

    Considerable evidence suggests that how candidates react to selection procedures can affect their test performance and their attitudes toward the hiring organization (e.g., recommending the firm to others). However, very few studies of candidate reactions have examined one of the outcomes organizations care most about: job performance. We attempt to address this gap by developing and testing a conceptual framework that delineates whether and how candidate reactions might influence job performance. We accomplish this objective using data from 4 studies (total N = 6,480), 6 selection procedures (personality tests, job knowledge tests, cognitive ability tests, work samples, situational judgment tests, and a selection inventory), 5 key candidate reactions (anxiety, motivation, belief in tests, self-efficacy, and procedural justice), 2 contexts (industry and education), 3 continents (North America, South America, and Europe), 2 study designs (predictive and concurrent), and 4 occupational areas (medical, sales, customer service, and technological). Consistent with previous research, candidate reactions were related to test scores, and test scores were related to job performance. Further, there was some evidence that reactions affected performance indirectly through their influence on test scores. Finally, in no cases did candidate reactions affect the prediction of job performance by increasing or decreasing the criterion-related validity of test scores. Implications of these findings and avenues for future research are discussed. PsycINFO Database Record (c) 2013 APA, all rights reserved

  19. How Do Executive Functions Fit with the Cattell-Horn-Carroll Model? Some Evidence from a Joint Factor Analysis of the Delis-Kaplan Executive Function System and the Woodcock-Johnson III Tests of Cognitive Abilities

    Science.gov (United States)

    Floyd, Randy G.; Bergeron, Renee; Hamilton, Gloria; Parra, Gilbert R.

    2010-01-01

    This study investigated the relations among executive functions and cognitive abilities through a joint exploratory factor analysis and joint confirmatory factor analysis of 25 test scores from the Delis-Kaplan Executive Function System and the Woodcock-Johnson III Tests of Cognitive Abilities. Participants were 100 children and adolescents…

  20. Effects of age, gender, education and race on two tests of language ability in community-based older adults.

    Science.gov (United States)

    Snitz, Beth E; Unverzagt, Frederick W; Chang, Chung-Chou H; Bilt, Joni Vander; Gao, Sujuan; Saxton, Judith; Hall, Kathleen S; Ganguli, Mary

    2009-12-01

    Neuropsychological tests, including tests of language ability, are frequently used to differentiate normal from pathological cognitive aging. However, language can be particularly difficult to assess in a standardized manner in cross-cultural studies and in patients from different educational and cultural backgrounds. This study examined the effects of age, gender, education and race on performance of two language tests: the animal fluency task (AFT) and the Indiana University Token Test (IUTT). We report population-based normative data on these tests from two combined ethnically divergent, cognitively normal, representative population samples of older adults. Participants aged > or =65 years from the Monongahela-Youghiogheny Healthy Aging Team (MYHAT) and from the Indianapolis Study of Health and Aging (ISHA) were selected based on (1) a Clinical Dementia Rating (CDR) score of 0; (2) non-missing baseline language test data; and (3) race self-reported as African-American or white. The combined sample (n = 1885) was 28.1% African-American. Multivariate ordinal logistic regression was used to model the effects of demographic characteristics on test scores. On both language tests, better performance was significantly associated with higher education, younger age, and white race. On the IUTT, better performance was also associated with female gender. We found no significant interactions between age and sex, and between race and education. Age and education are more potent variables than are race and gender influencing performance on these language tests. Demographically stratified normative tables for these measures can be used to guide test interpretation and aid clinical diagnosis of impaired cognition.

  1. Student Test Scores: How the Sausage Is Made and Why You Should Care. Evidence Speaks Reports, Vol 1, #25

    Science.gov (United States)

    Jacob, Brian A.

    2016-01-01

    Contrary to popular belief, modern cognitive assessments--including the new Common Core tests--produce test scores based on sophisticated statistical models rather than the simple percent of items a student answers correctly. While there are good reasons for this, it means that reported test scores depend on many decisions made by test designers,…

  2. Classifying and scoring of molecules with the NGN: new datasets, significance tests, and generalization

    Directory of Open Access Journals (Sweden)

    Cameron Christopher JF

    2010-10-01

    Full Text Available Abstract This paper demonstrates how a Neural Grammar Network learns to classify and score molecules for a variety of tasks in chemistry and toxicology. In addition to a more detailed analysis on datasets previously studied, we introduce three new datasets (BBB, FXa, and toxicology to show the generality of the approach. A new experimental methodology is developed and applied to both the new datasets as well as previously studied datasets. This methodology is rigorous and statistically grounded, and ultimately culminates in a Wilcoxon significance test that proves the effectiveness of the system. We further include a complete generalization of the specific technique to arbitrary grammars and datasets using a mathematical abstraction that allows researchers in different domains to apply the method to their own work. Background Our work can be viewed as an alternative to existing methods to solve the quantitative structure-activity relationship (QSAR problem. To this end, we review a number approaches both from a methodological and also a performance perspective. In addition to these approaches, we also examined a number of chemical properties that can be used by generic classifier systems, such as feed-forward artificial neural networks. In studying these approaches, we identified a set of interesting benchmark problem sets to which many of the above approaches had been applied. These included: ACE, AChE, AR, BBB, BZR, Cox2, DHFR, ER, FXa, GPB, Therm, and Thr. Finally, we developed our own benchmark set by collecting data on toxicology. Results Our results show that our system performs better than, or comparatively to, the existing methods over a broad range of problem types. Our method does not require the expert knowledge that is necessary to apply the other methods to novel problems. Conclusions We conclude that our success is due to the ability of our system to: 1 encode molecules losslessly before presentation to the learning system, and 2

  3. Pediatric residents' learning styles and temperaments and their relationships to standardized test scores.

    Science.gov (United States)

    Tuli, Sanjeev Y; Thompson, Lindsay A; Saliba, Heidi; Black, Erik W; Ryan, Kathleen A; Kelly, Maria N; Novak, Maureen; Mellott, Jane; Tuli, Sonal S

    2011-12-01

    Board certification is an important professional qualification and a prerequisite for credentialing, and the Accreditation Council for Graduate Medical Education (ACGME) assesses board certification rates as a component of residency program effectiveness. To date, research has shown that preresidency measures, including National Board of Medical Examiners scores, Alpha Omega Alpha Honor Medical Society membership, or medical school grades poorly predict postresidency board examination scores. However, learning styles and temperament have been identified as factors that 5 affect test-taking performance. The purpose of this study is to characterize the learning styles and temperaments of pediatric residents and to evaluate their relationships to yearly in-service and postresidency board examination scores. This cross-sectional study analyzed the learning styles and temperaments of current and past pediatric residents by administration of 3 validated tools: the Kolb Learning Style Inventory, the Keirsey Temperament Sorter, and the Felder-Silverman Learning Style test. These results were compared with known, normative, general and medical population data and evaluated for correlation to in-service examination and postresidency board examination scores. The predominant learning style for pediatric residents was converging 44% (33 of 75 residents) and the predominant temperament was guardian 61% (34 of 56 residents). The learning style and temperament distribution of the residents was significantly different from published population data (P  =  .002 and .04, respectively). Learning styles, with one exception, were found to be unrelated to standardized test scores. The predominant learning style and temperament of pediatric residents is significantly different than that of the populations of general and medical trainees. However, learning styles and temperament do not predict outcomes on standardized in-service and board examinations in pediatric residents.

  4. [Relationship between unipedal stance test score and center of pressure velocity in elderly].

    Science.gov (United States)

    Rodrigo Antonio, Guzmán; Rony, Silvestre; Francisco Aniceto, Rodríguez; David Andrés, Arriagada; Pablo Andrés, Ortega

    2011-01-01

    Frequent falls are one of the most important health problems in the elderly population. The unipedal stance test (UPST), asses postural stability and is used in fall risk measures. Despite this, there is little information about its relationship with posturographic parameters (PP) that characterizes postural stability. Center of pressure velocity (CoPV) is one of the best PP that describes postural stability. The aim of this study was to analyze the relation between UST score and CoPV in elderly population. A sample of 38 healthy elderly subjects where divided in two groups according to their UPST score, low performance (LP, n=11) and high performance (HP, n=27). The correlation between UPST score and COP mean velocity (CoPmV), recorded from a posturographic test, was analyzed between both groups. An inverse correlation between UPST score and CoPmV was found in both groups. However, this was higher in the LP group (r=-0.69, P=.02) compared to the HP (r=-0.39, P=.04). Based on the results of this investigation, it may be concluded that the achievement on UPST has an inverse relationship with CoPmV, especially in subjects with low performance in the UPST. Copyright © 2010 SEGG. Published by Elsevier Espana. All rights reserved.

  5. The effect of instructional methodology on high school students natural sciences standardized tests scores

    Science.gov (United States)

    Powell, P. E.

    Educators have recently come to consider inquiry based instruction as a more effective method of instruction than didactic instruction. Experience based learning theory suggests that student performance is linked to teaching method. However, research is limited on inquiry teaching and its effectiveness on preparing students to perform well on standardized tests. The purpose of the study to investigate whether one of these two teaching methodologies was more effective in increasing student performance on standardized science tests. The quasi experimental quantitative study was comprised of two stages. Stage 1 used a survey to identify teaching methods of a convenience sample of 57 teacher participants and determined level of inquiry used in instruction to place participants into instructional groups (the independent variable). Stage 2 used analysis of covariance (ANCOVA) to compare posttest scores on a standardized exam by teaching method. Additional analyses were conducted to examine the differences in science achievement by ethnicity, gender, and socioeconomic status by teaching methodology. Results demonstrated a statistically significant gain in test scores when taught using inquiry based instruction. Subpopulation analyses indicated all groups showed improved mean standardized test scores except African American students. The findings benefit teachers and students by presenting data supporting a method of content delivery that increases teacher efficacy and produces students with a greater cognition of science content that meets the school's mission and goals.

  6. Effect of Mindfulness Meditation on Perceived Stress Scores and Autonomic Function Tests of Pregnant Indian Women.

    Science.gov (United States)

    Muthukrishnan, Shobitha; Jain, Reena; Kohli, Sangeeta; Batra, Swaraj

    2016-04-01

    Various pregnancy complications like hypertension, preeclampsia have been strongly correlated with maternal stress. One of the connecting links between pregnancy complications and maternal stress is mind-body intervention which can be part of Complementary and Alternative Medicine (CAM). Biologic measures of stress during pregnancy may get reduced by such interventions. To evaluate the effect of Mindfulness meditation on perceived stress scores and autonomic function tests of pregnant Indian women. Pregnant Indian women of 12 weeks gestation were randomised to two treatment groups: Test group with Mindfulness meditation and control group with their usual obstetric care. The effect of Mindfulness meditation on perceived stress scores and cardiac sympathetic functions and parasympathetic functions (Heart rate variation with respiration, lying to standing ratio, standing to lying ratio and respiratory rate) were evaluated on pregnant Indian women. There was a significant decrease in perceived stress scores, a significant decrease of blood pressure response to cold pressor test and a significant increase in heart rate variability in the test group (pwomen. The results of this study suggest that mindfulness meditation improves parasympathetic functions in pregnant women and is a powerful modulator of the sympathetic nervous system during pregnancy.

  7. A knowledge-based theory of rising scores on "culture-free" tests.

    Science.gov (United States)

    Fox, Mark C; Mitchum, Ainsley L

    2013-08-01

    Secular gains in intelligence test scores have perplexed researchers since they were documented by Flynn (1984, 1987). Gains are most pronounced on abstract, so-called culture-free tests, prompting Flynn (2007) to attribute them to problem-solving skills availed by scientifically advanced cultures. We propose that recent-born individuals have adopted an approach to analogy that enables them to infer higher level relations requiring roles that are not intrinsic to the objects that constitute initial representations of items. This proposal is translated into item-specific predictions about differences between cohorts in pass rates and item-response patterns on the Raven's Matrices (Flynn, 1987), a seemingly culture-free test that registers the largest Flynn effect. Consistent with predictions, archival data reveal that individuals born around 1940 are less able to map objects at higher levels of relational abstraction than individuals born around 1990. Polytomous Rasch models verify predicted violations of measurement invariance, as raw scores are found to underestimate the number of analogical rules inferred by members of the earlier cohort relative to members of the later cohort who achieve the same overall score. The work provides a plausible cognitive account of the Flynn effect, furthers understanding of the cognition of matrix reasoning, and underscores the need to consider how test-takers select item responses. PsycINFO Database Record (c) 2013 APA, all rights reserved.

  8. Evaluating the Effects of Differences in Group Abilities on the Tucker and the Levine Observed-Score Methods for Common-Item Nonequivalent Groups Equating. ACT Research Report Series 2010-1

    Science.gov (United States)

    Chen, Hanwei; Cui, Zhongmin; Zhu, Rongchun; Gao, Xiaohong

    2010-01-01

    The most critical feature of a common-item nonequivalent groups equating design is that the average score difference between the new and old groups can be accurately decomposed into a group ability difference and a form difficulty difference. Two widely used observed-score linear equating methods, the Tucker and the Levine observed-score methods,…

  9. The Analysis of a Teacher Test Preparation Tutorial to Learner Test Scores: An Action Research Study

    Science.gov (United States)

    Mild, Toni L. Hittle

    2014-01-01

    Many Pennsylvania colleges and universities require that teacher candidates pass a standardized assessment in order to gain formal entry in to their education programs. Standardized tests are also required for Level I teacher certification within Pennsylvania. The initial assessment required of all Pennsylvania preservice teachers for…

  10. Learning Anatomy Enhances Spatial Ability

    Science.gov (United States)

    Vorstenbosch, Marc A. T. M.; Klaassen, Tim P. F. M.; Donders, A. R. T.; Kooloos, Jan G. M.; Bolhuis, Sanneke M.; Laan, Roland F. J. M.

    2013-01-01

    Spatial ability is an important factor in learning anatomy. Students with high scores on a mental rotation test (MRT) systematically score higher on anatomy examinations. This study aims to investigate if learning anatomy also oppositely improves the MRT-score. Five hundred first year students of medicine ("n" = 242, intervention) and…

  11. WEB-BASED ADAPTIVE TESTING SYSTEM (WATS FOR CLASSIFYING STUDENTS ACADEMIC ABILITY

    Directory of Open Access Journals (Sweden)

    Jaemu LEE,

    2012-08-01

    Full Text Available Computer Adaptive Testing (CAT has been highlighted as a promising assessment method to fulfill two testing purposes: estimating student academic ability and classifying student academic level. In this paper, we introduced the Web-based Adaptive Testing System (WATS developed to support a cost effective assessment for classifying students’ ability into different academic levels. Instead of using a traditional paper and pencil test, the WATS is expected to serve as an alternate method to promptly diagnosis and identify underachieving students through Web-based testing. The WATS can also help provide students with appropriate learning contents and necessary academic support in time. In this paper, theoretical background and structure of WATS, item construction process based upon item response theory, and user interfaces of WATS were discussed.

  12. An analysis of aviation test scores to characterize Student Naval Aviator disqualification

    OpenAIRE

    Wahl, Erich J.

    1998-01-01

    Approved for public release; distribution is unlimited The U.S. Navy uses the Aviation Selection Test Battery (ASTh) to identify those Student Naval Aviator (SNA) applicants most likely to succeed in flight training. Using classification and regression trees, this thesis concludes that individual answers to an ASTh subtest, the Biographical Inventory, are not good predictors of SNA primary flight grades. It also concludes that those SNA who score less than a 6 on the Pilot Biographical Inv...

  13. The reliability and validity of the Danish Draft Board Cognitive Ability Test: Børge Prien's Prøve.

    Science.gov (United States)

    Teasdale, Thomas W; Hartmann, Peter V W; Pedersen, Christoffer H; Bertelsen, Mette

    2011-04-01

    The Danish Draft Board has used the same test for assessing general cognitive ability, the Børge Prien's Prøve (BPP), for over 50 years during which time all men on reaching the age of 18 become liable for conscription. Data from the test has, over the decades, been used in numerous and wide-ranging research studies. Nonetheless, owing to the special circumstances of its administration, some psychometric properties, which are generally assessed for psychological tests, have not previously been investigated for the BPP. First, since the test is only used at the assessment phase, retesting with the BPP occurs only rarely and under exceptional circumstances. Therefore, its Test-Retest reliability has hitherto not been documented. Second, questions have often been raised as to whether the validity of the BPP is undermined by either a lack of motivation and under-performing among some of the men taking the test, being, as they are, compelled to do so, and/or by gradual obsolescence of the test over the decades of its use. We here present findings from three new studies to show that (a) the BPP has a satisfactory Test-Retest reliability, r=0.77, (b) BPP test scores are not positively associated with expressed attitude to being called upon to serve conscription and (c) the correlation between the BPP and a measure of educational level has remained stable (at about 0.5) through the last two decades. Taken together these three findings further support the continuing value of the BPP in research relating to cognitive ability. © 2010 The Authors. Scandinavian Journal of Psychology © 2010 The Scandinavian Psychological Associations.

  14. Sex Differences in Fluid Reasoning: Manifest and Latent Estimates from the Cognitive Abilities Test

    Directory of Open Access Journals (Sweden)

    Joni M. Lakin

    2014-06-01

    Full Text Available The size and nature of sex differences in cognitive ability continues to be a source of controversy. Conflicting findings result from the selection of measures, samples, and methods used to estimate sex differences. Existing sex differences work on the Cognitive Abilities Test (CogAT has analyzed manifest variables, leaving open questions about sex differences in latent narrow cognitive abilities and the underlying broad ability of fluid reasoning (Gf. This study attempted to address these questions. A confirmatory bifactor model was used to estimate Gf and three residual narrow ability factors (verbal, quantitative, and figural. We found that latent mean differences were larger than manifest estimates for all three narrow abilities. However, mean differences in Gf were trivial, consistent with previous research. In estimating group variances, the Gf factor showed substantially greater male variability (around 20% greater. The narrow abilities varied: verbal reasoning showed small variability differences while quantitative and figural showed substantial differences in variance (up to 60% greater. These results add precision and nuance to the study of the variability and masking hypothesis.

  15. Impact of Answer-Switching Behavior on Multiple-Choice Test Scores in Higher Education

    Directory of Open Access Journals (Sweden)

    Ramazan BAŞTÜRK

    2011-06-01

    Full Text Available The multiple- choice format is one of the most popular selected-response item formats used in educational testing. Researchers have shown that Multiple-choice type test is a useful vehicle for student assessment in core university subjects that usually have large student numbers. Even though the educators, test experts and different test recourses maintain the idea that the first answer should be retained, many researchers argued that this argument is not dependent with empirical findings. The main question of this study is to examine how the answer switching behavior affects the multiple-choice test score. Additionally, gender differences and relationship between number of answer switching behavior and item parameters (item difficulty and item discrimination were investigated. The participants in this study consisted of 207 upper-level College of Education students from mid-sized universities. A Midterm exam consisted of 20 multiple-choice questions was used. According to the result of this study, answer switching behavior statistically increase test scores. On the other hand, there is no significant gender difference in answer-switching behavior. Additionally, there is a significant negative relationship between answer switching behavior and item difficulties.

  16. Are students' impressions of improved learning through active learning methods reflected by improved test scores?

    Science.gov (United States)

    Everly, Marcee C

    2013-02-01

    To report the transformation from lecture to more active learning methods in a maternity nursing course and to evaluate whether student perception of improved learning through active-learning methods is supported by improved test scores. The process of transforming a course into an active-learning model of teaching is described. A voluntary mid-semester survey for student acceptance of the new teaching method was conducted. Course examination results, from both a standardized exam and a cumulative final exam, among students who received lecture in the classroom and students who had active learning activities in the classroom were compared. Active learning activities were very acceptable to students. The majority of students reported learning more from having active-learning activities in the classroom rather than lecture-only and this belief was supported by improved test scores. Students who had active learning activities in the classroom scored significantly higher on a standardized assessment test than students who received lecture only. The findings support the use of student reflection to evaluate the effectiveness of active-learning methods and help validate the use of student reflection of improved learning in other research projects. Copyright © 2011 Elsevier Ltd. All rights reserved.

  17. Validity and reliability of Abbreviated Mental Test Score (AMTS) among older Iranian.

    Science.gov (United States)

    Foroughan, Mahshid; Wahlund, Lars-Olof; Jafari, Zahra; Rahgozar, Mehdi; Farahani, Ida G; Rashedi, Vahid

    2017-11-01

    Cognitive impairment is common among older people and is associated with increased morbidity and mortality. The main aim of this study was to evaluate the validity of the Persian version of the Abbreviated Mental Test Score (AMTS) as a screening tool for dementia. Data were obtained from a cross-sectional study. One hundred and one older adults who were members of Iranian Alzheimer Association and 101 of their siblings were entered into this study by convenient sampling. The Diagnostic and Statistical Manual of Mental Disorders, 4th edition, criteria for diagnosing dementia and the Mini-Mental State Examination were used as the study tools. The gathered data were analyzed by the Mann-Whitney U-test, the Kruskal-Wallis test, Spearman's rank correlation coefficient, and the receiver-operating characteristic. The AMTS could successfully differentiate the dementia group from the non-dementia group. Scores were significantly correlated with Diagnostic and Statistical Manual of Mental Disorders diagnosis for dementia and Mini-Mental State Examination scores (P < 0.001). Educational level (P < 0.001) and male sex (P = 0.015) were positively associated with AMTS, whereas (P < 0.001) was negatively associated with AMTS. Total Cronbach's α coefficient was 0.90. The scores 6 and 7 showed the optimum balance between sensitivity (99% and 94%, respectively) and specificity (85% and 86%, respectively). The Persian version of the AMTS is a valid cognitive assessment tool for older Iranian adults and can be used for dementia screening in Iran. © 2017 Japanese Psychogeriatric Society.

  18. Comparison of physical therapy anatomy performance and anxiety scores in timed and untimed practical tests.

    Science.gov (United States)

    Schwartz, Sarah M; Evans, Cathy; Agur, Anne M R

    2015-01-01

    Students in health care professional programs face many stressful tests that determine successful completion of their program. Test anxiety during these high stakes examinations can affect working memory and lead to poor outcomes. Methods of decreasing test anxiety include lengthening the time available to complete examinations or evaluating students using untimed examinations. There is currently no consensus in the literature regarding whether untimed examinations provide a benefit to test performance in clinical anatomy. This study aimed to determine the impact of timed versus untimed practical tests on Master of Physical Therapy student anatomy performance and test anxiety. Test anxiety was measured using the State-Trait Anxiety Inventory (STAI). Differences in performance, anxiety scores, and time taken were compared using paired sample Student's t-tests. Eighty-one of the 84 students completed the study and provided feedback. Students performed significantly higher on the untimed test (P = 0.005), with a significant reduction in test anxiety (P anxiety. If the intended goal of evaluating health care professional students is to determine fundamental competencies, these factors should be considered when designing future curricula. © 2014 American Association of Anatomists.

  19. PROMIS Pain Interference and Physical Function Scores Correlate With the Foot and Ankle Ability Measure (FAAM) in Patients With Hallux Valgus.

    Science.gov (United States)

    Nixon, Devon C; McCormick, Jeremy J; Johnson, Jeffrey E; Klein, Sandra E

    2017-11-01

    Traditional patient-reported outcome instruments like the Foot and Ankle Ability Measure (FAAM) quantify patient disability but often are limited by responder burden and incomplete questionnaires. The Patient-Reported Outcome Measurement Information System (PROMIS) overcomes such obstacles through computer-adaptive technology and can capture outcome data from various domains including physical and psychosocial function. Prior work has compared the FAAM with PROMIS physical function; however, there is little evidence comparing the association between foot and ankle-specific tools like the FAAM with more general outcomes measures of PROMIS pain interference and depression in foot and ankle conditions. (1) We asked whether there was a relationship between FAAM Activities of Daily Living (ADL) scores with PROMIS physical function, pain interference, and depression in patients with hallux valgus. (2) Additionally, we asked if we could identify specific factors that are associated with variance in FAAM and PROMIS physical function scores in patients with hallux valgus. Eighty-five new patients with either a primary or secondary diagnosis of hallux valgus based on clinic billing codes from July 2015 to February 2016 were retrospectively identified. Patients completed FAAM ADL paper-based surveys and electronic PROMIS questionnaires for physical function, pain interference, and depression from new patient visits at a single time. Spearman rho correlations were performed between FAAM ADL and PROMIS scores. Analyses then were used to identify differences in FAAM ADL and PROMIS physical function measures based on demographic variables. Stepwise linear regressions then determined which demographic and/or outcome variable(s) accounted for the variance in FAAM ADL and PROMIS physical function scores. FAAM scores correlated strongly with PROMIS physical function (r = 0.70, p hallux valgus. PROMIS tools allow for more-efficient data collection across multiple domains and, moving

  20. Testing of spatial ability: construction and evaluation of a new instrument

    Czech Academy of Sciences Publication Activity Database

    Květon, Petr; Jelínek, Martin; Vobořil, Dalibor

    2014-01-01

    Roč. 56, č. 3 (2014), s. 233-252 ISSN 0039-3320 R&D Projects: GA ČR(CZ) GAP407/11/2397 Institutional support: RVO:68081740 Keywords : spatial ability * testing * psychometrics Subject RIV: AN - Psychology Impact factor: 0.442, year: 2014

  1. Mental Abilities and School Achievement: A Test of a Mediation Hypothesis

    Science.gov (United States)

    Vock, Miriam; Preckel, Franzis; Holling, Heinz

    2011-01-01

    This study analyzes the interplay of four cognitive abilities--reasoning, divergent thinking, mental speed, and short-term memory--and their impact on academic achievement in school in a sample of adolescents in grades seven to 10 (N = 1135). Based on information processing approaches to intelligence, we tested a mediation hypothesis, which states…

  2. Components of Spatial Thinking: Evidence from a Spatial Thinking Ability Test

    Science.gov (United States)

    Lee, Jongwon; Bednarz, Robert

    2012-01-01

    This article introduces the development and validation of the spatial thinking ability test (STAT). The STAT consists of sixteen multiple-choice questions of eight types. The STAT was validated by administering it to a sample of 532 junior high, high school, and university students. Factor analysis using principal components extraction was applied…

  3. A comprehensive test of evolutionarily increased competitive ability in a highly invasive plant species

    Science.gov (United States)

    Joshi, Srijana; Gruntman, Michal; Bilton, Mark; Seifan, Merav; Tielbörger, Katja

    2014-01-01

    Background and Aims A common hypothesis to explain plants' invasive success is that release from natural enemies in the introduced range selects for reduced allocation to resistance traits and a subsequent increase in resources available for growth and competitive ability (evolution of increased competitive ability, EICA). However, studies that have investigated this hypothesis have been incomplete as they either did not test for all aspects of competitive ability or did not select appropriate competitors. Methods Here, the prediction of increased competitive ability was examined with the invasive plant Lythrum salicaria (purple loosestrife) in a set of common-garden experiments that addressed these aspects by carefully distinguishing between competitive effect and response of invasive and native plants, and by using both intraspecific and interspecific competition settings with a highly vigorous neighbour, Urtica dioica (stinging nettle), which occurs in both ranges. Key Results While the intraspecific competition results showed no differences in competitive effect or response between native and invasive plants, the interspecific competition experiment revealed greater competitive response and effect of invasive plants in both biomass and seed production. Conclusions The use of both intra- and interspecific competition experiments in this study revealed opposing results. While the first experiment refutes the EICA hypothesis, the second shows strong support for it, suggesting evolutionarily increased competitive ability in invasive populations of L. salicaria. It is suggested that the use of naturally co-occurring heterospecifics, rather than conspecifics, may provide a better evaluation of the possible evolutionary shift towards greater competitive ability. PMID:25301818

  4. Development of a Culture Specific Critical Thinking Ability Test and Using It as a Supportive Diagnostic Test for Giftedness

    Science.gov (United States)

    Köksal, Mustafa Serdar

    2016-01-01

    The purposes of this study were to develop a culture specific critical thinking ability test for 6, 7, and 8. grade students in Turkey and to use it as an assessment instrument for giftedness. For these purposes, item pool involving 22 items was formed by writing items focusing on the current and common events presented in (Turkish) media from…

  5. Association testing for next-generation sequencing data using score statistics

    DEFF Research Database (Denmark)

    Skotte, Line; Korneliussen, Thorfinn Sand; Albrechtsen, Anders

    2012-01-01

    computationally feasible due to the use of score statistics. As part of the joint likelihood, we model the distribution of the phenotypes using a generalized linear model framework, which works for both quantitative and discrete phenotypes. Thus, the method presented here is applicable to case-control studies...... of genotype calls into account have been proposed; most require numerical optimization which for large-scale data is not always computationally feasible. We show that using a score statistic for the joint likelihood of observed phenotypes and observed sequencing data provides an attractive approach...... to association testing for next-generation sequencing data. The joint model accounts for the genotype classification uncertainty via the posterior probabilities of the genotypes given the observed sequencing data, which gives the approach higher power than methods based on called genotypes. This strategy remains...

  6. A high COPD assessment test score may predict anxiety in COPD

    Directory of Open Access Journals (Sweden)

    Harryanto H

    2018-03-01

    Full Text Available Hilman Harryanto,1 Sally Burrows,2 Yuben Moodley1,2 1Department of Respiratory Medicine, Fiona Stanley Hospital, Perth, WA, Australia; 2Faculty of Health and Medical Sciences, Medical School, University of Western Australia, Perth, WA, AustraliaThe prevalence of anxiety is 55% in patients with COPD,1 and it is associated with worse disease control. Therefore, early recognition and institution of treatment of this comorbidity significantly improve patient’s quality of life. Recently, a questionnaire called the COPD assessment test (CAT has been incorporated into the Global Initiative for Chronic Obstructive Lung Disease (GOLD guidelines for the management of COPD, and a higher score is associated with increased COPD symptoms.2 Considering the regular use of CAT, it was evaluated whether this tool can also be used to identify anxiety. The CAT score was correlated with the Hospital Anxiety and Depression Scale (HADS to determine the level at which CAT may predict anxiety.

  7. Test Scores, Class Rank and College Performance: Lessons for Broadening Access and Promoting Success.

    Science.gov (United States)

    Niu, Sunny X; Tienda, Marta

    2012-04-01

    Using administrative data for five Texas universities that differ in selectivity, this study evaluates the relative influence of two key indicators for college success-high school class rank and standardized tests. Empirical results show that class rank is the superior predictor of college performance and that test score advantages do not insulate lower ranked students from academic underperformance. Using the UT-Austin campus as a test case, we conduct a simulation to evaluate the consequences of capping students admitted automatically using both achievement metrics. We find that using class rank to cap the number of students eligible for automatic admission would have roughly uniform impacts across high schools, but imposing a minimum test score threshold on all students would have highly unequal consequences by greatly reduce the admission eligibility of the highest performing students who attend poor high schools while not jeopardizing admissibility of students who attend affluent high schools. We discuss the implications of the Texas admissions experiment for higher education in Europe.

  8. COMPARISON BETWEEN WOOD DRYING DEFECT SCORES: SPECIMEN TESTING X ANALYSIS OF KILN-DRIED BOARDS

    Directory of Open Access Journals (Sweden)

    Djeison Cesar Batista

    2015-04-01

    Full Text Available It is important to develop drying technologies for Eucalyptus grandis lumber, which is one of the most planted species of this genus in Brazil and plays an important role as raw material for the wood industry. The general aim of this work was to assess the conventional kiln drying of juvenile wood of three clones of Eucalyptus grandis. The specific aims were to compare the behavior between: i drying defects indicated by tests with wood specimens and conventional kiln-dried boards; and ii physical properties and the drying quality. Five 11-year-old trees of each clone were felled, and only flatsawn boards of the first log were used. Basic density and total shrinkage were determined, and the drying test with wood specimens at 100 °C was carried out. Kiln drying of boards was performed, and initial and final moisture content, moisture gradient in thickness, drying stresses and drying defects were assessed. The defect scoring method was used to verify the behavior between the defects detected by specimen testing and the defects detected in kiln-dried boards. As main results, the drying schedule was too severe for the wood, resulting in a high level of boards with defects. The behavior between the defects in the drying test with specimens and the defects of kiln-dried boards was different, there was no correspondence, according to the defect scoring method.

  9. The test ability of an adaptive pulse wave for ADC testing

    NARCIS (Netherlands)

    Sheng, Xiaoqin; Kerkhoff, Hans G.

    2010-01-01

    In the conventional ADC production test method, a high-quality analogue sine wave is applied to the Analogue-to-Digital Converter (ADC), which is expensive to generate. Nowadays, an increasing number of ADCs are integrated into a system-on-chip (SoC) platform design, which usually contains a digital

  10. Predicting Stereotype Threat, Test Anxiety, and Cognitive Ability Test Performance: An Examination of Three Models

    Science.gov (United States)

    Sawyer, Jr., Thomas P.; Hollis-Sawyer, Lisa A.

    2005-01-01

    As the classroom and workplace, among other contexts, become more diverse in their population characteristics, the need to be aware of specific factors impacting testing outcome issues correspondingly increases. The focus in this study, among other purposes, was to identify possible interactions between examinee's individual-difference…

  11. Stochastic order in dichotomous item response models for fixed tests, research adaptive tests, or multiple abilities

    NARCIS (Netherlands)

    van der Linden, Willem J.

    1995-01-01

    Dichotomous item response theory (IRT) models can be viewed as families of stochastically ordered distributions of responses to test items. This paper explores several properties of such distributiom. The focus is on the conditions under which stochastic order in families of conditional

  12. Clock Drawing Test and the diagnosis of amnestic mild cognitive impairment: can more detailed scoring systems do the work?

    Science.gov (United States)

    Rubínová, Eva; Nikolai, Tomáš; Marková, Hana; Siffelová, Kamila; Laczó, Jan; Hort, Jakub; Vyhnálek, Martin

    2014-01-01

    The Clock Drawing Test is a frequently used cognitive screening test with several scoring systems in elderly populations. We compare simple and complex scoring systems and evaluate the usefulness of the combination of the Clock Drawing Test with the Mini-Mental State Examination to detect patients with mild cognitive impairment. Patients with amnestic mild cognitive impairment (n = 48) and age- and education-matched controls (n = 48) underwent neuropsychological examinations, including the Clock Drawing Test and the Mini-Mental State Examination. Clock drawings were scored by three blinded raters using one simple (6-point scale) and two complex (17- and 18-point scales) systems. The sensitivity and specificity of these scoring systems used alone and in combination with the Mini-Mental State Examination were determined. Complex scoring systems, but not the simple scoring system, were significant predictors of the amnestic mild cognitive impairment diagnosis in logistic regression analysis. At equal levels of sensitivity (87.5%), the Mini-Mental State Examination showed higher specificity (31.3%, compared with 12.5% for the 17-point Clock Drawing Test scoring scale). The combination of Clock Drawing Test and Mini-Mental State Examination scores increased the area under the curve (0.72; p Drawing Test did not differentiate between healthy elderly and patients with amnestic mild cognitive impairment in our sample. Complex scoring systems were slightly more efficient, yet still were characterized by high rates of false-positive results. We found psychometric improvement using combined scores from the Mini-Mental State Examination and the Clock Drawing Test when complex scoring systems were used. The results of this study support the benefit of using combined scores from simple methods.

  13. Micronucleus test for radiation biodosimetry in mass casualty events: Evaluation of visual and automated scoring

    Energy Technology Data Exchange (ETDEWEB)

    Bolognesi, Claudia, E-mail: claudia.bolognesi@istge.i [Environmental Carcinogenesis Unit, National Cancer Research Institute, Largo R. Benzi 10, 16132 Genoa (Italy); Balia, Cristina; Roggieri, Paola [Environmental Carcinogenesis Unit, National Cancer Research Institute, Largo R. Benzi 10, 16132 Genoa (Italy); Cardinale, Francesco [Clinical Epidemiology Unit, National Cancer Research Institute, Largo R. Benzi 10, 16132 Genoa (Italy); Department of Health Sciences, University of Genoa, Genoa (Italy); Bruzzi, Paolo [Clinical Epidemiology Unit, National Cancer Research Institute, Largo R. Benzi 10, 16132 Genoa (Italy); Sorcinelli, Francesca [Environmental Carcinogenesis Unit, National Cancer Research Institute, Largo R. Benzi 10, 16132 Genoa (Italy); Laboratory of Genetics, Histology and Molecular Biology Section, Army Medical and Veterinary, Research Center, Via Santo Stefano Rotondo 4, 00184 Roma (Italy); Lista, Florigio [Laboratory of Genetics, Histology and Molecular Biology Section, Army Medical and Veterinary, Research Center, Via Santo Stefano Rotondo 4, 00184 Roma (Italy); D' Amelio, Raffaele [Sapienza, Universita di Roma II Facolta di Medicina e Chirurgia and Ministero della Difesa, Direzione Generale Sanita Militare (Italy); Righi, Enzo [Frascati National Laboratories, National Institute of Nuclear Physics, Via Enrico Fermi 40, 00044 Frascati, Rome (Italy)

    2011-02-15

    In the case of a large-scale nuclear or radiological incidents a reliable estimate of dose is an essential tool for providing timely assessment of radiation exposure and for making life-saving medical decisions. Cytogenetics is considered as the 'gold standard' for biodosimetry. The dicentric analysis (DA) represents the most specific cytogenetic bioassay. The micronucleus test (MN) applied in interphase in peripheral lymphocytes is an alternative and simpler approach. A dose-effect calibration curve for the MN frequency in peripheral lymphocytes from 27 adult donors was established after in vitro irradiation at a dose range 0.15-8 Gy of {sup 137}Cs gamma rays (dose rate 6 Gy min{sup -1}). Dose prediction by visual scoring in a dose-blinded study (0.15-4.0 Gy) revealed a high level of accuracy (R = 0.89). The scoring of MN is time consuming and requires adequate skills and expertise. Automated image analysis is a feasible approach allowing to reduce the time and to increase the accuracy of the dose estimation decreasing the variability due to subjective evaluation. A good correlation (R = 0.705) between visual and automated scoring with visual correction was observed over the dose range 0-2 Gy. Almost perfect discrimination power for exposure to 1-2 Gy, and a satisfactory power for 0.6 Gy were detected. This threshold level can be considered sufficient for identification of sub lethally exposed individuals by automated CBMN assay.

  14. Test and Score Data Summary for TOEFL[R] Internet-Based and Paper-Based Tests. January 2008-December 2008 Test Data

    Science.gov (United States)

    Educational Testing Service, 2008

    2008-01-01

    The Test of English as a Foreign Language[TM], better known as TOEFL[R], is designed to measure the English-language proficiency of people whose native language is not English. TOEFL scores are accepted by more than 6,000 colleges, universities, and licensing agencies in 130 countries. The test is also used by governments, and scholarship and…

  15. Scoring in genetically modified organism proficiency tests based on log-transformed results.

    Science.gov (United States)

    Thompson, Michael; Ellison, Stephen L R; Owen, Linda; Mathieson, Kenneth; Powell, Joanne; Key, Pauline; Wood, Roger; Damant, Andrew P

    2006-01-01

    The study considers data from 2 UK-based proficiency schemes and includes data from a total of 29 rounds and 43 test materials over a period of 3 years. The results from the 2 schemes are similar and reinforce each other. The amplification process used in quantitative polymerase chain reaction determinations predicts a mixture of normal, binomial, and lognormal distributions dominated by the latter 2. As predicted, the study results consistently follow a positively skewed distribution. Log-transformation prior to calculating z-scores is effective in establishing near-symmetric distributions that are sufficiently close to normal to justify interpretation on the basis of the normal distribution.

  16. DISCRIMINATIVE ANALYSIS OF TESTS FOR EVALUATING SITUATIONMOTORIC ABILITIES BETWEEN TWO GROUPS OF BASKETBALL PLAYERS SELECTED BY THE TEST OF SOCIOMETRY

    Directory of Open Access Journals (Sweden)

    Abdulla Elezi

    2011-09-01

    Full Text Available Determining differences between the two groups of basketball players selected with the modified sociometric test (Paranosić and Lazarević in some tests for assessing situation-motor skills, was the aim of this work. The test sample was consisted of 20 basketball players who had most positive points and 20 basketball players who had most negative points, in total- 40 players. T-test was applied to determine whether there are differences between the two groups of basketball players who had been elected with the help of the sociometric test. Analyses were made with the program SPSS 8.0. The discriminative analysis has determined that the differences in the arithmetic means between the groups of basketball players who had most positive points and the group of basketball players who had most negative points in some tests for assessing situation-motor abilities do not exist

  17. Increased correlation coefficient between the written test score and tutors’ performance test scores after training of tutors for assessment of medical students during problem-based learning course in Malaysia

    Directory of Open Access Journals (Sweden)

    Heethal Jaiprakash

    2016-03-01

    Full Text Available This paper is aimed at finding if there was a change of correlation between the written test score and tutors’ performance test scores in the assessment of medical students during a problem-based learning (PBL course in Malaysia. This is a cross-sectional observational study, conducted among 264 medical students in two groups from November 2010 to November 2012. The first group’s tutors did not receive tutor training; while the second group’s tutors were trained in the PBL process. Each group was divided into high, middle and low achievers based on their end-of-semester exam scores. PBL scores were taken which included written test scores and tutors’ performance test scores. Pearson correlation coefficient was calculated between the two kinds of scores in each group. The correlation coefficient between the written scores and tutors’ scores in group 1 was 0.099 (p<0.001 and for group 2 was 0.305 (p<0.001. The higher correlation coefficient in the group where tutors received the PBL training reinforces the importance of tutor training before their participation in the PBL course.

  18. Increased correlation coefficient between the written test score and tutors' performance test scores after training of tutors for assessment of medical students during problem-based learning course in Malaysia.

    Science.gov (United States)

    Jaiprakash, Heethal; Min, Aung Ko Ko; Ghosh, Sarmishtha

    2016-03-01

    This paper is aimed at finding if there was a change of correlation between the written test score and tutors' performance test scores in the assessment of medical students during a problem-based learning (PBL) course in Malaysia. This is a cross-sectional observational study, conducted among 264 medical students in two groups from November 2010 to November 2012. The first group's tutors did not receive tutor training; while the second group's tutors were trained in the PBL process. Each group was divided into high, middle and low achievers based on their end-of-semester exam scores. PBL scores were taken which included written test scores and tutors' performance test scores. Pearson correlation coefficient was calculated between the two kinds of scores in each group. The correlation coefficient between the written scores and tutors' scores in group 1 was 0.099 (pcorrelation coefficient in the group where tutors received the PBL training reinforces the importance of tutor training before their participation in the PBL course.

  19. Test analysis and research on static choice reaction ability of commercial vehicle drivers

    Science.gov (United States)

    Zhang, Lingchao; Wei, Lang; Qiao, Jie; Tian, Shun; Wang, Shengchang

    2017-03-01

    Drivers' choice reaction ability has a certain relation with safe driving. It has important significance to research its influence on traffic safety. Firstly, the paper uses a choice reaction detector developed by research group to detect drivers' choice reaction ability of commercial vehicles, and gets 2641 effective samples. Then by using mathematical statistics method, the paper founds that average reaction time from accident group has no difference with non-accident group, and then introduces a variance rate of reaction time as a new index to replace it. The result shows that the test index choice reaction errors and variance rate of reaction time have positive correlations with accidents. Finally, according to testing results of the detector, the paper formulates a detection threshold with four levels for helping transportation companies to assess commercial vehicles drivers.

  20. Development of a psychological test to diagnose abilities required for successful learning medicine

    Directory of Open Access Journals (Sweden)

    H.-W. Gessmann

    2014-08-01

    Full Text Available We substantiate the necessity of psychological tools aimed at diagnostics of the capabilities for successful learning in medical university, and show the progress of its development. The questionnaire is developed based on the U.S. and European success tests, and its design meets the famous “test for medical professions” (TMS. “Kostroma test for medical professions” (KTMP is not a translation or adaptation of TMS to Russian conditions. It will be re-designed with new test items based on the principles of classical test construction. Creating scientifically based methods of psychological diagnosis of general cognitive ability is a prerequisite for the successful solution of a wide range of research and practical issues related to improving the effectiveness of education and training programs.

  1. A Comparison of the Approaches of Generalizability Theory and Item Response Theory in Estimating the Reliability of Test Scores for Testlet-Composed Tests

    Science.gov (United States)

    Lee, Guemin; Park, In-Yong

    2012-01-01

    Previous assessments of the reliability of test scores for testlet-composed tests have indicated that item-based estimation methods overestimate reliability. This study was designed to address issues related to the extent to which item-based estimation methods overestimate the reliability of test scores composed of testlets and to compare several…

  2. REPRODUCIBILITY OF THE MODIFIED STAR EXCURSION BALANCE TEST COMPOSITE AND SPECIFIC REACH DIRECTION SCORES.

    Science.gov (United States)

    van Lieshout, Remko; Reijneveld, Elja A E; van den Berg, Sandra M; Haerkens, Gijs M; Koenders, Niek H; de Leeuw, Arina J; van Oorsouw, Roel G; Paap, Davy; Scheffer, Else; Weterings, Stijn; Stukstette, Mirelle J

    2016-06-01

    The mSEBT is a screening tool used to evaluate dynamic balance. Most research investigating measurement properties focused on intrarater reliability and was done in small samples. To know whether the mSEBT is useful to discriminate dynamic balance between persons and to evaluate changes in dynamic balance, more research into intra- and interrater reliability and smallest detectable change (synonymous with minimal detectable change) is needed. To estimate intra- and interrater reliability and smallest detectable change of the mSEBT in adults at risk for ankle sprain. Cross-sectional, test-retest design. Fifty-five healthy young adults participating in sports at risk for ankle sprain participated (mean ± SD age, 24.0 ± 2.9 years). Each participant performed three test sessions within one hour and was rated by two physical therapists (session 1, rater 1; session 2, rater 2; session 3, rater 1). Participants and raters were blinded for previous measurements. Normalized composite and reach direction scores for the right and left leg were collected. Analysis of variance was used to calculate intraclass correlation coefficient values for intra- and interrater reliability. Smallest detectable change values were calculated based on the standard error of measurement. Intra- and interrater reliability for both legs was good to excellent (intraclass correlation coefficient ranging from 0.87 to 0.94). The intrarater smallest detectable change for the composite score of the right leg was 7.2% and for the left 6.2%. The interrater smallest detectable change for the composite score of the right leg was 6.9% and for the left 5.0%. The mSEBT is a reliable measurement instrument to discriminate dynamic balance between persons. Most smallest detectable change values of the mSEBT appear to be large. More research is needed to investigate if the mSEBT is usable for evaluative purposes. Level 2.

  3. Breast is best, but for how long? Testing breastfeeding guidelines for optimal cognitive ability

    OpenAIRE

    Doyle, Orla; Timmins, Lori

    2008-01-01

    Objectives. To investigate the relationship between breastfeeding duration and cognitive development using longitudinal survey data. The World Health Organisation (WHO) and the American Academy of Pediatrics (AAP) recommend exclusive breastfeeding until six months post-partum and a combination of complementary foods and breast milk thereafter. This study estimates non-parametric regression models to test whether these recommendations also hold for cognitive ability. Design. Longitudinal cohor...

  4. Reliability and validity of the new Tanaka B Intelligence Scale scores: a group intelligence test.

    Directory of Open Access Journals (Sweden)

    Yota Uno

    Full Text Available OBJECTIVE: The present study evaluated the reliability and concurrent validity of the new Tanaka B Intelligence Scale, which is an intelligence test that can be administered on groups within a short period of time. METHODS: The new Tanaka B Intelligence Scale and Wechsler Intelligence Scale for Children-Third Edition were administered to 81 subjects (mean age ± SD 15.2 ± 0.7 years residing in a juvenile detention home; reliability was assessed using Cronbach's alpha coefficient, and concurrent validity was assessed using the one-way analysis of variance intraclass correlation coefficient. Moreover, receiver operating characteristic analysis for screening for individuals who have a deficit in intellectual function (an FIQ<70 was performed. In addition, stratum-specific likelihood ratios for detection of intellectual disability were calculated. RESULTS: The Cronbach's alpha for the new Tanaka B Intelligence Scale IQ (BIQ was 0.86, and the intraclass correlation coefficient with FIQ was 0.83. Receiver operating characteristic analysis demonstrated an area under the curve of 0.89 (95% CI: 0.85-0.96. In addition, the stratum-specific likelihood ratio for the BIQ≤65 stratum was 13.8 (95% CI: 3.9-48.9, and the stratum-specific likelihood ratio for the BIQ≥76 stratum was 0.1 (95% CI: 0.03-0.4. Thus, intellectual disability could be ruled out or determined. CONCLUSION: The present results demonstrated that the new Tanaka B Intelligence Scale score had high reliability and concurrent validity with the Wechsler Intelligence Scale for Children-Third Edition score. Moreover, the post-test probability for the BIQ could be calculated when screening for individuals who have a deficit in intellectual function. The new Tanaka B Intelligence Test is convenient and can be administered within a variety of settings. This enables evaluation of intellectual development even in settings where performing intelligence tests have previously been difficult.

  5. Reliability and validity of the new Tanaka B Intelligence Scale scores: a group intelligence test.

    Science.gov (United States)

    Uno, Yota; Mizukami, Hitomi; Ando, Masahiko; Yukihiro, Ryoji; Iwasaki, Yoko; Ozaki, Norio

    2014-01-01

    The present study evaluated the reliability and concurrent validity of the new Tanaka B Intelligence Scale, which is an intelligence test that can be administered on groups within a short period of time. The new Tanaka B Intelligence Scale and Wechsler Intelligence Scale for Children-Third Edition were administered to 81 subjects (mean age ± SD 15.2 ± 0.7 years) residing in a juvenile detention home; reliability was assessed using Cronbach's alpha coefficient, and concurrent validity was assessed using the one-way analysis of variance intraclass correlation coefficient. Moreover, receiver operating characteristic analysis for screening for individuals who have a deficit in intellectual function (an FIQIntelligence Scale IQ (BIQ) was 0.86, and the intraclass correlation coefficient with FIQ was 0.83. Receiver operating characteristic analysis demonstrated an area under the curve of 0.89 (95% CI: 0.85-0.96). In addition, the stratum-specific likelihood ratio for the BIQ≤65 stratum was 13.8 (95% CI: 3.9-48.9), and the stratum-specific likelihood ratio for the BIQ≥76 stratum was 0.1 (95% CI: 0.03-0.4). Thus, intellectual disability could be ruled out or determined. The present results demonstrated that the new Tanaka B Intelligence Scale score had high reliability and concurrent validity with the Wechsler Intelligence Scale for Children-Third Edition score. Moreover, the post-test probability for the BIQ could be calculated when screening for individuals who have a deficit in intellectual function. The new Tanaka B Intelligence Test is convenient and can be administered within a variety of settings. This enables evaluation of intellectual development even in settings where performing intelligence tests have previously been difficult.

  6. A comprehensive test of evolutionarily increased competitive ability in a highly invasive plant species.

    Science.gov (United States)

    Joshi, Srijana; Gruntman, Michal; Bilton, Mark; Seifan, Merav; Tielbörger, Katja

    2014-12-01

    A common hypothesis to explain plants' invasive success is that release from natural enemies in the introduced range selects for reduced allocation to resistance traits and a subsequent increase in resources available for growth and competitive ability (evolution of increased competitive ability, EICA). However, studies that have investigated this hypothesis have been incomplete as they either did not test for all aspects of competitive ability or did not select appropriate competitors. Here, the prediction of increased competitive ability was examined with the invasive plant Lythrum salicaria (purple loosestrife) in a set of common-garden experiments that addressed these aspects by carefully distinguishing between competitive effect and response of invasive and native plants, and by using both intraspecific and interspecific competition settings with a highly vigorous neighbour, Urtica dioica (stinging nettle), which occurs in both ranges. While the intraspecific competition results showed no differences in competitive effect or response between native and invasive plants, the interspecific competition experiment revealed greater competitive response and effect of invasive plants in both biomass and seed production. The use of both intra- and interspecific competition experiments in this study revealed opposing results. While the first experiment refutes the EICA hypothesis, the second shows strong support for it, suggesting evolutionarily increased competitive ability in invasive populations of L. salicaria. It is suggested that the use of naturally co-occurring heterospecifics, rather than conspecifics, may provide a better evaluation of the possible evolutionary shift towards greater competitive ability. © The Author 2014. Published by Oxford University Press on behalf of the Annals of Botany Company. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  7. Back extensor muscle endurance test scores in coal miners in Australia

    Energy Technology Data Exchange (ETDEWEB)

    Stewart, M.; Latimer, J.; Jamieson, M. [University of Sydney, Sydney, NSW (Australia). Faculty of Health and Science, School of Physiotherapy

    2003-06-01

    Low back pain is a common complaint among those working in the Australian coal mining industry. One test that may be predictive of first-time episodes of low back pain is the Biering-Sorensen test of back extensor endurance strength. While this test has been evaluated in overseas sedentary populations, normative data and the discriminative ability of the test have not been evaluated with coal miners. Eighty-eight coal miners completed a questionnaire for known risk factors for low back pain, performed the Biering-Sorensen test, and undertook a test of aerobic fitness. Data analysis was performed to describe the groups and to determine whether any significant difference existed between those with a past history of low back pain and those without. Significantly lower than expected holding times were found in this group of coal miners (mean 113 s). This result was significantly lower than demonstrated in previous studies. When holding times for those with a past history of low back pain were compared with times for those with no history of low back pain, the difference was not statistically significant, nor was there a significant difference in fitness between those with a past history of low back pain and those without. It is concluded that coal miners in Australia have lower than normal Biering-Sorensen holding times. This lower back holding time does not differ between coal miners with a past history of low back pain and those without.

  8. Grouped to Achieve: Are There Benefits to Assigning Students to Heterogeneous Cooperative Learning Groups Based on Pre-Test Scores?

    Science.gov (United States)

    Werth, Arman Karl

    Cooperative learning has been one of the most widely used instructional practices around the world since the early 1980's. Small learning groups have been in existence since the beginning of the human race. These groups have grown in their variance and complexity overtime. Classrooms are getting more diverse every year and instructors need a way to take advantage of this diversity to improve learning. The purpose of this study was to see if heterogeneous cooperative learning groups based on student achievement can be used as a differentiated instructional strategy to increase students' ability to demonstrate knowledge of science concepts and ability to do engineering design. This study includes two different groups made up of two different middle school science classrooms of 25-30 students. These students were given an engineering design problem to solve within cooperative learning groups. One class was put into heterogeneous cooperative learning groups based on student's pre-test scores. The other class was grouped based on random assignment. The study measured the difference between each class's pre-post gains, student's responses to a group interaction form and interview questions addressing their perceptions of the makeup of their groups. The findings of the study were that there was no significant difference between learning gains for the treatment and comparison groups. There was a significant difference between the treatment and comparison groups in student perceptions of their group's ability to stay on task and manage their time efficiently. Both the comparison and treatment groups had a positive perception of the composition of their cooperative learning groups.

  9. The design organization test: further demonstration of reliability and validity as a brief measure of visuospatial ability.

    Science.gov (United States)

    Killgore, William D S; Gogel, Hannah

    2014-01-01

    Neuropsychological assessments are frequently time-consuming and fatiguing for patients. Brief screening evaluations may reduce test duration and allow more efficient use of time by permitting greater attention toward neuropsychological domains showing probable deficits. The Design Organization Test (DOT) was initially developed as a 2-min paper-and-pencil alternative for the Block Design (BD) subtest of the Wechsler scales. Although initially validated for clinical neurologic patients, we sought to further establish the reliability and validity of this test in a healthy, more diverse population. Two alternate versions of the DOT and the Wechsler Abbreviated Scale of Intelligence (WASI) were administered to 61 healthy adult participants. The DOT showed high alternate forms reliability (r = .90-.92), and the two versions yielded equivalent levels of performance. The DOT was highly correlated with BD (r = .76-.79) and was significantly correlated with all subscales of the WASI. The DOT proved useful when used in lieu of BD in the calculation of WASI IQ scores. Findings support the reliability and validity of the DOT as a measure of visuospatial ability and suggest its potential worth as an efficient estimate of intellectual functioning in situations where lengthier tests may be inappropriate or unfeasible.

  10. An alternative to the balance error scoring system: using a low-cost balance board to improve the validity/reliability of sports-related concussion balance testing.

    Science.gov (United States)

    Chang, Jasper O; Levy, Susan S; Seay, Seth W; Goble, Daniel J

    2014-05-01

    Recent guidelines advocate sports medicine professionals to use balance tests to assess sensorimotor status in the management of concussions. The present study sought to determine whether a low-cost balance board could provide a valid, reliable, and objective means of performing this balance testing. Criterion validity testing relative to a gold standard and 7 day test-retest reliability. University biomechanics laboratory. Thirty healthy young adults. Balance ability was assessed on 2 days separated by 1 week using (1) a gold standard measure (ie, scientific grade force plate), (2) a low-cost Nintendo Wii Balance Board (WBB), and (3) the Balance Error Scoring System (BESS). Validity of the WBB center of pressure path length and BESS scores were determined relative to the force plate data. Test-retest reliability was established based on intraclass correlation coefficients. Composite scores for the WBB had excellent validity (r = 0.99) and test-retest reliability (R = 0.88). Both the validity (r = 0.10-0.52) and test-retest reliability (r = 0.61-0.78) were lower for the BESS. These findings demonstrate that a low-cost balance board can provide improved balance testing accuracy/reliability compared with the BESS. This approach provides a potentially more valid/reliable, yet affordable, means of assessing sports-related concussion compared with current methods.

  11. Performance on large-scale science tests: Item attributes that may impact achievement scores

    Science.gov (United States)

    Gordon, Janet Victoria

    Significant differences in achievement among ethnic groups persist on the eighth-grade science Washington Assessment of Student Learning (WASL). The WASL measures academic performance in science using both scenario and stand-alone question types. Previous research suggests that presenting target items connected to an authentic context, like scenario question types, can increase science achievement scores especially in underrepresented groups and thus help to close the achievement gap. The purpose of this study was to identify significant differences in performance between gender and ethnic subgroups by question type on the 2005 eighth-grade science WASL. MANOVA and ANOVA were used to examine relationships between gender and ethnic subgroups as independent variables with achievement scores on scenario and stand-alone question types as dependent variables. MANOVA revealed no significant effects for gender, suggesting that the 2005 eighth-grade science WASL was gender neutral. However, there were significant effects for ethnicity. ANOVA revealed significant effects for ethnicity and ethnicity by gender interaction in both question types. Effect sizes were negligible for the ethnicity by gender interaction. Large effect sizes between ethnicities on scenario question types became moderate to small effect sizes on stand-alone question types. This indicates the score advantage the higher performing subgroups had over the lower performing subgroups was not as large on stand-alone question types compared to scenario question types. A further comparison examined performance on multiple-choice items only within both question types. Similar achievement patterns between ethnicities emerged; however, achievement patterns between genders changed in boys' favor. Scenario question types appeared to register differences between ethnic groups to a greater degree than stand-alone question types. These differences may be attributable to individual differences in cognition

  12. The Mediating Effect of Listening Metacognitive Awareness between Test-Taking Motivation and Listening Test Score: An Expectancy-Value Theory Approach.

    Science.gov (United States)

    Xu, Jian

    2017-01-01

    The present study investigated test-taking motivation in L2 listening testing context by applying Expectancy-Value Theory as the framework. Specifically, this study was intended to examine the complex relationships among expectancy, importance, interest, listening anxiety, listening metacognitive awareness, and listening test score using data from a large-scale and high-stakes language test among Chinese first-year undergraduates. Structural equation modeling was used to examine the mediating effect of listening metacognitive awareness on the relationship between expectancy, importance, interest, listening anxiety, and listening test score. According to the results, test takers' listening scores can be predicted by expectancy, interest, and listening anxiety significantly. The relationship between expectancy, interest, listening anxiety, and listening test score was mediated by listening metacognitive awareness. The findings have implications for test takers to improve their test taking motivation and listening metacognitive awareness, as well as for L2 teachers to intervene in L2 listening classrooms.

  13. The Mediating Effect of Listening Metacognitive Awareness between Test-Taking Motivation and Listening Test Score: An Expectancy-Value Theory Approach

    Directory of Open Access Journals (Sweden)

    Jian Xu

    2017-12-01

    Full Text Available The present study investigated test-taking motivation in L2 listening testing context by applying Expectancy-Value Theory as the framework. Specifically, this study was intended to examine the complex relationships among expectancy, importance, interest, listening anxiety, listening metacognitive awareness, and listening test score using data from a large-scale and high-stakes language test among Chinese first-year undergraduates. Structural equation modeling was used to examine the mediating effect of listening metacognitive awareness on the relationship between expectancy, importance, interest, listening anxiety, and listening test score. According to the results, test takers’ listening scores can be predicted by expectancy, interest, and listening anxiety significantly. The relationship between expectancy, interest, listening anxiety, and listening test score was mediated by listening metacognitive awareness. The findings have implications for test takers to improve their test taking motivation and listening metacognitive awareness, as well as for L2 teachers to intervene in L2 listening classrooms.

  14. Survival analysis of colorectal cancer patients with tumor recurrence using global score test methodology

    Energy Technology Data Exchange (ETDEWEB)

    Zain, Zakiyah, E-mail: zac@uum.edu.my; Ahmad, Yuhaniz, E-mail: yuhaniz@uum.edu.my [School of Quantitative Sciences, Universiti Utara Malaysia, UUM Sintok 06010, Kedah (Malaysia); Azwan, Zairul, E-mail: zairulazwan@gmail.com, E-mail: farhanaraduan@gmail.com, E-mail: drisagap@yahoo.com; Raduan, Farhana, E-mail: zairulazwan@gmail.com, E-mail: farhanaraduan@gmail.com, E-mail: drisagap@yahoo.com; Sagap, Ismail, E-mail: zairulazwan@gmail.com, E-mail: farhanaraduan@gmail.com, E-mail: drisagap@yahoo.com [Surgery Department, Universiti Kebangsaan Malaysia Medical Centre, Jalan Yaacob Latif, 56000 Bandar Tun Razak, Kuala Lumpur (Malaysia); Aziz, Nazrina, E-mail: nazrina@uum.edu.my

    2014-12-04

    Colorectal cancer is the third and the second most common cancer worldwide in men and women respectively, and the second in Malaysia for both genders. Surgery, chemotherapy and radiotherapy are among the options available for treatment of patients with colorectal cancer. In clinical trials, the main purpose is often to compare efficacy between experimental and control treatments. Treatment comparisons often involve several responses or endpoints, and this situation complicates the analysis. In the case of colorectal cancer, sets of responses concerned with survival times include: times from tumor removal until the first, the second and the third tumor recurrences, and time to death. For a patient, the time to recurrence is correlated to the overall survival. In this study, global score test methodology is used in combining the univariate score statistics for comparing treatments with respect to each survival endpoint into a single statistic. The data of tumor recurrence and overall survival of colorectal cancer patients are taken from a Malaysian hospital. The results are found to be similar to those computed using the established Wei, Lin and Weissfeld method. Key factors such as ethnic, gender, age and stage at diagnose are also reported.

  15. Change of direction ability test differentiates higher level and lower level soccer referees

    Science.gov (United States)

    Los, Arcos A; Grande, I; Casajús, JA

    2016-01-01

    This report examines the agility and level of acceleration capacity of Spanish soccer referees and investigates the possible differences between field referees of different categories. The speed test consisted of 3 maximum acceleration stretches of 15 metres. The change of direction ability (CODA) test used in this study was a modification of the Modified Agility Test (MAT). The study included a sample of 41 Spanish soccer field referees from the Navarre Committee of Soccer Referees divided into two groups: i) the higher level group (G1, n = 20): 2ndA, 2ndB and 3rd division referees from the Spanish National Soccer League (28.43 ± 1.39 years); and ii) the lower level group (G2, n = 21): Navarre Provincial League soccer referees (29.54 ± 1.87 years). Significant differences were found with respect to the CODA between G1 (5.72 ± 0.13 s) and G2 (6.06 ± 0.30 s), while no differences were encountered between groups in acceleration ability. No significant correlations were obtained in G1 between agility and the capacity to accelerate. Significant correlations were found between sprint and agility times in the G2 and in the total group. The results of this study showed that agility can be used as a discriminating factor for differentiating between national and regional field referees; however, no observable differences were found over the 5 and 15 m sprint tests. PMID:27274111

  16. Genome scan for linkage to asthma using a linkage disequilibrium-lod score test.

    Science.gov (United States)

    Jiang, Y; Slager, S L; Huang, J

    2001-01-01

    We report a genome-wide linkage study of asthma on the German and Collaborative Study on the Genetics of Asthma (CSGA) data. Using a combined linkage and linkage disequilibrium test and the nonparametric linkage score, we identified 13 markers from the German data, 1 marker from the African American (CSGA) data, and 7 markers from the Caucasian (CSGA) data in which the p-values ranged between 0.0001 and 0.0100. From our analysis and taking into account previous published linkage studies of asthma, we suggest that three regions in chromosome 5 (around D5S418, D5S644, and D5S422), one region in chromosome 6 (around three neighboring markers D6S1281, D6S291, and D6S1019), one region in chromosome 11 (around D11S2362), and two regions in chromosome 12 (around D12S351 and D12S324) especially merit further investigation.

  17. Applying cognitive acuity theory to the development and scoring of situational judgment tests.

    Science.gov (United States)

    Leeds, J Peter

    2017-11-09

    The theory of cognitive acuity (TCA) treats the response options within items as signals to be detected and uses psychophysical methods to estimate the respondents' sensitivity to these signals. Such a framework offers new methods to construct and score situational judgment tests (SJT). Leeds (2012) defined cognitive acuity as the capacity to discern correctness and distinguish between correctness differences among simultaneously presented situation-specific response options. In this study, SJT response options were paired in order to offer the respondent a two-option choice. The contrast in correctness valence between the two options determined the magnitude of signal emission, with larger signals portending a higher probability of detection. A logarithmic relation was found between correctness valence contrast (signal stimulus) and its detectability (sensation response). Respondent sensitivity to such signals was measured and found to be related to the criterion variables. The linkage between psychophysics and elemental psychometrics may offer new directions for measurement theory.

  18. Correlations between the scores of computerized adaptive testing, paper and pencil tests, and the Korean Medical Licensing Examination

    Directory of Open Access Journals (Sweden)

    Mee Young Kim

    2005-06-01

    Full Text Available To evaluate the usefulness of computerized adaptive testing (CAT in medical school, the General Examination for senior medical students was administered as a paper and pencil test (P&P and using CAT. The General Examination is a graduate examination, which is also a preliminary examination for the Korean Medical Licensing Examination (KMLE. The correlations between the results of the CAT and P&P and KMLE were analyzed. The correlation between the CAT and P&P was 0.8013 (p=0.000; that between the CAT and P&P was 0.7861 (p=0.000; and that between the CAT and KMLE was 0.6436 (p=0.000. Six out of 12 students with an ability estimate below 0.52 failed the KMLE. The results showed that CAT could replace P&P in medical school. The ability of CAT to predict whether students would pass the KMLE was 0.5 when the criterion of the theta value was set at -0.52 that was chosen arbitrarily for the prediction of pass or failure.

  19. Treatment for Schistosoma japonicum, reduction of intestinal parasite load, and cognitive test score improvements in school-aged children.

    Directory of Open Access Journals (Sweden)

    Amara E Ezeamama

    Full Text Available To determine whether treatment of intestinal parasitic infections improves cognitive function in school-aged children, we examined changes in cognitive testscores over 18 months in relation to: (i treatment-related Schistosoma japonicum intensity decline, (ii spontaneous reduction of single soil-transmitted helminth (STH species, and (iii ≥2 STH infections among 253 S. japonicum-infected children.Helminth infections were assessed at baseline and quarterly by the Kato-Katz method. S. japonicum infection was treated at baseline using praziquantel. An intensity-based indicator of lower vs. no change/higher infection was defined separately for each helminth species and joint intensity declines of ≥2 STH species. In addition, S. japonicum infection-free duration was defined in four categories based on time of schistosome re-infection: >18 (i.e. cured, >12 to ≤18, 6 to ≤12 and ≤6 (persistently infected months. There was no baseline treatment for STHs but their intensity varied possibly due to spontaneous infection clearance/acquisition. Four cognitive tests were administered at baseline, 6, 12, and 18 months following S. japonicum treatment: learning and memory domains of Wide Range Assessment of Memory and Learning (WRAML, verbal fluency (VF, and Philippine nonverbal intelligence test (PNIT. Linear regression models were used to relate changes in respective infections to test performance with adjustment for sociodemographic confounders and coincident helminth infections.Children cured (β = 5.8; P = 0.02 and those schistosome-free for >12 months (β = 1.5; P = 0.03 scored higher in WRAML memory and VF tests compared to persistently infected children independent of STH infections. A decline vs. no change/increase of any individual STH species (β:11.5-14.5; all P12 months post-treatment and those who experienced declines of ≥2 STH species scored higher in three of four cognitive tests. Our result suggests that sustained

  20. Poisson Approximation-Based Score Test for Detecting Association of Rare Variants.

    Science.gov (United States)

    Fang, Hongyan; Zhang, Hong; Yang, Yaning

    2016-07-01

    Genome-wide association study (GWAS) has achieved great success in identifying genetic variants, but the nature of GWAS has determined its inherent limitations. Under the common disease rare variants (CDRV) hypothesis, the traditional association analysis methods commonly used in GWAS for common variants do not have enough power for detecting rare variants with a limited sample size. As a solution to this problem, pooling rare variants by their functions provides an efficient way for identifying susceptible genes. Rare variant typically have low frequencies of minor alleles, and the distribution of the total number of minor alleles of the rare variants can be approximated by a Poisson distribution. Based on this fact, we propose a new test method, the Poisson Approximation-based Score Test (PAST), for association analysis of rare variants. Two testing methods, namely, ePAST and mPAST, are proposed based on different strategies of pooling rare variants. Simulation results and application to the CRESCENDO cohort data show that our methods are more powerful than the existing methods. © 2016 John Wiley & Sons Ltd/University College London.

  1. The achievement impact of the inclusion model on the standardized test scores of general education students

    Science.gov (United States)

    Garrett-Rainey, Syrena

    The purpose of this study was to compare the achievement of general education students within regular education classes to the achievement of general education students in inclusion/co-teach classes to determine whether there was a significant difference in the achievement between the two groups. The school district's inclusion/co-teach model included ongoing professional development support for teachers and administrators. General education teachers, special education teachers, and teacher assistants collaborated to develop instructional strategies to provide additional remediation to help students to acquire the skills needed to master course content. This quantitative study reviewed the end-of course test (EoCT) scores of Grade 10 physical science and math students within an urban school district. It is not known whether general education students in an inclusive/co-teach science or math course will demonstrate a higher achievement on the EoCT in math or science than students not in an inclusive/co-teach classroom setting. In addition, this study sought to determine if students classified as low socioeconomic status benefited from participating in co-teaching classrooms as evidenced by standardized tests. Inferential statistics were used to determine whether there was a significant difference between the achievements of the treatment group (inclusion/co-teach) and the control group (non-inclusion/co-teach). The findings can be used to provide school districts with optional instructional strategies to implement in the diverse classroom setting in the modern classroom to increase academic performance on state standardized tests.

  2. Science Teacher Efficacy and Outcome Expectancy as Predictors of Students' End-of-Instruction (EOI) Biology I Test Scores

    Science.gov (United States)

    Angle, Julie; Moseley, Christine

    2009-01-01

    The purpose of this study was to compare teacher efficacy beliefs of secondary Biology I teachers whose students' mean scores on the statewide End-of-Instruction (EOI) Biology I test met or exceeded the state academic proficiency level (Proficient Group) to teacher efficacy beliefs of secondary Biology I teachers whose students' mean scores on the…

  3. Utilizing the Six Realms of Meaning in Improving Campus Standardized Test Scores through Team Teaching and Strategic Planning

    Science.gov (United States)

    Stevenson, Rosnisha D.; Kritsonis, William Allan

    2009-01-01

    This article will seek to utilize Dr. William Allan Kritsonis' book "Ways of Knowing Through the Realms of Meaning" (2007) as a framework to improve a campus's standardized test scores, more specifically, their TAKS (Texas Assessment of Knowledge and Skills) scores. Many campuses have an improvement plan, also known as a Campus…

  4. Implications of Deployed and Nondeployed Fathers on Seventh Graders' California Achievement Test Scores during a Military Crisis.

    Science.gov (United States)

    Pisano, Mark C.

    The differences in California Achievement Test (CAT) scores from 1990 to 1991 in seventh graders, currently enrolled in Albritton Junior High School in the Fort Bragg Schools, of deployed and nondeployed fathers were analyzed. CAT percentile scores from 1990 and 1991 (1991 being the year of "Desert Storm") were obtained in reading, math…

  5. A Case Study About Why It Can Be Difficult To Test Whether Propensity Score Analysis Works in Field Experiments

    Directory of Open Access Journals (Sweden)

    William R. Shadish

    2013-02-01

    Full Text Available Peikes, Moreno and Orzol (2008 sensibly caution researchers that propensity score analysis may not lead to valid causal inference in field applications. But at the same time, they made the far stronger claim to have performed an ideal test of whether propensity score matching in quasi-experimental data is capable of approximating the results of a randomized experiment in their dataset, and that this ideal test showed that such matching could not do so. In this article we show that their study does not support that conclusion because it failed to meet a number of basic criteria for an ideal test. By implication, many other purported tests of the effectiveness of propensity score analysis probably also fail to meet these criteria, and are therefore questionable contributions to the literature on the effects of propensity score analysis. DOI: 10.2458/azu_jmmss.v3i2.16475

  6. A Case Study About Why It Can Be Difficult To Test Whether Propensity Score Analysis Works in Field Experiments

    Directory of Open Access Journals (Sweden)

    Thomas D. Cook

    2012-01-01

    Full Text Available Peikes, Moreno and Orzol (2008 sensibly caution researchers that propensity score analysis may not lead to valid causal inference in field applications. But at the same time, they made the far stronger claim to have performed an ideal test of whether propensity score matching in quasi-experimental data is capable of approximating the results of a randomized experiment in their dataset, and that this ideal test showed that such matching could not do so. In this article we show that their study does not support that conclusion because it failed to meet a number of basic criteria for an ideal test. By implication, many other purported tests of the effectiveness of propensity score analysis probably also fail to meet these criteria, and are therefore questionable contributions to the literature on the effects of propensity score analysis.

  7. Similar predictions of etravirine sensitivity regardless of genotypic testing method used: comparison of available scoring systems.

    Science.gov (United States)

    Vingerhoets, Johan; Nijs, Steven; Tambuyzer, Lotke; Hoogstoel, Annemie; Anderson, David; Picchio, Gaston

    2012-01-01

    The aims of this study were to compare various genotypic scoring systems commonly used to predict virological outcome to etravirine, and examine their concordance with etravirine phenotypic susceptibility. Six etravirine genotypic scoring systems were assessed: Tibotec 2010 (based on 20 mutations; TBT 20), Monogram, Stanford HIVdb, ANRS, Rega (based on 37, 30, 27 and 49 mutations, respectively) and virco(®)TYPE HIV-1 (predicted fold change based on genotype). Samples from treatment-experienced patients who participated in the DUET trials and with both genotypic and phenotypic data (n=403) were assessed using each scoring system. Results were retrospectively correlated with virological response in DUET. κ coefficients were calculated to estimate the degree of correlation between the different scoring systems. Correlation between the five scoring systems and the TBT 20 system was approximately 90%. Virological response by etravirine susceptibility was comparable regardless of which scoring system was utilized, with 70-74% of DUET patients determined as susceptible to etravirine by the different scoring systems achieving plasma viral load <50 HIV-1 RNA copies/ml. In samples classed as phenotypically susceptible to etravirine (fold change in 50% effective concentration ≤3), correlations with genotypic score were consistently high across scoring systems (≥70%). In general, the etravirine genotypic scoring systems produced similar results, and genotype-phenotype concordance was high. As such, phenotypic interpretations, and in their absence all genotypic scoring systems investigated, may be used to reliably predict the activity of etravirine.

  8. The TSCA interagency testing committee`s approaches to screening and scoring chemicals and chemical groups: 1977-1983

    Energy Technology Data Exchange (ETDEWEB)

    Walker, J.D. [Environmental Protection Agency, Washington, DC (United States)

    1990-12-31

    This paper describes the TSCA interagency testing committee`s (ITC) approaches to screening and scoring chemicals and chemical groups between 1977 and 1983. During this time the ITC conducted five scoring exercises to select chemicals and chemical groups for detailed review and to determine which of these chemicals and chemical groups should be added to the TSCA Section 4(e) Priority Testing List. 29 refs., 1 fig., 2 tabs.

  9. The Dysexecutive Questionnaire advanced: item and test score characteristics, 4-factor solution, and severity classification.

    Science.gov (United States)

    Bodenburg, Sebastian; Dopslaff, Nina

    2008-01-01

    The Dysexecutive Questionnaire (DEX, , Behavioral assessment of the dysexecutive syndrome, 1996) is a standardized instrument to measure possible behavioral changes as a result of the dysexecutive syndrome. Although initially intended only as a qualitative instrument, the DEX has also been used increasingly to address quantitative problems. Until now there have not been more fundamental statistical analyses of the questionnaire's testing quality. The present study is based on an unselected sample of 191 patients with acquired brain injury and reports on the data relating to the quality of the items, the reliability and the factorial structure of the DEX. Item 3 displayed too great an item difficulty, whereas item 11 was not sufficiently discriminating. The DEX's reliability in self-rating is r = 0.85. In addition to presenting the statistical values of the tests, a clinical severity classification of the overall scores of the 4 found factors and of the questionnaire as a whole is carried out on the basis of quartile standards.

  10. The use of test scores from large-scale assessment surveys: psychometric and statistical considerations

    Directory of Open Access Journals (Sweden)

    Henry Braun

    2017-11-01

    Full Text Available Abstract Background Economists are making increasing use of measures of student achievement obtained through large-scale survey assessments such as NAEP, TIMSS, and PISA. The construction of these measures, employing plausible value (PV methodology, is quite different from that of the more familiar test scores associated with assessments such as the SAT or ACT. These differences have important implications both for utilization and interpretation. Although much has been written about PVs, it appears that there are still misconceptions about whether and how to employ them in secondary analyses. Methods We address a range of technical issues, including those raised in a recent article that was written to inform economists using these databases. First, an extensive review of the relevant literature was conducted, with particular attention to key publications that describe the derivation and psychometric characteristics of such achievement measures. Second, a simulation study was carried out to compare the statistical properties of estimates based on the use of PVs with those based on other, commonly used methods. Results It is shown, through both theoretical analysis and simulation, that under fairly general conditions appropriate use of PV yields approximately unbiased estimates of model parameters in regression analyses of large scale survey data. The superiority of the PV methodology is particularly evident when measures of student achievement are employed as explanatory variables. Conclusions The PV methodology used to report student test performance in large scale surveys remains the state-of-the-art for secondary analyses of these databases.

  11. Hippocampal dose volume histogram predicts Hopkins Verbal Learning Test scores after brain irradiation

    Directory of Open Access Journals (Sweden)

    Catherine Okoukoni, PhD

    2017-10-01

    Full Text Available Purpose: Radiation-induced cognitive decline is relatively common after treatment for primary and metastatic brain tumors; however, identifying dosimetric parameters that are predictive of radiation-induced cognitive decline is difficult due to the heterogeneity of patient characteristics. The memory function is especially susceptible to radiation effects after treatment. The objective of this study is to correlate volumetric radiation doses received by critical neuroanatomic structures to post–radiation therapy (RT memory impairment. Methods and materials: Between 2008 and 2011, 53 patients with primary brain malignancies were treated with conventionally fractionated RT in prospectively accrued clinical trials performed at our institution. Dose-volume histogram analysis was performed for the hippocampus, parahippocampus, amygdala, and fusiform gyrus. Hopkins Verbal Learning Test-Revised scores were obtained at least 6 months after RT. Impairment was defined as an immediate recall score ≤15. For each anatomic region, serial regression was performed to correlate volume receiving a given dose (VD(Gy with memory impairment. Results: Hippocampal V53.4Gy to V60.9Gy significantly predicted post-RT memory impairment (P < .05. Within this range, the hippocampal V55Gy was the most significant predictor (P = .004. Hippocampal V55Gy of 0%, 25%, and 50% was associated with tumor-induced impairment rates of 14.9% (95% confidence interval [CI], 7.2%-28.7%, 45.9% (95% CI, 24.7%-68.6%, and 80.6% (95% CI, 39.2%-96.4%, respectively. Conclusions: The hippocampal V55Gy is a significant predictor for impairment, and a limiting dose below 55 Gy may minimize radiation-induced cognitive impairment.

  12. Rugby versus Soccer in South Africa: Content Familiarity Contributes to Cross-Cultural Differences in Cognitive Test Scores

    Science.gov (United States)

    Malda, Maike; van de Vijver, Fons J. R.; Temane, Q. Michael

    2010-01-01

    In this study, cross-cultural differences in cognitive test scores are hypothesized to depend on a test's cultural complexity (Cultural Complexity Hypothesis: CCH), here conceptualized as its content familiarity, rather than on its cognitive complexity (Spearman's Hypothesis: SH). The content familiarity of tests assessing short-term memory,…

  13. The Effect of Computer-Based Self-Access Learning on Weekly Vocabulary Test Scores

    Directory of Open Access Journals (Sweden)

    Jordan Dreyer

    2014-09-01

    Full Text Available This study sets out to clarify the effectiveness of using an online vocabulary study tool, Quizlet, in an urban high school language arts class. Previous similar studies have mostly dealt with English Language Learners in college settings (Chui, 2013, and were therefore not directed at the issue self-efficacy that is at the heart of the problem of urban high school students in America entering remedial writing programs (Rose, 1989. The study involves 95 students over the course of 14 weeks. Students were tested weekly and were asked to use the Quizlet program in their own free time. The result of this optional involvement was that many students did not participate in the treatment and therefore acted as an elective control group. The resultant data collected shows a strong correlation between the use of an online vocabulary review program and short-term vocabulary retention. The study also showed that students who paced themselves and spread out their study sessions outperformed those students who used the program only for last minute “cram sessions.” The implications of the study are that students who take advantage of tools outside of the classroom are able to out perform their peers. The results are also in line with the call to include technology in the Basic Writing classroom not simply as a tool, but as a “form of discourse” (Jonaitis, 2012. Weekly vocabulary tests, combined with the daily online activity as reported by Quizlet, show that: 1 utilizing the review software improved the scores of most students, 2 those students who used Quizlet to review more than a single time (i.e., several days before the test outperformed those who only used the product once, and 3 students who professed proficiency with the “notebook” system of vocabulary learning appeared not to need the treatment.

  14. Simple shoulder test and Oxford Shoulder Score: Persian translation and cross-cultural validation.

    Science.gov (United States)

    Naghdi, Soofia; Nakhostin Ansari, Noureddin; Rustaie, Nilufar; Akbari, Mohammad; Ebadi, Safoora; Senobari, Maryam; Hasson, Scott

    2015-12-01

    To translate, culturally adapt, and validate the simple shoulder test (SST) and Oxford Shoulder Score (OSS) into Persian language using a cross-sectional and prospective cohort design. A standard forward and backward translation was followed to culturally adapt the SST and the OSS into Persian language. Psychometric properties of floor and ceiling effects, construct convergent validity, discriminant validity, internal consistency reliability, test-retest reliability, standard error of the measurement (SEM), smallest detectable change (SDC), and factor structure were determined. One hundred patients with shoulder disorders and 50 healthy subjects participated in the study. The PSST and the POSS showed no missing responses. No floor or ceiling effects were observed. Both the PSST and POSS detected differences between patients and healthy subjects supporting their discriminant validity. Construct convergent validity was confirmed by a very good correlation between the PSST and POSS (r = 0.68). There was high internal consistency for both the PSST (α = 0.73) and the POSS (α = 0.91 and 0.92). Test-retest reliability with 1-week interval was excellent (ICCagreement = 0.94 for PSST and 0.90 for POSS). Factor analyses demonstrated a three-factor solution for the PSST (49.7 % of variance) and a two-factor solution for the POSS (61.6 % of variance). The SEM/SDC was satisfactory for PSST (5.5/15.3) and POSS (6.8/18.8). The PSST and POSS are valid and reliable outcome measures for assessing functional limitations in Persian-speaking patients with shoulder disorders.

  15. Sequential Neighborhood Effects: The Effect of Long-Term Exposure to Concentrated Disadvantage on Children's Reading and Math Test Scores.

    Science.gov (United States)

    Hicks, Andrew L; Handcock, Mark S; Sastry, Narayan; Pebley, Anne R

    2018-02-01

    Prior research has suggested that children living in a disadvantaged neighborhood have lower achievement test scores, but these studies typically have not estimated causal effects that account for neighborhood choice. Recent studies used propensity score methods to account for the endogeneity of neighborhood exposures, comparing disadvantaged and nondisadvantaged neighborhoods. We develop an alternative propensity function approach in which cumulative neighborhood effects are modeled as a continuous treatment variable. This approach offers several advantages. We use our approach to examine the cumulative effects of neighborhood disadvantage on reading and math test scores in Los Angeles. Our substantive results indicate that recency of exposure to disadvantaged neighborhoods may be more important than average exposure for children's test scores. We conclude that studies of child development should consider both average cumulative neighborhood exposure and the timing of this exposure.

  16. Linkage analysis in nuclear families. 2: Relationship between affected sib-pair tests and lod score analysis.

    Science.gov (United States)

    Knapp, M; Seuchter, S A; Baur, M P

    1994-01-01

    It is believed that the main advantage of affected sib-pair tests is that their application requires no information about the underlying genetic mechanism of the disease. However, here it is proved that the mean test, which can be considered the most prominent of the affected sib-pair tests, is equivalent to lod score analysis for an assumed recessive mode of inheritance, irrespective of the true mode of the disease. Further relationships of certain sib-pair tests and lod score analysis under specific assumed genetic modes are investigated.

  17. Evaluation of Factors Affecting Continuous Performance Test Identical Pairs Version Score of Schizophrenic Patients in a Japanese Clinical Sample

    Directory of Open Access Journals (Sweden)

    Takayoshi Koide

    2012-01-01

    Full Text Available Aim. Cognitive impairment in schizophrenia strongly relates to social outcome and is a good candidate for endophenotypes. When we accurately measure drug efficacy or effects of genes or variants relevant to schizophrenia on cognitive impairment, clinical factors that can affect scores on cognitive tests, such as age and severity of symptoms, should be considered. To elucidate the effect of clinical factors, we conducted multiple regression analysis using scores of the Continuous Performance Test Identical Pairs Version (CPT-IP, which is often used to measure attention/vigilance in schizophrenia. Methods. We conducted the CPT-IP (4-4 digit and examined clinical information (sex, age, education years, onset age, duration of illness, chlorpromazine-equivalent dose, and Positive and Negative Symptom Scale (PANSS scores in 126 schizophrenia patients in Japanese population. Multiple regression analysis was used to evaluate the effect of clinical factors. Results. Age, chlorpromazine-equivalent dose, and PANSS-negative symptom score were associated with mean d′ score in patients. These three clinical factors explained about 28% of the variance in mean d′ score. Conclusions. As conclusion, CPT-IP score in schizophrenia patients is influenced by age, chlorpromazine-equivalent dose and PANSS negative symptom score.

  18. The Sinonasal Outcome Test 22 score in persons without chronic rhinosinusitis

    DEFF Research Database (Denmark)

    Lange, Bibi; Thilsing, T; Baelum, J

    2016-01-01

    -67 with a mean score of 10.5 (CI: 9.1 - 11.9) and the median score was 7. Persons with allergic rhinitis and blue collar workers had a significant higher score. CONCLUSION: The median value of 7 is taken as the normal SNOT 22 score in persons without CRS and can be used as a reference in clinical settings...... and research. Allergic rhinitis and occupation affects SNOT 22 in persons without CRS. This article is protected by copyright. All rights reserved....

  19. Reliability characteristics and applicability of a repeated sprint ability test in male young soccer players

    DEFF Research Database (Denmark)

    Castagna, Carlo; Francini, Lorenzo; Krustrup, Peter

    2018-01-01

    The aim of this study was to examine the usefulness and reliability characteristics of a repeated sprint ability test considering 5 line sprints of 30-m interspersed with 30-s of active recovery in non-elite outfield young male soccer players. Twenty-six (age 14.9±1.2 years, height 1.72±0.12 cm......, body mass 62.2±5.1 kg) players were tested 48 hours and 7 days apart for 5x30-m performance over 5 trials (T1-T5). Short- (T1-T2) and long-term reliability (T1-T3-T4-T5) were assessed with Intraclass Correlation Coefficient (ICC) and with typical error for measurement (TEM). Short- and long...... study revealed that the 5x30-m sprint test is a reliable field test in the short and long-term when the sum of sprint times and the best sprint performance are considered as outcome variables. Sprint performance decrements variables showed large variability across trials....

  20. Survey of Expert Opinion on Intelligence: Causes of International Differences in Cognitive Ability Tests.

    Science.gov (United States)

    Rindermann, Heiner; Becker, David; Coyle, Thomas R

    2016-01-01

    Following Snyderman and Rothman (1987, 1988), we surveyed expert opinions on the current state of intelligence research. This report examines expert opinions on causes of international differences in student assessment and psychometric IQ test results. Experts were surveyed about the importance of culture, genes, education (quantity and quality), wealth, health, geography, climate, politics, modernization, sampling error, test knowledge, discrimination, test bias, and migration. The importance of these factors was evaluated for diverse countries, regions, and groups including Finland, East Asia, sub-Saharan Africa, Southern Europe, the Arabian-Muslim world, Latin America, Israel, Jews in the West, Roma (gypsies), and Muslim immigrants. Education was rated by N = 71 experts as the most important cause of international ability differences. Genes were rated as the second most relevant factor but also had the highest variability in ratings. Culture, health, wealth, modernization, and politics were the next most important factors, whereas other factors such as geography, climate, test bias, and sampling error were less important. The paper concludes with a discussion of limitations of the survey (e.g., response rates and validity of expert opinions).

  1. Exploration of analysis methods for diagnostic imaging tests: problems with ROC AUC and confidence scores in CT colonography.

    Science.gov (United States)

    Mallett, Susan; Halligan, Steve; Collins, Gary S; Altman, Doug G

    2014-01-01

    Different methods of evaluating diagnostic performance when comparing diagnostic tests may lead to different results. We compared two such approaches, sensitivity and specificity with area under the Receiver Operating Characteristic Curve (ROC AUC) for the evaluation of CT colonography for the detection of polyps, either with or without computer assisted detection. In a multireader multicase study of 10 readers and 107 cases we compared sensitivity and specificity, using radiological reporting of the presence or absence of polyps, to ROC AUC calculated from confidence scores concerning the presence of polyps. Both methods were assessed against a reference standard. Here we focus on five readers, selected to illustrate issues in design and analysis. We compared diagnostic measures within readers, showing that differences in results are due to statistical methods. Reader performance varied widely depending on whether sensitivity and specificity or ROC AUC was used. There were problems using confidence scores; in assigning scores to all cases; in use of zero scores when no polyps were identified; the bimodal non-normal distribution of scores; fitting ROC curves due to extrapolation beyond the study data; and the undue influence of a few false positive results. Variation due to use of different ROC methods exceeded differences between test results for ROC AUC. The confidence scores recorded in our study violated many assumptions of ROC AUC methods, rendering these methods inappropriate. The problems we identified will apply to other detection studies using confidence scores. We found sensitivity and specificity were a more reliable and clinically appropriate method to compare diagnostic tests.

  2. Influence of impulsivity-reflexivity when testing dynamic spatial ability: sex and g differences.

    Science.gov (United States)

    Quiroga, M Angeles; Hernández, José Manuel; Rubio, Victor; Shih, Pei Chun; Santacreu, José

    2007-11-01

    This work analyzes the possibility that the differences in the performance of men and women in dynamic spatial tasks such as the Spatial Orientation Dynamic Test-Revised (SODT-R; Santacreu & Rubio, 1998), obtained in previous works, are due to cognitive style (Reflexivity-Impulsivity) or to the speed-accuracy tradeoff (SATO) that the participants implement. If these differences are due to cognitive style, they would be independent of intelligence, whereas if they are due to SATO, they may be associated with intelligence. In this work, 1652 participants, 984 men and 668 women, ages between 18 and 55 years, were assessed. In addition to the SODT-R, the "Test de Razonamiento Analitico, Secuencial e Inductivo" (TRASI [Analytical, Sequential, and Inductive Reasoning Test]; Rubio & Santacreu, 2003) was administered as a measure of general intelligence. Impulsivity scores (Zi) of Salkind and Wright (1977) were used to analyze reflexivity-impulsivity and SATO. The results obtained indicate that (a) four performance groups can be identified: Fast-accurate, Slow-inaccurate, Impulsive, and Reflexive. The first two groups solve the task as a function of a competence variable and the last two as a function of a personality variable; (b) performance differences should be attributed to SATO; (c) SATO differs depending on sex and intelligence level.

  3. Conceptual Scoring and Classification Accuracy of Vocabulary Testing in Bilingual Children

    Science.gov (United States)

    Anaya, Jissel B.; Peña, Elizabeth D.; Bedore, Lisa M.

    2018-01-01

    Purpose: This study examined the effects of single-language and conceptual scoring on the vocabulary performance of bilingual children with and without specific language impairment. We assessed classification accuracy across 3 scoring methods. Method: Participants included Spanish-English bilingual children (N = 247) aged 5;1 (years;months) to…

  4. Assessment of emotion processing skills in acquired brain injury using an ability-based test of emotional intelligence.

    Science.gov (United States)

    Hall, Sarah E; Wrench, Joanne M; Wilson, Sarah J

    2018-04-01

    Social and emotional problems are commonly reported after moderate to severe acquired brain injury (ABI) and pose a significant barrier to rehabilitation. However, progress in assessment of emotional skills has been limited by a lack of validated measurement approaches. This study represents the first formal psychometric evaluation of the use of the Mayer-Salovey-Caruso Emotional Intelligence Test (MSCEIT) V2.0 as a tool for assessing skills in perceiving, using, understanding and managing emotions following ABI. The sample consisted of 82 participants aged 18-80 years in the postacute phase of recovery (2 months-7 years) after moderate to severe ABI. Participants completed the MSCEIT V2.0 and measures of cognition and mood. Sociodemographic and clinical variables were collated from participant interview and medical files. Results revealed deficits across all MSCEIT subscales (approximately 1 SD below the normative mean). Internal consistency was adequate at overall, area, and branch levels, and MSCEIT scores correlated in expected ways with key demographic, clinical, cognitive, and mood variables. MSCEIT performance was related to injury severity and clinician-rated functioning after ABI. Confirmatory factor analysis favored a 3-factor model of EI due to statistical redundancy of the Using Emotions branch. Overall, these findings suggest that the MSCEIT V2.0 is sensitive to emotion processing deficits after moderate to severe ABI, and can yield valid and reliable scores in an ABI sample. In terms of theoretical contributions, our findings support a domain-based, 3-factor approach for characterizing emotion-related abilities in brain-injured individuals. (PsycINFO Database Record (c) 2018 APA, all rights reserved).

  5. Mathematical (Dis)abilities Within the Opportunity-Propensity Model: The Choice of Math Test Matters.

    Science.gov (United States)

    Baten, Elke; Desoete, Annemie

    2018-01-01

    This study examined individual differences in mathematics learning by combining antecedent (A), opportunity (O), and propensity (P) indicators within the Opportunity-Propensity Model. Although there is already some evidence for this model based on secondary datasets, there currently is no primary data available that simultaneously takes into account A, O, and P factors in children with and without Mathematical Learning Disabilities (MLD). Therefore, the mathematical abilities of 114 school-aged children (grade 3 till 6) with and without MLD were analyzed and combined with information retrieved from standardized tests and questionnaires. Results indicated significant differences in personality, motivation, temperament, subjective well-being, self-esteem, self-perceived competence, and parental aspirations when comparing children with and without MLD. In addition, A, O, and P factors were found to underlie mathematical abilities and disabilities. For the A factors, parental aspirations explained about half of the variance in fact retrieval speed in children without MLD, and SES was especially involved in the prediction of procedural accuracy in general. Teachers' experience contributed as O factor and explained about 6% of the variance in mathematical abilities. P indicators explained between 52 and 69% of the variance, with especially intelligence as overall significant predictor. Indirect effects pointed towards the interrelatedness of the predictors and the value of including A, O, and P indicators in a comprehensive model. The role parental aspirations played in fact retrieval speed was partially mediated through the self-perceived competence of the children, whereas the effect of SES on procedural accuracy was partially mediated through intelligence in children of both groups and through working memory capacity in children with MLD. Moreover, in line with the componential structure of mathematics, our findings were dependent on the math task used. Different A, O

  6. Test Ability of Chiller Post Re-functioning at The Facility of KH-IPSB3

    International Nuclear Information System (INIS)

    Harahap, Sentot-Alibasya; M-Taufik-Arsyad; Djunaidi

    2006-01-01

    Chiller water unit (CWU) represent supplier of cool water for the system of ventilation, cooler of purification system irrigate canal (T.C) and pool (depository ponds) ex-fuel at facility of KH-IPSB3. Have been conducted by re-functioning of Chiller unit water by changing system conduct and control from electronic mode to mode of electric-mechanic and parts of damage other. A unit water chiller system have some component / appliance which way of its activity integrated one other, so that re-functioning of partial like in this time where some of component / appliance use old goods and some of using new material hence to get optimum activity at test ability of difficult chiller relative. At operation is old ones expected by optimum activity will be able to reach with an process which in phases and planed. (author)

  7. Comparison of the diagnostic ability of Moorfield’s regression analysis and glaucoma probability score using Heidelberg retinal tomograph III in eyes with primary open angle glaucoma

    Science.gov (United States)

    Jindal, Shveta; Dada, Tanuj; Sreenivas, V; Gupta, Viney; Sihota, Ramanjit; Panda, Anita

    2010-01-01

    Purpose: To compare the diagnostic performance of the Heidelberg retinal tomograph (HRT) glaucoma probability score (GPS) with that of Moorfield’s regression analysis (MRA). Materials and Methods: The study included 50 eyes of normal subjects and 50 eyes of subjects with early-to-moderate primary open angle glaucoma. Images were obtained by using HRT version 3.0. Results: The agreement coefficient (weighted k) for the overall MRA and GPS classification was 0.216 (95% CI: 0.119 – 0.315). The sensitivity and specificity were evaluated using the most specific (borderline results included as test negatives) and least specific criteria (borderline results included as test positives). The MRA sensitivity and specificity were 30.61 and 98% (most specific) and 57.14 and 98% (least specific). The GPS sensitivity and specificity were 81.63 and 73.47% (most specific) and 95.92 and 34.69% (least specific). The MRA gave a higher positive likelihood ratio (28.57 vs. 3.08) and the GPS gave a higher negative likelihood ratio (0.25 vs. 0.44).The sensitivity increased with increasing disc size for both MRA and GPS. Conclusions: There was a poor agreement between the overall MRA and GPS classifications. GPS tended to have higher sensitivities, lower specificities, and lower likelihood ratios than the MRA. The disc size should be taken into consideration when interpreting the results of HRT, as both the GPS and MRA showed decreased sensitivity for smaller discs and the GPS showed decreased specificity for larger discs. PMID:20952832

  8. Comparison of the diagnostic ability of Moorfield′s regression analysis and glaucoma probability score using Heidelberg retinal tomograph III in eyes with primary open angle glaucoma

    Directory of Open Access Journals (Sweden)

    Jindal Shveta

    2010-01-01

    Full Text Available Purpose: To compare the diagnostic performance of the Heidelberg retinal tomograph (HRT glaucoma probability score (GPS with that of Moorfield′s regression analysis (MRA. Materials and Methods: The study included 50 eyes of normal subjects and 50 eyes of subjects with early-to-moderate primary open angle glaucoma. Images were obtained by using HRT version 3.0. Results: The agreement coefficient (weighted k for the overall MRA and GPS classification was 0.216 (95% CI: 0.119 - 0.315. The sensitivity and specificity were evaluated using the most specific (borderline results included as test negatives and least specific criteria (borderline results included as test positives. The MRA sensitivity and specificity were 30.61 and 98% (most specific and 57.14 and 98% (least specific. The GPS sensitivity and specificity were 81.63 and 73.47% (most specific and 95.92 and 34.69% (least specific. The MRA gave a higher positive likelihood ratio (28.57 vs. 3.08 and the GPS gave a higher negative likelihood ratio (0.25 vs. 0.44.The sensitivity increased with increasing disc size for both MRA and GPS. Conclusions: There was a poor agreement between the overall MRA and GPS classifications. GPS tended to have higher sensitivities, lower specificities, and lower likelihood ratios than the MRA. The disc size should be taken into consideration when interpreting the results of HRT, as both the GPS and MRA showed decreased sensitivity for smaller discs and the GPS showed decreased specificity for larger discs.

  9. The Validity of Graduate Management Admission Test Scores: A Summary of Studies Conducted from 1997 to 2004

    Science.gov (United States)

    Talento-Miller, Eileen; Rudner, Lawrence M.

    2008-01-01

    The validity of Graduate Management Admission Test (GMAT) scores is examined by summarizing 273 studies conducted between 1997 and 2004. Each of the studies was conducted through the Validity Study Service of the test sponsor and contained identical variables and statistical methods. Validity coefficients from each of the studies were corrected…

  10. Predicting Motor Skills from Strengths and Difficulties Questionnaire Scores, Language Ability, and Other Features of New Zealand Children Entering Primary School

    Science.gov (United States)

    Sargisson, Rebecca J.; Powell, Cheniel; Stanley, Peter; de Candole, Rosalind

    2014-01-01

    The motor and language skills, emotional and behavioural problems of 245 children were measured at school entry. Fine motor scores were significantly predicted by hyperactivity, phonetic awareness, prosocial behaviour, and the presence of medical problems. Gross motor scores were significantly predicted by the presence of medical problems. The…

  11. Power and sample size evaluation for the Cochran-Mantel-Haenszel mean score (Wilcoxon rank sum) test and the Cochran-Armitage test for trend.

    Science.gov (United States)

    Lachin, John M

    2011-11-10

    The power of a chi-square test, and thus the required sample size, are a function of the noncentrality parameter that can be obtained as the limiting expectation of the test statistic under an alternative hypothesis specification. Herein, we apply this principle to derive simple expressions for two tests that are commonly applied to discrete ordinal data. The Wilcoxon rank sum test for the equality of distributions in two groups is algebraically equivalent to the Mann-Whitney test. The Kruskal-Wallis test applies to multiple groups. These tests are equivalent to a Cochran-Mantel-Haenszel mean score test using rank scores for a set of C-discrete categories. Although various authors have assessed the power function of the Wilcoxon and Mann-Whitney tests, herein it is shown that the power of these tests with discrete observations, that is, with tied ranks, is readily provided by the power function of the corresponding Cochran-Mantel-Haenszel mean scores test for two and R > 2 groups. These expressions yield results virtually identical to those derived previously for rank scores and also apply to other score functions. The Cochran-Armitage test for trend assesses whether there is an monotonically increasing or decreasing trend in the proportions with a positive outcome or response over the C-ordered categories of an ordinal independent variable, for example, dose. Herein, it is shown that the power of the test is a function of the slope of the response probabilities over the ordinal scores assigned to the groups that yields simple expressions for the power of the test. Copyright © 2011 John Wiley & Sons, Ltd.

  12. A general equation to obtain multiple cut-off scores on a test from multinomial logistic regression.

    Science.gov (United States)

    Bersabé, Rosa; Rivas, Teresa

    2010-05-01

    The authors derive a general equation to compute multiple cut-offs on a total test score in order to classify individuals into more than two ordinal categories. The equation is derived from the multinomial logistic regression (MLR) model, which is an extension of the binary logistic regression (BLR) model to accommodate polytomous outcome variables. From this analytical procedure, cut-off scores are established at the test score (the predictor variable) at which an individual is as likely to be in category j as in category j+1 of an ordinal outcome variable. The application of the complete procedure is illustrated by an example with data from an actual study on eating disorders. In this example, two cut-off scores on the Eating Attitudes Test (EAT-26) scores are obtained in order to classify individuals into three ordinal categories: asymptomatic, symptomatic and eating disorder. Diagnoses were made from the responses to a self-report (Q-EDD) that operationalises DSM-IV criteria for eating disorders. Alternatives to the MLR model to set multiple cut-off scores are discussed.

  13. The Effects of Teaching Descriptive Geometry in General Engineering 103 on Spatial Relations Tests Scores.

    Science.gov (United States)

    Stallings, William M.

    It was hypothesized that instruction in descriptive geometry produces an increase in SRT scores. The resultant data do not firmly support this hypothesis. It is suggested that this study be replicated with the use of randomly selected control groups. (MS)

  14. Testing the ability of a semidistributed hydrological model to simulate contributing area

    Science.gov (United States)

    Mengistu, S. G.; Spence, C.

    2016-06-01

    A dry climate, the prevalence of small depressions, and the lack of a well-developed drainage network are characteristics of environments with extremely variable contributing areas to runoff. These types of regions arguably present the greatest challenge to properly understanding catchment streamflow generation processes. Previous studies have shown that contributing area dynamics are important for streamflow response, but the nature of the relationship between the two is not typically understood. Furthermore, it is not often tested how well hydrological models simulate contributing area. In this study, the ability of a semidistributed hydrological model, the PDMROF configuration of Environment Canada's MESH model, was tested to determine if it could simulate contributing area. The study focused on the St. Denis Creek watershed in central Saskatchewan, Canada, which with its considerable topographic depressions, exhibits wide variation in contributing area, making it ideal for this type of investigation. MESH-PDMROF was able to replicate contributing area derived independently from satellite imagery. Daily model simulations revealed a hysteretic relationship between contributing area and streamflow not apparent from the less frequent remote sensing observations. This exercise revealed that contributing area extent can be simulated by a semi-distributed hydrological model with a scheme that assumes storage capacity distribution can be represented with a probability function. However, further investigation is needed to determine if it can adequately represent the complex relationship between streamflow and contributing area that is such a key signature of catchment behavior.

  15. Lead exposure and the 2010 achievement test scores of children in New York counties

    Directory of Open Access Journals (Sweden)

    Strayhorn Jillian C

    2012-01-01

    Full Text Available Abstract Background Lead is toxic to cognitive and behavioral functioning in children even at levels well below those producing physical symptoms. Continuing efforts in the U.S. since about the 1970s to reduce lead exposure in children have dramatically reduced the incidence of elevated blood lead levels (with elevated levels defined by the current U.S. Centers for Disease Control threshold of 10 μg/dl. The current study examines how much lead toxicity continues to impair the academic achievement of children of New York State, using 2010 test data. Methods This study relies on three sets of data published for the 57 New York counties outside New York City: school achievement data from the New York State Department of Education, data on incidence of elevated blood lead levels from the New York State Department of Health, and data on income from the U.S. Census Bureau. We studied third grade and eighth grade test scores in English Language Arts and mathematics. Using the county as the unit of analysis, we computed bivariate correlations and regression coefficients, with percent of children achieving at the lowest reported level as the dependent variable and the percent of preschoolers in the county with elevated blood lead levels as the independent variable. Then we repeated those analyses using partial correlations to control for possible confounding effects of family income, and using multiple regressions with income included. Results The bivariate correlations between incidence of elevated lead and number of children in the lowest achievement group ranged between 0.38 and 0.47. The partial correlations ranged from 0.29 to 0.40. The regression coefficients, both bivariate and partial (both estimating the increase in percent of children in the lowest achievement group for every percent increase in the children with elevated blood lead levels, ranged from 0.52 to 1.31. All regression coefficients, when rounded to the nearest integer, were

  16. Longitudinal analysis of standardized test scores of students in the Science Writing Heuristic approach

    Science.gov (United States)

    Chanlen, Niphon

    The purpose of this study was to examine the longitudinal impacts of the Science Writing Heuristic (SWH) approach on student science achievement measured by the Iowa Test of Basic Skills (ITBS). A number of studies have reported positive impact of an inquiry-based instruction on student achievement, critical thinking skills, reasoning skills, attitude toward science, etc. So far, studies have focused on exploring how an intervention affects student achievement using teacher/researcher-generated measurement. Only a few studies have attempted to explore the long-term impacts of an intervention on student science achievement measured by standardized tests. The students' science and reading ITBS data was collected from 2000 to 2011 from a school district which had adopted the SWH approach as the main approach in science classrooms since 2002. The data consisted of 12,350 data points from 3,039 students. The multilevel model for change with discontinuity in elevation and slope technique was used to analyze changes in student science achievement growth trajectories prior and after adopting the SWH approach. The results showed that the SWH approach positively impacted students by initially raising science achievement scores. The initial impact was maintained and gradually increased when students were continuously exposed to the SWH approach. Disadvantaged students who were at risk of having low science achievement had bigger benefits from experience with the SWH approach. As a result, existing problematic achievement gaps were narrowed down. Moreover, students who started experience with the SWH approach as early as elementary school seemed to have better science achievement growth compared to students who started experiencing with the SWH approach only in high school. The results found in this study not only confirmed the positive impacts of the SWH approach on student achievement, but also demonstrated additive impacts found when students had longitudinal experiences

  17. Test Review: Wechsler, D., & Naglieri, J.A. (2006). "Wechsler Nonverbal Scale of Ability". San Antonio, TX--Harcourt Assessment

    Science.gov (United States)

    Massa, Idalia; Rivera, Vivina

    2009-01-01

    This article provides a review of the Wechsler Nonverbal Scale of Ability (WNV), a general cognitive ability assessment tool for individuals' aged 4 year 0 months through 21 years 11 months with English language and/or communicative limitations. The test targets a population whose performance on intelligence batteries might be compromised by…

  18. A Comparison of Scores on the WISC-R and Lorge-Thorndike Intelligence Test for Disadvantaged Black Elementary School Children

    Science.gov (United States)

    Lowe, James D.; Karnes, Frances A.

    1976-01-01

    It is indicated that, although the scores [obtained on both tests] are significantly correlated, the tests yield significantly different scores with the Lorge-Thorndike consistently overestimating the WISC-R full scale I.Q. (Author)

  19. Critique of the Watson-Glaser Critical Thinking Appraisal Test: The More You Know, the Lower Your Score

    Directory of Open Access Journals (Sweden)

    Kevin Possin

    2014-12-01

    Full Text Available The Watson-Glaser Critical Thinking Appraisal Test is one of the oldest, most frequently used, multiple-choice critical-thinking tests on the market in business, government, and legal settings for purposes of hiring and promotion. I demonstrate, however, that the test has serious construct-validity issues, stemming primarily from its ambiguous, unclear, misleading, and sometimes mysterious instructions, which have remained unaltered for decades. Erroneously scored items further diminish the test’s validity. As a result, having enhanced knowledge of formal and informal logic could well result in test subjects receiving lower scores on the test. That’s not how things should work for a CT assessment test.

  20. The Extent to Which TOEFL iBT Speaking Scores Are Associated with Performance on Oral Language Tasks and Oral Ability Components for Japanese University Students

    Science.gov (United States)

    Ockey, Gary J.; Koyama, Dennis; Setoguchi, Eric; Sun, Angela

    2015-01-01

    The purpose of this study was to determine the extent to which performance on the TOEFL iBT speaking section is associated with other indicators of Japanese university students' abilities to communicate orally in an academic English environment and to determine which components of oral ability for these tasks are best assessed by TOEFL iBT. To…

  1. Validity and predictive ability of the juvenile arthritis disease activity score based on CRP versus ESR in a Nordic population-based setting

    DEFF Research Database (Denmark)

    Nordal, E B; Zak, M; Aalto, K

    2012-01-01

    To compare the juvenile arthritis disease activity score (JADAS) based on C reactive protein (CRP) (JADAS-CRP) with JADAS based on erythrocyte sedimentation rate (ESR) (JADAS-ESR) and to validate JADAS in a population-based setting.......To compare the juvenile arthritis disease activity score (JADAS) based on C reactive protein (CRP) (JADAS-CRP) with JADAS based on erythrocyte sedimentation rate (ESR) (JADAS-ESR) and to validate JADAS in a population-based setting....

  2. Developing a Numerical Ability Test for Students of Education in Jordan: An Application of Item Response Theory

    Science.gov (United States)

    Abed, Eman Rasmi; Al-Absi, Mohammad Mustafa; Abu shindi, Yousef Abdelqader

    2016-01-01

    The purpose of the present study is developing a test to measure the numerical ability for students of education. The sample of the study consisted of (504) students from 8 universities in Jordan. The final draft of the test contains 45 items distributed among 5 dimensions. The results revealed that acceptable psychometric properties of the test;…

  3. An Operational Definition of Learning Disabilities (Cognitive Domain) Using WISC Full Scale IQ and Peabody Individual Achievement Test Scores.

    Science.gov (United States)

    Brenton, Beatrice White; Gilmore, Doug

    An operational index of discrepancy between ability and achievement using the Wechsler Intelligence Scale for Children and the Peabody Individual Achievement Test (PIAT) was tested with 50 male and 10 female legally identified learning disabled (LD) children (mean age 9 years 2 months). Use of the index identified 74% of the males and 30% of the…

  4. CaPTHUS scoring model in primary hyperparathyroidism: can it eliminate the need for ioPTH testing?

    Science.gov (United States)

    Elfenbein, Dawn M; Weber, Sara; Schneider, David F; Sippel, Rebecca S; Chen, Herbert

    2015-04-01

    The CaPTHUS model was reported to have a positive predictive value of 100 % to correctly predict single-gland disease in patients with primary hyperparathyroidism, thus obviating the need for intraoperative parathyroid hormone (ioPTH) testing. We sought to apply the CaPTHUS scoring model in our patient population and assess its utility in predicting long-term biochemical cure. We retrospective reviewed all parathyroidectomies for primary hyperparathyroidism performed at our university hospital from 2003 to 2012. We routinely perform ioPTH testing. Biochemical cure was defined as a normal calcium level at 6 months. A total of 1,421 patients met the inclusion criteria: 78 % of patients had a single adenoma at the time of surgery, 98 % had a normal serum calcium at 1 week postoperatively, and 96 % had a normal serum calcium level 6 months postoperatively. Using the CaPTHUS scoring model, 307 patients (22.5 %) had a score of ≥ 3, with a positive predictive value of 91 % for single adenoma. A CaPTHUS score of ≥ 3 had a positive predictive value of 98 % for biochemical cure at 1 week as well as at 6 months. In our population, where ioPTH testing is used routinely to guide use of bilateral exploration, patients with a preoperative CaPTHUS score of ≥ 3 had good long-term biochemical cure rates. However, the model only predicted adenoma in 91 % of cases. If minimally invasive parathyroidectomy without ioPTH testing had been done for these patients, the cure rate would have dropped from 98 % to an unacceptable 89 %. Even in these patients with high CaPTHUS scores, multigland disease is present in almost 10 %, and ioPTH testing is necessary.

  5. Testing measurement invariance of the schizotypal personality questionnaire-brief scores across Spanish and Swiss adolescents.

    Directory of Open Access Journals (Sweden)

    Javier Ortuño-Sierra

    Full Text Available BACKGROUND: Schizotypy is a complex construct intimately related to psychosis. Empirical evidence indicates that participants with high scores on schizotypal self-report are at a heightened risk for the later development of psychotic disorders. Schizotypal experiences represent the behavioural expression of liability for psychotic disorders. Previous factorial studies have shown that schizotypy is a multidimensional construct similar to that found in patients with schizophrenia. Specifically, using the Schizotypal Personality Questionnaire-Brief (SPQ-B, the three-dimensional model has been widely replicated. However, there has been no in-depth investigation of whether the dimensional structure underlying the SPQ-B scores is invariant across countries. METHODS: The main goal of this study was to examine the measurement invariance of the SPQ-B scores across Spanish and Swiss adolescents. The final sample was made up of 261 Spanish participants (51.7% men; M = 16.04 years and 241 Swiss participants (52.3% men; M = 15.94 years. RESULTS: The results indicated that Raine et al.'s three-factor model presented adequate goodness-of-fit indices. Moreover, the results supported the measurement invariance (configural and partial strong invariance of the SPQ-B scores across the two samples. Spanish participants scored higher on Interpersonal dimension than Swiss when latent means were compared. DISCUSSION: The study of measurement equivalence across countries provides preliminary evidence for the Raine et al.'s three-factor model and of the cross-cultural validity of the SPQ-B scores in adolescent population. Future studies should continue to examine the measurement invariance of the schizotypy and psychosis-risk syndromes across cultures.

  6. Use of transdermal and intravenous granisetron and the ability of the Hesketh score to assess nausea and vomiting induced by multiday chemotherapy

    Directory of Open Access Journals (Sweden)

    Boccia RV

    2012-07-01

    Full Text Available Ralph V Boccia,1 Gemma Clark,2 Julian D Howell21Center for Cancer and Blood Disorders, Bethesda, MD, USA; 2ProStrakan Pharmaceuticals, Galashiels, UKPurpose: Hesketh scores define emetogenicity of single-agent and multiagent single-day chemotherapy. This analysis determined the emetogenicity of multiagent, multiday chemotherapy and the Granisetron Transdermal System (GTDS; Sancuso®.Methods: This was a retrospective analysis of a multicenter, randomized, double-blind, phase III noninferiority trial of GTDS versus oral granisetron in patients receiving 3 days of multiagent moderately or highly emetogenic chemotherapy, regardless of granisetron formulation. Emesis was defined as vomiting/retching or the use of rescue medication. Logistic regression and classification trees were used to determine the optimal combination of Hesketh scores over the multiagent, multiday regimens for the prediction of emesis.Results: Of 393 patients, 272 (69.2% were chemotherapy naïve. The most common types of cancer were lung (30.5% and gynecologic (21.9%. The most common chemotherapeutic regimen (in 14.2% of patients was cisplatin plus etoposide on days 1–3. The best binary emesis predictor was day 1 Hesketh score. Patients with a day 1 Hesketh score of 5 had the highest rate of emesis (62.5% versus patients with a score < 5 (31.7%. For patients with day 1 Hesketh score < 5, only 14.3% of those receiving only one drug on day 1 experienced emesis.Conclusion: Hesketh emetogenicity scores of individual agents are applicable to multiday, multiagent chemotherapeutic regimens in patients receiving antiemetics.Keywords: chemotherapy-induced nausea and vomiting, emetogenicity, granisetron, clinical trial, retrospective analysis

  7. Differences in distribution of T-scores and Z-scores among bone densitometry tests in postmenopausal women (a comparative study)

    International Nuclear Information System (INIS)

    Wendlova, J.

    2002-01-01

    To determine the character of T-score and Z-score value distribution in individually selected methods of bone densitometry and to compare them using statistical analysis. We examined 56 postmenopausal women with an age between 43 and 68 years with osteopenia or osteoporosis according to the WHO classification. The following measurements were made in each patient: T-score and Z-score for: 1) Stiffness index (S) of the left heel bone, USM (index). 2) Bone mineral density of the left heel bone (BMDh), DEXA (g of Ca hydroxyapatite per cm 2 ). 3) Bone mineral density of trabecular bone of the L1 vertebra (BMDL1). QCT (mg of Ca hydroxyapatite per cm 3 ). The densitometers used in the study were: ultrasonometer to measure heel bone, Achilles plus LUNAR, USA: DEXA to measure heel bone, PIXl, LUNAR, USA: QCT to measure the L1 vertebra, CT, SOMATOM Plus, Siemens, Germany. Statistical analysis: differences between measured values of T-scores (Z-scores) were evaluated by parametric or non-parametric methods of determining the 95 % confidence intervals (C.I.). Differences between Z-score and T-score values for compared measurements were statistically significant; however, these differences were lower for Z-scores. Largest differences in 95 % C.I., characterizing individual measurements of T-score values (in comparison with Z-scores), were found for those densitometers whose age range of the reference groups of young adults differed the most, and conversely, the smallest differences in T-score values were found when the differences between the age ranges of reference groups were smallest. The higher variation in T-score values in comparison to Z-scores is also caused by a non-standard selection of the reference groups of young adults for the QCT, PIXI and Achilles Plus densitometers used in the study. Age characteristics of the reference group for T-scores should be standardized for all types of densitometers. (author)

  8. The current ability to test theories of gravity with black hole shadows

    Science.gov (United States)

    Mizuno, Yosuke; Younsi, Ziri; Fromm, Christian M.; Porth, Oliver; De Laurentis, Mariafelicia; Olivares, Hector; Falcke, Heino; Kramer, Michael; Rezzolla, Luciano

    2018-04-01

    Our Galactic Centre, Sagittarius A*, is believed to harbour a supermassive black hole, as suggested by observations tracking individual orbiting stars1,2. Upcoming submillimetre very-long baseline interferometry images of Sagittarius A* carried out by the Event Horizon Telescope collaboration (EHTC)3,4 are expected to provide critical evidence for the existence of this supermassive black hole5,6. We assess our present ability to use EHTC images to determine whether they correspond to a Kerr black hole as predicted by Einstein's theory of general relativity or to a black hole in alternative theories of gravity. To this end, we perform general-relativistic magnetohydrodynamical simulations and use general-relativistic radiative-transfer calculations to generate synthetic shadow images of a magnetized accretion flow onto a Kerr black hole. In addition, we perform these simulations and calculations for a dilaton black hole, which we take as a representative solution of an alternative theory of gravity. Adopting the very-long baseline interferometry configuration from the 2017 EHTC campaign, we find that it could be extremely difficult to distinguish between black holes from different theories of gravity, thus highlighting that great caution is needed when interpreting black hole images as tests of general relativity.

  9. The Mediating Effect of Listening Metacognitive Awareness between Test-Taking Motivation and Listening Test Score: An Expectancy-Value Theory Approach

    OpenAIRE

    Xu, Jian

    2017-01-01

    The present study investigated test-taking motivation in L2 listening testing context by applying Expectancy-Value Theory as the framework. Specifically, this study was intended to examine the complex relationships among expectancy, importance, interest, listening anxiety, listening metacognitive awareness, and listening test score using data from a large-scale and high-stakes language test among Chinese first-year undergraduates. Structural equation modeling was used to examine the mediating...

  10. Detection of improvement in the masticatory function from old to new removable partial dentures using mixing ability test.

    Science.gov (United States)

    Asakawa, A; Fueki, K; Ohyama, T

    2005-09-01

    The aim of this study was to determine the sensitivity of the Mixing Ability Test to detect improvement of masticatory function in subjects on transition from old to new removable partial dentures. Thirty-two subjects (seven males, 25 females, mean age 65.0 years) with distal extension partially edentulous area in mandible and/or maxilla participated in the study. The following reasons were presented for replacing the old removable partial dentures with new ones: fracture and/or poor fitness of retainers, extraction of abutment teeth, poor fitness of denture base, severe wear of artificial teeth and request for metal base dentures. Masticatory function with old and new removable partial dentures after an adaptation period (mean 27.4 weeks) was evaluated by the Mixing Ability Test. Subjects were asked to masticate five two-coloured wax cubes with each removable partial denture. Mixing Ability Index was obtained from the colour mixture and shape of the masticated cubes. Wilcoxon signed-rank test was used to test the difference of Mixing Ability Indexes between old and new removable partial dentures. The mixing ability indexes with new removable partial dentures (mean+/- s.d.: 0.70+/- 0.68) was significantly higher (Premovable partial dentures (-0.11+/-1.13). The results suggest that the Mixing Ability Test was capable of detecting improvement in masticatory function with new removable partial dentures.

  11. Differences in physical-fitness test scores between actively and passively recruited older adults : Consequences for norm-based classification

    NARCIS (Netherlands)

    van Heuvelen, M.J.G.; Stevens, M.; Kempen, G.I.J.M.

    This study investigated differences in physical-fitness test scores between actively and passively recruited older adults and the consequences thereof for norm-based classification of individuals. Walking endurance, grip strength, hip flexibility, balance, manual dexterity, and reaction time were

  12. Differential Predictive Validity of High School GPA and College Entrance Test Scores for University Students in Yemen

    Science.gov (United States)

    Al-Hattami, Abdulghani Ali Dawod

    2012-01-01

    High school grade point average and college entrance test scores are two admission criteria that are currently used by most colleges in Yemen to select their prospective students. Given their widespread use, it is important to investigate their predictive validity to ensure the accuracy of the admission decisions in these institutions. This study…

  13. Automated Scoring for the "TOEFL Junior"® Comprehensive Writing and Speaking Test. Research Report. ETS RR-15-09

    Science.gov (United States)

    Evanini, Keelan; Heilman, Michael; Wang, Xinhao; Blanchard, Daniel

    2015-01-01

    This report describes the initial automated scoring results that were obtained using the constructed responses from the Writing and Speaking sections of the pilot forms of the "TOEFL Junior"® Comprehensive test administered in late 2011. For all of the items except one (the edit item in the Writing section), existing automated scoring…

  14. Predicting Pre-Service Classroom Teachers' Civil Servant Recruitment Examination's Educational Sciences Test Scores Using Artificial Neural Networks

    Science.gov (United States)

    Demir, Metin

    2015-01-01

    This study predicts the number of correct answers given by pre-service classroom teachers in Civil Servant Recruitment Examination's (CSRE) educational sciences test based on their high school grade point averages, university entrance scores, and grades (mid-term and final exams) from their undergraduate educational courses. This study was…

  15. Using imputed genotype data in the joint score tests for genetic association and gene-environment interactions in case-control studies.

    Science.gov (United States)

    Song, Minsun; Wheeler, William; Caporaso, Neil E; Landi, Maria Teresa; Chatterjee, Nilanjan

    2018-03-01

    Genome-wide association studies (GWAS) are now routinely imputed for untyped single nucleotide polymorphisms (SNPs) based on various powerful statistical algorithms for imputation trained on reference datasets. The use of predicted allele counts for imputed SNPs as the dosage variable is known to produce valid score test for genetic association. In this paper, we investigate how to best handle imputed SNPs in various modern complex tests for genetic associations incorporating gene-environment interactions. We focus on case-control association studies where inference for an underlying logistic regression model can be performed using alternative methods that rely on varying degree on an assumption of gene-environment independence in the underlying population. As increasingly large-scale GWAS are being performed through consortia effort where it is preferable to share only summary-level information across studies, we also describe simple mechanisms for implementing score tests based on standard meta-analysis of "one-step" maximum-likelihood estimates across studies. Applications of the methods in simulation studies and a dataset from GWAS of lung cancer illustrate ability of the proposed methods to maintain type-I error rates for the underlying testing procedures. For analysis of imputed SNPs, similar to typed SNPs, the retrospective methods can lead to considerable efficiency gain for modeling of gene-environment interactions under the assumption of gene-environment independence. Methods are made available for public use through CGEN R software package. © 2017 WILEY PERIODICALS, INC.

  16. Factors Influencing the Rise in Test Scores: Urban Connecticut Educators' Perceptions

    Science.gov (United States)

    Merlone, Carol A.

    2013-01-01

    "Education is the source of shared values essential to democracy, (...) [however], values are not enough for democracy to function well; expert skills are also needed" (Fuhrman & Lazerson, 2005, xxvi). With the turn of the 21st century, debates over the nation's public school system's ability to ensure No Child Left Behind (NCLB) Act…

  17. Pulmonary Exacerbation Score in Cystlc Fibrosis Patients: Reliability and Validity Testing

    OpenAIRE

    Keller, F.

    2016-01-01

    Background: Lung disease in cystic fibrosis (CF) is characterized by recurrent pulmonary exacerbations (PEs), but consensus on diagnostic criteria for PE is lacking. The use of a consistent definition of PE as an outcome measure in CF clinical trials would allow meaningful comparison across centers. The aim of this study was to assess the reliability and validity of a simplified version of the Seattle Pulmonary Exacerbation Score (SPEX). Materials and Methods: A cross-sectional observational ...

  18. The six-spot-step test - a new method for monitoring walking ability in patients with chronic inflammatory polyneuropathy

    DEFF Research Database (Denmark)

    Kreutzfeldt, Melissa; Jensen, Henrik B; Ravnborg, Mads

    2017-01-01

    OBJECTIVE: To evaluate whether the Six-Spot-Step-Test (SSST) is more suitable for monitoring walking ability in patients with chronic inflammatory polyneuropathy than the Timed-25-Foot-Walking test (T25FW). METHOD: In the SSST, participants have to walk as quickly as possible across a field...... of effect size, standardized response means and relative efficiency. Both ambulation tests correlated moderately to PGIC. CONCLUSION: The SSST may be superior to the T25FW in terms of dynamic range, floor effect and responsiveness which makes the SSST a possible alternative for monitoring walking ability...

  19. Compensation or inhibitory failure? Testing hypotheses of age-related right frontal lobe involvement in verbal memory ability using structural and diffusion MRI

    Science.gov (United States)

    Cox, Simon R.; Bastin, Mark E.; Ferguson, Karen J.; Allerhand, Mike; Royle, Natalie A.; Maniega, Susanna Muñoz; Starr, John M.; MacLullich, Alasdair M.J.; Wardlaw, Joanna M.; Deary, Ian J.; MacPherson, Sarah E.

    2015-01-01

    Functional neuroimaging studies report increased right prefrontal cortex (PFC) involvement during verbal memory tasks amongst low-scoring older individuals, compared to younger controls and their higher-scoring contemporaries. Some propose that this reflects inefficient use of neural resources through failure of the left PFC to inhibit non-task-related right PFC activity, via the anterior corpus callosum (CC). For others, it indicates partial compensation – that is, the right PFC cannot completely supplement the failing neural network, but contributes positively to performance. We propose that combining structural and diffusion brain MRI can be used to test predictions from these theories which have arisen from fMRI studies. We test these hypotheses in immediate and delayed verbal memory ability amongst 90 healthy older adults of mean age 73 years. Right hippocampus and left dorsolateral prefrontal cortex (DLPFC) volumes, and fractional anisotropy (FA) in the splenium made unique contributions to verbal memory ability in the whole group. There was no significant effect of anterior callosal white matter integrity on performance. Rather, segmented linear regression indicated that right DLPFC volume was a significantly stronger positive predictor of verbal memory for lower-scorers than higher-scorers, supporting a compensatory explanation for the differential involvement of the right frontal lobe in verbal memory tasks in older age. PMID:25241394

  20. Compensation or inhibitory failure? Testing hypotheses of age-related right frontal lobe involvement in verbal memory ability using structural and diffusion MRI.

    Science.gov (United States)

    Cox, Simon R; Bastin, Mark E; Ferguson, Karen J; Allerhand, Mike; Royle, Natalie A; Maniega, Susanna Muñoz; Starr, John M; MacLullich, Alasdair M J; Wardlaw, Joanna M; Deary, Ian J; MacPherson, Sarah E

    2015-02-01

    Functional neuroimaging studies report increased right prefrontal cortex (PFC) involvement during verbal memory tasks amongst low-scoring older individuals, compared to younger controls and their higher-scoring contemporaries. Some propose that this reflects inefficient use of neural resources through failure of the left PFC to inhibit non-task-related right PFC activity, via the anterior corpus callosum (CC). For others, it indicates partial compensation - that is, the right PFC cannot completely supplement the failing neural network, but contributes positively to performance. We propose that combining structural and diffusion brain MRI can be used to test predictions from these theories which have arisen from fMRI studies. We test these hypotheses in immediate and delayed verbal memory ability amongst 90 healthy older adults of mean age 73 years. Right hippocampus and left dorsolateral prefrontal cortex (DLPFC) volumes, and fractional anisotropy (FA) in the splenium made unique contributions to verbal memory ability in the whole group. There was no significant effect of anterior callosal white matter integrity on performance. Rather, segmented linear regression indicated that right DLPFC volume was a significantly stronger positive predictor of verbal memory for lower-scorers than higher-scorers, supporting a compensatory explanation for the differential involvement of the right frontal lobe in verbal memory tasks in older age. Crown Copyright © 2014. Published by Elsevier Ltd. All rights reserved.

  1. Estimation of an Examinee's Ability in the Web-Based Computerized Adaptive Testing Program IRT-CAT

    Directory of Open Access Journals (Sweden)

    Yoon-Hwan Lee

    2006-11-01

    Full Text Available We developed a program to estimate an examinee's ability in order to provide freely available access to a web-based computerized adaptive testing (CAT program. We used PHP and Java Script as the program languages, PostgresSQL as the database management system on an Apache web server and Linux as the operating system. A system which allows for user input and searching within inputted items and creates tests was constructed. We performed an ability estimation on each test based on a Rasch model and 2- or 3-parametric logistic models. Our system provides an algorithm for a web-based CAT, replacing previous personal computer-based ones, and makes it possible to estimate an examinee?占퐏 ability immediately at the end of test.

  2. Recent Developments in Language Assessment and the Case of Four Large-Scale Tests of ESOL Ability

    Science.gov (United States)

    Stoynoff, Stephen

    2009-01-01

    This review article surveys recent developments and validation activities related to four large-scale tests of L2 English ability: the iBT TOEFL, the IELTS, the FCE, and the TOEIC. In addition to describing recent changes to these tests, the paper reports on validation activities that were conducted on the measures. The results of this research…

  3. Are WISC IQ scores in children with mathematical learning disabilities underestimated? The influence of a specialized intervention on test performance.

    Science.gov (United States)

    Lambert, Katharina; Spinath, Birgit

    2018-01-01

    Intelligence measures play a pivotal role in the diagnosis of mathematical learning disabilities (MLD). Probably as a result of math-related material in IQ tests, children with MLD often display reduced IQ scores. However, it remains unclear whether the effects of math remediation extend to IQ scores. The present study investigated the impact of a special remediation program compared to a control group receiving private tutoring (PT) on the WISC IQ scores of children with MLD. We included N=45 MLD children (7-12 years) in a study with a pre- and post-test control group design. Children received remediation for two years on average. The analyses revealed significantly greater improvements in the experimental group on the Full-Scale IQ, and the Verbal Comprehension, Perceptual Reasoning, and Working Memory indices, but not Processing Speed, compared to the PT group. Children in the experimental group showed an average WISC IQ gain of more than ten points. Results indicate that the WISC IQ scores of MLD children might be underestimated and that an effective math intervention can improve WISC IQ test performance. Taking limitations into account, we discuss the use of IQ measures more generally for defining MLD in research and practice. Copyright © 2017 Elsevier Ltd. All rights reserved.

  4. Metacognitive Ability Relationship with Test Result of Senior High School of Biology Teacher Competence in Sijunjung District

    Science.gov (United States)

    Ardi, A.; Fadilah, M.; Ichsani, W.

    2018-04-01

    This research aimed to reveal how the relationship between metacognitive ability and the test result of biology teacher competence in Sijunjung District. The population of this descriptive research were all high school biology teachers in Sijunjung District, and sample is all teachers who are members of the population, which is 23 biology teachers. The instrument used in this research are a questionnaire of research on teacher's metacognitive ability and document about teacher competence test result. The questionnaire was validated first by two lecturers of biology and one lecturer of English. Data analysis using Pearson Product Moment's. Based on the results of research and discussion that have been described, it can generally be concluded that there is a low relationship between metacognitive ability with competence test results of high school biology teachers in Sijunjung District. Partially, the relationship of metacognitive ability with the test result of professional competence of biology teacher showed significant result, with correlation coefficient 0,46 and t table 1,72 while titung 2,37. The contribution of metacognitive ability to the competence test result of the teacher is 21.6%, while the other 78.4% have not been revealed in this research.

  5. Timed up & go test score in patients with hip fracture is related to the type of walking aid

    DEFF Research Database (Denmark)

    Kristensen, Morten T; Bandholm, Thomas; Holm, Bente

    2009-01-01

    Kristensen MT, Bandholm T, Holm B, Ekdahl C, Kehlet H. Timed Up & Go test score in patients with hip fracture is related to the type of walking aid. OBJECTIVE: To determine the relationship between Timed Up & Go (TUG) test scores and type of walking aid used during the test, and to determine...... the feasibility of using the rollator as a standardized walking aid during the TUG in patients with hip fracture who were allowed full weight-bearing (FWB). DESIGN: Prospective methodological study. SETTING: An acute orthopedic hip fracture unit at a university hospital. PARTICIPANTS: Patients (N=126; 90 women......, 36 men) with hip fracture with a mean age +/- SD of 74.8+/-12.7 years performed the TUG the day before discharge from the orthopedic ward. INTERVENTIONS: Not applicable. MAIN OUTCOME MEASURES: The TUG was performed with the walking aid the patient was to be discharged with: a walker (n=88) or elbow...

  6. Comparison of Physical Therapy Anatomy Performance and Anxiety Scores in Timed and Untimed Practical Tests

    Science.gov (United States)

    Schwartz, Sarah M.; Evans, Cathy; Agur, Anne M.R.

    2015-01-01

    Students in health care professional programs face many stressful tests that determine successful completion of their program. Test anxiety during these high stakes examinations can affect working memory and lead to poor outcomes. Methods of decreasing test anxiety include lengthening the time available to complete examinations or evaluating…

  7. Relationship Between Broiler Body Weights, Eimeria maxima Gross Lesion Scores, and Microscores in Three Anticoccidial Sensitivity Tests.

    Science.gov (United States)

    Barrios, Miguel A; Da Costa, Manuel; Kimminau, Emily; Fuller, Lorraine; Clark, Steven; Pesti, Gene; Beckstead, Robert

    2017-06-01

    Anticoccidial sensitivity tests (ASTs) serve to determine the efficacy of anticoccidial drugs against Eimeria field isolates in a controlled laboratory setting. The most commonly measured parameters are body weight gain, feed conversion ratio, gross intestinal lesion scores, and mortality. Due to the difficulty in reliably scoring gross lesion scores of Eimeria maxima , microscopic analysis of intestinal scrapings (microscores) can be used in the field to indicate the presence of this particular Eimeria. The goal of this study was to determine the relationship between E. maxima microscores and broiler body weights and gross E. maxima lesion scores in three ASTs. Day-old broiler chicks were raised for 12 days on a standard corn-soy diet. On Day 12, chicks were placed in Petersime batteries and treatment diets were provided. There were six birds per pen, four pens per treatment, and 12 treatments, for a total of 288 chicks per AST. The treatments were as follows: 1) nonmedicated, noninfected; 2) nonmedicated, infected; 3) lasalocid, infected; 4) salinomycin, infected; 5) diclazuril, infected; 6) monensin, infected; 7) decoquinate, infected; 8) narasin + nicarbazin, infected; 9) narasin, infected; 10) nicarbazin, infected; 11) robenidine, infected; and 12) zoalene, infected. On Day 14, chicks were challenged with an Eimeria field isolate by oral gavage. On Day 20, broilers were weighed, and gross lesion scores and microscores were classified from 0 to 4 depending on the severity of the gross lesion scores and E. maxima microscores. Data from three trials using different field isolates were statistically analyzed using a logarithmic regression model. There was no relationship (P = 0.1224) between microscores and body weight gain. There was a positive relationship between microscores and gross lesion scores (P = 0.004). However, there was also an interaction between isolate and treatment (P Eimeria or the amount of E. maxima in the inoculum.

  8. The Standard Error of a Proportion for Different Scores and Test Length.

    Directory of Open Access Journals (Sweden)

    David A. Walker

    2005-06-01

    Full Text Available This paper examines Smith's (2003 proposed standard error of a proportion index..associated with the idea of reliability as sufficiency of information. A detailed table..indexing all of the standard error values affiliated with assessments that range from 5 to..100 items, where students scored as low as 50% correct and 50% incorrect to as high as..95% correct and 5% incorrect, calculated in increments of 1 percentage point, is..presented, along with distributional qualities. Examples using this measure for classroom..teachers and higher education instructors of assessment are provided.

  9. Birth order and its relationship to depression, anxiety, and self-concept test scores in children.

    Science.gov (United States)

    Gates, L; Lineberger, M R; Crockett, J; Hubbard, J

    1988-03-01

    Children (N = 404), 7 to 12 years old, were given the Children's Depression Inventory, the State-Trait Anxiety Inventory for Children, and the Piers-Harris Self-Concept Scale. First-born children scored significantly lower on depression than second-, third-, fourth-born, and youngest children. First borns showed significantly less trait anxiety than third-born children. First-born children also showed significantly higher levels of self-esteem than second-born and youngest children. Girls in this study showed significantly more trait anxiety than boys.

  10. Chronic obstructive pulmonary disease (COPD) assessment test scores corresponding to modified Medical Research Council grades among COPD patients.

    Science.gov (United States)

    Lee, Chang-Hoon; Lee, Jinwoo; Park, Young Sik; Lee, Sang-Min; Yim, Jae-Joon; Kim, Young Whan; Han, Sung Koo; Yoo, Chul-Gyu

    2015-09-01

    In assigning patients with chronic obstructive pulmonary disease (COPD) to subgroups according to the updated guidelines of the Global Initiative for Chronic Obstructive Lung Disease, discrepancies have been noted between the COPD assessment test (CAT) criteria and modified Medical Research Council (mMRC) criteria. We investigated the determinants of symptom and risk groups and sought to identify a better CAT criterion. This retrospective study included COPD patients seen between June 20, 2012, and December 5, 2012. The CAT score that can accurately predict an mMRC grade ≥ 2 versus COPD patients, the percentages of patients classified into subgroups A, B, C, and D were 24.5%, 47.2%, 4.2%, and 24.1% based on CAT criteria and 49.3%, 22.4%, 8.9%, and 19.4% based on mMRC criteria, respectively. More than 90% of the patients who met the mMRC criteria for the 'more symptoms group' also met the CAT criteria. AUROC and CART analyses suggested that a CAT score ≥ 15 predicted an mMRC grade ≥ 2 more accurately than the current CAT score criterion. During follow-up, patients with CAT scores of 10 to 14 did not have a different risk of exacerbation versus those with CAT scores COPD patients.

  11. Zero Calcium Score as a Filter for Further Testing in Patients Admitted to the Coronary Care Unit with Chest Pain.

    Science.gov (United States)

    Correia, Luis Cláudio Lemos; Esteves, Fábio P; Carvalhal, Manuela; Souza, Thiago Menezes Barbosa de; Sá, Nicole de; Correia, Vitor Calixto de Almeida; Alexandre, Felipe Kalil Beirão; Lopes, Fernanda; Ferreira, Felipe; Noya-Rabelo, Márcia

    2017-06-12

    The accuracy of zero coronary calcium score as a filter in patients with chest pain has been demonstrated at the emergency room and outpatient clinics, populations with low prevalence of coronary artery disease (CAD). To test the gatekeeping role of zero calcium score in patients with chest pain admitted to the coronary care unit (CCU), where the pretest probability of CAD is higher than that of other populations. Patients underwent computed tomography for calcium scoring, and obstructive CAD was defined by a minimum 70% stenosis on invasive angiography. In 146 patients studied, the prevalence of CAD was 41%. A zero calcium score was present in 35% of the patients. The sensitivity and specificity of zero calcium score yielded a negative likelihood ratio of 0.16. After logistic regression adjustment for pretest probability, zero calcium score was independently associated with lower odds of CAD (OR = 0.12, 95%CI = 0.04-0.36), increasing the area under the ROC curve of the clinical model from 0.76 to 0.82 (p = 0.006). Zero calcium score provided a net reclassification improvement of 0.20 (p = 0.0018) over the clinical model when using a pretest probability threshold of 10% for discharging without further testing. In patients with pretest probability zero calcium score had a negative predictive value of 95% (95%CI = 83%-99%), with a number needed to test of 2.1 for obtaining one additional discharge. Zero calcium score substantially reduces the pretest probability of obstructive CAD in patients admitted to the CCU with acute chest pain. (Arq Bras Cardiol. 2017; [online].ahead print, PP.0-0). A acurácia do escore de cálcio coronário zero como um filtro nos pacientes com dor torácica aguda tem sido demonstrada na sala de emergência e nos ambulatórios, populações com baixa prevalência de doença arterial coronariana (DAC). Testar o papel do escore de cálcio zero como filtro nos pacientes com dor torácica admitidos numa unidade coronariana intensiva (UCI), na

  12. Percentiles of the null distribution of 2 maximum lod score tests.

    Science.gov (United States)

    Ulgen, Ayse; Yoo, Yun Joo; Gordon, Derek; Finch, Stephen J; Mendell, Nancy R

    2004-01-01

    We here consider the null distribution of the maximum lod score (LOD-M) obtained upon maximizing over transmission model parameters (penetrance values, dominance, and allele frequency) as well as the recombination fraction. Also considered is the lod score maximized over a fixed choice of genetic model parameters and recombination-fraction values set prior to the analysis (MMLS) as proposed by Hodge et al. The objective is to fit parametric distributions to MMLS and LOD-M. Our results are based on 3,600 simulations of samples of n = 100 nuclear families ascertained for having one affected member and at least one other sibling available for linkage analysis. Each null distribution is approximately a mixture p(2)(0) + (1 - p)(2)(v). The values of MMLS appear to fit the mixture 0.20(2)(0) + 0.80chi(2)(1.6). The mixture distribution 0.13(2)(0) + 0.87chi(2)(2.8). appears to describe the null distribution of LOD-M. From these results we derive a simple method for obtaining critical values of LOD-M and MMLS. Copyright 2004 S. Karger AG, Basel

  13. A quantitative assessment of alkaptonuria: testing the reliability of two disease severity scoring systems.

    Science.gov (United States)

    Cox, Trevor F; Ranganath, Lakshminarayan

    2011-12-01

    Alkaptonuria (AKU) is due to excessive homogentisic acid accumulation in body fluids due to lack of enzyme homogentisate dioxygenase leading in turn to varied clinical manifestations mainly by a process of conversion of HGA to a polymeric melanin-like pigment known as ochronosis. A potential treatment, a drug called nitisinone, to decrease formation of HGA is available. However, successful demonstration of its efficacy in modifying the natural history of AKU requires an effective quantitative assessment tool. We have described two potential tools that could be used to quantitate disease burden in AKU. One tool describes scoring the clinical features that includes clinical assessments, investigations and questionnaires in 15 patients with AKU. The second tool describes a scoring system that only includes items obtained from questionnaires used in 44 people with AKU. Statistical analyses were carried out on the two patient datasets to assess the AKU tools; these included the calculation of Chronbach's alpha, multidimensional scaling and simple linear regression analysis. The conclusion was that there was good evidence that the tools could be adopted as AKU assessment tools, but perhaps with further refinement before being used in the practical setting of a clinical trial.

  14. The Two-Systems Account of Theory of Mind: Testing the Links to Social- Perceptual and Cognitive Abilities

    Directory of Open Access Journals (Sweden)

    Bozana Meinhardt-Injac

    2018-01-01

    Full Text Available According to the two-systems account of theory of mind (ToM, understanding mental states of others involves both fast social-perceptual processes, as well as slower, reflexive cognitive operations (Frith and Frith, 2008; Apperly and Butterfill, 2009. To test the respective roles of specific abilities in either of these processes we administered 15 experimental procedures to a large sample of 343 participants, testing ability in face recognition and holistic perception, language, and reasoning. ToM was measured by a set of tasks requiring ability to track and to infer complex emotional and mental states of others from faces, eyes, spoken language, and prosody. We used structural equation modeling to test the relative strengths of a social-perceptual (face processing related and reflexive-cognitive (language and reasoning related path in predicting ToM ability. The two paths accounted for 58% of ToM variance, thus validating a general two-systems framework. Testing specific predictor paths revealed language and face recognition as strong and significant predictors of ToM. For reasoning, there were neither direct nor mediated effects, albeit reasoning was strongly associated with language. Holistic face perception also failed to show a direct link with ToM ability, while there was a mediated effect via face recognition. These results highlight the respective roles of face recognition and language for the social brain, and contribute closer empirical specification of the general two-systems account.

  15. The Two-Systems Account of Theory of Mind: Testing the Links to Social- Perceptual and Cognitive Abilities.

    Science.gov (United States)

    Meinhardt-Injac, Bozana; Daum, Moritz M; Meinhardt, Günter; Persike, Malte

    2018-01-01

    According to the two-systems account of theory of mind (ToM), understanding mental states of others involves both fast social-perceptual processes, as well as slower, reflexive cognitive operations (Frith and Frith, 2008; Apperly and Butterfill, 2009). To test the respective roles of specific abilities in either of these processes we administered 15 experimental procedures to a large sample of 343 participants, testing ability in face recognition and holistic perception, language, and reasoning. ToM was measured by a set of tasks requiring ability to track and to infer complex emotional and mental states of others from faces, eyes, spoken language, and prosody. We used structural equation modeling to test the relative strengths of a social-perceptual (face processing related) and reflexive-cognitive (language and reasoning related) path in predicting ToM ability. The two paths accounted for 58% of ToM variance, thus validating a general two-systems framework. Testing specific predictor paths revealed language and face recognition as strong and significant predictors of ToM. For reasoning, there were neither direct nor mediated effects, albeit reasoning was strongly associated with language. Holistic face perception also failed to show a direct link with ToM ability, while there was a mediated effect via face recognition. These results highlight the respective roles of face recognition and language for the social brain, and contribute closer empirical specification of the general two-systems account.

  16. Evaluation of the Discrepancy between the European Pharmacopoeia Test and an Adopted United States Pharmacopoeia Test Regarding the Weight Uniformity of Scored Tablet Halves: Is Harmonization Required?

    Science.gov (United States)

    Zaid, Abdel Naser; Ghoush, Abeer Abu; Al-Ramahi, Rowa'; Are'r, Mohammed

    2012-01-01

    The aim of this study was to evaluate whether there exists any discrepancy between the European Pharmacopoeia (Ph. Eur.) and adopted United States Pharmacopeia (USP) tests concerning the weight uniformity measurements of tablet halves after splitting. The USP method does not contain provisions to evaluate split tablets, so here we adopt their whole tablet weight uniformity method. Twenty-nine different commercial scored tablets (local and imported) were divided. The split units were individually weighed and the relative standard deviation (RSD) for each product was calculated and then evaluated according to both the adopted USP and the Ph. Eur. tests of weight uniformity. Twenty out of the 29 products tested failed the USP test, while 14 of them failed the Ph. Eur. test. Nine products passed both the USP and Ph. Eur. tests. Six products passed the Ph. Eur. test but failed the USP test, with all of these products having an RSD greater than 6%. The correlation coefficient between the weight and content of split halves for three randomly selected products-corotenol 100 mg, corotenol 50 mg, and lorazepam 2.5 mg-was found to be 0.986, 0.998, and 0.72, respectively. A clear difference can be seen between outcomes obtained by the two compendial tablet splitting methods with regard to weight uniformity. Results from the USP test showed that tighter measures are needed to pass the test. Our results argue that the Ph. Eur. should revise the existing weight uniformity test on scored tablets to include the RSD parameter in it. The USP should include this adopted test as a specific test for scored tablet halves, not just whole tablets. Manufacturers in some cases will need to improve the quality of the produced scored tablets in order to pass the USP test, especially those with low therapeutic indices. Finally, harmonization between the pharmacopoeias regarding the weight uniformity testing of split tablets is warranted. The aim of this study was to evaluate whether there

  17. Test Anxiety Among College Students With Specific Reading Disability (Dyslexia): Nonverbal Ability and Working Memory as Predictors.

    Science.gov (United States)

    Nelson, Jason M; Lindstrom, Will; Foels, Patricia A

    2015-01-01

    Test anxiety and its correlates were examined with college students with and without specific reading disability (RD; n = 50 in each group). Results indicated that college students with RD reported higher test anxiety than did those without RD, and the magnitude of these differences was in the medium range on two test anxiety scales. Relative to college students without RD, up to 5 times as many college students with RD reported clinically significant test anxiety. College students with RD reported significantly higher cognitively based test anxiety than physically based test anxiety. Reading skills, verbal ability, and processing speed were not correlated with test anxiety. General intelligence, nonverbal ability, and working memory were negatively correlated with test anxiety, and the magnitude of these correlations was medium to large. When these three cognitive constructs were considered together in multiple regression analyses, only working memory and nonverbal ability emerged as significant predictors and varied based on the test anxiety measure. Implications for assessment and intervention are discussed. © Hammill Institute on Disabilities 2013.

  18. A risk score for predicting coronary artery disease in women with angina pectoris and abnormal stress test finding.

    Science.gov (United States)

    Lo, Monica Y; Bonthala, Nirupama; Holper, Elizabeth M; Banks, Kamakki; Murphy, Sabina A; McGuire, Darren K; de Lemos, James A; Khera, Amit

    2013-03-15

    Women with angina pectoris and abnormal stress test findings commonly have no epicardial coronary artery disease (CAD) at catheterization. The aim of the present study was to develop a risk score to predict obstructive CAD in such patients. Data were analyzed from 337 consecutive women with angina pectoris and abnormal stress test findings who underwent cardiac catheterization at our center from 2003 to 2007. Forward selection multivariate logistic regression analysis was used to identify the independent predictors of CAD, defined by ≥50% diameter stenosis in ≥1 epicardial coronary artery. The independent predictors included age ≥55 years (odds ratio 2.3, 95% confidence interval 1.3 to 4.0), body mass index stress imaging (odds ratio 2.8, 95% confidence interval 1.5 to 5.5), and exercise capacity statistic of 0.745 (95% confidence interval 0.70 to 0.79), and an optimized cutpoint of a score of ≤2 included 62% of the subjects and had a negative predictive value of 80%. In conclusion, a simple clinical risk score of 7 characteristics can help differentiate those more or less likely to have CAD among women with angina pectoris and abnormal stress test findings. This tool, if validated, could help to guide testing strategies in women with angina pectoris. Copyright © 2013 Elsevier Inc. All rights reserved.

  19. Reprocessing ability of high density fuels for research and test reactors

    International Nuclear Information System (INIS)

    Gay, A.; Belieres, M.

    1997-01-01

    The development of a new high density fuel is becoming a key issue for Research Reactors operators. Such a new fuel should be a Low Enrichment Uranium (LEU) fuel with a high density, to improve present in core performances. It must be compatible with the reprocessing in an industrial plant to provide a steady back-end solution. Within the framework of a work group CEA/CERCA/COGEMA on new fuel development for Research Reactors, COGEMA has performed an evaluation of the reprocessing ability of some fuel dispersants selected as good candidates. The results will allow US to classify these fuel dispersants from a reprocessing ability point of view. (author)

  20. Examining the Relationship between Students' Mathematics Test Scores and Computer Use at Home and at School

    Science.gov (United States)

    O'Dwyer, Laura M.; Russell, Michael; Bebell, Damian; Seeley, Kevon

    2008-01-01

    Over the past decade, standardized test results have become the primary tool used to judge the effectiveness of schools and educational programs, and today, standardized testing serves as the keystone for educational policy at the state and federal levels. This paper examines the relationship between fourth grade mathematics achievement and…

  1. Innovative testing of spatial ability: interactive responding and the use of complex stimuli material

    Czech Academy of Sciences Publication Activity Database

    Jelínek, Martin; Květon, Petr; Vobořil, Dalibor

    2015-01-01

    Roč. 16, č. 1 (2015), s. 45-55 ISSN 1612-4782 R&D Projects: GA ČR(CZ) GAP407/11/2397 Institutional support: RVO:68081740 Keywords : Spatial ability * Navigation skill * Working memory Subject RIV: AN - Psychology Impact factor: 1.340, year: 2015

  2. Isolation and testing the cholesteral reduction ability (in-vitro) of ...

    African Journals Online (AJOL)

    Probiotics are live microbial feed supplements, which positively affect the host animal by improving its intestinal microbial balance. Studies have shown probiotic activities of Lactococci isolated from dairy foods, which include the ability to inhibit the growth of other bacteria and the reduction of cholesterol. However, there is ...

  3. Diagnosing academic language ability : An analysis of the Test of Academic Literacy for Postgraduate Students

    NARCIS (Netherlands)

    Pot, Anna; Weideman, Albert

    2015-01-01

    Following the observation that a large number of postgraduate students may not possess an adequate level of academic language ability to complete their studies successfully, this study investigates postgraduate students' strengths and weaknesses in academic literacy, with a specific focus on

  4. The effect of an intervention program on functional movement screen test scores in mixed martial arts athletes.

    Science.gov (United States)

    Bodden, Jamie G; Needham, Robert A; Chockalingam, Nachiappan

    2015-01-01

    This study assessed the basic fundamental movements of mixed martial arts (MMA) athletes using the functional movement screen (FMS) assessment and determined if an intervention program was successful at improving results. Participants were placed into 1 of the 2 groups: intervention and control groups. The intervention group was required to complete a corrective exercise program 4 times per week, and all participants were asked to continue their usual MMA training routine. A mid-intervention FMS test was included to examine if successful results were noticed sooner than the 8-week period. Results highlighted differences in FMS test scores between the control group and intervention group (p = 0.006). Post hoc testing revealed a significant increase in the FMS score of the intervention group between weeks 0 and 8 (p = 0.00) and weeks 0 and 4 (p = 0.00) and no significant increase between weeks 4 and 8 (p = 1.00). A χ analysis revealed that the intervention group participants were more likely to have an FMS score >14 than participants in the control group at week 4 (χ = 7.29, p < 0.01) and week 8 (χ = 5.2, p ≤ 0.05). Finally, a greater number of participants in the intervention group were free from asymmetry at week 4 and week 8 compared with the initial test period. The results of the study suggested that a 4-week intervention program was sufficient at improving FMS scores. Most if not all, the movements covered on the FMS relate to many aspects of MMA training. The knowledge that the FMS can identify movement dysfunctions and, furthermore, the fact that the issues can be improved through a standardized intervention program could be advantageous to MMA coaches, thus, providing the opportunity to adapt and implement new additions to training programs.

  5. Students' Attitude toward and Acceptability of Computerized Adaptive Testing in Medical School and their Effect on the Examinees' Ability

    Directory of Open Access Journals (Sweden)

    Mee Young Kim

    2005-06-01

    Full Text Available An examinee's ability can be evaluated precisely using computerized adaptive testing (CAT, which is shorter than written tests and more efficient in terms of the duration of the examination. We used CAT for the second General Examination of 98 senior students in medical college on November 27, 2004. We prepared 1,050 pre-calibrated test items according to item response theory, which had been used for the General Examination administered to senior students in 2003. The computer was programmed to pose questions until the standard error of the ability estimate was smaller than 0.01. To determine the students' attitude toward and evaluation of CAT, we conducted surveys before and after the examination, via the Web. The mean of the students' ability estimates was 0.3513 and its standard deviation was 0.9097 (range -2.4680 to +2.5310. There was no significant difference in the ability estimates according to the responses of students to items concerning their experience with CAT, their ability to use a computer, or their anxiety before and after the examination (p>0.05. Many students were unhappy that they could not recheck their responses (49%, and some stated that there were too few examination items (24%. Of the students, 79 % had no complaints concerning using a computer and 63% wanted to expand the use of CAT. These results indicate that CAT can be implemented in medical schools without causing difficulties for users.

  6. The Effect of Presentation Medium on Pilot Selection Test Battery Scores

    National Research Council Canada - National Science Library

    Biggerstaff, S

    1998-01-01

    .... The American Psychological Association (APA) has set guidelines to be followed to ensure both qualitative and quantitative equivalence of new test formats prior to their use in applied settings...

  7. A powerful score-based test statistic for detecting gene-gene co-association.

    Science.gov (United States)

    Xu, Jing; Yuan, Zhongshang; Ji, Jiadong; Zhang, Xiaoshuai; Li, Hongkai; Wu, Xuesen; Xue, Fuzhong; Liu, Yanxun

    2016-01-29

    The genetic variants identified by Genome-wide association study (GWAS) can only account for a small proportion of the total heritability for complex disease. The existence of gene-gene joint effects which contains the main effects and their co-association is one of the possible explanations for the "missing heritability" problems. Gene-gene co-association refers to the extent to which the joint effects of two genes differ from the main effects, not only due to the traditional interaction under nearly independent condition but the correlation between genes. Generally, genes tend to work collaboratively within specific pathway or network contributing to the disease and the specific disease-associated locus will often be highly correlated (e.g. single nucleotide polymorphisms (SNPs) in linkage disequilibrium). Therefore, we proposed a novel score-based statistic (SBS) as a gene-based method for detecting gene-gene co-association. Various simulations illustrate that, under different sample sizes, marginal effects of causal SNPs and co-association levels, the proposed SBS has the better performance than other existed methods including single SNP-based and principle component analysis (PCA)-based logistic regression model, the statistics based on canonical correlations (CCU), kernel canonical correlation analysis (KCCU), partial least squares path modeling (PLSPM) and delta-square (δ (2)) statistic. The real data analysis of rheumatoid arthritis (RA) further confirmed its advantages in practice. SBS is a powerful and efficient gene-based method for detecting gene-gene co-association.

  8. A framework for testing the ability of models to project climate change and its impacts

    DEFF Research Database (Denmark)

    Refsgaard, J. C.; Madsen, H.; Andréassian, V.

    2014-01-01

    Models used for climate change impact projections are typically not tested for simulation beyond current climate conditions. Since we have no data truly reflecting future conditions, a key challenge in this respect is to rigorously test models using proxies of future conditions. This paper presents...... a validation framework and guiding principles applicable across earth science disciplines for testing the capability of models to project future climate change and its impacts. Model test schemes comprising split-sample tests, differential split-sample tests and proxy site tests are discussed in relation...... to their application for projections by use of single models, ensemble modelling and space-time-substitution and in relation to use of different data from historical time series, paleo data and controlled experiments. We recommend that differential-split sample tests should be performed with best available proxy data...

  9. Test-retest reliability of the Battery for the Assessment of Auditory Sensorimotor and Timing Abilities (BAASTA)

    NARCIS (Netherlands)

    Bégel, Valentin; Verga, Laura; Benoit, Charles-Etienne; Kotz, Sonja A; Bella, Simone Dalla

    2018-01-01

    Perceptual and sensorimotor timing skills can be comprehensively assessed with the Battery for the Assessment of Auditory Sensorimotor and Timing Abilities (BAASTA). The battery has been used for testing rhythmic skills in healthy adults and patient populations (e.g., with Parkinson disease),

  10. Literature Review: Validity and Potential Usefulness of Psychomotor Ability Tests for Personnel Selection and Classification

    Science.gov (United States)

    1988-04-01

    processing capabilities. Craik and Lockhart (1972), for example, investigated limitations In the ability to store Information In short-term storage...F. I. M., & Lockhart , R. S. (1972). Levels of processing : A framework for memory research. Journal of Verbal Learning and Verbal Behavior, 11, 671...preferable to an information processing taxonomy for purposes of the current selection and classification research. i-irst, on a theoretical level

  11. Use of e-rater[R] in Scoring of the TOEFL iBT[R] Writing Test. Research Report. ETS RR-11-25

    Science.gov (United States)

    Haberman, Shelby J.

    2011-01-01

    Alternative approaches are discussed for use of e-rater[R] to score the TOEFL iBT[R] Writing test. These approaches involve alternate criteria. In the 1st approach, the predicted variable is the expected rater score of the examinee's 2 essays. In the 2nd approach, the predicted variable is the expected rater score of 2 essay responses by the…

  12. Nasalance Scores of Children with Repaired Cleft Palate Who Exhibit Normal Velopharyngeal Closure during Aerodynamic Testing

    Science.gov (United States)

    Zajac, David J.

    2013-01-01

    Purpose: To determine if children with repaired cleft palate and normal velopharyngeal (VP) closure as determined by aerodynamic testing exhibit greater acoustic nasalance than control children without cleft palate. Method: Pressure-flow procedures were used to identify 2 groups of children based on VP closure during the production of /p/ in the…

  13. A Comparison of Eighth Grade Students' Testing Scores between the "Jeopardy" and "Seatwork" Types of Review.

    Science.gov (United States)

    Daft, Lee T.

    This study focused on the review process before social studies testing. The students involved in the study were 71 13 and 14-year olds and came from predominantly middle to upper class social status in a Knoxville, Tennessee suburb. The influence of an interactive review based on the quiz show "Jeopardy" was compared with that of a "seatwork"…

  14. Interpreting Mathematics Scores on the New Jersey College Basic Skills Placement Test.

    Science.gov (United States)

    Dass, Jane; Pine, Charles

    The New Jersey College Basic Skills Placement Test (NJCBSPT) is designed to measure certain basic language and mathematics skills of students entering New Jersey colleges. The primary purpose of the two mathematics sections is to determine whether students are prepared to begin certain college-level work without a handicap in computation or…

  15. The Effects of Specific Reading Interventions on Elementary Students' Test Scores

    Science.gov (United States)

    Griffin, Jacqueline Laverne Meeks

    2016-01-01

    Many students in third, fourth and fifth grades struggle at the lowest levels of reading proficiency. In fact, fewer than 40% of fourth graders in the United States read at or above the "proficient" level on state standardized tests in 2009 (D'Ardenne, Barnes, Hightower, Lamason, Mason, Patterson, Stephens, Wilson, Smith & Erickson,…

  16. Detection of acute deterioration in health status visit among COPD patients by monitoring COPD assessment test score

    Directory of Open Access Journals (Sweden)

    Pothirat C

    2015-02-01

    Full Text Available Chaicharn Pothirat, Warawut Chaiwong, Atikun Limsukon, Athavudh Deesomchok, Chalerm Liwsrisakun, Chaiwat Bumroongkit, Theerakorn Theerakittikul, Nittaya PhetsukDivision of Pulmonary, Critical Care and Allergy, Department of Internal Medicine, Faculty of Medicine, Chiang Mai University, Chiang Mai, ThailandBackground: The Chronic Obstructive Pulmonary Disease Assessment Test (CAT could play a role in detecting acute deterioration in health status during monitoring visits in routine clinical practice.Objective: To evaluate the discriminative property of a change in CAT score from a stable baseline visit for detecting acute deterioration in health status visits of chronic obstructive pulmonary disease (COPD patients.Methods: The CAT questionnaire was administered to stable COPD patients routinely attending the chest clinic of Chiang Mai University Hospital who were monitored using the CAT score every 1–3 months for 15 months. Acute deterioration in health status was defined as worsening or exacerbation. CAT scores at baseline, and subsequent visits with acute deterioration in health status were analyzed using the t-test. The receiver operating characteristic curve was performed to evaluate the discriminative property of change in CAT score for detecting acute deterioration during a health status visit.Results: A total of 354 follow-up visits were made by 140 patients, aged 71.1±8.4 years, with a forced expiratory volume in 1 second of 47.49%±18.2% predicted, who were monitored for 15 months. The mean CAT score change between stable baseline visits, by patients’ and physicians’ global assessments, were 0.05 (95% confidence interval [CI], -0.37–0.46 and 0.18 (95% CI, -0.23–0.60, respectively. At worsening visits, as assessed by patients, there was significant increase in CAT score (6.07; 95% CI, 4.95–7.19. There were also significant increases in CAT scores at visits with mild and moderate exacerbation (5.51 [95% CI, 4.39–6

  17. The quantitative LOD score: test statistic and sample size for exclusion and linkage of quantitative traits in human sibships.

    Science.gov (United States)

    Page, G P; Amos, C I; Boerwinkle, E

    1998-04-01

    We present a test statistic, the quantitative LOD (QLOD) score, for the testing of both linkage and exclusion of quantitative-trait loci in randomly selected human sibships. As with the traditional LOD score, the boundary values of 3, for linkage, and -2, for exclusion, can be used for the QLOD score. We investigated the sample sizes required for inferring exclusion and linkage, for various combinations of linked genetic variance, total heritability, recombination distance, and sibship size, using fixed-size sampling. The sample sizes required for both linkage and exclusion were not qualitatively different and depended on the percentage of variance being linked or excluded and on the total genetic variance. Information regarding linkage and exclusion in sibships larger than size 2 increased as approximately all possible pairs n(n-1)/2 up to sibships of size 6. Increasing the recombination (theta) distance between the marker and the trait loci reduced empirically the power for both linkage and exclusion, as a function of approximately (1-2theta)4.

  18. Stability of person ability measures in people with acquired brain injury in the use of everyday technology: the test-retest reliability of the Management of Everyday Technology Assessment (META).

    Science.gov (United States)

    Malinowsky, Camilla; Kassberg, Ann-Charlotte; Larsson-Lund, Maria; Kottorp, Anders

    2016-01-01

    To evaluate the test-retest reliability of the Management of Everyday Technology Assessment (META) in a sample of people with acquired brain injury (ABI). The META was administered twice within a two-week period to 25 people with ABI. A Rasch measurement model was used to convert the META ordinal raw scores into equal-interval linear measures of each participant's ability to manage everyday technology (ET). Test-retest reliability of the stability of the person ability measures in the META was examined by a standardized difference Z-test and an intra-class correlations analysis (ICC 1). The results showed that the paired person ability measures generated from the META were stable over the test-retest period for 22 of the 25 subjects. The ICC 1 correlation was 0.63, which indicates good overall reliability. The META demonstrated acceptable test-retest reliability in a sample of people with ABI. The results illustrate the importance of using sufficiently challenging ETs (relative to a person's abilities) to generate stable META measurements over time. Implications for Rehabilitation The findings add evidence regarding the test-retest reliability of the person ability measures generated from the observation assessment META in a sample of people with ABI. The META might support professionals in the evaluation of interventions that are designed to improve clients' performance of activities including the ability to manage ET.

  19. Report test scores assessment of the functioning of dosimetry service staff of the CNLV

    International Nuclear Information System (INIS)

    Alvarez R, J.T.; Tovar M, V.M.

    2002-12-01

    The ININ realized the evaluation of the service of personal dosimetry in the CNLV, in the categories: IV. (Photons of high energy of 137 CS) and the VA. (Particles beta of 90 Sr/ 90 Y); in the category IV the test was satisfactory, however in the chart 1 has an underestimation a the American Standard HP over the value true conventional of a 9%; for this irregularity it is recommended to revise the procedures of evaluation of the process and the determination of the chart 1 of the HP. In the category VA, the test is also satisfactory, however the results contrasted with the chart 2 and the HP, the values were overestimated in 29% of the true conventional value, and for that problem is recommended to revise the evaluation procedures in contrast with the values determined by the standard HP. (Author)

  20. Sistem Scoring Conversion TOEFL Paper Based Test (PBT Politeknik Negeri Cilacap Menggunakan Metode User Centered Design

    Directory of Open Access Journals (Sweden)

    Cahya Vikasari

    2017-06-01

    Full Text Available Sistem komputer interaktif untuk dipakai oleh useruntuk mendukung pekerjannya. User merupakan object yang penting didalam pengembangan dan pembangun sistem. User adalah personal-personal yang terlibat langsung dalam pemakaian aplikasi. Konsep dari UCD adalah user sebagai pusat dari proses pengembangan sistem, dan tujuan/sifat-sifat, konteks dan lingkungan sistem semua didasarkan dari pengalaman pengguna Pembangunan sistem skoring test TOEFL paper based test (PBT di UPT bahasa politeknik negeri cilacapmenggunakan metode UCD. Dengan menggunakan metode UCD sistem dapat   mempermudah dan mempercepat pendaftaran oleh calon pendaftar dengan tampilan antarmuka yang user friendly , mempermudah proses pengelolaan data dan rekap data pendaftar, mempermudah pengkonversian skor TOEFL yang dilakukan secara otomatis, serta  meminimalisir terjadinya kesalahan, duplikasi data dan duplikasi kegiatan.

  1. Reliability and Validity of the New Tanaka B Intelligence Scale Scores: A Group Intelligence Test

    OpenAIRE

    Uno, Yota; Mizukami, Hitomi; Ando, Masahiko; Yukihiro, Ryoji; Iwasaki, Yoko; Ozaki, Norio

    2014-01-01

    OBJECTIVE: The present study evaluated the reliability and concurrent validity of the new Tanaka B Intelligence Scale, which is an intelligence test that can be administered on groups within a short period of time. METHODS: The new Tanaka B Intelligence Scale and Wechsler Intelligence Scale for Children-Third Edition were administered to 81 subjects (mean age ± SD 15.2 ± 0.7 years) residing in a juvenile detention home; reliability was assessed using Cronbach's alpha coefficient, and concurre...

  2. Refining Ovarian Cancer Test accuracy Scores (ROCkeTS): protocol for a prospective longitudinal test accuracy study to validate new risk scores in women with symptoms of suspected ovarian cancer

    Science.gov (United States)

    Sundar, Sudha; Rick, Caroline; Dowling, Francis; Au, Pui; Rai, Nirmala; Champaneria, Rita; Stobart, Hilary; Neal, Richard; Davenport, Clare; Mallett, Susan; Sutton, Andrew; Kehoe, Sean; Timmerman, Dirk; Bourne, Tom; Van Calster, Ben; Gentry-Maharaj, Aleksandra; Deeks, Jon

    2016-01-01

    Introduction Ovarian cancer (OC) is associated with non-specific symptoms such as bloating, making accurate diagnosis challenging: only 1 in 3 women with OC presents through primary care referral. National Institute for Health and Care Excellence guidelines recommends sequential testing with CA125 and routine ultrasound in primary care. However, these diagnostic tests have limited sensitivity or specificity. Improving accurate triage in women with vague symptoms is likely to improve mortality by streamlining referral and care pathways. The Refining Ovarian Cancer Test Accuracy Scores (ROCkeTS; HTA 13/13/01) project will derive and validate new tests/risk prediction models that estimate the probability of having OC in women with symptoms. This protocol refers to the prospective study only (phase III). Methods and analysis ROCkeTS comprises four parallel phases. The full ROCkeTS protocol can be found at http://www.birmingham.ac.uk/ROCKETS. Phase III is a prospective test accuracy study. The study will recruit 2450 patients from 15 UK sites. Recruited patients complete symptom and anxiety questionnaires, donate a serum sample and undergo ultrasound scored as per International Ovarian Tumour Analysis (IOTA) criteria. Recruitment is at rapid access clinics, emergency departments and elective clinics. Models to be evaluated include those based on ultrasound derived by the IOTA group and novel models derived from analysis of existing data sets. Estimates of sensitivity, specificity, c-statistic (area under receiver operating curve), positive predictive value and negative predictive value of diagnostic tests are evaluated and a calibration plot for models will be presented. ROCkeTS has received ethical approval from the NHS West Midlands REC (14/WM/1241) and is registered on the controlled trials website (ISRCTN17160843) and the National Institute of Health Research Cancer and Reproductive Health portfolios. PMID:27507231

  3. Refining Ovarian Cancer Test accuracy Scores (ROCkeTS): protocol for a prospective longitudinal test accuracy study to validate new risk scores in women with symptoms of suspected ovarian cancer.

    Science.gov (United States)

    Sundar, Sudha; Rick, Caroline; Dowling, Francis; Au, Pui; Snell, Kym; Rai, Nirmala; Champaneria, Rita; Stobart, Hilary; Neal, Richard; Davenport, Clare; Mallett, Susan; Sutton, Andrew; Kehoe, Sean; Timmerman, Dirk; Bourne, Tom; Van Calster, Ben; Gentry-Maharaj, Aleksandra; Menon, Usha; Deeks, Jon

    2016-08-09

    Ovarian cancer (OC) is associated with non-specific symptoms such as bloating, making accurate diagnosis challenging: only 1 in 3 women with OC presents through primary care referral. National Institute for Health and Care Excellence guidelines recommends sequential testing with CA125 and routine ultrasound in primary care. However, these diagnostic tests have limited sensitivity or specificity. Improving accurate triage in women with vague symptoms is likely to improve mortality by streamlining referral and care pathways. The Refining Ovarian Cancer Test Accuracy Scores (ROCkeTS; HTA 13/13/01) project will derive and validate new tests/risk prediction models that estimate the probability of having OC in women with symptoms. This protocol refers to the prospective study only (phase III). ROCkeTS comprises four parallel phases. The full ROCkeTS protocol can be found at http://www.birmingham.ac.uk/ROCKETS. Phase III is a prospective test accuracy study. The study will recruit 2450 patients from 15 UK sites. Recruited patients complete symptom and anxiety questionnaires, donate a serum sample and undergo ultrasound scored as per International Ovarian Tumour Analysis (IOTA) criteria. Recruitment is at rapid access clinics, emergency departments and elective clinics. Models to be evaluated include those based on ultrasound derived by the IOTA group and novel models derived from analysis of existing data sets. Estimates of sensitivity, specificity, c-statistic (area under receiver operating curve), positive predictive value and negative predictive value of diagnostic tests are evaluated and a calibration plot for models will be presented. ROCkeTS has received ethical approval from the NHS West Midlands REC (14/WM/1241) and is registered on the controlled trials website (ISRCTN17160843) and the National Institute of Health Research Cancer and Reproductive Health portfolios. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted

  4. Predisposing factors of pneumothorax in percutaneous transthoracic fine needle aspiration biopsy: comparison between CT emphysema score and pulmonary function test

    International Nuclear Information System (INIS)

    Lee, Chang Ho; Park, Kyung Joo; Park, Dong Won; Jung, Kyung Il; Suh, Jung Ho

    1997-01-01

    To compare the CT emphysema score with various factors of pulmonary function test by simple spirometry and to use the result as a predictor of pneumothorax in percutaneous transthoracic fine needle aspiration biopsy. The CT scans of 106 patients who had undergone percutaneous transthoracic fine needle aspiration biopsy of lung lesions within the previous 18 months were retrospectively reviewed. In 75 of these 106 cases, the results of the pulmonary function test were also reviewed. On plain chest radiography, pneumothorax was noted in 20 cases (19%). Emphysema was blindly evaluated. We divided each lung into four segments and determined the severity and involved volume of emphysema, as seen on CT. Severity was classified as one of four grades, as follow : absence of emphysema=0 ; low attenuation area of less than 5mm=1 ; low attenuation area of more than 5mm, and vascular pruning with normal lung intervening=2 ; and diffuse low attenuation without intervening normal lung, and larger confluent low attenuation with vascular pruning and distortion of branching pattern occupying all or almost all the involved parenchyma=3. The involved area was also classified as one of four grades : less than 25%=1 ; 25 - 49%=2 ; 51 - 74%=3 ; and more than 75%=4. The CT emphysema score was defined as the average of the grade of severity multiplied by the grade of involved area. Pulmonary function tests, consisting of simple spirometry and a pulmonologist's interpretation, were evaluated. We also evaluated depth and size of lesion as known predisposing factors in postbioptic pneumothorax. Statistical analysis was performed using the chi-square test, Wilcoxon ranks sum W test and the student t test. A comparison between the two groups of occurrence(with or without pneumothorax) showed the emphysema scores to be 1.69±2.0 and 1.11±2.9, respectively ; there was thus no significant difference between the two groups (z= - 0.048, p>0.10). Nor were differences revealed by the pulmonary

  5. Predisposing factors of pneumothorax in percutaneous transthoracic fine needle aspiration biopsy: comparison between CT emphysema score and pulmonary function test

    Energy Technology Data Exchange (ETDEWEB)

    Lee, Chang Ho; Park, Kyung Joo; Park, Dong Won; Jung, Kyung Il; Suh, Jung Ho [Ajou Univ. College of Medicine, Seoul (Korea, Republic of)

    1997-11-01

    To compare the CT emphysema score with various factors of pulmonary function test by simple spirometry and to use the result as a predictor of pneumothorax in percutaneous transthoracic fine needle aspiration biopsy. The CT scans of 106 patients who had undergone percutaneous transthoracic fine needle aspiration biopsy of lung lesions within the previous 18 months were retrospectively reviewed. In 75 of these 106 cases, the results of the pulmonary function test were also reviewed. On plain chest radiography, pneumothorax was noted in 20 cases (19%). Emphysema was blindly evaluated. We divided each lung into four segments and determined the severity and involved volume of emphysema, as seen on CT. Severity was classified as one of four grades, as follow : absence of emphysema=0 ; low attenuation area of less than 5mm=1 ; low attenuation area of more than 5mm, and vascular pruning with normal lung intervening=2 ; and diffuse low attenuation without intervening normal lung, and larger confluent low attenuation with vascular pruning and distortion of branching pattern occupying all or almost all the involved parenchyma=3. The involved area was also classified as one of four grades : less than 25%=1 ; 25 - 49%=2 ; 51 - 74%=3 ; and more than 75%=4. The CT emphysema score was defined as the average of the grade of severity multiplied by the grade of involved area. Pulmonary function tests, consisting of simple spirometry and a pulmonologist's interpretation, were evaluated. We also evaluated depth and size of lesion as known predisposing factors in postbioptic pneumothorax. Statistical analysis was performed using the chi-square test, Wilcoxon ranks sum W test and the student t test. A comparison between the two groups of occurrence(with or without pneumothorax) showed the emphysema scores to be 1.69{+-}2.0 and 1.11{+-}2.9, respectively ; there was thus no significant difference between the two groups (z= - 0.048, p>0.10). Nor were differences revealed by the

  6. The students' ability in the mathematical literacy for uncertainty problems on the PISA adaptation test

    Science.gov (United States)

    Julie, Hongki; Sanjaya, Febi; Anggoro, Ant. Yudhi

    2017-08-01

    One of purposes of this study was to describe the solution profile of the junior high school students for the PISA adaptation test. The procedures conducted by researchers to achieve this objective were (1) adapting the PISA test, (2) validating the adapting PISA test, (3) asking junior high school students to do the adapting PISA test, and (4) making the students' solution profile. The PISA problems for mathematics could be classified into four areas, namely quantity, space and shape, change and relationship, and uncertainty. The research results that would be presented in this paper were the result test for uncertainty problems. In the adapting PISA test, there were fifteen questions. Subjects in this study were 18 students from 11 junior high schools in Yogyakarta, Central Java, and Banten. The type of research that used by the researchers was a qualitative research. For the first uncertainty problem in the adapting test, 66.67% of students reached level 3. For the second uncertainty problem in the adapting test, 44.44% of students achieved level 4, and 33.33% of students reached level 3. For the third uncertainty problem in the adapting test n, 38.89% of students achieved level 5, 11.11% of students reached level 4, and 5.56% of students achieved level 3. For the part a of the fourth uncertainty problem in the adapting test, 72.22% of students reached level 4 and for the part b of the fourth uncertainty problem in the adapting test, 83.33% students achieved level 4.

  7. COMPARISON BETWEEN WOOD DRYING DEFECT SCORES: SPECIMEN TESTING X ANALYSIS OF KILN-DRIED BOARDS

    OpenAIRE

    Djeison Cesar Batista; Márcio Pereira da Rocha; Ricardo Jorge Klitzke

    2015-01-01

    It is important to develop drying technologies for Eucalyptus grandis lumber, which is one of the most planted species of this genus in Brazil and plays an important role as raw material for the wood industry. The general aim of this work was to assess the conventional kiln drying of juvenile wood of three clones of Eucalyptus grandis. The specific aims were to compare the behavior between: i) drying defects indicated by tests with wood specimens and conventional kiln-dried boards; and ii) ph...

  8. Measuring Creative Imagery Abilities

    Directory of Open Access Journals (Sweden)

    Dorota M. Jankowska

    2015-10-01

    Full Text Available Over the decades, creativity and imagination research developed in parallel, but they surprisingly rarely intersected. This paper introduces a new theoretical model of creative imagination, which bridges creativity and imagination research, as well as presents a new psychometric instrument, called the Test of Creative Imagery Abilities (TCIA, developed to measure creative imagery abilities understood in accordance with this model. Creative imagination is understood as constituted by three interrelated components: vividness (the ability to create images characterized by a high level of complexity and detail, originality (the ability to produce unique imagery, and transformativeness (the ability to control imagery. TCIA enables valid and reliable measurement of these three groups of abilities, yielding the general score of imagery abilities and at the same time making profile analysis possible. We present the results of eight studies on a total sample of more than 1,700 participants, showing the factor structure of TCIA using confirmatory factor analysis, as well as provide data confirming this instrument’s validity and reliability. The availability of TCIA for interested researchers may result in new insights and possibilities of integrating the fields of creativity and imagination science.

  9. Verifying the functional ability of microstructured surfaces by model-based testing

    Science.gov (United States)

    Hartmann, Wito; Weckenmann, Albert

    2014-09-01

    Micro- and nanotechnology enables the use of new product features such as improved light absorption, self-cleaning or protection, which are based, on the one hand, on the size of functional nanostructures and the other hand, on material-specific properties. With the need to reliably measure progressively smaller geometric features, coordinate and surface-measuring instruments have been refined and now allow high-resolution topography and structure measurements down to the sub-nanometre range. Nevertheless, in many cases it is not possible to make a clear statement about the functional ability of the workpiece or its topography because conventional concepts of dimensioning and tolerancing are solely geometry oriented and standardized surface parameters are not sufficient to consider interaction with non-geometric parameters, which are dominant for functions such as sliding, wetting, sealing and optical reflection. To verify the functional ability of microstructured surfaces, a method was developed based on a parameterized mathematical-physical model of the function. From this model, function-related properties can be identified and geometric parameters can be derived, which may be different for the manufacturing and verification processes. With this method it is possible to optimize the definition of the shape of the workpiece regarding the intended function by applying theoretical and experimental knowledge, as well as modelling and simulation. Advantages of this approach will be discussed and demonstrated by the example of a microstructured inking roll.

  10. Does Stereotype Threat Affect Post-Course Scores on the Astronomy Diagnostic Test?

    Science.gov (United States)

    Deming, G. L.; Hufnagel, B.; Landato, J. M.; Hodari, A. K.

    2003-12-01

    During the 1990s, Claude Steele and others demonstrated that women mathematics students under-performed while men over-performed on selected GRE questions when told that the exam could differentiate by gender. Stereotype threat is triggered for these women when they fear someone else may negatively stereotype them, and therefore, their performance is affected. In a limited study involving 229 students, we investigated the effect of stereotype threat on performance on the Astronomy Diagnostic Test (ADT). The ADT was administered as a pre-test in four introductory astronomy classes intended for non-science majors. The same professors taught pairs of classes at the University of Maryland, a large research institution, and W. R. Harper College, a small liberal arts school. The classes were treated the same until the final day before the post-course ADT was given. One "threatened" class at each campus was told that gender mattered so they should be sure to include it on the ADT. The "control" classes were told that gender does not matter. The results show no stereotype threat effect on the women in these introductory classes. The university men did slightly over-perform at low statistical significance. As Steele suggested, students must identify with a subject in order to strongly invoke a stereotype threat. This research was supported in part by the National Science Foundation through grants REC-0089239 to GLD, DGE-97014489 to BH, and DGE-9714452 for AKH.

  11. The implications of policy pre-post test scores for street-level bureaucratic discretion.

    Science.gov (United States)

    Dorch, Edwina L

    2009-01-01

    Substantial reductions in audit error rates observed over the past few years suggest eligibility workers have moved toward an eligibility compliance culture described by Bane and Ellwood. However, the results of this study indicate that social service caseworkers responded correctly to 49% of the targeted policy items at the pre-test stage and 68% at the post-test stage. Such findings provide preliminary support for the hypothesis that, in instances when caseworkers lack policy knowledge, they use their own discretion. Such a finding not only supports Lipsky's theory but also supports the notion that administrators should be encouraged to utilize 'mastery learning' procedures whereby caseworkers are retained in new-hire and follow-up training classes until they have mastered 100% of targeted policy information. Retention of caseworkers may also reduce federal and local audit errors and errors in crediting the reduction of caseloads to social service policies when in fact significant components of them have not been implemented (learned or utilized). And, most importantly, retention in training classes may increase the appropriate provision of services to the needy.

  12. Opioid abusers’ ability to differentiate an opioid from placebo in laboratory challenge testing*

    Science.gov (United States)

    Antoine, Denis G.; Strain, Eric C.; Tompkins, D. Andrew; Bigelow, George E.

    2013-01-01

    Background Abuse liability assessments influence drug development, federal regulation, and clinical care. One suggested procedure to reduce variability of assessments is a qualification phase, which assesses whether study applicants adequately distinguish active drug from placebo; applicants failing to make this distinction are disqualified. The present analyses assessed differences between qualification phase qualifiers and non-qualifiers. Methods Data were collected from 23 completers of the qualification phase of an abuse liability study. Opioid abusing participants received 30 mg oxycodone and placebo orally on separate days, and were characterized as qualifiers (vs. non-qualifiers) if their peak visual analog scale liking rating for oxycodone was at least 20 points higher than placebo’s peak rating. Groups were compared on demographic characteristics, drug history, and physiologic, subject and observer ratings. Results 61% of participants were qualifiers and 39% were non-qualifiers. Groups had similar demographic characteristics, drug use histories, and pupillary constriction responses. However, unlike qualifiers, non-qualifiers had an exaggerated placebo response for the liking score (p=0.03) and an attenuated oxycodone response for the liking score (p<.0001). Non-qualifiers’ failure to differentiate oxycodone versus placebo was evident for subject and observer ratings. Conclusion Different subjective responses to identical stimuli support the use of a qualification phase in abuse liability assessments. Further research should explore objective measures that may better account for these differences, determine optimal qualification criteria, and explore the developmental course of drug use. This study also documents certain opioid abusers fail to differentiate 30 mg of oxycodone from placebo, a phenomenon deserving further study. PMID:23369645

  13. DISCRIMINATIVE ANALYSIS OF TESTS FOR EVALUATING SITUATIONMOTORIC ABILITIES BETWEEN TWO GROUPS OF BASKETBALL PLAYERS SELECTED BY THE TEST OF SOCIOMETRY

    OpenAIRE

    Abdulla Elezi; Nazim Myrtaj; Florian Miftari

    2011-01-01

    Determining differences between the two groups of basketball players selected with the modified sociometric test (Paranosić and Lazarević) in some tests for assessing situation-motor skills, was the aim of this work. The test sample was consisted of 20 basketball players who had most positive points and 20 basketball players who had most negative points, in total- 40 players. T-test was applied to determine whether there are differences between the two groups of basketball players who had bee...

  14. Associations between cadmium exposure and neurocognitive test scores in a cross-sectional study of US adults.

    Science.gov (United States)

    Ciesielski, Timothy; Bellinger, David C; Schwartz, Joel; Hauser, Russ; Wright, Robert O

    2013-02-05

    Low-level environmental cadmium exposure and neurotoxicity has not been well studied in adults. Our goal was to evaluate associations between neurocognitive exam scores and a biomarker of cumulative cadmium exposure among adults in the Third National Health and Nutrition Examination Survey (NHANES III). NHANES III is a nationally representative cross-sectional survey of the U.S. population conducted between 1988 and 1994. We analyzed data from a subset of participants, age 20-59, who participated in a computer-based neurocognitive evaluation. There were four outcome measures: the Simple Reaction Time Test (SRTT: visual motor speed), the Symbol Digit Substitution Test (SDST: attention/perception), the Serial Digit Learning Test (SDLT) trials-to-criterion, and the SDLT total-error-score (SDLT-tests: learning recall/short-term memory). We fit multivariable-adjusted models to estimate associations between urinary cadmium concentrations and test scores. 5662 participants underwent neurocognitive screening, and 5572 (98%) of these had a urinary cadmium level available. Prior to multivariable-adjustment, higher urinary cadmium concentration was associated with worse performance in each of the 4 outcomes. After multivariable-adjustment most of these relationships were not significant, and age was the most influential variable in reducing the association magnitudes. However among never-smokers with no known occupational cadmium exposure the relationship between urinary cadmium and SDST score (attention/perception) was significant: a 1 μg/L increase in urinary cadmium corresponded to a 1.93% (95%CI: 0.05, 3.81) decrement in performance. These results suggest that higher cumulative cadmium exposure in adults may be related to subtly decreased performance in tasks requiring attention and perception, particularly among those adults whose cadmium exposure is primarily though diet (no smoking or work based cadmium exposure). This association was observed among exposure levels

  15. The differential item functioning and structural equivalence of a nonverbal cognitive ability test for five language groups

    Directory of Open Access Journals (Sweden)

    Pieter Schaap

    2011-10-01

    Research purpose: The aim of the study was to determine the differential item functioning (DIF and structural equivalence of a nonverbal cognitive ability test (the PiB/SpEEx Observance test [401] for five South African language groups. Motivation for study: Cultural and language group sensitive tests can lead to unfair discrimination and is a contentious workplace issue in South Africa today. Misconceptions about psychometric testing in industry can cause tests to lose credibility if industries do not use a scientifically sound test-by-test evaluation approach. Research design, approach and method: The researcher used a quasi-experimental design and factor analytic and logistic regression techniques to meet the research aims. The study used a convenience sample drawn from industry and an educational institution. Main findings: The main findings of the study show structural equivalence of the test at a holistic level and nonsignificant DIF effect sizes for most of the comparisons that the researcher made. Practical/managerial implications: This research shows that the PIB/SpEEx Observance Test (401 is not completely language insensitive. One should see it rather as a language-reduced test when people from different language groups need testing. Contribution/value-add: The findings provide supporting evidence that nonverbal cognitive tests are plausible alternatives to verbal tests when one compares people from different language groups.

  16. Analitycal Descriptive Study of Students' Critical Mathematic Thinking Ability Through Graded Response Model (Grm)

    OpenAIRE

    nurul, didin; zahra anasha, zara

    2013-01-01

    Critical mathematic thinking ability is very important to solve daily problems. But in reality, junior high school students' critical mathematic thinking ability is still low. Ability measurement such as measurement of critical mathematic thinking ability cannot be measured through multiple choices test. In that case, an essay test in which graded scoring is used as scoring technique more suitable than multiple choices test. The result of the essay test will be analyzed to describe...

  17. Assessing the discriminating power of item and test scores in the linear factor-analysis model

    Directory of Open Access Journals (Sweden)

    Pere J. Ferrando

    2012-01-01

    Full Text Available Las propuestas rigurosas y basadas en un modelo psicométrico para estudiar el impreciso concepto de "capacidad discriminativa" son escasas y generalmente limitadas a los modelos no-lineales para items binarios. En este artículo se propone un marco general para evaluar la capacidad discriminativa de las puntuaciones en ítems y tests que son calibrados mediante el modelo de un factor común. La propuesta se organiza en torno a tres criterios: (a tipo de puntuación, (b rango de discriminación y (c aspecto específico que se evalúa. Dentro del marco propuesto: (a se discuten las relaciones entre 16 medidas, de las cuales 6 parecen ser nuevas, y (b se estudian las relaciones entre ellas. La utilidad de la propuesta en las aplicaciones psicométricas que usan el modelo factorial se ilustra mediante un ejemplo empírico.

  18. The Reliability of Clock Drawing Test Scoring Systems Modeled on the Normative Data in Healthy Aging and Nonamnestic Mild Cognitive Impairment.

    Science.gov (United States)

    Mazancova, Adela Fendrych; Nikolai, Tomas; Stepankova, Hana; Kopecek, Miloslav; Bezdicek, Ondrej

    2017-10-01

    The Clock Drawing Test (CDT) is a commonly used tool in clinical practice and research for cognitive screening among older adults. The main goal of the present study was to analyze the interrater reliability of three different CDT scoring systems (by Shulman et al., Babins et al., and Cohen et al.). We used a clock with a predrawn circle. The CDT was evaluated by three independent raters based on the normative data set of healthy older and very old adults and patients with nonamnestic mild cognitive impairment (naMCI; N = 438; aged 61-94). We confirmed a high interrater reliability measured by the intraclass correlation coefficients (ICCs): Shulman ICC = .809, Babins ICC = .894, and Cohen ICC = .862, all p < .001. We found that age and education levels have a significant effect on CDT performance, yet there was no influence of gender. Finally, the scoring systems differentiated between naMCI and age- and education-matched controls: Shulman's area under the receiver operating characteristic curve (AUC) = .84, Cohen AUC = .71, all p < .001; and a slightly lower discriminative ability was shown by Babins: AUC = .65, p = .012.

  19. ¿Exito en California? A Validity Critique of Language Program Evaluations and Analysis of English Learner Test Scores

    Directory of Open Access Journals (Sweden)

    Marilyn S. Thompson

    2002-01-01

    Full Text Available Several states have recently faced ballot initiatives that propose to functionally eliminate bilingual education in favor of English-only approaches. Proponents of these initiatives have argued an overall rise in standardized achievement scores of California's limited English proficient (LEP students is largely due to the implementation of English immersion programs mandated by Proposition 227 in 1998, hence, they claim Exito en California (Success in California. However, many such arguments presented in the media were based on flawed summaries of these data. We first discuss the background, media coverage, and previous research associated with California's Proposition 227. We then present a series of validity concerns regarding use of Stanford-9 achievement data to address policy for educating LEP students; these concerns include the language of the test, alternative explanations, sample selection, and data analysis decisions. Finally, we present a comprehensive summary of scaled-score achievement means and trajectories for California's LEP and non-LEP students for 1998-2000. Our analyses indicate that although scores have risen overall, the achievement gap between LEP and EP students does not appear to be narrowing.

  20. State-trait decomposition of Name Letter Test scores and relationships with global self-esteem.

    Science.gov (United States)

    Perinelli, Enrico; Alessandri, Guido; Donnellan, M Brent; Łaguna, Mariola

    2018-06-01

    The Name Letter Test (NLT) assesses the degree that participants show a preference for an individual's own initials. The NLT was often thought to measure implicit self-esteem, but recent literature reviews do not equivocally support this hypothesis. Several authors have argued that the NLT is most strongly associated with the state component of self-esteem. The current research uses a modified STARTS model to (a) estimate the percentage of stable and transient components of the NLT and (b) estimate the covariances between stable/transient components of the NLT and stable/transient components of self-esteem and positive and negative affect. Two longitudinal studies were conducted with different time lags: In Study 1, participants were assessed daily for 7 consecutive days, whereas in Study 2, participants were assessed weekly for 8 consecutive weeks. Participants also completed a battery of questionnaires including global self-esteem, positive affect, and negative affect. In both studies, the NLT showed (a) high stability across time, (b) a high percentage of stable variance, (c) no significant covariance with stable and transient factors for global self-esteem, and (d) a different pattern of correlations with stable and transient factors of affect than global self-esteem. Collectively, these results further undermine the claim that the NLT is a valid measure of implicit self-esteem. Future work is needed to identify theoretically grounded correlates of the NLT. (PsycINFO Database Record (c) 2018 APA, all rights reserved).

  1. The Impact of Scholastic Instrumental Music and Scholastic Chess Study on the Standardized Test Scores of Students in Grades Three, Four, and Five

    Science.gov (United States)

    Martinez, Edwin E.

    2012-01-01

    This study examines the impact of instrumental music study and group chess lessons on the standardized test scores of suburban elementary public school students (grades three through five) in Levittown, New York. The study divides the students into the following groups and compares the standardized test scores of each: a) instrumental music…

  2. The test ability of fish Tawes to leachate garbage dump (TPA) Benowo

    Science.gov (United States)

    Juliardi AR, N. R.; Wiyanti, R. I.

    2018-01-01

    Leachate is a liquid from waste containing elements of dissolved and suspended elements. Garbage collected at the landfill site contains organic, inorganic and heavy metal substances. If the rains will produce leachate with mineral content, organic and heavy metals. When the condition or leachate flow in let to the soil surface can cause negative effects to the surrounding environment including for humans. Toxicity test it was conducted to determine the level of leachate toxicity of the test animals living in surface water located around of the “TPA Benowo”. In this study using Tawes fish with length between 4-6 cm. In this toxicity test is done in 2 stages, namely: range finding test, the search for this range is obtained 0% concentrations (as control) 0,3%; 0,6%; 0,9%; 0,12% and 0,15%. The next stage of toxicity acute test, at this stage of toxicity concentration do smaller again that is: 0,18%; 0,36%; 0,54%; 0,72% and 0,9%. The results obtained LC50 value of 0,385%, while eyes, brown stomach skin.

  3. English Language Proficiency and Test Performance: An Evaluation of Bilingual Students with the Woodcock-Johnson III Tests of Cognitive Abilities

    Science.gov (United States)

    Sotelo-Dynega, Marlene; Ortiz, Samuel O.; Flanagan, Dawn P.; Chaplin, William F.

    2013-01-01

    In this article, we report the findings of an exploratory empirical study that investigated the relationship between English Language Proficiency (ELP) on performance on the Woodcock-Johnson Tests of Cognitive Abilities-Third Edition (WJ III) when administered in English to bilingual students of varying levels of ELP. Sixty-one second-grade…

  4. Assessing The Representative And Discriminative Ability Of Test Environments For Rice Breeding In Malaysia Using GGE Biplot

    Directory of Open Access Journals (Sweden)

    Yusuff Oladosu

    2017-11-01

    Full Text Available Identification of outstanding rice genotype for target environments is complicated by genotype environment interactions. Using genotype main effect plus genotype by environment interaction GGE Biplot software fifteen rice genotypes were evaluated at five locations representing the major rice producing areas in peninsula Malaysia in two cropping seasons to i identify ideal test environment for selecting superior rice genotype and ii identify discriminative and representative ability of test locations. Genotypes locations years and genotypes by environment interaction effect revealed high significant difference P 0.01 for number of tillers per hill grains per panicle grain weight per hill and yield per hectare. Grain yield per hectare had a non-repeatable crossover pattern that formed a complex and single mega-environment. Based on the crossover pattern a set of cultivars were selected for the whole region on the merit of mean performance and their stability analysis. The tested environments were divided into two mega-environments. An ideal test environment that measures the discriminative and representative ability of test location reveal that environment Sekinchan SC is the best environment while Kedah KD and Penang PN can also be considered as favorable environment whereas Serdang SS and Tanjung Karang TK were the poorest locations for selecting genotypes adapted to the whole region. This study serves a reference for genotypes evaluation as well as identification of test locations for rice breeding in Malaysia.

  5. Gaze Stabilization Test Asymmetry Score as an Indicator of Previous Concussion in a Cohort of Collegiate Football Players.

    Science.gov (United States)

    Honaker, Julie A; Criter, Robin E; Patterson, Jessie N; Jones, Sherri M

    2015-07-01

    Vestibular dysfunction may lead to decreased visual acuity with head movements, which may impede athletic performance and result in injury. The purpose of this study was to test the hypothesis that athletes with history of concussion would have differences in gaze stabilization test (GST) as compared with those without a history of concussion. Cross-sectional, descriptive. University Athletic Medicine Facility. Fifteen collegiate football players with a history of concussion, 25 collegiate football players without a history of concussion. Participants completed the dizziness handicap inventory (DHI), static visual acuity, perception time test, active yaw plane GST, stability evaluation test (SET), and a bedside oculomotor examination. Independent samples t test was used to compare GST, SET, and DHI scores per group, with Bonferroni-adjusted alpha at P history of concussion. The results support further research on the use of GST for sport-related concussion evaluation and monitoring. Inclusion of objective vestibular tests in the concussion protocol may reveal the presence of peripheral vestibular or visual-vestibular deficits. Therefore, the GST may add an important perspective on the effects of concussion.

  6. Evaluation of the Sealing Ability of Three Obturation Techniques Using a Glucose Leakage Test

    Directory of Open Access Journals (Sweden)

    Katarzyna Olczak

    2017-01-01

    Full Text Available The aim of this study was to evaluate the sealing ability of three different canal filling techniques. Sixty-four roots of extracted human maxillary anterior teeth were prepared using ProTaper® rotary instruments. The specimens were then randomly divided into 3 experimental groups (n=16 and 2 control groups (n=8. The root canals were filled using cold lateral compaction (CLC group, continuous wave condensation technique using the Elements Obturation Unit® (EOU group, and ProTaper obturators (PT group. For the negative control group, 8 roots were filled using lateral compaction as in the CLC group, and the teeth were covered twice with a layer of nail varnish (NCG group. Another 8 roots were filled using lateral compaction, but without sealer, and these were used as the positive control (PCG group. A glucose leakage model was used for quantitative evaluation of microleakage for 24 hours and 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, and 12 weeks. No significant difference in the cumulative amount of leakage was found between the three experimental groups at all observation times. The lateral condensation of cold gutta-percha can guarantee a similar seal of canal fillings as can be achieved by using thermal methods, in the round canals.

  7. Arithmetic Abilities in Children with Developmental Dyslexia: Performance on French ZAREKI-R Test

    Science.gov (United States)

    De Clercq-Quaegebeur, Maryse; Casalis, Séverine; Vilette, Bruno; Lemaitre, Marie-Pierre; Vallée, Louis

    2018-01-01

    A high comorbidity between reading and arithmetic disabilities has already been reported. The present study aims at identifying more precisely patterns of arithmetic performance in children with developmental dyslexia, defined with severe and specific criteria. By means of a standardized test of achievement in mathematics ("Calculation and…

  8. Limited ability of the proton-pump inhibitor test to identify patients with gastroesophageal reflux disease

    DEFF Research Database (Denmark)

    Bytzer, Peter; Jones, Roger; Vakil, Nimish

    2012-01-01

    The efficacy of proton-pump inhibitor (PPI) therapy often is assessed to determine whether patients' symptoms are acid-related and if patients have gastroesophageal reflux disease (GERD), although the accuracy of this approach is questionable. We evaluated the diagnostic performance of the PPI test...

  9. Lord-Wingersky Algorithm Version 2.0 for Hierarchical Item Factor Models with Applications in Test Scoring, Scale Alignment, and Model Fit Testing.

    Science.gov (United States)

    Cai, Li

    2015-06-01

    Lord and Wingersky's (Appl Psychol Meas 8:453-461, 1984) recursive algorithm for creating summed score based likelihoods and posteriors has a proven track record in unidimensional item response theory (IRT) applications. Extending the recursive algorithm to handle multidimensionality is relatively simple, especially with fixed quadrature because the recursions can be defined on a grid formed by direct products of quadrature points. However, the increase in computational burden remains exponential in the number of dimensions, making the implementation of the recursive algorithm cumbersome for truly high-dimensional models. In this paper, a dimension reduction method that is specific to the Lord-Wingersky recursions is developed. This method can take advantage of the restrictions implied by hierarchical item factor models, e.g., the bifactor model, the testlet model, or the two-tier model, such that a version of the Lord-Wingersky recursive algorithm can operate on a dramatically reduced set of quadrature points. For instance, in a bifactor model, the dimension of integration is always equal to 2, regardless of the number of factors. The new algorithm not only provides an effective mechanism to produce summed score to IRT scaled score translation tables properly adjusted for residual dependence, but leads to new applications in test scoring, linking, and model fit checking as well. Simulated and empirical examples are used to illustrate the new applications.

  10. Associations between MMPI-2-RF validity scale scores and extra-test measures of personality and psychopathology.

    Science.gov (United States)

    Forbey, Johnathan D; Lee, Tayla T C; Ben-Porath, Yossef S; Arbisi, Paul A; Gartland, Diane

    2013-08-01

    The current study explored associations between two potentially invalidating self-report styles detected by the Validity scales of the Minnesota Multiphasic Personality Inventory-2-Restructured Form (MMPI-2-RF), over-reporting and under-reporting, and scores on the MMPI-2-RF substantive, as well as eight collateral self-report measures administered either at the same time or within 1 to 10 days of MMPI-2-RF administration. Analyses were conducted with data provided by college students, male prisoners, and male psychiatric outpatients from a Veterans Administration facility. Results indicated that if either an over- or under-reporting response style was suggested by the MMPI-2-RF Validity scales, scores on the majority of the MMPI-2-RF substantive scales, as well as a number of collateral measures, were significantly affected in all three groups in the expected directions. Test takers who were identified as potentially engaging in an over- or under-reporting response style by the MMPI-2-RF Validity scales appeared to approach extra-test measures similarly regardless of when these measures were administered in relation to the MMPI-2-RF. Limitations and suggestions for future study are discussed.

  11. Including osteoprotegerin and collagen IV in a score-based blood test for liver fibrosis increases diagnostic accuracy.

    Science.gov (United States)

    Bosselut, Nelly; Taibi, Ludmia; Guéchot, Jérôme; Zarski, Jean-Pierre; Sturm, Nathalie; Gelineau, Marie-Christine; Poggi, Bernard; Thoret, Sophie; Lasnier, Elisabeth; Baudin, Bruno; Housset, Chantal; Vaubourdolle, Michel

    2013-01-16

    Noninvasive methods for liver fibrosis evaluation in chronic liver diseases have been recently developed, i.e. transient elastography (Fibroscan™) and blood tests (Fibrometer®, Fibrotest®, and Hepascore®). In this study, we aimed to design a new score in chronic hepatitis C (CHC) by selecting blood markers in a large panel and we compared its diagnostic performance with those of other noninvasive methods. Sixteen blood tests were performed in 306 untreated CHC patients included in a multicenter prospective study (ANRS HC EP 23 Fibrostar) using METAVIR histological fibrosis stage as reference. The new score was constructed by non linear regression using the most accurate biomarkers. Five markers (alpha-2-macroglobulin, apolipoprotein-A1, AST, collagen IV and osteoprotegerin) were included in the new function called Coopscore©. Using the Obuchowski Index, Coopscore© shows higher diagnostic performances than for Fibrometer®, Fibrotest®, Hepascore® and Fibroscan™ in CHC. Association between Fibroscan™ and Coopscore© might avoid 68% of liver biopsies for the diagnosis of significant fibrosis. Coopscore© provides higher accuracy than other noninvasive methods for the diagnosis of liver fibrosis in CHC. The association of Coopscore© with Fibroscan™ increases its predictive value. Copyright © 2012 Elsevier B.V. All rights reserved.

  12. A test of the effect of advance organizers and reading ability on seventh-grade science achievement

    Science.gov (United States)

    Underhill, Patricia Annette

    The use of advance organizers was first introduced by Ausubel in his learning theory of meaningful learning. Subsequent research focused on the efficacy of advance organizers. Although, earlier research produced inconclusive results, more recent research suggests advance organizers do facilitate recall. However, the bulk of the research focused on older subjects (students in high school and college and adults). Prior research did not consider that a subject's reading ability may affect the effectiveness of an advance organizer. The purposes of this study were to investigate whether (1) an advance organizer facilitates both immediate and delayed recall, (2) the reading ability of students and the type of pre-instructional material they receive effect recall, and (3) reading ability has an effect on recall with younger students. Seventy-five seventh-grade students were divided into three groups. One group received a written organizer, one group received a graphic organizer, and one group received an introductory passage before reading a learning passage. After completing the reading passage, all subjects received an immediate posttest. Fourteen days later, subjects received the same posttest incorporated in an end-of-the-chapter test. Results of the study indicate the following: (1) no significant difference in immediate and delayed recall of learning material between students who received a written organizer, a graphic organizer, or an introductory passage, (2) there was a main effect for time of testing and a main effect for reading ability, and (3) there was not an interaction between reading ability and the type of pre-instructional material. These findings did not support previous research.

  13. [The ability of drivers to give first aid--testing by questionnaire].

    Science.gov (United States)

    Goniewicz, M

    1998-01-01

    Road accidents have become a serious social problem. The scale and complexity of this problem shows clearly that there is a necessity to improve citizens' ability to give first aid which is especially essential in the case of drivers. Thus special training how to give first aid at the accident place seems to be of the primary importance. The objective of this paper is to: 1) identify to what extent the drivers of motor vehicles are prepared to provide first aid for casualties of the road accidents, 2) evaluate the training system of teaching motorists how to give first aid before professional help arrives, 3) identify drivers' views on possibilities of decreasing the number of fatal casualties of the road accidents. The questionnaire was given to 560 employees of local government institutions in the city of Lublin either professional or non-professional drivers. The direct method and anonymous questionnaire were used. The results of the questionnaire revealed clearly that very few drivers are well-prepared to give proper first aid at the accident site. No matter what sex, education or driving experience, the drivers have not got enough skills to give first aid and the effect is enhanced by various psychological barriers. The questioned drivers shared the opinion that first aid training is badly run. The drivers stressed bad quality of the training and the fact that it is impossible to acquire practical skills that may be required in the case of emergency. Drivers' views on possibilities of decreasing the number of fatal casualties of the road accidents included, among others, the following propositions: in addition to the driving licence exam first aid exam should be compulsory severe enforcement and execution of the law which regulates the mandatory first aid giving.

  14. Relationship Between Jumping Ability, Agility and Sprint Performance of Elite Young Basketball Players: A Field-Test Approach

    OpenAIRE

    Abbas Asadi

    2016-01-01

    DOI: http://dx.doi.org/10.5007/1980-0037.2016v18n2p177   The purpose of this study was to determine the relationships between sprint, agility and jump performance of elite young basketball players. Sixteen elite national level young male basketball players participated in this study. The jumping ability of each player was determined using countermovement jump (CMJ), and broad long jump (BLJ). The agility T test (TT) and Illinois agility test (IAT) were assessed to determine the agilit...

  15. Abilities of Oropharyngeal pH Tests and Salivary Pepsin Analysis to Discriminate Between Asymptomatic Volunteers and Subjects With Symptoms of Laryngeal Irritation.

    Science.gov (United States)

    Yadlapati, Rena; Adkins, Christopher; Jaiyeola, Diana-Marie; Lidder, Alcina K; Gawron, Andrew J; Tan, Bruce K; Shabeeb, Nadine; Price, Caroline P E; Agrawal, Neelima; Ellenbogen, Michael; Smith, Stephanie S; Bove, Michiel; Pandolfino, John E

    2016-04-01

    It has been a challenge to confirm the association between laryngeal symptoms and physiological reflux disease. We examined the ability of oropharyngeal pH tests (with the Restech Dx-pH system) and salivary pepsin tests (with Peptest) to discriminate between asymptomatic volunteers (controls) and subjects with a combination of laryngeal and reflux symptoms (laryngeal ± reflux). We performed a physician-blinded prospective cohort study of 59 subjects at a single academic institution. Adult volunteers were recruited and separated into 3 groups on the basis of GerdQ and Reflux Symptom Index scores: controls (n = 20), laryngeal symptoms (n = 20), or laryngeal + reflux symptoms (n = 19). Subjects underwent laryngoscopy and oropharyngeal pH tests and submitted saliva samples for analysis of pepsin concentration. Primary outcomes included abnormal acid exposure and composite (RYAN) score for oropharyngeal pH tests and abnormal mean salivary pepsin concentration that was based on normative data. Complete oropharyngeal pH data were available from 53 subjects and complete salivary pepsin data from 35 subjects. We did not observe any significant differences between groups in percent of time spent below pH 4.0, 5.0, 5.5, 6.0, or RYAN scores or percent of subjects with positive results from tests for salivary pepsin (53% vs 40% vs 75%; P = .50, respectively). The laryngeal + reflux group had a significantly higher estimated mean concentration of salivary pepsin (117.9 ± 147.4 ng/mL) than the control group (32.4 ± 41.9 ng/mL) or laryngeal symptom group (7.5 ± 11.2 ng/mL) (P = .01 and P = .04, respectively). By using current normative thresholds, oropharyngeal pH testing and salivary pepsin analysis are not able to distinguish between healthy volunteers and subjects with a combination of laryngeal and reflux symptoms. Copyright © 2016 AGA Institute. Published by Elsevier Inc. All rights reserved.

  16. Polytrauma Defined by the New Berlin Definition: A Validation Test Based on Propensity-Score Matching Approach.

    Science.gov (United States)

    Rau, Cheng-Shyuan; Wu, Shao-Chun; Kuo, Pao-Jen; Chen, Yi-Chun; Chien, Peng-Chen; Hsieh, Hsiao-Yun; Hsieh, Ching-Hua

    2017-09-11

    Background: Polytrauma patients are expected to have a higher risk of mortality than that obtained by the summation of expected mortality owing to their individual injuries. This study was designed to investigate the outcome of patients with polytrauma, which was defined using the new Berlin definition, as cases with an Abbreviated Injury Scale (AIS) ≥ 3 for two or more different body regions and one or more additional variables from five physiologic parameters (hypotension [systolic blood pressure ≤ 90 mmHg], unconsciousness [Glasgow Coma Scale score ≤ 8], acidosis [base excess ≤ -6.0], coagulopathy [partial thromboplastin time ≥ 40 s or international normalized ratio ≥ 1.4], and age [≥70 years]). Methods: We retrieved detailed data on 369 polytrauma patients and 1260 non-polytrauma patients with an overall Injury Severity Score (ISS) ≥ 18 who were hospitalized between 1 January 2009 and 31 December 2015 for the treatment of all traumatic injuries, from the Trauma Registry System at a level I trauma center. Patients with burn injury or incomplete registered data were excluded. Categorical data were compared with two-sided Fisher exact or Pearson chi-square tests. The unpaired Student t -test and the Mann-Whitney U -test was used to analyze normally distributed continuous data and non-normally distributed data, respectively. Propensity-score matched cohort in a 1:1 ratio was allocated using the NCSS software with logistic regression to evaluate the effect of polytrauma on patient outcomes. Results: The polytrauma patients had a significantly higher ISS than non-polytrauma patients (median (interquartile range Q1-Q3), 29 (22-36) vs. 24 (20-25), respectively; p Polytrauma patients had a 1.9-fold higher odds of mortality than non-polytrauma patients (95% CI 1.38-2.49; p polytrauma patients, polytrauma patients had a substantially longer hospital length of stay (LOS). In addition, a higher proportion of polytrauma patients were admitted to the intensive

  17. Relationships between narrative language samples and norm-referenced test scores in language assessments of school-age children.

    Science.gov (United States)

    Danahy Ebert, Kerry; Scott, Cheryl M

    2014-10-01

    Both narrative language samples and norm-referenced language tests can be important components of language assessment for school-age children. The present study explored the relationship between these 2 tools within a group of children referred for language assessment. The study is a retrospective analysis of clinical records from 73 school-age children. Participants had completed an oral narrative language sample and at least one norm-referenced language test. Correlations between microstructural language sample measures and norm-referenced test scores were compared for younger (6- to 8-year-old) and older (9- to 12-year-old) children. Contingency tables were constructed to compare the 2 types of tools, at 2 different cutpoints, in terms of which children were identified as having a language disorder. Correlations between narrative language sample measures and norm-referenced tests were stronger for the younger group than the older group. Within the younger group, the level of language assessed by each measure contributed to associations among measures. Contingency analyses revealed moderate overlap in the children identified by each tool, with agreement affected by the cutpoint used. Narrative language samples may complement norm-referenced tests well, but age combined with narrative task can be expected to influence the nature of the relationship.

  18. Clinical score and rapid antigen detection test to guide antibiotic use for sore throats: randomised controlled trial of PRISM (primary care streptococcal management).

    Science.gov (United States)

    Little, Paul; Hobbs, F D Richard; Moore, Michael; Mant, David; Williamson, Ian; McNulty, Cliodna; Cheng, Ying Edith; Leydon, Geraldine; McManus, Richard; Kelly, Joanne; Barnett, Jane; Glasziou, Paul; Mullee, Mark

    2013-10-10

    To determine the effect of clinical scores that predict streptococcal infection or rapid streptococcal antigen detection tests compared with delayed antibiotic prescribing. Open adaptive pragmatic parallel group randomised controlled trial. Primary care in United Kingdom. Patients aged ≥ 3 with acute sore throat. An internet programme randomised patients to targeted antibiotic use according to: delayed antibiotics (the comparator group for analyses), clinical score, or antigen test used according to clinical score. During the trial a preliminary streptococcal score (score 1, n=1129) was replaced by a more consistent score (score 2, n=631; features: fever during previous 24 hours; purulence; attends rapidly (within three days after onset of symptoms); inflamed tonsils; no cough/coryza (acronym FeverPAIN). Symptom severity reported by patients on a 7 point Likert scale (mean severity of sore throat/difficulty swallowing for days two to four after the consultation (primary outcome)), duration of symptoms, use of antibiotics. For score 1 there were no significant differences between groups. For score 2, symptom severity was documented in 80% (168/207 (81%) in delayed antibiotics group; 168/211 (80%) in clinical score group; 166/213 (78%) in antigen test group). Reported severity of symptoms was lower in the clinical score group (-0.33, 95% confidence interval -0.64 to -0.02; P=0.04), equivalent to one in three rating sore throat a slight versus moderate problem, with a similar reduction for the antigen test group (-0.30, -0.61 to -0.00; P=0.05). Symptoms rated moderately bad or worse resolved significantly faster in the clinical score group (hazard ratio 1.30, 95% confidence interval 1.03 to 1.63) but not the antigen test group (1.11, 0.88 to 1.40). In the delayed antibiotics group, 75/164 (46%) used antibiotics. Use of antibiotics in the clinical score group (60/161) was 29% lower (adjusted risk ratio 0.71, 95% confidence interval 0.50 to 0.95; P=0.02) and in the

  19. Common Variance Among Three Measures of Nonverbal Cognitive Ability: WISC-R Performance Scale, WJPB-TCA Reasoning Cluster, and Halstead Category Test.

    Science.gov (United States)

    Telzrow, Cathy F.; Harr, Gale A.

    1987-01-01

    Examined the relationships among two psychometric measures of nonverbal cognitive ability - The Wechsler Intelligence Scale for Children-Revised (WISC-R) and the Woodcock-Johnson Psychoeducational Battery-Tests of Cognitive Ability (WJPB-TCA) and a neuropsychological test of abstract reasoning and concept formation (Halstead Category Test) in 25…

  20. Genetic variation of the growth hormone secretagogue receptor gene is associated with alcohol use disorders identification test scores and smoking.

    Science.gov (United States)

    Suchankova, Petra; Nilsson, Staffan; von der Pahlen, Bettina; Santtila, Pekka; Sandnabba, Kenneth; Johansson, Ada; Jern, Patrick; Engel, Jörgen A; Jerlhag, Elisabet

    2016-03-01

    The multifaceted gut-brain peptide ghrelin and its receptor (GHSR-1a) are implicated in mechanisms regulating not only the energy balance but also the reward circuitry. In our pre-clinical models, we have shown that ghrelin increases whereas GHSR-1a antagonists decrease alcohol consumption and the motivation to consume alcohol in rodents. Moreover, ghrelin signaling is required for the rewarding properties of addictive drugs including alcohol and nicotine in rodents. Given the hereditary component underlying addictive behaviors and disorders, we sought to investigate whether single nucleotide polymorphisms (SNPs) located in the pre-proghrelin gene (GHRL) and GHSR-1a gene (GHSR) are associated with alcohol use, measured by the alcohol use disorders identification test (AUDIT) and smoking. Two SNPs located in GHRL, rs4684677 (Gln90Leu) and rs696217 (Leu72Met), and one in GHSR, rs2948694, were genotyped in a subset (n = 4161) of a Finnish population-based cohort, the Genetics of Sexuality and Aggression project. The effect of these SNPs on AUDIT scores and smoking was investigated using linear and logistic regressions, respectively. We found that the minor allele of the rs2948694 SNP was nominally associated with higher AUDIT scores (P = 0.0204, recessive model) and smoking (P = 0.0002, dominant model). Furthermore, post hoc analyses showed that this risk allele was also associated with increased likelihood of having high level of alcohol problems as determined by AUDIT scores ≥ 16 (P = 0.0043, recessive model). These convergent findings lend further support for the hypothesized involvement of ghrelin signaling in addictive disorders. © 2015 Society for the Study of Addiction.

  1. Reliability, precision, and measurement in the context of data from ability tests, surveys, and assessments

    International Nuclear Information System (INIS)

    Fisher, W P Jr; Elbaum, B; Coulter, A

    2010-01-01

    Reliability coefficients indicate the proportion of total variance attributable to differences among measures separated along a quantitative continuum by a testing, survey, or assessment instrument. Reliability is usually considered to be influenced by both the internal consistency of a data set and the number of items, though textbooks and research papers rarely evaluate the extent to which these factors independently affect the data in question. Probabilistic formulations of the requirements for unidimensional measurement separate consistency from error by modelling individual response processes instead of group-level variation. The utility of this separation is illustrated via analyses of small sets of simulated data, and of subsets of data from a 78-item survey of over 2,500 parents of children with disabilities. Measurement reliability ultimately concerns the structural invariance specified in models requiring sufficient statistics, parameter separation, unidimensionality, and other qualities that historically have made quantification simple, practical, and convenient for end users. The paper concludes with suggestions for a research program aimed at focusing measurement research more on the calibration and wide dissemination of tools applicable to individuals, and less on the statistical study of inter-variable relations in large data sets.

  2. Relationship Between Jumping Ability, Agility and Sprint Performance of Elite Young Basketball Players: A Field-Test Approach

    Directory of Open Access Journals (Sweden)

    Abbas Asadi

    2016-05-01

    Full Text Available DOI: http://dx.doi.org/10.5007/1980-0037.2016v18n2p177   The purpose of this study was to determine the relationships between sprint, agility and jump performance of elite young basketball players. Sixteen elite national level young male basketball players participated in this study. The jumping ability of each player was determined using countermovement jump (CMJ, and broad long jump (BLJ. The agility T test (TT and Illinois agility test (IAT were assessed to determine the agility, and 20-m sprint time was also measured to determine sprint performance. The results of Pearson Product Moment Correlation analysis indicated moderate correlation between training age and IAT (r = -0.57; p = 0.021. Strong correlations were found between CMJ and BLJ (r = 0.71; p = 0.002, and between TT and IAT (r = 0.70; p = 0.002. Similarly, 20-m sprint time was strong correlated with CMJ (r = -0.61; p = 0.011, BLJ (r = -0.76; p = 0.001, TT (r = 0.77; p = 0.001, and IAT (r = 0.68; p = 0.003. In addition, CMJ was strongly correlated with TT (r = -0.60; p = 0.013, and IAT (r = -0.64; p = 0.007, and also strong correlation between BLJ with TT (r = -0.85; p = 0.001 and IAT (r = -0.76; p = 0.001. The findings of the present study indicated significant correlation between sprint and agility, jumping ability and sprint performance and between jumping ability and agility performance in basketball players. Therefore, the results suggest that sprint, agility and jumping ability share common physiological and biomechanical determinants.

  3. Test-retest reliability and minimal detectable change scores for sit-to-stand-to-sit tests, the six-minute walk test, the one-leg heel-rise test, and handgrip strength in people undergoing hemodialysis.

    Science.gov (United States)

    Segura-Ortí, Eva; Martínez-Olmos, Francisco José

    2011-08-01

    Determining the relative and absolute reliability of outcomes of physical performance tests for people undergoing hemodialysis is necessary to discriminate between the true effects of exercise interventions and the inherent variability of this cohort. The aims of this study were to assess the relative reliability of sit-to-stand-to-sit tests (the STS-10, which measures the time [in seconds] required to complete 10 full stands from a sitting position, and the STS-60, which measures the number of repetitions achieved in 60 seconds), the Six-Minute Walk Test (6MWT), the one-leg heel-rise test, and the handgrip strength test and to calculate minimal detectable change (MDC) scores in people undergoing hemodialysis. This study was a prospective, nonexperimental investigation. Thirty-nine people undergoing hemodialysis at 2 clinics in Spain were contacted. Study participants performed the STS-10 (n=37), the STS-60 (n=37), and the 6MWT (n=36). At one of the settings, the participants also performed the one-leg heel-rise test (n=21) and the handgrip strength test (n=12) on both the right and the left sides. Participants attended 2 testing sessions 1 to 2 weeks apart. High intraclass correlation coefficients (≥.88) were found for all tests, suggesting good relative reliability. The MDC scores at 90% confidence intervals were as follows: 8.4 seconds for the STS-10, 4 repetitions for the STS-60, 66.3 m for the 6MWT, 3.4 kg for handgrip strength (force-generating capacity), 3.7 repetitions for the one-leg heel-rise test with the right leg, and 5.2 repetitions for the one-leg heel-rise test with the left leg. Limitations A limited sample of patients was used in this study. The STS-16, STS-60, 6MWT, one-leg heel rise test, and handgrip strength test are reliable outcome measures. The MDC scores at 90% confidence intervals for these tests will help to determine whether a change is due to error or to an intervention.

  4. Use of Automated Scoring in Spoken Language Assessments for Test Takers with Speech Impairments. Research Report. ETS RR-17-42

    Science.gov (United States)

    Loukina, Anastassia; Buzick, Heather

    2017-01-01

    This study is an evaluation of the performance of automated speech scoring for speakers with documented or suspected speech impairments. Given that the use of automated scoring of open-ended spoken responses is relatively nascent and there is little research to date that includes test takers with disabilities, this small exploratory study focuses…

  5. Test-retest reliability of the Battery for the Assessment of Auditory Sensorimotor and Timing Abilities (BAASTA).

    Science.gov (United States)

    Bégel, Valentin; Verga, Laura; Benoit, Charles-Etienne; Kotz, Sonja A; Bella, Simone Dalla

    2018-04-27

    Perceptual and sensorimotor timing skills can be comprehensively assessed with the Battery for the Assessment of Auditory Sensorimotor and Timing Abilities (BAASTA). The battery has been used for testing rhythmic skills in healthy adults and patient populations (e.g., with Parkinson disease), showing sensitivity to timing and rhythm deficits. Here we assessed the test-retest reliability of the BAASTA in 20 healthy adults. Participants were tested twice with the BAASTA, implemented on a tablet interface, with a 2-week interval. They completed 4 perceptual tasks, namely, duration discrimination, anisochrony detection with tones and music, and the Beat Alignment Test (BAT). Moreover, they completed motor tasks via finger tapping, including unpaced and paced tapping with tones and music, synchronization-continuation, and adaptive tapping to a sequence with a tempo change. Despite high variability among individuals, the results showed stable test-retest reliability in most tasks. A slight but significant improvement from test to retest was found in tapping with music, which may reflect a learning effect. In general, the BAASTA was found a reliable tool for evaluating timing and rhythm skills. Copyright © 2018 Elsevier Masson SAS. All rights reserved.

  6. Predictive value of grade point average (GPA), Medical College Admission Test (MCAT), internal examinations (Block) and National Board of Medical Examiners (NBME) scores on Medical Council of Canada qualifying examination part I (MCCQE-1) scores.

    Science.gov (United States)

    Roy, Banibrata; Ripstein, Ira; Perry, Kyle; Cohen, Barry

    2016-01-01

    To determine whether the pre-medical Grade Point Average (GPA), Medical College Admission Test (MCAT), Internal examinations (Block) and National Board of Medical Examiners (NBME) scores are correlated with and predict the Medical Council of Canada Qualifying Examination Part I (MCCQE-1) scores. Data from 392 admitted students in the graduating classes of 2010-2013 at University of Manitoba (UofM), College of Medicine was considered. Pearson's correlation to assess the strength of the relationship, multiple linear regression to estimate MCCQE-1 score and stepwise linear regression to investigate the amount of variance were employed. Complete data from 367 (94%) students were studied. The MCCQE-1 had a moderate-to-large positive correlation with NBME scores and Block scores but a low correlation with GPA and MCAT scores. The multiple linear regression model gives a good estimate of the MCCQE-1 (R2 =0.604). Stepwise regression analysis demonstrated that 59.2% of the variation in the MCCQE-1 was accounted for by the NBME, but only 1.9% by the Block exams, and negligible variation came from the GPA and the MCAT. Amongst all the examinations used at UofM, the NBME is most closely correlated with MCCQE-1.

  7. Linear-rank testing of a non-binary, responder-analysis, efficacy score to evaluate pharmacotherapies for substance use disorders.

    Science.gov (United States)

    Holmes, Tyson H; Li, Shou-Hua; McCann, David J

    2016-11-23

    The design of pharmacological trials for management of substance use disorders is shifting toward outcomes of successful individual-level behavior (abstinence or no heavy use). While binary success/failure analyses are common, McCann and Li (CNS Neurosci Ther 2012; 18: 414-418) introduced "number of beyond-threshold weeks of success" (NOBWOS) scores to avoid dichotomized outcomes. NOBWOS scoring employs an efficacy "hurdle" with values reflecting duration of success. Here, we evaluate NOBWOS scores rigorously. Formal analysis of mathematical structure of NOBWOS scores is followed by simulation studies spanning diverse conditions to assess operating characteristics of five linear-rank tests on NOBWOS scores. Simulations include assessment of Fisher's exact test applied to hurdle component. On average, statistical power was approximately equal for five linear-rank tests. Under none of conditions examined did Fisher's exact test exhibit greater statistical power than any of the linear-rank tests. These linear-rank tests provide good Type I and Type II error control for comparing distributions of NOBWOS scores between groups (e.g. active vs. placebo). All methods were applied to re-analyses of data from four clinical trials of differing lengths and substances of abuse. These linear-rank tests agreed across all trials in rejecting (or not) their null (equality of distributions) at ≤ 0.05. © The Author(s) 2016.

  8. Raising test scores vs. teaching higher order thinking (HOT): senior science teachers' views on how several concurrent policies affect classroom practices

    Science.gov (United States)

    Zohar, Anat; Alboher Agmon, Vered

    2018-04-01

    This study investigates how senior science teachers viewed the effects of a Raising Test Scores policy and its implementation on instruction of higher order thinking (HOT), and on teaching thinking to students with low academic achievements.

  9. Testing the ability of a proposed geotechnical based method to evaluate the liquefaction potential analysis subjected to earthquake vibrations

    Science.gov (United States)

    Abbaszadeh Shahri, A.; Behzadafshar, K.; Esfandiyari, B.; Rajablou, R.

    2010-12-01

    During the earthquakes a number of earth dams have had severe damages or suffered major displacements as a result of liquefaction, thus modeling by computer codes can provide a reliable tool to predict the response of the dam foundation against earthquakes. These modeling can be used in the design of new dams or safety assessments of existing ones. In this paper, on base of the field and laboratory tests and by combination of several software packages a seismic geotechnical based analysis procedure is proposed and verified by comparison with computer model tests, field and laboratory experiences. Verification or validation of the analyses relies to ability of the applied computer codes. By use of Silakhor earthquake (2006, Ms 6.1) and in order to check the efficiency of the proposed framework, the procedure is applied to the Korzan earth dam of Iran which is located in Hamedan Province to analyze and estimate the liquefaction and safety factor. Design and development of a computer code by authors which named as “Abbas Converter” with graphical user interface which operates as logic connecter function that can computes and models the soil profiles is the critical point of this study and the results are confirm and proved the ability of the generated computer code on evaluation of soil behavior under the earthquake excitations. Also this code can make and render facilitate this study more than previous have done, and take over the encountered problem.

  10. Can the ability to adapt to exercise be considered a talent-and if so, can we test for it?

    Science.gov (United States)

    Pickering, Craig; Kiely, John

    2017-11-29

    Talent identification (TI) is a popular and hugely important topic within sports performance, with an ever-increasing amount of resources dedicated to unveiling the next sporting star. However, at present, most TI processes appear to select high-performing individuals at the present point in time, as opposed to identifying those individuals with the greatest capacity to improve. This represents a potential inefficiency within the TI process, reducing its effectiveness. In this article, we discuss whether the ability to adapt favorably, and with a large magnitude, to physical training can be considered a talent, testing it against proposed criteria. We also discuss whether, if such an ability can be considered a talent, being able to test for it as part of the TI process would be advantageous. Given that such a capacity is partially heritable, driven by genetic variation between individuals that mediate the adaptive response, we also explore whether the information gained from genetic profiling can be used to identify those with the greatest capacity to improve. Although there are some ethical hurdles which must be considered, the use of genetic information to identify those individuals with the greatest capacity appears to hold promise and may improve both the efficiency and effectiveness of contemporary TI programmes.

  11. 'Mechanical restraint-confounders, risk, alliance score': testing the clinical validity of a new risk assessment instrument.

    Science.gov (United States)

    Deichmann Nielsen, Lea; Bech, Per; Hounsgaard, Lise; Alkier Gildberg, Frederik

    2017-08-01

    Unstructured risk assessment, as well as confounders (underlying reasons for the patient's risk behaviour and alliance), risk behaviour, and parameters of alliance, have been identified as factors that prolong the duration of mechanical restraint among forensic mental health inpatients. To clinically validate a new, structured short-term risk assessment instrument called the Mechanical Restraint-Confounders, Risk, Alliance Score (MR-CRAS), with the intended purpose of supporting the clinicians' observation and assessment of the patient's readiness to be released from mechanical restraint. The content and layout of MR-CRAS and its user manual were evaluated using face validation by forensic mental health clinicians, content validation by an expert panel, and pilot testing within two, closed forensic mental health inpatient units. The three sub-scales (Confounders, Risk, and a parameter of Alliance) showed excellent content validity. The clinical validations also showed that MR-CRAS was perceived and experienced as a comprehensible, relevant, comprehensive, and useable risk assessment instrument. MR-CRAS contains 18 clinically valid items, and the instrument can be used to support the clinical decision-making regarding the possibility of releasing the patient from mechanical restraint. The present three studies have clinically validated a short MR-CRAS scale that is currently being psychometrically tested in a larger study.

  12. The ability of modified star excursion balance test to differentiate between women athletes with and without chronic ankle instability

    Directory of Open Access Journals (Sweden)

    Asma Razeghi

    2016-05-01

    Full Text Available The Star Excursion Balance Test (SEBT is one functional clinical test that widely used to assess dynamic balance in patients with ankle injuries. Since the ability of this test to detect impairments between athletes with and without chronic ankle instability(CAI is not clear, the aim of present study was to determine if the modified SEBT could detect reach deficits in patients with unilateral CAI. A convenience sample of thirty elite and sub elite women athletes were selected and assigned into two groups: CAI group (Mean ± SD: age: 25±3.5 years; height: 1.68±0.09 m; weight: 62.7±7.3kg, and healthy controls (Mean ± SD: age: 26±4.2 years; height: 1.69±0.05 m; weigh t: 62.7±7.3 kg.The dynamic balance test was obtained using modified SEBT from both limbs of each participant. The independent sample t-test was used for both between group and within group inter-limb comparisons. There was no significant difference in any directions of modified SEBT between two groups in both limbs. No significant interlimb differences were also observed within both groups. The modified SEBT may not enough sensitive to differentiate between athletes with and without CAI. Other factors such as ankle range of motion, muscle strength and pain intensity should be considered for better interpretation of the SEBT results.

  13. Level of intrauterine cocaine exposure and neuropsychological test scores in preadolescence: subtle effects on auditory attention and narrative memory.

    Science.gov (United States)

    Beeghly, Marjorie; Rose-Jacobs, Ruth; Martin, Brett M; Cabral, Howard J; Heeren, Timothy C; Frank, Deborah A

    2014-01-01

    Neuropsychological processes such as attention and memory contribute to children's higher-level cognitive and language functioning and predict academic achievement. The goal of this analysis was to evaluate whether level of intrauterine cocaine exposure (IUCE) alters multiple aspects of preadolescents' neuropsychological functioning assessed using a single age-referenced instrument, the NEPSY: A Developmental Neuropsychological Assessment (NEPSY) (Korkman et al., 1998), after controlling for relevant covariates. Participants included 137 term 9.5-year-old children from low-income urban backgrounds (51% male, 90% African American/Caribbean) from an ongoing prospective longitudinal study. Level of IUCE was assessed in the newborn period using infant meconium and maternal report. 52% of the children had IUCE (65% with lighter IUCE, and 35% with heavier IUCE), and 48% were unexposed. Infants with Fetal Alcohol Syndrome, HIV seropositivity, or intrauterine exposure to illicit substances other than cocaine and marijuana were excluded. At the 9.5-year follow-up visit, trained examiners masked to IUCE and background variables evaluated children's neuropsychological functioning using the NEPSY. The association between level of IUCE and NEPSY outcomes was evaluated in a series of linear regressions controlling for intrauterine exposure to other substances and relevant child, caregiver, and demographic variables. Results indicated that level of IUCE was associated with lower scores on the Auditory Attention and Narrative Memory tasks, both of which require auditory information processing and sustained attention for successful performance. However, results did not follow the expected ordinal, dose-dependent pattern. Children's neuropsychological test scores were also altered by a variety of other biological and psychosocial factors. Copyright © 2014 Elsevier Inc. All rights reserved.

  14. Level of Intrauterine Cocaine Exposure and Neuropsychological Test Scores in Preadolescence: Subtle Effects on Auditory Attention and Narrative Memory

    Science.gov (United States)

    Beeghly, Marjorie; Rose-Jacobs, Ruth; Martin, Brett M.; Cabral, Howard J.; Heeren, Timothy C.; Frank, Deborah A.

    2014-01-01

    Neuropsychological processes such as attention and memory contribute to children's higher-level cognitive and language functioning and predict academic achievement. The goal of this analysis was to evaluate whether level of intrauterine cocaine exposure (IUCE) alters multiple aspects of preadolescents' neuropsychological functioning assessed using a single age-referenced instrument, the NEPSY: A Developmental Neuropsychological Assessment (NEPSY) [71], after controlling for relevant covariates. Participants included 137 term 9.5-year-old children from low-income urban backgrounds (51% male, 90% African American/Caribbean) from an ongoing prospective longitudinal study. Level of IUCE was assessed in the newborn period using infant meconium and maternal report. 52% of the children had IUCE (65% with lighter IUCE, and 35% with heavier IUCE), and 48% were unexposed. Infants with Fetal Alcohol Syndrome, HIV seropositivity, or intrauterine exposure to illicit substances other than cocaine and marijuana were excluded. At the 9.5-year follow-up visit, trained examiners masked to IUCE and background variables evaluated children's neuropsychological functioning using the NEPSY. The association between level of IUCE and NEPSY outcomes was evaluated in a series of linear regressions controlling for intrauterine exposure to other substances and relevant child, caregiver, and demographic variables. Results indicated that level of IUCE was associated with lower scores on the Auditory Attention and Narrative Memory tasks, both of which require auditory information processing and sustained attention for successful performance. However, results did not follow the expected ordinal, dose-dependent pattern. Children's neuropsychological test scores were also altered by a variety of other biological and psychosocial factors. PMID:24978115

  15. A single-centre cohort study of National Early Warning Score (NEWS) and near patient testing in acute medical admissions.

    Science.gov (United States)

    Abbott, Tom E F; Torrance, Hew D T; Cron, Nicholas; Vaid, Nidhi; Emmanuel, Julian

    2016-11-01

    The utility of an early warning score may be improved when used with near patient testing. However, this has not yet been investigated for National Early Warning Score (NEWS). We hypothesised that the combination of NEWS and blood gas variables (lactate, glucose or base-excess) was more strongly associated with clinical outcome compared to NEWS alone. This was a prospective cohort study of adult medical admissions to a single-centre over 20days. Blood gas results and physiological observations were recorded at admission. NEWS was calculated retrospectively and combined with the biomarkers in multivariable logistic regression models. The primary outcome was a composite of mortality or critical care escalation within 2days of hospital admission. The secondary outcome was hospital length of stay. After accounting for missing data, 15 patients out of 322 (4.7%) died or were escalated to the critical care unit. The median length of stay was 4 (IQR 7) days. When combined with lactate or base excess, NEWS was associated with the primary outcome (OR 1.18, p=0.01 and OR 1.13, p=0.03). However, NEWS alone was more strongly associated with the primary outcome measure (OR 1.46, pglucose was not associated with the primary outcome. Neither NEWS nor any combination of NEWS and a biomarker were associated with hospital length of stay. Admission NEWS is more strongly associated with death or critical care unit admission within 2days of hospital admission, compared to combinations of NEWS and blood-gas derived biomarkers. Copyright © 2016 European Federation of Internal Medicine. Published by Elsevier B.V. All rights reserved.

  16. Student Perceptions of Sectional CT/MRI Use in Teaching Veterinary Anatomy and the Correlation with Visual Spatial Ability: A Student Survey and Mental Rotations Test.

    Science.gov (United States)

    Delisser, Peter J; Carwardine, Darren

    2017-11-29

    Diagnostic imaging technology is becoming more advanced and widely available to veterinary patients with the growing popularity of veterinary-specific computed tomography (CT) and magnetic resonance imaging (MRI). Veterinary students must, therefore, be familiar with these technologies and understand the importance of sound anatomic knowledge for interpretation of the resultant images. Anatomy teaching relies heavily on visual perception of structures and their function. In addition, visual spatial ability (VSA) positively correlates with anatomy test scores. We sought to assess the impact of including more diagnostic imaging, particularly CT/MRI, in the teaching of veterinary anatomy on the students' perceived level of usefulness and ease of understanding content. Finally, we investigated survey answers' relationship to the students' inherent baseline VSA, measured by a standard Mental Rotations Test. Students viewed diagnostic imaging as a useful inclusion that provided clear links to clinical relevance, thus improving the students' perceived benefits in its use. Use of CT and MRI images was not viewed as more beneficial, more relevant, or more useful than the use of radiographs. Furthermore, students felt that the usefulness of CT/MRI inclusion was mitigated by the lack of prior formal instruction on the basics of CT/MRI image generation and interpretation. To be of significantly greater use, addition of learning resources labeling relevant anatomy in tomographical images would improve utility of this novel teaching resource. The present study failed to find any correlation between student perceptions of diagnostic imaging in anatomy teaching and their VSA.

  17. An Argument against Using Standardized Test Scores for Placement of International Undergraduate Students in English as a Second Language (ESL) Courses

    Science.gov (United States)

    Kokhan, Kateryna

    2013-01-01

    Development and administration of institutional ESL placement tests require a great deal of financial and human resources. Due to a steady increase in the number of international students studying in the United States, some US universities have started to consider using standardized test scores for ESL placement. The English Placement Test (EPT)…

  18. Towards reporting standards for neuropsychological study results: A proposal to minimize communication errors with standardized qualitative descriptors for normalized test scores.

    Science.gov (United States)

    Schoenberg, Mike R; Rum, Ruba S

    2017-11-01

    Rapid, clear and efficient communication of neuropsychological results is essential to benefit patient care. Errors in communication are a lead cause of medical errors; nevertheless, there remains a lack of consistency in how neuropsychological scores are communicated. A major limitation in the communication of neuropsychological results is the inconsistent use of qualitative descriptors for standardized test scores and the use of vague terminology. PubMed search from 1 Jan 2007 to 1 Aug 2016 to identify guidelines or consensus statements for the description and reporting of qualitative terms to communicate neuropsychological test scores was conducted. The review found the use of confusing and overlapping terms to describe various ranges of percentile standardized test scores. In response, we propose a simplified set of qualitative descriptors for normalized test scores (Q-Simple) as a means to reduce errors in communicating test results. The Q-Simple qualitative terms are: 'very superior', 'superior', 'high average', 'average', 'low average', 'borderline' and 'abnormal/impaired'. A case example illustrates the proposed Q-Simple qualitative classification system to communicate neuropsychological results for neurosurgical planning. The Q-Simple qualitative descriptor system is aimed as a means to improve and standardize communication of standardized neuropsychological test scores. Research are needed to further evaluate neuropsychological communication errors. Conveying the clinical implications of neuropsychological results in a manner that minimizes risk for communication errors is a quintessential component of evidence-based practice. Copyright © 2017 Elsevier B.V. All rights reserved.

  19. Quantification of Emphysema with a Three-Dimensional Chest CT Scan: Correlation with the Visual Emphysema Scoring on Chest CT, Pulmonary Function Tests and Dyspnea Severity

    Energy Technology Data Exchange (ETDEWEB)

    Park, Hyun Jeong; Hwang, Jung Hwa [Dept. of Radiology, Soonchunhyang University Seoul Hospital, Seoul (Korea, Republic of)

    2011-09-15

    We wanted to prospectively evaluate the correlation between the quantification of emphysema using 3D CT densitometry with the visual emphysema score, pulmonary function tests (PFT) and the dyspnea score in patients with chronic obstructive pulmonary disease (COPD). Non-enhanced chest CT with 3D reconstruction was performed in 28 men with COPD (age 54-88 years). With histogram analysis, the total lung volume, mean lung density and proportion of low attenuation lung volume below predetermined thresholds were measured. The CT parameters were compared with the visual emphysema score, the PFT and the dyspnea score. A low attenuation lung volume below -950 HU was well correlated with the DLco and FEV{sub 1}/FVC. A Low attenuation lung volume below -950 HU and -930 HU was correlated with visual the emphysema score. A low attenuation lung volume below -950 HU was correlated with the dyspnea score, although the correlations between the other CT parameters and the dyspnea score were not significant. Objective quantification of emphysema using 3D CT densitometry was correlated with the visual emphysema score. A low attenuation lung volume below -950 HU was correlated with the DLco, the FEV{sub 1}/FVC and the dyspnea score.

  20. Quantification of Emphysema with a Three-Dimensional Chest CT Scan: Correlation with the Visual Emphysema Scoring on Chest CT, Pulmonary Function Tests and Dyspnea Severity

    International Nuclear Information System (INIS)

    Park, Hyun Jeong; Hwang, Jung Hwa

    2011-01-01

    We wanted to prospectively evaluate the correlation between the quantification of emphysema using 3D CT densitometry with the visual emphysema score, pulmonary function tests (PFT) and the dyspnea score in patients with chronic obstructive pulmonary disease (COPD). Non-enhanced chest CT with 3D reconstruction was performed in 28 men with COPD (age 54-88 years). With histogram analysis, the total lung volume, mean lung density and proportion of low attenuation lung volume below predetermined thresholds were measured. The CT parameters were compared with the visual emphysema score, the PFT and the dyspnea score. A low attenuation lung volume below -950 HU was well correlated with the DLco and FEV 1 /FVC. A Low attenuation lung volume below -950 HU and -930 HU was correlated with visual the emphysema score. A low attenuation lung volume below -950 HU was correlated with the dyspnea score, although the correlations between the other CT parameters and the dyspnea score were not significant. Objective quantification of emphysema using 3D CT densitometry was correlated with the visual emphysema score. A low attenuation lung volume below -950 HU was correlated with the DLco, the FEV 1 /FVC and the dyspnea score.

  1. Reformulation of the Children's Eating Attitudes Test (ChEAT): factor structure and scoring method in a non-clinical population.

    Science.gov (United States)

    Anton, S D; Han, H; Newton, R L; Martin, C K; York-Crowe, E; Stewart, T M; Williamson, D A

    2006-12-01

    The primary aims of this study were to empirically test the factor structure of the Children's Eating Attitudes Test (ChEAT) through both exploratory and confirmatory factor analyses and to interpret the factor structure of the ChEAT within the context of a new scoring method. The ChEAT was administered to 728 children in the 2nd through 6th grades (from five schools) at two different time points. Exactly half the students were male and half were female. To the best of our knowledge, this is the first study to empirically test the merits of an alternative 6-point scoring system as compared to the traditionally used 4-point scoring system. With the new scoring procedure, the skewness for all factor scores decreased, which resulted in increased variance in the item scores, as well as the total ChEAT score. Since the internal consistency of two factors in a recently proposed model was not acceptable (ChEAT reported by previous investigations. Intercorrelations among the factors suggested three higher order constructs. These findings indicate that the ChEAT subscales may be sufficiently stable to allow use in non-clinical samples of children.

  2. Development and pilot testing of a questionnaire to determine the ability and willingness of health personnel accompanying perinatal bereavement

    Directory of Open Access Journals (Sweden)

    Mª José Domínguez Santarén

    2013-01-01

    Full Text Available Introduction. The care that parents receive around the time of a loss has a huge impact on their perception of what happened and on their ability to cope. Good care cannot remove the pain and devastation that the loss of a pregnancy or the death of a baby can bring, but can promote healing.Methodology: Creation and pilot study for a questionnaire to determinate the capacity and willingness of perinatal bereavement support from staff in hospitalization and delivery room services in Zaragoza and Jaca who care for couples with a perinatal death.Statistical analysis. Qualitative analysis is made of the difficulties and limitations of this support staff is performing. Psychometric tests were conducted to determine the reliability and validity of the questionnaire by calculating Cronbach´s alpha and the intraclass correlation coefficient. For the analysis of construct validity, we performed the principal components factorial analysis (PCFA through the Varimax rotation system.Results. The qualitative analysis of open-ended responses indicates a lack of knowledge about this type of mourning and social and communication tools that often precludes effective accompaniment. We obtained a Cronbach alpha value of 0.835 overall questionnaire, which indicates high internal consistency or coherence among the items and relatively high CCI indicates good stability over time with significance p<,001. Making appropriate modifications could assess the ability and willingness of workers.

  3. Parental expectations, physical punishment, and violence among adolescents who score positive on a psychosocial screening test in primary care.

    Science.gov (United States)

    Ohene, Sally-Ann; Ireland, Marjorie; McNeely, Clea; Borowsky, Iris Wagman

    2006-02-01

    We sought to examine the relationship between perceived and stated parental expectations regarding adolescents' use of violence, parental use of physical punishment as discipline, and young adolescents' violence-related attitudes and involvement. Surveys were completed by 134 youth and their parents attending 8 pediatric practices. All youth were 10 to 15 years of age and had scored positive on a psychosocial screening test. Multivariate analyses revealed that perceived parental disapproval of the use of violence was associated with a more prosocial attitude toward interpersonal peer violence and a decreased likelihood of physical fighting by the youth. Parental report of whether they would advise their child to use violence in a conflict situation (stated parental expectations) was not associated with the adolescents' attitudes toward interpersonal peer violence, intentions to fight, physical fighting, bullying, or violence victimization. Parental use of corporal punishment as a disciplining method was inversely associated with a prosocial attitude toward interpersonal peer violence among the youth and positively correlated with youths' intentions to fight and fighting, bullying, and violence victimization. Perceived parental disapproval of the use of violence may be an important protective factor against youth involvement in violence, and parental use of physical punishment is associated with both violence perpetration and victimization among youth. Parents should be encouraged to clearly communicate to their children how to resolve conflicts without resorting to violence and to model these skills themselves by avoiding the use of physical punishment.

  4. Correlation between the concentration of serum polychlorinated biphenyls (PCBs) in pregnant cynomolgus monkeys and their offspring's behavioral scores in eye-contact test and finger maze learning test

    Energy Technology Data Exchange (ETDEWEB)

    Negishi, T. [Aoyama Gakuin Univ., Kanagawa (Japan); Takasuga, T. [Shimadzu Techno-Research Inc., Kyoto (Japan); Kawasaki, K. [Hoshi Univ., Tokyo (Japan); Kuroda, Y. [CREST Japan Science and Technology Corp., Saitama (Japan); Yoshikawa, Y. [The Univ. of Tokyo (Japan)

    2004-09-15

    A recent review suggested that pre- or perinatal exposure of developing fetuses to dioxins, the widespread environmental contaminants, such as polychrorinated biphenlys (PCBs), induce the irreversible abnormalities in the functions of central nervous system (CNS) in human. These chemicals can be transferred to each fetus and naonate transplacentally and lactationally in rhesus monkey. Several studies also reported the adverse effect of PCB on CNS development in rodents and monkeys as well as on behavior in rodents and monkeys. In the present study, we show a preliminary data about the correlation between the serum concentrations of PCBs in pregnant cynomolgus monkeys (Macaca fascicularis) and the scores of two behavioral tests, eye-contact test and four-step finger maze test, which evaluate consciousness against human observer and learning ability, respectively, in their offspring. This experimental surveillance system using non-human primates would be useful to predict the risk of PCBs exposure in human fetuses because of the similarities of cynomolgus monkey to human with regard to reproduction, developmental parameter, and others.

  5. What Specific Science Abilities and Skills Are Romanian Students Developing during Primary Education? A Comparison with the Abilities Tested by the TIMSS 2011 Inquiry

    Science.gov (United States)

    Ciascai, Liliana; Dulama, Maria-Eliza

    2013-01-01

    The results of Romanian students at international comparative TIMSS and PISA tests have constantly proven to be unsatisfactory. The present paper aims at analyzing the school syllabi "Mathematics and Environment exploration", "Environmental Education" and "Natural Sciences" studied during primary education in Romania…

  6. Test for antioxidant ability by scavenging long-lived mutagenic radicals in mammalian cells and by blood test with intentional radicals: an application of gallic acid

    International Nuclear Information System (INIS)

    Kumagai, Jun; Kawaura, Tomoko; Miyazaki, Tetsuo; Prost, Michel; Prost, Emmanuelle; Watanabe, Masami; Quetin-Leclercq, J.Joeelle

    2003-01-01

    Antioxidant ability of gallic acid (GA) are determined both by electron spin resonance measurement of long-lived radicals produced in γ-ray irradiated Syrian golden hamster embryo cells with GA and by hemolysis measurement with GA when blood cells are submitted to radicals. Scavenging properties of GA are determined by the reaction rate constant with long-lived mutagenic radicals in the cells while the blood test allows to analyze the global effects of this compound: radical scavenger+metal ion chelator+regeneration of intra- and extra-cellular antioxidant

  7. Beta-Test Data On An Assessment Of Textbook Problem Solving Ability: An Argument For Right/Wrong Grading?

    Science.gov (United States)

    Cummings, Karen; Marx, Jeffrey D.

    2010-10-01

    We have developed an assessment of students' ability to solve standard textbook style problems and are currently engaged in the validation and revision process. The assessment covers the topics of force and motion, conservation of momentum and conservation of energy at a level consistent with most calculus-based, introductory physics courses. This tool is discussed in more detail in an accompanying paper by Marx and Cummings. [1] Here we present preliminary beta-test data collected at four schools during the 2009/2010 academic year. Data include both pre- and post-instruction results for introductory physics courses as well as results for physics majors in later years. In addition, we present evidence that right/wrong grading may well be a perfectly acceptable grading procedure for a course-level assessment of this type.

  8. Improving personality facet scores with multidimensional computer adaptive testing: an illustration with the Neo Pi-R

    NARCIS (Netherlands)

    Makransky, Guido; Mortensen, Erik Lykke; Glas, Cornelis A.W.

    2013-01-01

    Narrowly defined personality facet scores are commonly reported and used for making decisions in clinical and organizational settings. Although these facets are typically related, scoring is usually carried out for a single facet at a time. This method can be ineffective and time consuming when

  9. Acute effects of two different initial heart rates on testing the Repeated Sprint Ability in young soccer players.

    Science.gov (United States)

    Ruscello, B; Briotti, G; Tozzo, N; Partipilo, F; Taraborelli, M; Zeppetella, A; Padulo, J; D'Ottavio, S

    2015-10-01

    The aim of this paper was to investigate the acute effects of two different initial heart rates intensities when testing the repeated sprint ability (RSA) performances in young soccer players. Since there are many kinds of pre-match warm-ups, we chose to take as an absolute indicator of internal load the heart rate reached at the end of two different warm-up protocols (60 vs. 90% HRmax) and to compare the respective RSA performances. The RSA tests were performed on fifteen male soccer players (age: 17.9±1.5 years) with two sets of ten shuttle-sprints (15+15 m) with a 1:3 exercise to rest ratio, in different days (randomized order) with different HR% (60 & 90% HRmax). In order to compare the different sprint performances a Fatigue Index (FI%) was computed, while the blood lactate concentrations (BLa-) were measured before and after testing, to compare metabolic demand. Significant differences among trials within each sets (Psoccer player operates during a real match. This background may be partially reproduced by warming up protocols that, by duration and metabolic commitment, can reproduce conveniently the physiological conditions encountered in a real game (e.g. HRmax≈85-95%; BLa->4 mmol/L(-1)).

  10. Investigating the Value of Section Scores for the "TOEFL iBT"® Test. "TOEFL iBT"® Research Report. TOEFL iBT-21. ETS Research Report RR-13-35

    Science.gov (United States)

    Sawaki, Yasuyo; Sinharay, Sandip

    2013-01-01

    This study investigates the value of reporting the reading, listening, speaking, and writing section scores for the "TOEFL iBT"® test, focusing on 4 related aspects of the psychometric quality of the TOEFL iBT section scores: reliability of the section scores, dimensionality of the test, presence of distinct score profiles, and the…

  11. Teachers' perceptions of the effectiveness of an urban health sciences curriculum in closing the Black-White test score gap: A participatory case study

    Science.gov (United States)

    Prince, Joan Marie

    1999-12-01

    Over the past years, progress in Black academic achievement, particularly in the area of science, has generally slowed or ceased. According to the 1994 NAEP assessment, twelfth-grade Black students are performing at the level of White eighth-grade students in the discipline of science (Department of Education, 1996). These students, in their last year of required schooling, are about to graduate, yet they lag at least four years behind their white counterparts in science achievement. Despite the establishment and implementation of numerous science intervention programs, Black students still suffer from a disparate gap in standardized test score achievement. The purpose of this research is to investigate teachers' perceptions of the effectiveness of an urban sciences intervention tool that was designed to assist in narrowing the Black-White science academic achievement gap. Specifically, what factors affect teachers' personal sense of instructional efficacy, and how does this translate into their outcome expectancy for student academic success? A multiple-case, replicative design, grounded in descriptive theory, was selected for the study. Multiple sources of evidence were queried to provide robust findings. These sources included a validated health sciences self-efficacy instrument, an interview protocol, a classroom observation, and a review of archival material that included case study participants' personnel files and meeting minutes. A cross-comparative analytic approach was selected for interpretation (Yin, 1994). Findings indicate that teachers attribute the success or failure of educational intervention tools in closing the Black-White test score gap to a variety of internal and external factors. These factors included a perceived lack of both monetary and personal support by the school leadership, as well as a perceived lack of parental involvement which impacted negatively on student achievement patterns. The case study participants displayed a depressed

  12. Unexplained Graft Dysfunction after Heart Transplantation—Role of Novel Molecular Expression Test Score and QTc-Interval: A Case Report

    Directory of Open Access Journals (Sweden)

    Khurram Shahzad

    2010-01-01

    Full Text Available In the current era of immunosuppressive medications there is increased observed incidence of graft dysfunction in the absence of known histological criteria of rejection after heart transplantation. A noninvasive molecular expression diagnostic test was developed and validated to rule out histological acute cellular rejection. In this paper we present for the first time, longitudinal pattern of changes in this novel diagnostic test score along with QTc-interval in a patient who was admitted with unexplained graft dysfunction. Patient presented with graft failure with negative findings on all known criteria of rejection including acute cellular rejection, antibody mediated rejection and cardiac allograft vasculopathy. The molecular expression test score showed gradual increase and QTc-interval showed gradual prolongation with the gradual decline in graft function. This paper exemplifies that in patients presenting with unexplained graft dysfunction, GEP test score and QTc-interval correlate with the changes in the graft function.

  13. Why is Mini-Mental state examination performance correlated with estimated premorbid cognitive ability?

    Science.gov (United States)

    Dykiert, D; Der, G; Starr, J M; Deary, I J

    2016-09-01

    Tests requiring the pronunciation of irregular words are used to estimate premorbid cognitive ability in patients with clinical diagnoses, and prior cognitive ability in normal ageing. However, scores on these word-reading tests correlate with scores on the Mini-Mental State Examination (MMSE), a widely used screening test for possible cognitive pathology. This study aimed to test whether the word-reading tests' correlations with MMSE scores in healthy older people are explained by childhood IQ or education. Wechsler Test of Adult Reading (WTAR), National Adult Reading Test (NART), MMSE scores and information about education were obtained from 1024 70-year-olds, for whom childhood intelligence test scores were available. WTAR and NART were positively correlated with the MMSE (r ≈ 0.40, p < 0.001). The shared variance of WTAR and NART with MMSE was significantly attenuated by ~70% after controlling for childhood intelligence test scores. Education explained little additional variance in the association between the reading tests and the MMSE. MMSE, which is often used to index cognitive impairment, is associated with prior cognitive ability. MMSE score is related to scores on WTAR and NART largely due to their shared association with prior ability. Obtained MMSE scores should be interpreted in the context of prior ability (or WTAR/NART score as its proxy).

  14. Automated Scoring of Short-Answer Open-Ended GRE® Subject Test Items. ETS GRE® Board Research Report No. 04-02. ETS RR-08-20

    Science.gov (United States)

    Attali, Yigal; Powers, Don; Freedman, Marshall; Harrison, Marissa; Obetz, Susan

    2008-01-01

    This report describes the development, administration, and scoring of open-ended variants of GRE® Subject Test items in biology and psychology. These questions were administered in a Web-based experiment to registered examinees of the respective Subject Tests. The questions required a short answer of 1-3 sentences, and responses were automatically…

  15. A Case for Adjusting Subjectively Rated Scores in the Advanced Placement Tests. Program Statistics Research. Technical Report No. 94-5.

    Science.gov (United States)

    Longford, Nicholas T.

    A case is presented for adjusting the scores for free response items in the Advanced Placement (AP) tests. Using information about the rating process from the reliability studies, administrations of the AP test for three subject areas, psychology, computer science, and English language and composition, are analyzed. In the reliability studies, 299…

  16. Fracture predictive ability of physical performance tests and history of falls in elderly women: a 10-year prospective study.

    Science.gov (United States)

    Wihlborg, A; Englund, M; Åkesson, K; Gerdhem, P

    2015-08-01

    In a large cohort of elderly women followed for 10 years, we found that balance, gait speed, and self-reported history of fall independently predicted fracture. These clinical risk factors are easily evaluated and therefore advantageous in a clinical setting. They would improve fracture risk assessment and thereby also fracture prevention. The aim of this study was to identify additional risk factors for osteoporosis-related fracture by investigating the fracture predictive ability of physical performance tests and self-reported history of falls. In the population-based Osteoporosis Prospective Risk Assessment study (OPRA), 1044 women were recruited at the age of 75 and followed for 10 years. At inclusion, knee extension force, standing balance, gait speed, and bone mineral density (BMD) were examined. Falls the year before investigation was assessed by questionnaire. Cox proportional hazards regression analysis was used to determine fracture hazard ratios (HR) with BMD, history of fracture, BMI, smoking habits, bisphosphonate, vitamin D, glucocorticoid, and alcohol use as covariates. Continuous variables were standardized and HR shown for each standard deviation change. Of all women, 427 (41%) sustained at least one fracture during the 10-year follow-up. Failing the balance test had an HR of 1.98 (1.18-3.32) for hip fracture. Each standard deviation decrease in gait speed was associated with an HR of 1.37 (1.14-1.64) for hip fracture. Previous fall had an HR of 1.30 (1.03-1.65) for any fracture; 1.39 (1.08-1.79) for any osteoporosis-related fracture; and 1.60 (1.03-2.48) for distal forearm fracture. Knee extension force did not show fracture predictability. The balance test, gait speed test, and self-reported history of fall all hold independent fracture predictability. Consideration of these clinical risk factors for fracture would improve the fracture risk assessment and subsequently also fracture prevention.

  17. Use of Verbal Descriptors, Thermal Scores and Electrical Pulp Testing Scores as Predictors of Tooth Pain Before and After Application of Benzocaine Gels into Cavities of Teeth with Pulpitis

    Science.gov (United States)

    Gangarosa, Louis P.; Ciarlone, Alfred E.; Neaverth, Elmer J.; Johnston, Carey A.; Snowden, J. Douglas; Thompson, William O.

    1989-01-01

    A double-blind pilot study was conducted on 27 consenting human volunteers who had irreversible pulpitis associated with persistent toothache pain from open carious lesions. Formulations tested contained either 0, 10%, or 20% benzocaine and were identified only by a numbered code. Before the experiment started, a small amount of a known 5% benzocaine gel was placed for 1 minute on the tongue of each patient to assure a sensation of numbness within the oral cavity. Then the test tooth was washed with a gentle stream of warm water and dried with gauze. A randomly selected test medication was placed into the open cavity and around the gingival margins for 5 minutes. Pre- and posttreatment tests were conducted at the following timed intervals: 0, 5, 15, 30, 45, 60, 75 and 90 minutes. The tests included degree of pain (rated: 0 = none, 1 = mild, 2 = moderate, 3 = severe); electrical pulp testing (EPT) by a modified, voltage-ramping instrument; and ice water testing (0.5 mL directed quickly onto sound enamel of the tooth and rated: 0 to 4, with 4 being intolerable). After testing, or when pain returned to baseline, endodontic procedures were performed. There was a significant increase (p pulpitis and control teeth, 3) there were no correlations between direction of EPT scores and pain relief, 4) cold water testing was a good predictor of whether or not a tooth had pulpitis, and 5) changes in cold water testing scores after treatment could not be correlated to relief of pain according to verbal descriptors. The effectiveness of benzocaine in relieving toothache pain verifies previous studies; however, a difference between 10% and 20% benzocaine could not be demonstrated probably because of two factors: 1) the present experiment had a small sample size, and 2) there was no direct measurement of duration of local anesthesia. PMID:2490060

  18. Expanding Talent Search Procedures by Including Measures of Spatial Ability: CTY's Spatial Test Battery

    Science.gov (United States)

    Stumpf, Heinrich; Mills, Carol J.; Brody, Linda E.; Baxley, Philip G.

    2013-01-01

    The importance of spatial ability for success in a variety of domains, particularly in science, technology, engineering, and mathematics (STEM), is widely acknowledged. Yet, students with high spatial ability are rarely identified, as Talent Searches for academically talented students focus on identifying high mathematical and verbal abilities.…

  19. Increasing the reliability of the fluid/crystallized difference score from the Kaufman Adolescent and Adult Intelligence Test with reliable component analysis.

    Science.gov (United States)

    Caruso, J C

    2001-06-01

    The unreliability of difference scores is a well documented phenomenon in the social sciences and has led researchers and practitioners to interpret differences cautiously, if at all. In the case of the Kaufman Adult and Adolescent Intelligence Test (KAIT), the unreliability of the difference between the Fluid IQ and the Crystallized IQ is due to the high correlation between the two scales. The consequences of the lack of precision with which differences are identified are wide confidence intervals and unpowerful significance tests (i.e., large differences are required to be declared statistically significant). Reliable component analysis (RCA) was performed on the subtests of the KAIT in order to address these problems. RCA is a new data reduction technique that results in uncorrelated component scores with maximum proportions of reliable variance. Results indicate that the scores defined by RCA have discriminant and convergent validity (with respect to the equally weighted scores) and that differences between the scores, derived from a single testing session, were more reliable than differences derived from equal weighting for each age group (11-14 years, 15-34 years, 35-85+ years). This reliability advantage results in narrower confidence intervals around difference scores and smaller differences required for statistical significance.

  20. Visual scoring of non-cavitated caries lesions and clinical trial efficiency, testing xylitol in caries active adults

    Science.gov (United States)

    Brown, JP; Amaechi, BT; Bader, JD; Gilbert, GH; Makhija, SK; Lozano-Pineda, J; Leo, MC; Chuhe, C; Vollmer, WM

    2013-01-01

    Objectives To better understand the effectiveness of xylitol in caries prevention in adults, and to attempt improved clinical trial efficiency. Methods As part of the Xylitol for Adult Caries Trial (X-ACT), non-cavitated and cavitated caries lesions were assessed in subjects who were experiencing the disease. The trial was a test of the effectiveness of 5 grams/day of xylitol, consumed by dissolving in the mouth five 1 gram lozenges spaced across each day, compared with a sucralose placebo. For this analysis, seeking trial efficiency, 538 subjects aged 21–80, with complete data for four dental examinations were selected from the 691 randomized into the three year trial, conducted at three sites. Acceptable inter and intra examiner reliability before and during the trial was quantified using the kappa statistic. Results The mean annualized non-cavitated plus cavitated lesion transition scores in coronal and root surfaces, from sound to carious favoured xylitol over placebo, during the three cumulative periods of 12, 24, and 33 months, but these clinically and statistically non-significant differences declined in magnitude over time. Restricting the present assessment to those subjects with a higher baseline lifetime caries experience showed possible but inconsistent benefit. Conclusions There was no clear and clinically relevant preventive effect of xylitol on caries in adults with adequate fluoride exposure when non-cavitated plus cavitated lesions were assessed. This conformed to the X-ACT trial result assessing cavitated lesions. Including non-cavitated lesion assessment in this full scale, placebo controlled, multi site, randomized, double blinded clinical trial in adults experiencing dental caries, did not achieve added trial efficiency or demonstrate practical benefit of xylitol. Trial Registration ClinicalTrials.Gov NCT00393055 PMID:24205951

  1. Visual scoring of non cavitated caries lesions and clinical trial efficiency, testing xylitol in caries-active adults.

    Science.gov (United States)

    Brown, John P; Amaechi, Bennett T; Bader, James D; Gilbert, Gregg H; Makhija, Sonia K; Lozano-Pineda, Juanita; Leo, Michael C; Chen, Chuhe; Vollmer, William M

    2014-06-01

    To better understand the effectiveness of xylitol in caries prevention in adults and to attempt improved clinical trial efficiency. As part of the Xylitol for Adult Caries Trial (X-ACT), non cavitated and cavitated caries lesions were assessed in subjects who were experiencing the disease. The trial was a test of the effectiveness of 5 g/day of xylitol, consumed by dissolving in the mouth five 1 g lozenges spaced across each day, compared with a sucralose placebo. For this analysis, seeking trial efficiency, 538 subjects aged 21-80, with complete data for four dental examinations, were selected from the 691 randomized into the 3-year trial, conducted at three sites. Acceptable inter- and intra-examiner reliability before and during the trial was quantified using the kappa statistic. The mean annualized noncavitated plus cavitated lesion transition scores in coronal and root surfaces, from sound to carious favoured xylitol over placebo, during the three cumulative periods of 12, 24, and 33 months, but these clinically and statistically nonsignificant differences declined in magnitude over time. Restricting the present assessment to those subjects with a higher baseline lifetime caries experience showed possible but inconsistent benefit. There was no clear and clinically relevant preventive effect of xylitol on caries in adults with adequate fluoride exposure when non cavitated plus cavitated lesions were assessed. This conformed to the X-ACT trial result assessing cavitated lesions. Including non cavitated lesion assessment in this full-scale, placebo-controlled, multisite, randomized, double-blinded clinical trial in adults experiencing dental caries did not achieve added trial efficiency or demonstrate practical benefit of xylitol. ClinicalTrials.Gov NCT00393055. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  2. Survey of Opinions on the Primacy of "g" and Social Consequences of Ability Testing: A Comparison of Expert and Non-Expert Views

    Science.gov (United States)

    Reeve, Charlie L.; Charles, Jennifer E.

    2008-01-01

    The current study examines the views of experts in the science of mental abilities about the primacy and uniqueness of "g" and the social implications of ability testing, and compares their responses to the views of a group of non-expert psychologists. Results indicate expert consensus that "g" is an important, non-trivial determinant (or at least…

  3. Reverse shock index multiplied by Glasgow Coma Scale score (rSIG) is a simple measure with high discriminant ability for mortality risk in trauma patients: an analysis of the Japan Trauma Data Bank.

    Science.gov (United States)

    Kimura, Akio; Tanaka, Noriko

    2018-04-11

    The shock index (SI), defined as heart rate (HR) divided by systolic blood pressure (SBP), is reported to be a more sensitive marker of shock than traditional vital signs alone. In previous literature, use of the reverse shock index (rSI), taken as SBP divided by HR, is recommended instead of SI for hospital triage. Among traumatized patients aged > 55 years, SI multiplied by age (SIA) might provide better prediction of early post-injury mortality. Separately, the Glasgow Coma Scale (GCS) score has been shown to be a very strong predictor. When considering these points together, rSI multiplied by GCS score (rSIG) or rSIG divided by age (rSIG/A) could provide even better prediction of in-hospital mortality. This retrospective, multicenter study used data from 168,517 patients registered in the Japan Trauma Data Bank for the period 2006-2015. We calculated areas under receiver operating characteristic curves (AUROCs) to measure the discriminant ability by comparing those of SI (or rSI), SIA, rSIG, and rSIG/A for in-hospital mortality and for 24-h blood transfusion. The highest ROC AUC (AUROC), 0.901(0.894-0.908) for in-hospital mortality in younger patients (aged < 55 years), was seen for rSIG. In older patients (aged ≥ 55 years), the AUROC of rSIG/A, 0.845(0.840-0.850), was highest for in-hospital mortality. However, the difference between rSIG and rSIG/A was slight and did not seem to be clinically important. rSIG also had the highest AUROC of 0.745 (0.741-749) for 24-h blood transfusion. rSIG ((SBP/HR) × GCS score) is easy to calculate without the need for additional information, charts or equipment, and can be a more reliable triage tool for identifying risk levels in trauma patients.

  4. Testing the hypothesis on cognitive evolution of modern humans' learning ability: current status of past-climatic approaches.

    Science.gov (United States)

    Yoneda, Minoru; Abe-Ouchi, Ayako; Kawahata, Hodaka; Yokoyama, Yusuke; Oguchi, Takashi

    2014-05-01

    The impact of climate change on human evolution is important and debating topic for many years. Since 2010, we have involved in a general joint project entitled "Replacement of Neanderthal by Modern Humans: Testing Evolutional Models of Learning", which based on a theoretical prediction that the cognitive ability related to individual and social learning divide fates of ancient humans in very unstable Late Pleistocene climate. This model predicts that the human populations which experienced a series of environmental changes would have higher rate of individual learners, while detailed reconstructions of global climate change have reported fluent and drastic change based on ice cores and stalagmites. However, we want to understand the difference between anatomically modern human which survived and the other archaic extinct humans including European Neanderthals and Asian Denisovans. For this purpose the global synchronized change is not useful for understanding but the regional difference in the amplitude and impact of climate change is the information required. Hence, we invited a geophysicist busing Global Circulation Model to reconstruct the climatic distribution and temporal change in a continental scale. At the same time, some geochemists and geographers construct a database of local climate changes recorded in different proxies. At last, archaeologists and anthropologists tried to interpret the emergence and disappearance of human species in Europe and Asia on the reconstructed past climate maps using some tools, such as Eco-cultural niche model. Our project will show the regional difference in climate change and related archaeological events and its impact on the evolution of learning ability of modern humans.

  5. Screening Cellulolytic Bacteria from the Digestive Tract Snail (Achatina fulica and Test the Ability of Cellulase Activity

    Directory of Open Access Journals (Sweden)

    Wijanarka Wijanarka

    2016-11-01

    Full Text Available On the research of enzyme production levels observed cellulase produced by bacteria in the digestive tract of the isolation of the Snail (Achatina fulica. Isolation of bacteria based on the ability of bacteria to grow on CMC media. The purpose of this study was to determine cellulase activity by cellulolytic bacteria. Some bacterial isolates were identified as cellulolytic bacteria, they were KE-B1, KE-B2, KE-B3, KE-B4, KE-B5, and KE-B6. Isolates KE-B6 was the best isolates. Furthermore KE-B6 isolates were grown on media production to determine the pattern of growth and enzyme activity. Measurement of cell growth was conducted by inoculating starter aged 22 hours at CMC production of liquid medium. Cellulase enzyme activity measurements was performed by the DNS method. The results showed that the highest activity by new isolate bacteria KE-B6 and its value of the activity of 0.4539 U/mL, growth rate (µ 0.377/hour and generation time (g 1.84 hour. This research expected cellulase of producing bacteria were easy, inexpensive and efficient. This enzyme can be used as an enzyme biolytic once expected to replace expensive commercial enzyme. The biotylic enzyme can be applied to strains improvement (protoplast fusion.How to CiteWijanarka, W., Kusdiyantini, E. & Parman, S. (2016. Screening Cellulolytic Bacteria from the Digestive Tract Snail (Achatina fulica and Test the Ability of Cellulase Activity. Biosaintifika: Journal of Biology & Biology Education, 8(3, 386-392. 

  6. Test del PWC 170 adaptado para determinar la capacidad de trabajo especial en beisbolistas escolares / Adapted PWC 170 test to determine special work abilities in school baseball players

    Directory of Open Access Journals (Sweden)

    Sadiel López-Leal

    2016-07-01

    Full Text Available Resumen Se adaptó el test de PWC 170, según el método de Karpman modificado para el bateo, con el propósito de determinar la capacidad de trabajo especial durante este ejercicio en beisbolistas de la categoría 9-10 años del área deportiva Julio Antonio Mella, ciudad de Camagüey, Cuba. Se estudiaron 15 atletas. Como métodos se emplearon la revisión bibliográfica sobre el proceso de control de preparación del deportista y el control médico deportivo, además se realizó la entrevista a entrenadores y la observación de clases. De su aplicación se obtuvo como resultado una alta correlación entre ambas experiencias y buena estabilidad, además de positivas valoraciones de todos los indicadores por parte de los entrenadores; de ahí que consideren efectiva esta prueba para evaluar objetivamente la capacidad de trabajo especial en beisbolistas escolares de Camagüey. Abstract The PWC 170 test was adapted, according to Karpman’s model modified for batting, with the purpose of determining special work abilities during the game in 9-10 years old baseball players of Julio Antonio Mella sports area, Camagüey city, Cuba. The sample was of 15 athletes. Some of the scientific methods used were bibliographic revision on the control process of the athletes training, coach interviews and class observation. As a result the author detected a high correlation of experiences, good stability and positive coach assessment on every indicator. Therefore, this test was considered effective to objectively evaluate special work abilities in school baseball players from Camagüey.

  7. Emotion Recognition Ability Test Using JACFEE Photos: A Validity/Reliability Study of a War Veterans' Sample and Their Offspring.

    Science.gov (United States)

    Castro-Vale, Ivone; Severo, Milton; Carvalho, Davide; Mota-Cardoso, Rui

    2015-01-01

    Emotion recognition is very important for social interaction. Several mental disorders influence facial emotion recognition. War veterans and their offspring are subject to an increased risk of developing psychopathology. Emotion recognition is an important aspect that needs to be addressed in this population. To our knowledge, no test exists that is validated for use with war veterans and their offspring. The current study aimed to validate the JACFEE photo set to study facial emotion recognition in war veterans and their offspring. The JACFEE photo set was presented to 135 participants, comprised of 62 male war veterans and 73 war veterans' offspring. The participants identified the facial emotion presented from amongst the possible seven emotions that were tested for: anger, contempt, disgust, fear, happiness, sadness, and surprise. A loglinear model was used to evaluate whether the agreement between the intended and the chosen emotions was higher than the expected. Overall agreement between chosen and intended emotions was 76.3% (Cohen kappa = 0.72). The agreement ranged from 63% (sadness expressions) to 91% (happiness expressions). The reliability by emotion ranged from 0.617 to 0.843 and the overall JACFEE photo set Cronbach alpha was 0.911. The offspring showed higher agreement when compared with the veterans (RR: 41.52 vs 12.12, p < 0.001), which confirms the construct validity of the test. The JACFEE set of photos showed good validity and reliability indices, which makes it an adequate instrument for researching emotion recognition ability in the study sample of war veterans and their respective offspring.

  8. Emotion Recognition Ability Test Using JACFEE Photos: A Validity/Reliability Study of a War Veterans' Sample and Their Offspring.

    Directory of Open Access Journals (Sweden)

    Ivone Castro-Vale

    Full Text Available Emotion recognition is very important for social interaction. Several mental disorders influence facial emotion recognition. War veterans and their offspring are subject to an increased risk of developing psychopathology. Emotion recognition is an important aspect that needs to be addressed in this population. To our knowledge, no test exists that is validated for use with war veterans and their offspring. The current study aimed to validate the JACFEE photo set to study facial emotion recognition in war veterans and their offspring. The JACFEE photo set was presented to 135 participants, comprised of 62 male war veterans and 73 war veterans' offspring. The participants identified the facial emotion presented from amongst the possible seven emotions that were tested for: anger, contempt, disgust, fear, happiness, sadness, and surprise. A loglinear model was used to evaluate whether the agreement between the intended and the chosen emotions was higher than the expected. Overall agreement between chosen and intended emotions was 76.3% (Cohen kappa = 0.72. The agreement ranged from 63% (sadness expressions to 91% (happiness expressions. The reliability by emotion ranged from 0.617 to 0.843 and the overall JACFEE photo set Cronbach alpha was 0.911. The offspring showed higher agreement when compared with the veterans (RR: 41.52 vs 12.12, p < 0.001, which confirms the construct validity of the test. The JACFEE set of photos showed good validity and reliability indices, which makes it an adequate instrument for researching emotion recognition ability in the study sample of war veterans and their respective offspring.

  9. A Comparison of Item Selection Procedures Using Different Ability Estimation Methods in Computerized Adaptive Testing Based on the Generalized Partial Credit Model

    Science.gov (United States)

    Ho, Tsung-Han

    2010-01-01

    Computerized adaptive testing (CAT) provides a highly efficient alternative to the paper-and-pencil test. By selecting items that match examinees' ability levels, CAT not only can shorten test length and administration time but it can also increase measurement precision and reduce measurement error. In CAT, maximum information (MI) is the most…

  10. Prediction of mortality using on-line, self-reported health data: empirical test of the RealAge score.

    Directory of Open Access Journals (Sweden)

    William R Hobbs

    Full Text Available OBJECTIVE: We validate an online, personalized mortality risk measure called "RealAge" assigned to 30 million individuals over the past 10 years. METHODS: 188,698 RealAge survey respondents were linked to California Department of Public Health death records using a one-way cryptographic hash of first name, last name, and date of birth. 1,046 were identified as deceased. We used Cox proportional hazards models and receiver operating characteristic (ROC curves to estimate the relative scales and predictive accuracies of chronological age, the RealAge score, and the Framingham ATP-III score for hard coronary heart disease (HCHD in this data. To address concerns about selection and to examine possible heterogeneity, we compared the results by time to death at registration, underlying cause of death, and relative health among users. RESULTS: THE REALAGE SCORE IS ACCURATELY SCALED (HAZARD RATIOS: age 1.076; RealAge-age 1.084 and more accurate than chronological age (age c-statistic: 0.748; RealAge c-statistic: 0.847 in predicting mortality from hard coronary heart disease following survey completion. The score is more accurate than the Framingham ATP-III score for hard coronary heart disease (c-statistic: 0.814, perhaps because self-reported cholesterol levels are relatively uninformative in the RealAge user sample. RealAge predicts deaths from malignant neoplasms, heart disease, and external causes. The score does not predict malignant neoplasm deaths when restricted to users with no smoking history, no prior cancer diagnosis, and no indicated health interest in cancer (p-value 0.820. CONCLUSION: The RealAge score is a valid measure of mortality risk in its user population.

  11. The Implementation of Role-Playing Model in Principles of Finance Accounting Learning to Improve Students’ Enjoyment and Students’ Test Scores

    Directory of Open Access Journals (Sweden)

    L. Saptono

    2010-01-01

    Full Text Available This research is a classroom action research. The goal of conducting this research is to improve students’ enjoyment level and their test scores by implementing role-playing method. The research is conducted in Accounting Education Study Program of Sanata Dharma University at odd semester on academic year 2010/2011. The participants were divided into two classes. The first class was the class that got the treatment, while the second class was the control class. The result of the study showed that there was an improvement of students’ enjoyment level and test scores in the class which implemented role-playing method.

  12. Might the Rorschach be a projective test after all? Social projection of an undesired trait alters Rorschach Oral Dependency scores.

    Science.gov (United States)

    Bornstein, Robert F

    2007-06-01

    The degree to which projection plays a role in Rorschach (Rorschach, 1921/1942) responding remains controversial, in part because extant data have yielded inconclusive results. In this investigation, I examined the impact of social projection on Rorschach Oral Dependency (ROD) scores using methods adapted from social cognition research. In Study 1, I prescreened 85 college students (40 women and 45 men) with the ROD scale and a widely used self-report measure of dependency, the Interpersonal Dependency Inventory (IDI; Hirschfeld et al., 1977). Results show that informing participants who scored low on the IDI that they were in fact highly dependent led to significant increases in ROD scores; I did not obtain parallel ROD increases for participants who scored high on the IDI or for participants who received low-dependent feedback. In Study 2, I examined a separate sample of 80 prescreened college students (40 women and 40 men) and showed that providing low self-report participants an opportunity to attribute dependency to a fictional target person prior to Rorschach responding attenuated the impact of high-dependent feedback on ROD scores. These results suggest that projection played a role in at least one domain of Rorschach responding. I discuss theoretical, clinical, and empirical implications of these results.

  13. Walking ability in patients with glioblastoma: prognostic value of the Berg Balance Scale and the 10 meter walk test.

    Science.gov (United States)

    Liljehult, Monique Mesot; Buus, Lise; Liljehult, Jacob; Rasmussen, Birthe Krogh

    2017-11-01

    Primary brain tumors frequently cause considerable functional impairments and the survival time when diagnosed with glioblastoma is 14.6 months. The aim of this study was to examine if baseline postural control and walking ability in patients with glioblastoma could predict long term walking ability and 1 year mortality. Data were gathered from prospective recordings in a brain cancer database supplemented by retrospective review of electronic patient records. We included 109 patients with glioblastoma, 47 women and 62 men with mean age 65 years. At admission 84 patients were tested with Berg Balance Scale and 57 were tested with 10 meter walk test. Binary logistic regression analysis showed no statistical significance in favour of the 10 meter walk test. Berg Balance Scale showed an ability to predict walking ability 4-8 months after admission. The risk of dying within a year was 6.9 times higher in patients who lost their ability to walk within 4-8 months of the first admission. This study showed that Berg Balance Scale has some ability to predict the loss of walking ability 4-8 months after admission. This could be an important indicator pin pointing patients most in need of more intensive specialized neurorehabilitation efforts early in the disease course.

  14. Genetic Performance and General Combining Ability of Oil Palm Deli dura x AVROS pisifera Tested on Inland Soils

    Science.gov (United States)

    Noh, A.; Rafii, M. Y.; Saleh, G.; Kushairi, A.; Latif, M. A.

    2012-01-01

    The performance of 11 oil palm AVROS (Algemene Vereniging van Rubberplanters ter Oostkust van Sumatra) pisiferas was evaluated based on their 40 dura x pisifera (DxP) progenies tested on inland soils, predominantly of Serdang Series. Fresh fruit bunch (FFB) yield of each pisiferas ranged from 121.93 to 143.9 kg palm−1 yr−1 with trial mean of 131.62 kg palm−1 yr−1. Analysis of variance (ANOVA) showed low genetic variability among pisifera parents for most of the characters indicating uniformity of the pisifera population. This was anticipated as the AVROS pisiferas were derived from small population and were inbred materials. However, some of the pisiferas have shown good general combining ability (GCA) for certain important economic traits. Three pisiferas (P1 (0.174/247), P3 (0.174/498), P11 (0.182/308)) were identified of having good GCA for FFB yield while pisiferas P1 (0.174/247), P10 (0.182/348), and P11 (0.182/308) were good combiners for oil-to-bunch ratio (O/B). The narrow genetic base of these materials was the main obstacle in breeding and population improvement. However, efforts have been made to introgress this material with the vast oil palm germplasm collections of MPOB for rectifying the problem. PMID:22701095

  15. Genetic Performance and General Combining Ability of Oil Palm Deli dura x AVROS pisifera Tested on Inland Soils

    Directory of Open Access Journals (Sweden)

    A. Noh

    2012-01-01

    Full Text Available The performance of 11 oil palm AVROS (Algemene Vereniging van Rubberplanters ter Oostkust van Sumatra pisiferas was evaluated based on their 40 dura x pisifera (DxP progenies tested on inland soils, predominantly of Serdang Series. Fresh fruit bunch (FFB yield of each pisiferas ranged from 121.93 to 143.9 kg palm−1 yr−1 with trial mean of 131.62 kg palm−1 yr−1. Analysis of variance (ANOVA showed low genetic variability among pisifera parents for most of the characters indicating uniformity of the pisifera population. This was anticipated as the AVROS pisiferas were derived from small population and were inbred materials. However, some of the pisiferas have shown good general combining ability (GCA for certain important economic traits. Three pisiferas (P1 (0.174/247, P3 (0.174/498, P11 (0.182/308 were identified of having good GCA for FFB yield while pisiferas P1 (0.174/247, P10 (0.182/348, and P11 (0.182/308 were good combiners for oil-to-bunch ratio (O/B. The narrow genetic base of these materials was the main obstacle in breeding and population improvement. However, efforts have been made to introgress this material with the vast oil palm germplasm collections of MPOB for rectifying the problem.

  16. Parkinson's disease and driving ability

    Science.gov (United States)

    Singh, Rajiv; Pentland, Brian; Hunter, John; Provan, Frances

    2007-01-01

    Objectives To explore the driving problems associated with Parkinson's disease (PD) and to ascertain whether any clinical features or tests predict driver safety. Methods The driving ability of 154 individuals with PD referred to a driving assessment centre was determined by a combination of clinical tests, reaction times on a test rig and an in‐car driving test. Results The majority of cases (104, 66%) were able to continue driving although 46 individuals required an automatic transmission and 10 others needed car modifications. Ability to drive was predicted by the severity of physical disease, age, presence of other associated medical conditions, particularly dementia, duration of disease, brake reaction, time on a test rig and score on a driving test (all pautomatic transmission. A combination of clinical tests and in‐car driving assessment will establish safety to drive, and a number of clinical correlates can be shown to predict the likely outcome and may assist in the decision process. This is the largest series of consecutive patients seen at a driving assessment centre reported to date, and the first to devise a scoring system for on‐road driving assessment. PMID:17178820

  17. GalaxyDock BP2 score: a hybrid scoring function for accurate protein-ligand docking

    Science.gov (United States)

    Baek, Minkyung; Shin, Woong-Hee; Chung, Hwan Won; Seok, Chaok

    2017-07-01

    Protein-ligand docking is a useful tool for providing atomic-level understanding of protein functions in nature and design principles for artificial ligands or proteins with desired properties. The ability to identify the true binding pose of a ligand to a target protein among numerous possible candidate poses is an essential requirement for successful protein-ligand docking. Many previously developed docking scoring functions were trained to reproduce experimental binding affinities and were also used for scoring binding poses. However, in this study, we developed a new docking scoring function, called GalaxyDock BP2 Score, by directly training the scoring power of binding poses. This function is a hybrid of physics-based, empirical, and knowledge-based score terms that are balanced to strengthen the advantages of each component. The performance of the new scoring function exhibits significant improvement over existing scoring functions in decoy pose discrimination tests. In addition, when the score is used with the GalaxyDock2 protein-ligand docking program, it outperformed other state-of-the-art docking programs in docking tests on the Astex diverse set, the Cross2009 benchmark set, and the Astex non-native set. GalaxyDock BP2 Score and GalaxyDock2 with this score are freely available at http://galaxy.seoklab.org/softwares/galaxydock.html.

  18. Emotional Intelligence Abilities and Traits in Different Career Paths

    Science.gov (United States)

    Kafetsios, Konstantinos; Maridaki-Kassotaki, Aikaterini; Zammuner, Vanda L.; Zampetakis, Leonidas A.; Vouzas, Fotios

    2009-01-01

    Two studies tested hypotheses about differences in emotional intelligence (EI) abilities and traits between followers of different career paths. Compared to their social science peers, science students had higher scores in adaptability and general mood traits measured with the Emotion Quotient Inventory, but lower scores in strategic EI abilities…

  19. The predictive ability of different customer feedback metrics for retention

    NARCIS (Netherlands)

    de Haan, Evert; Verhoef, Peter C.; Wiesel, Thorsten

    This study systematically compares different customer feedback metrics (CFMs) - namely customer satisfaction, the Net Promoter Score, and the Customer Effort Score - to test their ability to predict retention across a wide range of industries. We classify the CFMs according to a time focus (past,

  20. The Evolution of the Black-White Test Score Gap in Grades K-3: The Fragility of Results. NBER Working Paper No. 17960

    Science.gov (United States)

    Bond, Timothy N.; Lang, Kevin

    2012-01-01

    Although both economists and psychometricians typically treat them as interval scales, test scores are reported using ordinal scales. Using the Early Childhood Longitudinal Study and the Children of the National Longitudinal Survey, we examine the effect of order-preserving scale transformations on the evolution of the black-white reading test…

  1. Test Score Equating Using Discrete Anchor Items versus Passage-Based Anchor Items: A Case Study Using "SAT"® Data. Research Report. ETS RR-14-14

    Science.gov (United States)

    Liu, Jinghua; Zu, Jiyun; Curley, Edward; Carey, Jill

    2014-01-01

    The purpose of this study is to investigate the impact of discrete anchor items versus passage-based anchor items on observed score equating using empirical data.This study compares an "SAT"® critical reading anchor that contains more discrete items proportionally, compared to the total tests to be equated, to another anchor that…

  2. Using Logistic Regression for Validating or Invalidating Initial Statewide Cut-Off Scores on Basic Skills Placement Tests at the Community College Level

    Science.gov (United States)

    Secolsky, Charles; Krishnan, Sathasivam; Judd, Thomas P.

    2013-01-01

    The community colleges in the state of New Jersey went through a process of establishing statewide cut-off scores for English and mathematics placement tests. The colleges wanted to communicate to secondary schools a consistent preparation that would be necessary for enrolling in Freshman Composition and College Algebra at the community college…

  3. Does the Computer-Assisted Remedial Mathematics Program at Kearny High School Lead to Improved Scores on the N.J. Early Warning Test?

    Science.gov (United States)

    Schalago-Schirm, Cynthia

    Eighth-grade students in New Jersey take the Early Warning Test (EWT), which involves reading, writing, and mathematics. Students with EWT scores below the state level of competency take a remedial mathematics course that provides students with computer-assisted instruction (2 days per week) as well as regular classroom instruction (3 days per…

  4. An Integrated Model of Academic Self-Concept Development: Academic Self-Concept, Grades, Test Scores, and Tracking over 6 Years

    Science.gov (United States)

    Marsh, Herbert W.; Pekrun, Reinhard; Murayama, Kou; Arens, A. Katrin; Parker, Philip D.; Guo, Jiesi; Dicke, Theresa

    2018-01-01

    Our newly proposed integrated academic self-concept model integrates 3 major theories of academic self-concept formation and developmental perspectives into a unified conceptual and methodological framework. Relations among math self-concept (MSC), school grades, test scores, and school-level contextual effects over 6 years, from the end of…

  5. Raising Test Scores vs. Teaching Higher Order Thinking (HOT): Senior Science Teachers' Views on How Several Concurrent Policies Affect Classroom Practices

    Science.gov (United States)

    Zohar, Anat; Alboher Agmon, Vered

    2018-01-01

    Purpose: This study investigates how senior science teachers viewed the effects of a Raising Test Scores policy and its implementation on instruction of higher order thinking (HOT), and on teaching thinking to students with low academic achievements. Background: The study was conducted in the context of three concurrent policies advocating: (a)…

  6. Comparative evaluation of chest radiography, low-field MRI, the Shwachman-Kulczycki score and pulmonary function tests in patients with cystic fibrosis

    International Nuclear Information System (INIS)

    Anjorin, Angela; Vogl, Thomas J.; Schmidt, Helga; Posselt, Hans-Georg; Smaczny, Christina; Ackermann, Hanns; Deimling, Michael; Abolmaali, Nasreddin

    2008-01-01

    The aim of this study was to investigate whether the parenchymal lung damage in patients suffering from cystic fibrosis (CF) can be equivalently quantified by the Chrispin-Norman (CN) scores determined with low-field magnetic resonance imaging (MRI) and conventional chest radiography (CXR). Both scores were correlated with pulmonary function tests (PFT) and the Shwachman-Kulczycki method (SKM). To evaluate the comparability of MRI and CXR for different states of the disease, all scores were applied to patients divided into three age groups. Seventy-three CF patients (mean SKM score: 62 ± 8) with a median age (range) of 14 years (7-32) were included. The mean CN scores determined with both imaging methods were comparable (CXR: 12.1 ± 4.7; MRI: 12.0 ± 4.5) and showed high correlation (P < 0.05, R = 0.97). Only weak correlations were found between imaging, PFT, and SKM. Both imaging modalities revealed significantly more severe disease expression with age, while PFT and SKM failed to detect early signs of disease. We conclude that imaging of the lung in CF patients is capable of detecting subtle and early parenchymal destruction before lung function or clinical scoring is affected. Furthermore, low-field MRI revealed high consistency with chest radiography and may be used for a thorough follow-up while avoiding radiation exposure. (orig.)

  7. Igualación equipercentil del Examen de Habilidades y Conocimientos Básicos (EXHCOBA. [Equipercentile equating of the Basic Ability and Knowledge Test (EXHCOBA

    Directory of Open Access Journals (Sweden)

    Norma Larrazolo

    2006-07-01

    Full Text Available Equipercentile equating method is a statistical procedure where student raw scores of two different versions of the same test are considered equated if they correspond to the same percentile range. A graphic curve is presented to describe the difficult differences from version to version of a test. This work was aimed to estimate the equipercentil equating values, by academic content area, of the Basic Ability and Knowledge Test (EXHCOBA, by its Spanish acronym that is used by the University of Baja California (UABC as a selection student test. This norm-referenced test has excellent quality standards, a high technological development, several reliability and validity support studies, and others good psychometric parameters. Estimation of equating parameters was done applying the analytic method described by Kolen and Brennan (1995, with the random group procedure utilized by UABC to collect data. Results shows that equating was effective to adjust four statistical moments (mean, standard deviation, bias, and kurtosis of the frequency distributions of EXHCOBA´s version 3 and 4 compared with version 2, by content area, producing equal score distributions. Nevertheless, irregularities appeared at the ends of the curves that suggests the need of a smoothing procedure. La igualación equipercentil es un método estadístico en el cual los puntajes crudos de dos versiones de una prueba se consideran igualados si ellos corresponden al mismo rango percentilar en un grupo de examinados. En la igualación equipercentil se presenta una curva para describir las diferencias de dificultad de versión a versión. Este trabajo tuvo como objetivo estimar la igualación equipercentil sin suavizado de las versiones 3 y 4, con la versión 2, por área temática del Examen de Habilidades y Conocimientos Básicos (EXHCOBA que utiliza la Universidad Autónoma de Baja California (UABC para la selección de aspirantes, examen que posee un nivel de calidad excelente

  8. Talking to Score: Impression Management in L2 Oral Assessment and the Co-Construction of a Test Discourse Genre

    Science.gov (United States)

    Luk, Jasmine

    2010-01-01

    In recent years, the emphasis in second language (L2) oral proficiency assessment has shifted from linguistic accuracy to discourse strategies such as the ability to initiate, respond, and negotiate meaning. This has resulted in a growing interest in the discourse analysis of students' performance in different oral proficiency assessment formats.…

  9. Comparing long-term results of PASAT and SDMT scores in relation to neuropsychological testing in multiple sclerosis

    NARCIS (Netherlands)

    Sonder, J.M.; Burggraaff, J.; Knol, D.L.; Polman, C.H.; Uitdehaag, B.M.J.

    2014-01-01

    Background and objectives: The Symbol Digit Modalities Test (SDMT) shows advantages over the Paced Auditory Serial Addition Test (PASAT) as a cognitive test in patients with multiple sclerosis (MS). To determine which of these tests is most valid and reliable over time as an indicator of the

  10. A multimedia situational judgment test with a constructed-response item format: Its relationship with personality, cognitive ability, job experience, and academic performance

    NARCIS (Netherlands)

    Oostrom, J.K.; Born, M.Ph.; Serlie, A.W.; Van der Molen, H.T.

    2011-01-01

    Advances in computer technology have created opportunities for the development of a multimedia situational test in which responses are filmed with a webcam. This paper examined the relationship of a so-called webcam test with personality, cognitive ability, job experience, and academic performance.

  11. Sensitivity and specificity of the minimal chair height standing ability test: a simple and affordable fall-risk screening instrument.

    Science.gov (United States)

    Reider, Nadia C; Naylor, Patti-Jean; Gaul, Catherine

    2015-01-01

    Fall-risk screening instruments have been underutilized in clinical settings because of their lengthy administration time, need of cumbersome equipment, and lack of validation. The primary objective of this study was to assess the validity (sensitivity and specificity) of the Minimal Chair Height Standing Ability Test (MCHSAT). The secondary objective was to develop guidelines to provide physical therapists with best-practice recommendations that can easily be implemented in clinical practice. A retrospective cohort study design was used in which falling history, major medical conditions, cognitive status (Mini-Mental State Examination), and level of independence (Independent Activities of Daily Living) were obtained for 167 community-dwelling older adults (mean age = 83.6 ± 7.3 years), residents of British Columbia, Canada. Participants MCHSAT performance was assessed using a chair whose seat height was modifiable by increments of 5 cm, starting at 47 cm and lowering after each successful attempt. Sensitivity and specificity of the MCHSAT at each chair height were calculated and plotted as a receiver operating characteristic curve. A model to identify participants with history of falls was developed using a forward logistic regression (Wald). Mean MCHSAT performance (cm) was significantly better for participants without history of falls (30.3 cm, 95% CI: 28.1-32.5 cm) than for those with history of falls (37.7 cm, 95% CI: 35.5-40.0 cm) and was the single risk factor associated with fall status (β= 1.087, P history of falls was 34 cm (AUC = 0.72, 95% CI: 0.63-0.82). At this threshold, sensitivity and specificity values were 75% and 62%, respectively. Using 34 cm as the optimal performance, the MCHSAT correctly identified 75% of participants with history of falls and 62% of participants without history of falls. This provides evidence that the MCHSAT is a valid screening tool for use with an older Canadian population. As a simple and inexpensive testing instrument

  12. Associations between cadmium exposure and neurocognitive test scores in a cross-sectional study of US adults

    OpenAIRE

    Ciesielski, Timothy; Bellinger, David C.; Schwartz, Joel David; Hauser, Russ B.; Wright, Robert O.

    2013-01-01

    Background: Low-level environmental cadmium exposure and neurotoxicity has not been well studied in adults. Our goal was to evaluate associations between neurocognitive exam scores and a biomarker of cumulative cadmium exposure among adults in the Third National Health and Nutrition Examination Survey (NHANES III). Methods: NHANES III is a nationally representative cross-sectional survey of the U.S. population conducted between 1988 and 1994. We analyzed data from a subset of participants, ag...

  13. Preliminary testing of the reliability and feasibility of SAGE: a system to measure and score engagement with and use of research in health policies and programs.

    Science.gov (United States)

    Makkar, Steve R; Williamson, Anna; D'Este, Catherine; Redman, Sally

    2017-12-19

    Few measures of research use in health policymaking are available, and the reliability of such measures has yet to be evaluated. A new measure called the Staff Assessment of Engagement with Evidence (SAGE) incorporates an interview that explores policymakers' research use within discrete policy documents and a scoring tool that quantifies the extent of policymakers' research use based on the interview transcript and analysis of the policy document itself. We aimed to conduct a preliminary investigation of the usability, sensitivity, and reliability of the scoring tool in measuring research use by policymakers. Nine experts in health policy research and two independent coders were recruited. Each expert used the scoring tool to rate a random selection of 20 interview transcripts, and each independent coder rated 60 transcripts. The distribution of scores among experts was examined, and then, interrater reliability was tested within and between the experts and independent coders. Average- and single-measure reliability coefficients were computed for each SAGE subscales. Experts' scores ranged from the limited to extensive scoring bracket for all subscales. Experts as a group also exhibited at least a fair level of interrater agreement across all subscales. Single-measure reliability was at least fair except for three subscales: Relevance Appraisal, Conceptual Use, and Instrumental Use. Average- and single-measure reliability among independent coders was good to excellent for all subscales. Finally, reliability between experts and independent coders was fair to excellent for all subscales. Among experts, the scoring tool was comprehensible, usable, and sensitive to discriminate between documents with varying degrees of research use. Secondly, the scoring tool yielded scores with good reliability among the independent coders. There was greater variability among experts, although as a group, the tool was fairly reliable. The alignment between experts' and independent

  14. Gifted Students' Self-Perceptions of Ability in Specific Subject Domains: Factor Structure and Relationship with Above-Level Test Scores

    Science.gov (United States)

    Swiatek, Mary Ann

    2005-01-01

    Current self-concept theories suggest a multi-dimensional construct, with domain-specific self-concepts hierarchically related to global self-concept. The academic domain may be comprised of subject-specific domains that are related to performance in corresponding areas. Here, gifted students' responses to questions about how they compare with…

  15. Fall risk screening in the elderly: A comparison of the minimal chair height standing ability test and 5-repetition sit-to-stand test.

    Science.gov (United States)

    Reider, Nadia; Gaul, Catherine

    2016-01-01

    Successfully identifying older adults with a high risk of falling can be complicated, time consuming and not feasible in daily medical practice. This study compared the effectiveness of the Minimal Chair Height Standing Ability Test (MCHSAT) and 5-repetition sit-to-stand tst (5R-STS) as fall risk-screening instruments for the elderly. 167 community-dwelling older adults (mean age=83.6±7.3years) were interviewed for demographics, fall history, cognition, and mobility status. MCHSAT performance was assessed using a chair whose seat height was modifiable by increments of 5cm, starting at 47cm and lowering after each successful attempt. 5R-STS performance was assessed by recording the time it took to rise and sit back down five consecutive times from a chair of 47cm high. Operating Receiving Characteristic (ROC) curves and Area under the Curve (AUC) were calculated for each test as well as for sub-groups of participants classified based on medical comorbidities (e.g. cardiac disease/stroke, lower limb arthritis). The MCHSAT and 5R-STS were equally effective fall-risk screening instruments for the overall population (AUC (95% CI)=0.72 (0.63-0.82) and 0.73(0.64-0.81) respectively). The 5R-STS was more effective than the MCHSAT for participants suffering from lower limb arthritis (AUC (95% CI)=0.81(0.70-0.92) and 0.71(0.58-0.85) respectively) while the opposite was true for participants with a history of cardiac disease or stroke (AUC (95% CI)=0.59 (0.44-0.80) and 0.65 (0.47-0.84) respectively). Due to their simplicity and quick administration time, the MCHSAT and 5R-STS are equally suitable for implementation in clinical settings. Copyright © 2016. Published by Elsevier Ireland Ltd.

  16. Interpreting force concept inventory scores: Normalized gain and SAT scores

    Directory of Open Access Journals (Sweden)

    Jeffrey J. Steinert

    2007-05-01

    Full Text Available Preinstruction SAT scores and normalized gains (G on the force concept inventory (FCI were examined for individual students in interactive engagement (IE courses in introductory mechanics at one high school (N=335 and one university (N=292 , and strong, positive correlations were found for both populations ( r=0.57 and r=0.46 , respectively. These correlations are likely due to the importance of cognitive skills and abstract reasoning in learning physics. The larger correlation coefficient for the high school population may be a result of the much shorter time interval between taking the SAT and studying mechanics, because the SAT may provide a more current measure of abilities when high school students begin the study of mechanics than it does for college students, who begin mechanics years after the test is taken. In prior research a strong correlation between FCI G and scores on Lawson’s Classroom Test of Scientific Reasoning for students from the same two schools was observed. Our results suggest that, when interpreting class average normalized FCI gains and comparing different classes, it is important to take into account the variation of students’ cognitive skills, as measured either by the SAT or by Lawson’s test. While Lawson’s test is not commonly given to students in most introductory mechanics courses, SAT scores provide a readily available alternative means of taking account of students’ reasoning abilities. Knowing the students’ cognitive level before instruction also allows one to alter instruction or to use an intervention designed to improve students’ cognitive level.

  17. Interpreting force concept inventory scores: Normalized gain and SAT scores

    Directory of Open Access Journals (Sweden)

    Vincent P. Coletta

    2007-05-01

    Full Text Available Preinstruction SAT scores and normalized gains (G on the force concept inventory (FCI were examined for individual students in interactive engagement (IE courses in introductory mechanics at one high school (N=335 and one university (N=292, and strong, positive correlations were found for both populations (r=0.57 and r=0.46, respectively. These correlations are likely due to the importance of cognitive skills and abstract reasoning in learning physics. The larger correlation coefficient for the high school population may be a result of the much shorter time interval between taking the SAT and studying mechanics, because the SAT may provide a more current measure of abilities when high school students begin the study of mechanics than it does for college students, who begin mechanics years after the test is taken. In prior research a strong correlation between FCI G and scores on Lawson’s Classroom Test of Scientific Reasoning for students from the same two schools was observed. Our results suggest that, when interpreting class average normalized FCI gains and comparing different classes, it is important to take into account the variation of students’ cognitive skills, as measured either by the SAT or by Lawson’s test. While Lawson’s test is not commonly given to students in most introductory mechanics courses, SAT scores provide a readily available alternative means of taking account of students’ reasoning abilities. Knowing the students’ cognitive level before instruction also allows one to alter instruction or to use an intervention designed to improve students’ cognitive level.

  18. Development of a psychological test to measure ability-based emotional intelligence in the Indonesian workplace using an item response theory.

    Science.gov (United States)

    Fajrianthi; Zein, Rizqy Amelia

    2017-01-01

    This study aimed to develop an emotional intelligence (EI) test that is suitable to the Indonesian workplace context. Airlangga Emotional Intelligence Test (Tes Kecerdasan Emosi Airlangga [TKEA]) was designed to measure three EI domains: 1) emotional appraisal, 2) emotional recognition, and 3) emotional regulation. TKEA consisted of 120 items with 40 items for each subset. TKEA was developed based on the Situational Judgment Test (SJT) approach. To ensure its psychometric qualities, categorical confirmatory factor analysis (CCFA) and item response theory (IRT) were applied to test its validity and reliability. The study was conducted on 752 participants, and the results showed that test information function (TIF) was 3.414 (ability level = 0) for subset 1, 12.183 for subset 2 (ability level = -2), and 2.398 for subset 3 (level of ability = -2). It is concluded that TKEA performs very well to measure individuals with a low level of EI ability. It is worth to note that TKEA is currently at the development stage; therefore, in this study, we investigated TKEA's item analysis and dimensionality test of each TKEA subset.

  19. Development of a psychological test to measure ability-based emotional intelligence in the Indonesian workplace using an item response theory

    Directory of Open Access Journals (Sweden)

    Fajrianthi

    2017-11-01

    Full Text Available Fajrianthi,1 Rizqy Amelia Zein2 1Department of Industrial and Organizational Psychology, 2Department of Personality and Social Psychology, Faculty of Psychology, Universitas Airlangga, Surabaya, East Java, Indonesia Abstract: This study aimed to develop an emotional intelligence (EI test that is suitable to the Indonesian workplace context. Airlangga Emotional Intelligence Test (Tes Kecerdasan Emosi Airlangga [TKEA] was designed to measure three EI domains: 1 emotional appraisal, 2 emotional recognition, and 3 emotional regulation. TKEA consisted of 120 items with 40 items for each subset. TKEA was developed based on the Situational Judgment Test (SJT approach. To ensure its psychometric qualities, categorical confirmatory factor analysis (CCFA and item response theory (IRT were applied to test its validity and reliability. The study was conducted on 752 participants, and the results showed that test information function (TIF was 3.414 (ability level = 0 for subset 1, 12.183 for subset 2 (ability level = -2, and 2.398 for subset 3 (level of ability = -2. It is concluded that TKEA performs very well to measure individuals with a low level of EI ability. It is worth to note that TKEA is currently at the development stage; therefore, in this study, we investigated TKEA’s item analysis and dimensionality test of each TKEA subset. Keywords: categorical confirmatory factor analysis, emotional intelligence, item response theory 

  20. Evaluation of the validity of osteoporosis and fracture risk assessment tools (IOF One Minute Test, SCORE, and FRAX) in postmenopausal Palestinian women.

    Science.gov (United States)

    Kharroubi, Akram; Saba, Elias; Ghannam, Ibrahim; Darwish, Hisham

    2017-12-01

    The need for simple self-assessment tools is necessary to predict women at high risk for developing osteoporosis. In this study, tools like the IOF One Minute Test, Fracture Risk Assessment Tool (FRAX), and Simple Calculated Osteoporosis Risk Estimation (SCORE) were found to be valid for Palestinian women. The threshold for predicting women at risk for each tool was estimated. The purpose of this study is to evaluate the validity of the updated IOF (International Osteoporosis Foundation) One Minute Osteoporosis Risk Assessment Test, FRAX, SCORE as well as age alone to detect the risk of developing osteoporosis in postmenopausal Palestinian women. Three hundred eighty-two women 45 years and older were recruited including 131 women with osteoporosis and 251 controls following bone mineral density (BMD) measurement, 287 completed questionnaires of the different risk assessment tools. Receiver operating characteristic (ROC) curves were evaluated for each tool using bone BMD as the gold standard for osteoporosis. The area under the ROC curve (AUC) was the highest for FRAX calculated with BMD for predicting hip fractures (0.897) followed by FRAX for major fractures (0.826) with cut-off values ˃1.5 and ˃7.8%, respectively. The IOF One Minute Test AUC (0.629) was the lowest compared to other tested tools but with sufficient accuracy for predicting the risk of developing osteoporosis with a cut-off value ˃4 total yes questions out of 18. SCORE test and age alone were also as good predictors of risk for developing osteoporosis. According to the ROC curve for age, women ≥64 years had a higher risk of developing osteoporosis. Higher percentage of women with low BMD (T-score ≤-1.5) or osteoporosis (T-score ≤-2.5) was found among women who were not exposed to the sun, who had menopause before the age of 45 years, or had lower body mass index (BMI) compared to controls. Women who often fall had lower BMI and approximately 27% of the recruited postmenopausal