WorldWideScience

Sample records for applying item response

  1. Applying Item Response Theory methods to design a learning progression-based science assessment

    Science.gov (United States)

    Chen, Jing

    Learning progressions are used to describe how students' understanding of a topic progresses over time and to classify the progress of students into steps or levels. This study applies Item Response Theory (IRT) based methods to investigate how to design learning progression-based science assessments. The research questions of this study are: (1) how to use items in different formats to classify students into levels on the learning progression, (2) how to design a test to give good information about students' progress through the learning progression of a particular construct and (3) what characteristics of test items support their use for assessing students' levels. Data used for this study were collected from 1500 elementary and secondary school students during 2009--2010. The written assessment was developed in several formats such as the Constructed Response (CR) items, Ordered Multiple Choice (OMC) and Multiple True or False (MTF) items. The followings are the main findings from this study. The OMC, MTF and CR items might measure different components of the construct. A single construct explained most of the variance in students' performances. However, additional dimensions in terms of item format can explain certain amount of the variance in student performance. So additional dimensions need to be considered when we want to capture the differences in students' performances on different types of items targeting the understanding of the same underlying progression. Items in each item format need to be improved in certain ways to classify students more accurately into the learning progression levels. This study establishes some general steps that can be followed to design other learning progression-based tests as well. For example, first, the boundaries between levels on the IRT scale can be defined by using the means of the item thresholds across a set of good items. Second, items in multiple formats can be selected to achieve the information criterion at all

  2. Generalizability theory and item response theory

    NARCIS (Netherlands)

    Glas, C.A.W.; Eggen, T.J.H.M.; Veldkamp, B.P.

    2012-01-01

    Item response theory is usually applied to items with a selected-response format, such as multiple choice items, whereas generalizability theory is usually applied to constructed-response tasks assessed by raters. However, in many situations, raters may use rating scales consisting of items with a s

  3. Automated Item Selection Using Item Response Theory.

    Science.gov (United States)

    Stocking, Martha L.; And Others

    This paper presents a new heuristic approach to interactive test assembly that is called the successive item replacement algorithm. This approach builds on the work of W. J. van der Linden (1987) and W. J. van der Linden and E. Boekkooi-Timminga (1989) in which methods of mathematical optimization are combined with item response theory to…

  4. Thermal response based item identification.

    Energy Technology Data Exchange (ETDEWEB)

    Smith, M. K. (Morag K.); Hypes, P. A. (Philip A.); Bracken, D. S. (David S.)

    2001-01-01

    One of the most difficult problems in NDA of nuclear materials is identifying the chemical form of the nuclear material and the surrounding matrix. Recent work analyzing the calorimeter response of sources embedded in a variety of matrices has led to a possible solution to this problem. The wide range of thermal time constants exhibited by typical matrix materials lends itself to permitting the differentiation between materials, based on time constants extracted from the measured response. Potential applications include simple item identification, item fingerprinting as part of shipper-receiver measurements, and distinguishing between Pu metal and Pu oxide as required under certain proposed attribute measurements. The results of applying this technique to a variety of items will be presented and discussed.

  5. Thermal response based item identification

    International Nuclear Information System (INIS)

    One of the most difficult problems in NDA of nuclear materials is identifying the chemical form of the nuclear material and the surrounding matrix. Recent work analyzing the calorimeter response of sources embedded in a variety of matrices has led to a possible solution to this problem. The wide range of thermal time constants exhibited by typical matrix materials lends itself to permitting the differentiation between materials, based on time constants extracted from the measured response. Potential applications include simple item identification, item fingerprinting as part of shipper-receiver measurements, and distinguishing between Pu metal and Pu oxide as required under certain proposed attribute measurements. The results of applying this technique to a variety of items will be presented and discussed.

  6. Measuring Student Learning with Item Response Theory

    Science.gov (United States)

    Lee, Young-Jin; Palazzo, David J.; Warnakulasooriya, Rasil; Pritchard, David E.

    2008-01-01

    We investigate short-term learning from hints and feedback in a Web-based physics tutoring system. Both the skill of students and the difficulty and discrimination of items were determined by applying item response theory (IRT) to the first answers of students who are working on for-credit homework items in an introductory Newtonian physics…

  7. Teoria da resposta ao item aplicada ao Inventário de Depressão Beck Item response theory applied to the Beck Depression Inventory

    Directory of Open Access Journals (Sweden)

    Stela Maris de Jezus Castro

    2010-09-01

    Full Text Available O Inventário de Depressão Beck (BDI, uma escala que mede o traço latente de intensidade de sintomas depressivos, pode ser avaliado através da Teoria da Resposta ao Item (TRI. Este estudo utilizou o modelo TRI de Resposta Gradual na avaliação da intensidade de sintomas depressivos de 4.025 indivíduos que responderam ao BDI, de modo a explorar eficientemente a informação disponível nos diferentes aspectos possibilitados pelo uso desta metodologia. O ajuste foi efetuado no software PARSCALE. Foram identificados 13 itens do BDI nos quais pelo menos uma categoria de resposta não tinha chance maior que as demais de ser escolhida, de modo que estes itens tiveram de ser recategorizados. Os itens com maior capacidade de discriminação são relativos à tristeza, pessimismo, sentimento de fracasso, insatisfação, auto-aversão, indecisão e dificuldade para trabalhar. Os itens mais graves são aqueles relacionados com perda de peso, retraimento social e idéias suicidas. O grupo dos 202 indivíduos com as maiores intensidades de sintomas depressivos foi composto por 74% de mulheres, e praticamente 84% possuíam diagnóstico de algum transtorno psiquiátrico. Os resultados evidenciam alguns dos inúmeros ganhos advindos da utilização da TRI na análise de traços latentes.The Beck Depression Inventory (BDI, a scale that measures the latent trait intensity of depression symptoms, can be assessed by the Item Response Theory (IRT. This study used the Graded-Response model (GRM to assess the intensity of depressive symptoms in 4,025 individuals who responded to the BDI, in order to efficiently use the information available on different aspects enabled by the use of this methodology. The fit of this model was done in PARSCALE software. We identified 13 items of the BDI in which at least one response category was not more likely than others to be chosen, so that these items had to be categorized again. The items with greater power of

  8. A Mixed Effects Randomized Item Response Model

    Science.gov (United States)

    Fox, J.-P.; Wyrick, Cheryl

    2008-01-01

    The randomized response technique ensures that individual item responses, denoted as true item responses, are randomized before observing them and so-called randomized item responses are observed. A relationship is specified between randomized item response data and true item response data. True item response data are modeled with a (non)linear…

  9. Mokken scale analysis of mental health and well-being questionnaire item responses: a non-parametric IRT method in empirical research for applied health researchers

    Directory of Open Access Journals (Sweden)

    Stochl Jan

    2012-06-01

    Full Text Available Abstract Background Mokken scaling techniques are a useful tool for researchers who wish to construct unidimensional tests or use questionnaires that comprise multiple binary or polytomous items. The stochastic cumulative scaling model offered by this approach is ideally suited when the intention is to score an underlying latent trait by simple addition of the item response values. In our experience, the Mokken model appears to be less well-known than for example the (related Rasch model, but is seeing increasing use in contemporary clinical research and public health. Mokken's method is a generalisation of Guttman scaling that can assist in the determination of the dimensionality of tests or scales, and enables consideration of reliability, without reliance on Cronbach's alpha. This paper provides a practical guide to the application and interpretation of this non-parametric item response theory method in empirical research with health and well-being questionnaires. Methods Scalability of data from 1 a cross-sectional health survey (the Scottish Health Education Population Survey and 2 a general population birth cohort study (the National Child Development Study illustrate the method and modeling steps for dichotomous and polytomous items respectively. The questionnaire data analyzed comprise responses to the 12 item General Health Questionnaire, under the binary recoding recommended for screening applications, and the ordinal/polytomous responses to the Warwick-Edinburgh Mental Well-being Scale. Results and conclusions After an initial analysis example in which we select items by phrasing (six positive versus six negatively worded items we show that all items from the 12-item General Health Questionnaire (GHQ-12 – when binary scored – were scalable according to the double monotonicity model, in two short scales comprising six items each (Bech’s “well-being” and “distress” clinical scales. An illustration of ordinal item analysis

  10. Ramsay-Curve Item Response Theory for the Three-Parameter Logistic Item Response Model

    Science.gov (United States)

    Woods, Carol M.

    2008-01-01

    In Ramsay-curve item response theory (RC-IRT), the latent variable distribution is estimated simultaneously with the item parameters of a unidimensional item response model using marginal maximum likelihood estimation. This study evaluates RC-IRT for the three-parameter logistic (3PL) model with comparisons to the normal model and to the empirical…

  11. Extending item response theory to online homework

    Science.gov (United States)

    Kortemeyer, Gerd

    2014-06-01

    Item response theory (IRT) becomes an increasingly important tool when analyzing "big data" gathered from online educational venues. However, the mechanism was originally developed in traditional exam settings, and several of its assumptions are infringed upon when deployed in the online realm. For a large-enrollment physics course for scientists and engineers, the study compares outcomes from IRT analyses of exam and homework data, and then proceeds to investigate the effects of each confounding factor introduced in the online realm. It is found that IRT yields the correct trends for learner ability and meaningful item parameters, yet overall agreement with exam data is moderate. It is also found that learner ability and item discrimination is robust over a wide range with respect to model assumptions and introduced noise. Item difficulty is also robust, but over a narrower range.

  12. Extending Item Response Theory to Online Homework

    CERN Document Server

    Kortemeyer, Gerd

    2014-01-01

    Item Response Theory becomes an increasingly important tool when analyzing ``Big Data'' gathered from online educational venues. However, the mechanism was originally developed in traditional exam settings, and several of its assumptions are infringed upon when deployed in the online realm. For a large enrollment physics course for scientists and engineers, the study compares outcomes from IRT analyses of exam and homework data, and then proceeds to investigate the effects of each confounding factor introduced in the online realm. It is found that IRT yields the correct trends for learner ability and meaningful item parameters, yet overall agreement with exam data is moderate. It is also found that learner ability and item discrimination is over wide ranges robust with respect to model assumptions and introduced noise, less so than item difficulty.

  13. A Bifactor Multidimensional Item Response Theory Model for Differential Item Functioning Analysis on Testlet-Based Items

    Science.gov (United States)

    Fukuhara, Hirotaka; Kamata, Akihito

    2011-01-01

    A differential item functioning (DIF) detection method for testlet-based data was proposed and evaluated in this study. The proposed DIF model is an extension of a bifactor multidimensional item response theory (MIRT) model for testlets. Unlike traditional item response theory (IRT) DIF models, the proposed model takes testlet effects into…

  14. Groups of persons and groups of items in nonparametric item response theory

    NARCIS (Netherlands)

    Molenaar, IW; Yanai, H; Okada, A; Shigemasu, K; Kano, Y; Meulman, JJ

    2003-01-01

    In standard applications of Item Response Theory (IRT), n exchangeable persons have responded to k exchangeable items. Among neither persons nor items subgroups are distinguished This paper reviews methods and results for situations where it is meaningful to consider subgroups (of persons, of items,

  15. An Investigation of Multiple-Response-Option Multiple-Choice Items: Item Performance and Processing Demands.

    Science.gov (United States)

    Huntley, Renee M.; Plake, Barbara S.

    The combinational-format item (CFI)--multiple-choice item with combinations of alternatives presented as response choices--was studied to determine whether CFIs were different from regular multiple-choice items in item characteristics or in cognitive processing demands. Three undergraduate Foundations of Education classes (consisting of a total of…

  16. Teoria da Resposta ao Item Teoria de la respuesta al item Item response theory

    Directory of Open Access Journals (Sweden)

    Eutalia Aparecida Candido de Araujo

    2009-12-01

    Full Text Available A preocupação com medidas de traços psicológicos é antiga, sendo que muitos estudos e propostas de métodos foram desenvolvidos no sentido de alcançar este objetivo. Entre os trabalhos propostos, destaca-se a Teoria da Resposta ao Item (TRI que, a princípio, veio completar limitações da Teoria Clássica de Medidas, empregada em larga escala até hoje na medida de traços psicológicos. O ponto principal da TRI é que ela leva em consideração o item particularmente, sem relevar os escores totais; portanto, as conclusões não dependem apenas do teste ou questionário, mas de cada item que o compõe. Este artigo propõe-se a apresentar esta Teoria que revolucionou a teoria de medidas.La preocupación con las medidas de los rasgos psicológicos es antigua y muchos estudios y propuestas de métodos fueron desarrollados para lograr este objetivo. Entre estas propuestas de trabajo se incluye la Teoría de la Respuesta al Ítem (TRI que, en principio, vino a completar las limitaciones de la Teoría Clásica de los Tests, ampliamente utilizada hasta hoy en la medida de los rasgos psicológicos. El punto principal de la TRI es que se tiene en cuenta el punto concreto, sin relevar las puntuaciones totales; por lo tanto, los resultados no sólo dependen de la prueba o cuestionario, sino que de cada ítem que lo compone. En este artículo se propone presentar la Teoría que revolucionó la teoría de medidas.The concern with measures of psychological traits is old and many studies and proposals of methods were developed to achieve this goal. Among these proposed methods highlights the Item Response Theory (IRT that, in principle, came to complete limitations of the Classical Test Theory, which is widely used until nowadays in the measurement of psychological traits. The main point of IRT is that it takes into account the item in particular, not relieving the total scores; therefore, the findings do not only depend on the test or questionnaire

  17. Analyzing Force Concept Inventory with Item Response Theory

    CERN Document Server

    Wang, Jing

    2010-01-01

    Item Response Theory (IRT) is a popular assessment method used in education measurement, which builds on an assumption of a probability framework connecting students' innate ability and their actual performances on test items. The model transforms students' raw test scores through a nonlinear regression process into a scaled proficiency rating, which can be used to compare results obtained with different test questions. IRT also provides a theoretical approach to address ceiling effect and guessing. We applied IRT to analyze the Force Concept Inventory (FCI). The data was collected from 2802 students taking intro level mechanics courses at The Ohio State University. The data was analyzed with a 3-parameter item response model for multiple choice questions. We describe the procedures of the analysis and discuss the results and the interpretations. The analysis outcomes are compiled to provide a detailed IRT measurement metric of the FCI, which can be easily referenced and used by teachers and researchers for a...

  18. Evaluation of Anxiety Sensitivity among Daily Adult Smokers using Item Response Theory Analysis

    OpenAIRE

    Zvolensky, Michael J.; Strong, David; Bernstein, Amit; Vujanovic, Anka A.; Marshall, Erin C.

    2008-01-01

    The present investigation applied Item Response Theory (IRT) methodology to the 16-item Anxiety Sensitivity Index (ASI; Reiss, Peterson, Gursky, & McNally, 1986) for a sample of 475 daily adult smokers (52% women; Mage = 26.9, SD = 11.1, Range = 18 – 65). Using nonparametric item response analysis, all 16 ASI items were evaluated. Evaluation of the Option Characteristic Curves for each item revealed 4 poorly discriminating ASI items (1: “It is important not to appear nervous;” 5: “It is impor...

  19. An item response curves analysis of the Force Concept Inventory

    Science.gov (United States)

    Morris, Gary A.; Harshman, Nathan; Branum-Martin, Lee; Mazur, Eric; Mzoughi, Taha; Baker, Stephen D.

    2012-09-01

    Several years ago, we introduced the idea of item response curves (IRC), a simplistic form of item response theory (IRT), to the physics education research community as a way to examine item performance on diagnostic instruments such as the Force Concept Inventory (FCI). We noted that a full-blown analysis using IRT would be a next logical step, which several authors have since taken. In this paper, we show that our simple approach not only yields similar conclusions in the analysis of the performance of items on the FCI to the more sophisticated and complex IRT analyses but also permits additional insights by characterizing both the correct and incorrect answer choices. Our IRC approach can be applied to a variety of multiple-choice assessments but, as applied to a carefully designed instrument such as the FCI, allows us to probe student understanding as a function of ability level through an examination of each answer choice. We imagine that physics teachers could use IRC analysis to identify prominent misconceptions and tailor their instruction to combat those misconceptions, fulfilling the FCI authors' original intentions for its use. Furthermore, the IRC analysis can assist test designers to improve their assessments by identifying nonfunctioning distractors that can be replaced with distractors attractive to students at various ability levels.

  20. Evaluating Item Discrimination Power of WHOQOL-BREF from an Item Response Model Perspectives

    Science.gov (United States)

    Lin, Ting Hsiang; Yao, Grace

    2009-01-01

    Quality of life (QOL) has become an important component of health. By using the methodology of psychometric theory, we examine the item properties of the WHOQOL-BRIEF. Samejima's graded response model with natural metrics of the logistic response function was fitted. The results showed items with negative natures were less discriminating. Items…

  1. Cognitive processes in self-report responses: tests of item context effects in work attitude measures.

    Science.gov (United States)

    Harrison, D A; McLaughlin, M E

    1993-02-01

    Much applied research relies on multi-item, self-report instruments. Drawing from recent cognitive theories, it was hypothesized that the items preceding a self-report item, its item context, can generate cognitive carryover and prompt context-consistent responses. These hypotheses were tested in 2 investigations: a field experiment involving 431 employees of a nonprofit urban hospital and a laboratory replication involving 245 undergraduate business students who held full- or part-time jobs. In both studies, evaluatively neutral items were placed in specially arranged blocks of uniformly positive, uniformly negative, or randomly mixed items on 3 modified Job Descriptive Index scales. Responses to the neutral items differed across the 3 forms, but scale-level psychometric properties remained unchanged. The implications of these item- and scale-level results for a variety of self-report measures in organizations are discussed. PMID:8449851

  2. An Item Response Model for Characterizing Test Compromise.

    Science.gov (United States)

    Segall, Daniel O.

    2002-01-01

    Developed an item response model for characterizing test-compromise that enables the estimation of item preview and score-gain distributions. In the approach, models parameters and posterior distributions are estimated by Markov Chain Monte Carlo procedures. Simulation study results suggest that when at least some test items are known to be…

  3. Using response times for item selection in adaptive testing

    NARCIS (Netherlands)

    Linden, van der Wim J.

    2008-01-01

    Response times on items can be used to improve item selection in adaptive testing provided that a probabilistic model for their distribution is available. In this research, the author used a hierarchical modeling framework with separate first-level models for the responses and response times and a s

  4. Generalized Fiducial Inference for Binary Logistic Item Response Models.

    Science.gov (United States)

    Liu, Yang; Hannig, Jan

    2016-06-01

    Generalized fiducial inference (GFI) has been proposed as an alternative to likelihood-based and Bayesian inference in mainstream statistics. Confidence intervals (CIs) can be constructed from a fiducial distribution on the parameter space in a fashion similar to those used with a Bayesian posterior distribution. However, no prior distribution needs to be specified, which renders GFI more suitable when no a priori information about model parameters is available. In the current paper, we apply GFI to a family of binary logistic item response theory models, which includes the two-parameter logistic (2PL), bifactor and exploratory item factor models as special cases. Asymptotic properties of the resulting fiducial distribution are discussed. Random draws from the fiducial distribution can be obtained by the proposed Markov chain Monte Carlo sampling algorithm. We investigate the finite-sample performance of our fiducial percentile CI and two commonly used Wald-type CIs associated with maximum likelihood (ML) estimation via Monte Carlo simulation. The use of GFI in high-dimensional exploratory item factor analysis was illustrated by the analysis of a set of the Eysenck Personality Questionnaire data. PMID:26769340

  5. Analysis of Item Response and Differential Item Functioning of Alcohol Expectancies in Middle School Youth

    OpenAIRE

    McCarthy, Denis M.; Pedersen, Sarah L.; D'Amico, Elizabeth J.

    2009-01-01

    Drinking behavior in pre-adolescence is a significant predictor of both short and long-term negative consequences. This study examined the psychometric properties of one known risk factor for drinking in this age group, alcohol expectancies, within an Item Response Theory framework. In a sample of middle school youth (N = 1273), we tested differential item functioning (DIF) in positive and negative alcohol expectancies across grade, gender, and ethnicity. MIMIC model analyses tested differenc...

  6. Obtaining a common scale for item response theory item parameters using separate versus concurrent estimation in the common-item equating design

    NARCIS (Netherlands)

    Hanson, Bradley A.; Beguin, Anton A.

    2002-01-01

    Item response theory item parameters can be estimated using data from a common-item equating design either separately for each form or concurrently across forms. This paper reports the results of a simulation study of separate versus concurrent item parameter estimation. Using simulated data from a

  7. Semiparametric Item Response Functions in the Context of Guessing

    Science.gov (United States)

    Falk, Carl F.; Cai, Li

    2016-01-01

    We present a logistic function of a monotonic polynomial with a lower asymptote, allowing additional flexibility beyond the three-parameter logistic model. We develop a maximum marginal likelihood-based approach to estimate the item parameters. The new item response model is demonstrated on math assessment data from a state, and a computationally…

  8. Item Response Theory Modeling of the Philadelphia Naming Test

    Science.gov (United States)

    Fergadiotis, Gerasimos; Kellough, Stacey; Hula, William D.

    2015-01-01

    Purpose: In this study, we investigated the fit of the Philadelphia Naming Test (PNT; Roach, Schwartz, Martin, Grewal, & Brecher, 1996) to an item-response-theory measurement model, estimated the precision of the resulting scores and item parameters, and provided a theoretical rationale for the interpretation of PNT overall scores by relating…

  9. Practical Guide to Conducting an Item Response Theory Analysis

    Science.gov (United States)

    Toland, Michael D.

    2014-01-01

    Item response theory (IRT) is a psychometric technique used in the development, evaluation, improvement, and scoring of multi-item scales. This pedagogical article provides the necessary information needed to understand how to conduct, interpret, and report results from two commonly used ordered polytomous IRT models (Samejima's graded…

  10. A Bayesian Semiparametric Item Response Model with Dirichlet Process Priors

    Science.gov (United States)

    Miyazaki, Kei; Hoshino, Takahiro

    2009-01-01

    In Item Response Theory (IRT), item characteristic curves (ICCs) are illustrated through logistic models or normal ogive models, and the probability that examinees give the correct answer is usually a monotonically increasing function of their ability parameters. However, since only limited patterns of shapes can be obtained from logistic models…

  11. Analyzing force concept inventory with item response theory

    Science.gov (United States)

    Wang, Jing; Bao, Lei

    2010-10-01

    Item response theory is a popular assessment method used in education. It rests on the assumption of a probability framework that relates students' innate ability and their performance on test questions. Item response theory transforms students' raw test scores into a scaled proficiency score, which can be used to compare results obtained with different test questions. The scaled score also addresses the issues of ceiling effects and guessing, which commonly exist in quantitative assessment. We used item response theory to analyze the force concept inventory (FCI). Our results show that item response theory can be useful for analyzing physics concept surveys such as the FCI and produces results about the individual questions and student performance that are beyond the capability of classical statistics. The theory yields detailed measurement parameters regarding the difficulty, discrimination features, and probability of correct guess for each of the FCI questions.

  12. Inconsistent Student Responses in TIMSS Questionnaire Items on Mathematics Lessons

    Directory of Open Access Journals (Sweden)

    Selda Yıldırım

    2009-12-01

    Full Text Available This study investigated consistency among Turkish students’ responses to TIMSS 2007 questionnaire items on frequency of certain activities in mathematics classrooms. In Turkey, 4476 students from 143 schools participated in the study. Analyses have revealed the existence of inconsistencies in student responses as indicated by high proportion of within-class variance components. That is, students in same class specified fluctuating frequencies to certain classroom activities, showing that some factors had an affect on perception of individuals. Further analyses showed that students at different levels of mathematics achievement reported differently on frequency of classroom activities, and precise items were answered more consistently compared to items containing vague terms. Using factor scores instead of individual item responses contributed consistency of responses within classes but only to a small extent. Based on the findings, this study also provided implications for questionnaire design.

  13. An item factor analysis and item response theory-based revision of the Everyday Discrimination Scale.

    Science.gov (United States)

    Stucky, Brian D; Gottfredson, Nisha C; Panter, A T; Daye, Charles E; Allen, Walter R; Wightman, Linda F

    2011-04-01

    The Everyday Discrimination Scale (EDS), a widely used measure of daily perceived discrimination, is purported to be unidimensional, to function well among African Americans, and to have adequate construct validity. Two separate studies and data sources were used to examine and cross-validate the psychometric properties of the EDS. In Study 1, an exploratory factor analysis was conducted on a sample of African American law students (N = 589), providing strong evidence of local dependence, or nuisance multidimensionality within the EDS. In Study 2, a separate nationally representative community sample (N = 3,527) was used to model the identified local dependence in an item factor analysis (i.e., bifactor model). Next, item response theory (IRT) calibrations were conducted to obtain item parameters. A five-item, revised-EDS was then tested for gender differential item functioning (in an IRT framework). Based on these analyses, a summed score to IRT-scaled score translation table is provided for the revised-EDS. Our results indicate that the revised-EDS is unidimensional, with minimal differential item functioning, and retains predictive validity consistent with the original scale.

  14. Techniques Applied to the Authentication of Gold Jewellery Items

    International Nuclear Information System (INIS)

    Malleable and ductile, found free in nature, in alluvial deposits as nuggets and pellets, gold could easily be hammered and rolled to form simple objects such as plaques and beads since the Vth millennium BC. The evolution of the goldsmith's skill brought many changes to goldsmithing. Fabricated alloys of required quality and type replaced the natural alloys of gold, were silver and copper are present at rather variable concentrations. A gold alloy could therefore be produced with the colour and the mechanical and physical properties necessary to the final purpose of the item. (author)

  15. An NCME Instructional Module on Item-Fit Statistics for Item Response Theory Models

    Science.gov (United States)

    Ames, Allison J.; Penfield, Randall D.

    2015-01-01

    Drawing valid inferences from item response theory (IRT) models is contingent upon a good fit of the data to the model. Violations of model-data fit have numerous consequences, limiting the usefulness and applicability of the model. This instructional module provides an overview of methods used for evaluating the fit of IRT models. Upon completing…

  16. Fighting bias with statistics: Detecting gender differences in responses to items on a preschool science assessment

    Science.gov (United States)

    Greenberg, Ariela Caren

    Differential item functioning (DIF) and differential distractor functioning (DDF) are methods used to screen for item bias (Camilli & Shepard, 1994; Penfield, 2008). Using an applied empirical example, this mixed-methods study examined the congruency and relationship of DIF and DDF methods in screening multiple-choice items. Data for Study I were drawn from item responses of 271 female and 236 male low-income children on a preschool science assessment. Item analyses employed a common statistical approach of the Mantel-Haenszel log-odds ratio (MH-LOR) to detect DIF in dichotomously scored items (Holland & Thayer, 1988), and extended the approach to identify DDF (Penfield, 2008). Findings demonstrated that the using MH-LOR to detect DIF and DDF supported the theoretical relationship that the magnitude and form of DIF and are dependent on the DDF effects, and demonstrated the advantages of studying DIF and DDF in multiple-choice items. A total of 4 items with DIF and DDF and 5 items with only DDF were detected. Study II incorporated an item content review, an important but often overlooked and under-published step of DIF and DDF studies (Camilli & Shepard). Interviews with 25 female and 22 male low-income preschool children and an expert review helped to interpret the DIF and DDF results and their comparison, and determined that a content review process of studied items can reveal reasons for potential item bias that are often congruent with the statistical results. Patterns emerged and are discussed in detail. The quantitative and qualitative analyses were conducted in an applied framework of examining the validity of the preschool science assessment scores for evaluating science programs serving low-income children, however, the techniques can be generalized for use with measures across various disciplines of research.

  17. Item response analysis of the Positive and Negative Syndrome Scale

    Directory of Open Access Journals (Sweden)

    Lindenmayer Jean-Pierre

    2007-11-01

    Full Text Available Abstract Background Statistical models based on item response theory were used to examine (a the performance of individual Positive and Negative Syndrome Scale (PANSS items and their options, (b the effectiveness of various subscales to discriminate among individual differences in symptom severity, and (c the appropriateness of cutoff scores recently recommended by Andreasen and her colleagues (2005 to establish symptom remission. Methods Option characteristic curves were estimated using a nonparametric item response model to examine the probability of endorsing each of 7 options within each of 30 PANSS items as a function of standardized, overall symptom severity. Our data were baseline PANSS scores from 9205 patients with schizophrenia or schizoaffective disorder who were enrolled between 1995 and 2003 in either a large, naturalistic, observational study or else in 1 of 12 randomized, double-blind, clinical trials comparing olanzapine to other antipsychotic drugs. Results Our analyses show that the majority of items forming the Positive and Negative subscales of the PANSS perform very well. We also identified key areas for improvement or revision in items and options within the General Psychopathology subscale. The Positive and Negative subscale scores are not only more discriminating of individual differences in symptom severity than the General Psychopathology subscale score, but are also more efficient on average than the 30-item total score. Of the 8 items recently recommended to establish symptom remission, 1 performed markedly different from the 7 others and should either be deleted or rescored requiring that patients achieve a lower score of 2 (rather than 3 to signal remission. Conclusion This first item response analysis of the PANSS supports its sound psychometric properties; most PANSS items were either very good or good at assessing overall severity of illness. These analyses did identify some items which might be further improved

  18. MODERATING ABILITY OF ITEM RESPONSE THEORY THROUGH PRIOR GUESSING PARAMETER

    Directory of Open Access Journals (Sweden)

    Siow Hoo Leong

    2013-01-01

    Full Text Available A psycho-technology approach to discouraging guessing in multiple-choice formatted item can be done through reducing the a priori guessing probability of an item. This study proposes a psychometrics framework of Item Response Theory (IRT to model the effect of having various priori guessing probabilities across different items. A prior guessing parameter is proposed to serves as a moderator of the ability parameter in the two parameter logistic IRT. The results show that the proposed prior guessing parameter successfully moderates the ability parameters of the subjects with different degrees of guessing. However, the prior guessing parameter is insensitive when the performance pattern is mixed within the testlet but similar across testlet with different priori guessing probabilities.

  19. The 12-item World Health Organization Disability Assessment Schedule II (WHO-DAS II: a nonparametric item response analysis

    Directory of Open Access Journals (Sweden)

    Fernandez Ana

    2010-05-01

    Full Text Available Abstract Background Previous studies have analyzed the psychometric properties of the World Health Organization Disability Assessment Schedule II (WHO-DAS II using classical omnibus measures of scale quality. These analyses are sample dependent and do not model item responses as a function of the underlying trait level. The main objective of this study was to examine the effectiveness of the WHO-DAS II items and their options in discriminating between changes in the underlying disability level by means of item response analyses. We also explored differential item functioning (DIF in men and women. Methods The participants were 3615 adult general practice patients from 17 regions of Spain, with a first diagnosed major depressive episode. The 12-item WHO-DAS II was administered by the general practitioners during the consultation. We used a non-parametric item response method (Kernel-Smoothing implemented with the TestGraf software to examine the effectiveness of each item (item characteristic curves and their options (option characteristic curves in discriminating between changes in the underliying disability level. We examined composite DIF to know whether women had a higher probability than men of endorsing each item. Results Item response analyses indicated that the twelve items forming the WHO-DAS II perform very well. All items were determined to provide good discrimination across varying standardized levels of the trait. The items also had option characteristic curves that showed good discrimination, given that each increasing option became more likely than the previous as a function of increasing trait level. No gender-related DIF was found on any of the items. Conclusions All WHO-DAS II items were very good at assessing overall disability. Our results supported the appropriateness of the weights assigned to response option categories and showed an absence of gender differences in item functioning.

  20. PENGEMBANGAN TES BERPIKIR KRITIS DENGAN PENDEKATAN ITEM RESPONSE THEORY

    Directory of Open Access Journals (Sweden)

    Fajrianthi Fajrianthi

    2016-06-01

    Full Text Available Penelitian ini bertujuan untuk menghasilkan sebuah alat ukur (tes berpikir kritis yang valid dan reliabel untuk digunakan, baik dalam lingkup pendidikan maupun kerja di Indonesia. Tahapan penelitian dilakukan berdasarkan tahap pengembangan tes menurut Hambleton dan Jones (1993. Kisi-kisi dan pembuatan butir didasarkan pada konsep dalam tes Watson-Glaser Critical Thinking Appraisal (WGCTA. Pada WGCTA, berpikir kritis terdiri dari lima dimensi yaitu Inference, Recognition Assumption, Deduction, Interpretation dan Evaluation of arguments. Uji coba tes dilakukan pada 1.453 peserta tes seleksi karyawan di Surabaya, Gresik, Tuban, Bojonegoro, Rembang. Data dikotomi dianalisis dengan menggunakan model IRT dengan dua parameter yaitu daya beda dan tingkat kesulitan butir. Analisis dilakukan dengan menggunakan program statistik Mplus versi 6.11 Sebelum melakukan analisis dengan IRT, dilakukan pengujian asumsi yaitu uji unidimensionalitas, independensi lokal dan Item Characteristic Curve (ICC. Hasil analisis terhadap 68 butir menghasilkan 15 butir dengan daya beda yang cukup baik dan tingkat kesulitan butir yang berkisar antara –4 sampai dengan 2.448. Sedikitnya jumlah butir yang berkualitas baik disebabkan oleh kelemahan dalam menentukan subject matter experts di bidang berpikir kritis dan pemilihan metode skoring. Kata kunci: Pengembangan tes, berpikir kritis, item response theory   DEVELOPING CRITICAL THINKING TEST UTILISING ITEM RESPONSE THEORY Abstract The present study was aimed to develop a valid and reliable instrument in assesing critical thinking which can be implemented both in educational and work settings in Indonesia. Following the Hambleton and Jones’s (1993 procedures on test development, the study developed the instrument by employing the concept of critical thinking from Watson-Glaser Critical Thinking Appraisal (WGCTA. The study included five dimensions of critical thinking as adopted from the WGCTA: Inference, Recognition

  1. An NCME Instructional Module on Latent DIF Analysis Using Mixture Item Response Models

    Science.gov (United States)

    Cho, Sun-Joo; Suh, Youngsuk; Lee, Woo-yeol

    2016-01-01

    The purpose of this ITEMS module is to provide an introduction to differential item functioning (DIF) analysis using mixture item response models. The mixture item response models for DIF analysis involve comparing item profiles across latent groups, instead of manifest groups. First, an overview of DIF analysis based on latent groups, called…

  2. Morphological Contributions to Adolescent Word Reading: An Item Response Approach

    Science.gov (United States)

    Goodwin, Amanda P.; Gilbert, Jennifer K.; Cho, Sun-Joo

    2013-01-01

    The current study uses a crossed random-effects item response model to simultaneously examine both reader and word characteristics and interactions between them that predict the reading of 39 morphologically complex words for 221 middle school students. Results suggest that a reader's ability to read a root word (e.g., "isolate") predicts that…

  3. Item Response Theory in the context of Improving Student Reasoning

    Science.gov (United States)

    Goddard, Chase; Davis, Jeremy; Pyper, Brian

    2011-10-01

    We are interested to see if Item Response Theory can help to better inform the development of reasoning ability in introductory physics. A first pass through our latest batch of data from the Heat and Temperature Conceptual Evaluation, the Lawson Classroom Test of Scientific Reasoning, and the Epistemological Beliefs About Physics Survey may help in this effort.

  4. Students' Proficiency Scores within Multitrait Item Response Theory

    Science.gov (United States)

    Scott, Terry F.; Schumayer, Daniel

    2015-01-01

    In this paper we present a series of item response models of data collected using the Force Concept Inventory. The Force Concept Inventory (FCI) was designed to poll the Newtonian conception of force viewed as a multidimensional concept, that is, as a complex of distinguishable conceptual dimensions. Several previous studies have developed…

  5. A Speeded Item Response Model with Gradual Process Change

    Science.gov (United States)

    Goegebeur, Yuri; De Boeck, Paul; Wollack, James A.; Cohen, Allan S.

    2008-01-01

    An item response theory model for dealing with test speededness is proposed. The model consists of two random processes, a problem solving process and a random guessing process, with the random guessing gradually taking over from the problem solving process. The involved change point and change rate are considered random parameters in order to…

  6. Bad Questions: An Essay Involving Item Response Theory

    Science.gov (United States)

    Thissen, David

    2016-01-01

    David Thissen, a professor in the Department of Psychology and Neuroscience, Quantitative Program at the University of North Carolina, has consulted and served on technical advisory committees for assessment programs that use item response theory (IRT) over the past couple decades. He has come to the conclusion that there are usually two purposes…

  7. Functionally unidimensional item response models for multivariate binary data

    DEFF Research Database (Denmark)

    Ip, Edward; Molenberghs, Geert; Chen, Shyh-Huei;

    2013-01-01

    The problem of fitting unidimensional item response models to potentially multidimensional data has been extensively studied. The focus of this article is on response data that have a strong dimension but also contain minor nuisance dimensions. Fitting a unidimensional model to such multidimensio......The problem of fitting unidimensional item response models to potentially multidimensional data has been extensively studied. The focus of this article is on response data that have a strong dimension but also contain minor nuisance dimensions. Fitting a unidimensional model to such...... 2 issues: (a) can a proposed nonlinear projection track the functional dimension well, and (b) what are the biases in the ability estimate and the associated standard error when estimating the functional dimension? To investigate the second issue, the nonlinear projection is used as an evaluative...

  8. Item response modeling: an evaluation of the children's fruit and vegetable self-efficacy questionnaire

    Science.gov (United States)

    Perceived self-efficacy (SE) for eating fruit and vegetables (FV) is a key variable mediating FV change in interventions. This study applies item response modeling (IRM) to a fruit, juice and vegetable self-efficacy questionnaire (FVSEQ) previously validated with classical test theory (CTT) procedur...

  9. An Item Response Theory Analysis of the Community of Inquiry Scale

    Science.gov (United States)

    Horzum, Mehmet Baris; Uyanik, Gülden Kaya

    2015-01-01

    The aim of this study is to examine validity and reliability of Community of Inquiry Scale commonly used in online learning by the means of Item Response Theory. For this purpose, Community of Inquiry Scale version 14 is applied on 1,499 students of a distance education center's online learning programs at a Turkish state university via internet.…

  10. Using Response Time to Detect Item Preknowledge in Computer-Based Licensure Examinations

    Science.gov (United States)

    Qian, Hong; Staniewska, Dorota; Reckase, Mark; Woo, Ada

    2016-01-01

    This article addresses the issue of how to detect item preknowledge using item response time data in two computer-based large-scale licensure examinations. Item preknowledge is indicated by an unexpected short response time and a correct response. Two samples were used for detecting item preknowledge for each examination. The first sample was from…

  11. Dimensionality Assessment Using the Full-Information Item Bifactor Analysis for Graded Response Data: An Illustration with the State Metacognitive Inventory

    Science.gov (United States)

    Immekus, Jason C.; Imbrie, P. K.

    2008-01-01

    Dimensionality assessment using the full-information item bifactor model for graded response data is provided. The model applies to data in which each item relates to a general factor and one group factor. Specifically, alternative model specification within item response theory (IRT) is shown to test a scale's factor structure. For illustrative…

  12. Empirical Differences in Omission Tendency and Reading Ability in PISA: An Application of Tree-Based Item Response Models

    Science.gov (United States)

    Okumura, Taichi

    2014-01-01

    This study examined the empirical differences between the tendency to omit items and reading ability by applying tree-based item response (IRTree) models to the Japanese data of the Programme for International Student Assessment (PISA) held in 2009. For this purpose, existing IRTree models were expanded to contain predictors and to handle…

  13. Marginal Maximum Likelihood Estimation of Item Response Models in R

    Directory of Open Access Journals (Sweden)

    Matthew S. Johnson

    2007-02-01

    Full Text Available Item response theory (IRT models are a class of statistical models used by researchers to describe the response behaviors of individuals to a set of categorically scored items. The most common IRT models can be classified as generalized linear fixed- and/or mixed-effect models. Although IRT models appear most often in the psychological testing literature, researchers in other fields have successfully utilized IRT-like models in a wide variety of applications. This paper discusses the three major methods of estimation in IRT and develops R functions utilizing the built-in capabilities of the R environment to find the marginal maximum likelihood estimates of the generalized partial credit model. The currently available R packages ltm is also discussed.

  14. Item Response Theory with Covariates (IRT-C): Assessing Item Recovery and Differential Item Functioning for the Three-Parameter Logistic Model

    Science.gov (United States)

    Tay, Louis; Huang, Qiming; Vermunt, Jeroen K.

    2016-01-01

    In large-scale testing, the use of multigroup approaches is limited for assessing differential item functioning (DIF) across multiple variables as DIF is examined for each variable separately. In contrast, the item response theory with covariate (IRT-C) procedure can be used to examine DIF across multiple variables (covariates) simultaneously. To…

  15. Network Security Risk Assessment Based on Item Response Theory

    OpenAIRE

    Fangwei Li; Qing Huang; Jiang Zhu; Zhuxun Peng

    2015-01-01

    Owing to the traditional risk assessment method has one-sidedness and is difficult to reflect the real network situation, a risk assessment method based on Item Response Theory (IRT) is put forward in network security. First of all, the novel algorithms of calculating the threat of attack and the successful probability of attack are proposed by the combination of IRT model and Service Security Level. Secondly, the service weight of importance is calculated by the three-demarcation analytic hi...

  16. Evaluating social influence relations: an item-response-modeling approach:

    OpenAIRE

    Schwenk, Gero

    2009-01-01

    Subject of this paper is the measurement of social influence in social networks. The theoretical point of departure is twofold. First, focus is on cognitive processing of perceived influence. Second, three distinct dimensions of social influence are considered: persuasion, authority and coercion. Combining these considerations with Item Response Theory methods, questionnaire-type measurement instruments are proposed. These instruments are employed in a closed network case study where applicab...

  17. Item Response Theory Analysis and Differential Item Functioning across Age, Gender and Country of a Short Form of the Advanced Progressive Matrices

    Science.gov (United States)

    Chiesi, Francesca; Ciancaleoni, Matteo; Galli, Silvia; Morsanyi, Kinga; Primi, Caterina

    2012-01-01

    Item Response Theory (IRT) models were applied to investigate the psychometric properties of the Arthur and Day's Advanced Progressive Matrices-Short Form (APM-SF; 1994) [Arthur and Day (1994). "Development of a short form for the Raven Advanced Progressive Matrices test." "Educational and Psychological Measurement, 54," 395-403] in order to test…

  18. An investigation of emotional intelligence measures using item response theory.

    Science.gov (United States)

    Cho, Seonghee; Drasgow, Fritz; Cao, Mengyang

    2015-12-01

    This study investigated the psychometric properties of 3 frequently administered emotional intelligence (EI) scales (Wong and Law Emotional Intelligence Scale [WLEIS], Schutte Self-Report Emotional Intelligence Test [SEIT], and Trait Emotional Intelligence Questionnaire [TEIQue]), which were developed on the basis of different theoretical frameworks (i.e., ability EI and mixed EI). By conducting item response theory (IRT) analyses, the authors examined the item parameters and compared the fits of 2 response process models (i.e., dominance model and ideal point model) for these scales with data from 355 undergraduate sample recruited from the subject pool. Several important findings were obtained. First, the EI scales seem better able to differentiate individuals at low trait levels than high trait levels. Second, a dominance model showed better model fit to the self-report ability EI scale (WLEIS) and also fit better with most subfactors of the SEIT, except for the mood regulation/optimism factor. Both dominance and ideal point models fit a self-report mixed EI scale (TEIQue). Our findings suggest (a) the EI scales should be revised to include more items at moderate and higher trait levels; and (b) the nature of the EI construct should be considered during the process of scale development. PMID:25961137

  19. The challenges of fitting an item response theory model to the Social Anhedonia Scale.

    Science.gov (United States)

    Reise, Steven P; Horan, William P; Blanchard, Jack J

    2011-05-01

    This study explored the application of latent variable measurement models to the Social Anhedonia Scale (SAS; Eckblad, Chapman, Chapman, & Mishlove, 1982), a widely used and influential measure in schizophrenia-related research. Specifically, we applied unidimensional and bifactor item response theory (IRT) models to data from a community sample of young adults (n = 2,227). Ordinal factor analyses revealed that identifying a coherent latent structure in the 40-item SAS data was challenging due to (a) the presence of multiple small content clusters (e.g., doublets); (b) modest relations between those clusters, which, in turn, implies a general factor of only modest strength; (c) items that shared little variance with the majority of items; and (d) cross-loadings in bifactor solutions. Consequently, we conclude that SAS responses cannot be modeled accurately by either unidimensional or bifactor IRT models. Although the application of a bifactor model to a reduced 17-item set met with better success, significant psychometric and substantive problems remained. Results highlight the challenges of applying latent variable models to scales that were not originally designed to fit these models.

  20. Bookmark locations and item response model selection in the presence of local item dependence.

    Science.gov (United States)

    Skaggs, Gary

    2007-01-01

    The bookmark standard setting procedure is a popular method for setting performance standards on state assessment programs. This study reanalyzed data from an application of the bookmark procedure to a passage-based test that used the Rasch model to create the item ordered booklet. Several problems were noted in this implementation of the bookmark procedure, including disagreement among the SMEs about the correct order of items in the bookmark booklet, performance level descriptions of the passing standard being based on passage difficulty as well as item difficulty, and the presence of local item dependence within reading passages. Bookmark item locations were recalculated for the IRT three-parameter model and the multidimensional bifactor model. The results showed that the order of item locations was very similar for all three models when items of high difficulty and low discrimination were excluded. However, the items whose positions were the most discrepant between models were not the items that the SMEs disagreed about the most in the original standard setting. The choice of latent trait model did not address problems of item order disagreement. Implications for the use of the bookmark method in the presence of local item dependence are discussed.

  1. Adult Attachment Ratings (AAR): an item response theory analysis.

    Science.gov (United States)

    Pilkonis, Paul A; Kim, Yookyung; Yu, Lan; Morse, Jennifer Q

    2014-01-01

    The Adult Attachment Ratings (AAR) include 3 scales for anxious, ambivalent attachment (excessive dependency, interpersonal ambivalence, and compulsive care-giving), 3 for avoidant attachment (rigid self-control, defensive separation, and emotional detachment), and 1 for secure attachment. The scales include items (ranging from 6-16 in their original form) scored by raters using a 3-point format (0 = absent, 1 = present, and 2 = strongly present) and summed to produce a total score. Item response theory (IRT) analyses were conducted with data from 414 participants recruited from psychiatric outpatient, medical, and community settings to identify the most informative items from each scale. The IRT results allowed us to shorten the scales to 5-item versions that are more precise and easier to rate because of their brevity. In general, the effective range of measurement for the scales was 0 to +2 SDs for each of the attachment constructs; that is, from average to high levels of attachment problems. Evidence for convergent and discriminant validity of the scales was investigated by comparing them with the Experiences of Close Relationships-Revised (ECR-R) scale and the Kobak Attachment Q-sort. The best consensus among self-reports on the ECR-R, informant ratings on the ECR-R, and expert judgments on the Q-sort and the AAR emerged for anxious, ambivalent attachment. Given the good psychometric characteristics of the scale for secure attachment, however, this measure alone might provide a simple alternative to more elaborate procedures for some measurement purposes. Conversion tables are provided for the 7 scales to facilitate transformation from raw scores to IRT-calibrated (theta) scores.

  2. Adult Attachment Ratings (AAR): an item response theory analysis.

    Science.gov (United States)

    Pilkonis, Paul A; Kim, Yookyung; Yu, Lan; Morse, Jennifer Q

    2014-01-01

    The Adult Attachment Ratings (AAR) include 3 scales for anxious, ambivalent attachment (excessive dependency, interpersonal ambivalence, and compulsive care-giving), 3 for avoidant attachment (rigid self-control, defensive separation, and emotional detachment), and 1 for secure attachment. The scales include items (ranging from 6-16 in their original form) scored by raters using a 3-point format (0 = absent, 1 = present, and 2 = strongly present) and summed to produce a total score. Item response theory (IRT) analyses were conducted with data from 414 participants recruited from psychiatric outpatient, medical, and community settings to identify the most informative items from each scale. The IRT results allowed us to shorten the scales to 5-item versions that are more precise and easier to rate because of their brevity. In general, the effective range of measurement for the scales was 0 to +2 SDs for each of the attachment constructs; that is, from average to high levels of attachment problems. Evidence for convergent and discriminant validity of the scales was investigated by comparing them with the Experiences of Close Relationships-Revised (ECR-R) scale and the Kobak Attachment Q-sort. The best consensus among self-reports on the ECR-R, informant ratings on the ECR-R, and expert judgments on the Q-sort and the AAR emerged for anxious, ambivalent attachment. Given the good psychometric characteristics of the scale for secure attachment, however, this measure alone might provide a simple alternative to more elaborate procedures for some measurement purposes. Conversion tables are provided for the 7 scales to facilitate transformation from raw scores to IRT-calibrated (theta) scores. PMID:24033268

  3. Students' proficiency scores within multitrait item response theory

    Science.gov (United States)

    Scott, Terry F.; Schumayer, Daniel

    2015-12-01

    In this paper we present a series of item response models of data collected using the Force Concept Inventory. The Force Concept Inventory (FCI) was designed to poll the Newtonian conception of force viewed as a multidimensional concept, that is, as a complex of distinguishable conceptual dimensions. Several previous studies have developed single-trait item response models of FCI data; however, we feel that multidimensional models are also appropriate given the explicitly multidimensional design of the inventory. The models employed in the research reported here vary in both the number of fitting parameters and the number of underlying latent traits assumed. We calculate several model information statistics to ensure adequate model fit and to determine which of the models provides the optimal balance of information and parsimony. Our analysis indicates that all item response models tested, from the single-trait Rasch model through to a model with ten latent traits, satisfy the standard requirements of fit. However, analysis of model information criteria indicates that the five-trait model is optimal. We note that an earlier factor analysis of the same FCI data also led to a five-factor model. Furthermore the factors in our previous study and the traits identified in the current work match each other well. The optimal five-trait model assigns proficiency scores to all respondents for each of the five traits. We construct a correlation matrix between the proficiencies in each of these traits. This correlation matrix shows strong correlations between some proficiencies, and strong anticorrelations between others. We present an interpretation of this correlation matrix.

  4. Using the Nominal Response Model to Evaluate Response Category Discrimination in the PROMIS Emotional Distress Item Pools

    Science.gov (United States)

    Preston, Kathleen; Reise, Steven; Cai, Li; Hays, Ron D.

    2011-01-01

    The authors used a nominal response item response theory model to estimate category boundary discrimination (CBD) parameters for items drawn from the Emotional Distress item pools (Depression, Anxiety, and Anger) developed in the Patient-Reported Outcomes Measurement Information Systems (PROMIS) project. For polytomous items with ordered response…

  5. Modeling Local Item Dependence in Cloze and Reading Comprehension Test Items Using Testlet Response Theory

    Science.gov (United States)

    Baghaei, Purya; Ravand, Hamdollah

    2016-01-01

    In this study the magnitudes of local dependence generated by cloze test items and reading comprehension items were compared and their impact on parameter estimates and test precision was investigated. An advanced English as a foreign language reading comprehension test containing three reading passages and a cloze test was analyzed with a…

  6. [Unfolding item response model using best-worst scaling].

    Science.gov (United States)

    Ikehara, Kazuya

    2015-02-01

    In attitude measurement and sensory tests, the unfolding model is typically used. In this model, response probability is formulated by the distance between the person and the stimulus. In this study, we proposed an unfolding item response model using best-worst scaling (BWU model), in which a person chooses the best and worst stimulus among repeatedly presented subsets of stimuli. We also formulated an unfolding model using best scaling (BU model), and compared the accuracy of estimates between the BU and BWU models. A simulation experiment showed that the BWU modell performed much better than the BU model in terms of bias and root mean square errors of estimates. With reference to Usami (2011), the proposed models were apllied to actual data to measure attitudes toward tardiness. Results indicated high similarity between stimuli estimates generated with the proposed models and those of Usami (2011).

  7. Analysis of Multiple Partially Ordered Responses to Belief Items with Don't Know Option.

    Science.gov (United States)

    Ip, Edward H; Chen, Shyh-Huei; Quandt, Sara A

    2016-06-01

    Understanding beliefs, values, and preferences of patients is a tenet of contemporary health sciences. This application was motivated by the analysis of multiple partially ordered set (poset) responses from an inventory on layman beliefs about diabetes. The partially ordered set arises because of two features in the data-first, the response options contain a Don't Know (DK) option, and second, there were two consecutive occasions of measurement. As predicted by the common sense model of illness, beliefs about diabetes were not necessarily stable across the two measurement occasions. Instead of analyzing the two occasions separately, we studied the joint responses across the occasions as a poset response. Few analytic methods exist for data structures other than ordered or nominal categories. Poset responses are routinely collapsed and then analyzed as either rank ordered or nominal data, leading to the loss of nuanced information that might be present within poset categories. In this paper we developed a general class of item response models for analyzing the poset data collected from the Common Sense Model of Diabetes Inventory. The inferential object of interest is the latent trait that indicates congruence of belief with the biomedical model. To apply an item response model to the poset diabetes inventory, we proved that a simple coding algorithm circumvents the requirement of writing new codes such that standard IRT software could be directly used for the purpose of item estimation and individual scoring. Simulation experiments were used to examine parameter recovery for the proposed poset model. PMID:25479822

  8. Analysis of Multiple Partially Ordered Responses to Belief Items with Don't Know Option.

    Science.gov (United States)

    Ip, Edward H; Chen, Shyh-Huei; Quandt, Sara A

    2016-06-01

    Understanding beliefs, values, and preferences of patients is a tenet of contemporary health sciences. This application was motivated by the analysis of multiple partially ordered set (poset) responses from an inventory on layman beliefs about diabetes. The partially ordered set arises because of two features in the data-first, the response options contain a Don't Know (DK) option, and second, there were two consecutive occasions of measurement. As predicted by the common sense model of illness, beliefs about diabetes were not necessarily stable across the two measurement occasions. Instead of analyzing the two occasions separately, we studied the joint responses across the occasions as a poset response. Few analytic methods exist for data structures other than ordered or nominal categories. Poset responses are routinely collapsed and then analyzed as either rank ordered or nominal data, leading to the loss of nuanced information that might be present within poset categories. In this paper we developed a general class of item response models for analyzing the poset data collected from the Common Sense Model of Diabetes Inventory. The inferential object of interest is the latent trait that indicates congruence of belief with the biomedical model. To apply an item response model to the poset diabetes inventory, we proved that a simple coding algorithm circumvents the requirement of writing new codes such that standard IRT software could be directly used for the purpose of item estimation and individual scoring. Simulation experiments were used to examine parameter recovery for the proposed poset model.

  9. An Item Response Theory Analysis of the Community of Inquiry Scale

    Directory of Open Access Journals (Sweden)

    Mehmet Barış Horzum

    2015-04-01

    Full Text Available The aim of this study is to examine validity and reliability of Community of Inquiry Scale commonly used in online learning by the means of Item Response Theory. For this purpose, Community of Inquiry Scale version 14 is applied on 1,499 students of a distance education center’s online learning programs at a Turkish state university via internet. The collected data is analyzed by using a statistical software package. Research data is analyzed in three aspects, which are checking model assumptions, checking model-data fit and item analysis. Item and test features of the scale are examined by the means of Graded Response Theory. In order to use this model of IRT, after testing the assumptions out of the data gathered from 1,499 participants, data model compliance was examined. Following the affirmative results gathered from the examinations, all data is analyzed by using GRM. As a result of the study, the Community of Inquiry Scale adapted to Turkish by Horzum (in press is found to be reliable and valid by the means of Classical Test Theory and Item Response Theory.

  10. Influence of Item Direction on Student Responses in Attitude Assessment.

    Science.gov (United States)

    Campbell, Noma Jo; Grissom, Stephen

    To investigate the effects of wording in attitude test items, a five-point Likert-type rating scale was administered to 173 undergraduate education majors. The test measured attitudes toward college and self, and contained 38 positively-worded items. Thirty-eight negatively-worded items were also written to parallel the positive statements.…

  11. Limits on Log Odds Ratios for Unidimensional Item Response Theory Models

    Science.gov (United States)

    Haberman, Shelby J.; Holland, Paul W.; Sinharay, Sandip

    2007-01-01

    Bounds are established for log odds ratios (log cross-product ratios) involving pairs of items for item response models. First, expressions for bounds on log odds ratios are provided for one-dimensional item response models in general. Then, explicit bounds are obtained for the Rasch model and the two-parameter logistic (2PL) model. Results are…

  12. Reevaluation of the Amsterdam Inventory for Auditory Disability and Handicap Using Item Response Theory

    Science.gov (United States)

    Hospers, J. Mirjam Boeschen; Smits, Niels; Smits, Cas; Stam, Mariska; Terwee, Caroline B.; Kramer, Sophia E.

    2016-01-01

    Purpose: We reevaluated the psychometric properties of the Amsterdam Inventory for Auditory Disability and Handicap (AIADH; Kramer, Kapteyn, Festen, & Tobi, 1995) using item response theory. Item response theory describes item functioning along an ability continuum. Method: Cross-sectional data from 2,352 adults with and without hearing…

  13. Self efficacy for fruit, vegetable and water intakes: Expanded and abbreviated scales from item response modeling analyses

    Directory of Open Access Journals (Sweden)

    Cullen Karen W

    2010-03-01

    Full Text Available Abstract Objective To improve an existing measure of fruit and vegetable intake self efficacy by including items that varied on levels of difficulty, and testing a corresponding measure of water intake self efficacy. Design Cross sectional assessment. Items were modified to have easy, moderate and difficult levels of self efficacy. Classical test theory and item response modeling were applied. Setting One middle school at each of seven participating sites (Houston TX, Irvine CA, Philadelphia PA, Pittsburg PA, Portland OR, rural NC, and San Antonio TX. Subjects 714 6th grade students. Results Adding items to reflect level (low, medium, high of self efficacy for fruit and vegetable intake achieved scale reliability and validity comparable to existing scales, but the distribution of items across the latent variable did not improve. Selecting items from among clusters of items at similar levels of difficulty along the latent variable resulted in an abbreviated scale with psychometric characteristics comparable to the full scale, except for reliability. Conclusions The abbreviated scale can reduce participant burden. Additional research is necessary to generate items that better distribute across the latent variable. Additional items may need to tap confidence in overcoming more diverse barriers to dietary intake.

  14. Item Response Modeling of Presence-Severity Items: Application to Measurement of Patient-Reported Outcomes

    Science.gov (United States)

    Liu, Ying; Verkuilen, Jay

    2013-01-01

    The Presence-Severity (P-S) format refers to a compound item structure in which a question is first asked to check the presence of the particular event in question. If the respondent provides an affirmative answer, a follow-up is administered, often about the frequency, density, severity, or impact of the event. Despite the popularity of the P-S…

  15. Statistical Tests of Conditional Independence between Responses and/or Response Times on Test Items

    Science.gov (United States)

    van der Linden, Wim J.; Glas, Cees A. W.

    2010-01-01

    Three plausible assumptions of conditional independence in a hierarchical model for responses and response times on test items are identified. For each of the assumptions, a Lagrange multiplier test of the null hypothesis of conditional independence against a parametric alternative is derived. The tests have closed-form statistics that are easy to…

  16. Analyzing Multiple-Choice Questions by Model Analysis and Item Response Curves

    Science.gov (United States)

    Wattanakasiwich, P.; Ananta, S.

    2010-07-01

    In physics education research, the main goal is to improve physics teaching so that most students understand physics conceptually and be able to apply concepts in solving problems. Therefore many multiple-choice instruments were developed to probe students' conceptual understanding in various topics. Two techniques including model analysis and item response curves were used to analyze students' responses from Force and Motion Conceptual Evaluation (FMCE). For this study FMCE data from more than 1000 students at Chiang Mai University were collected over the past three years. With model analysis, we can obtain students' alternative knowledge and the probabilities for students to use such knowledge in a range of equivalent contexts. The model analysis consists of two algorithms—concentration factor and model estimation. This paper only presents results from using the model estimation algorithm to obtain a model plot. The plot helps to identify a class model state whether it is in the misconception region or not. Item response curve (IRC) derived from item response theory is a plot between percentages of students selecting a particular choice versus their total score. Pros and cons of both techniques are compared and discussed.

  17. A Pearson-Type-VII Item Response Model for Assessing Person Fluctuation

    Science.gov (United States)

    Ferrando, Pere J.

    2007-01-01

    Using Lumsden's Thurstonian fluctuation model as a starting point, this paper attempts to develop a unidimensional item response theory model intended for binary personality items. Under some additional assumptions, a new model is obtained in which the item characteristic curves are defined by a cumulative Pearson-Type-VII distribution, and the…

  18. Item response theory and the measurement of psychiatric constructs: some empirical and conceptual issues and challenges.

    Science.gov (United States)

    Reise, S P; Rodriguez, A

    2016-07-01

    Item response theory (IRT) measurement models are now commonly used in educational, psychological, and health-outcomes measurement, but their impact in the evaluation of measures of psychiatric constructs remains limited. Herein we present two, somewhat contradictory, theses. The first is that, when skillfully applied, IRT has much to offer psychiatric measurement in terms of scale development, psychometric analysis, and scoring. The second argument, however, is that psychiatric measurement presents some unique challenges to the application of IRT - challenges that may not be easily addressed by application of conventional IRT models and methods. These challenges include, but are not limited to, the modeling of conceptually narrow constructs and their associated limited item pools, and unipolar constructs where the expected latent trait distribution is highly skewed. PMID:27056796

  19. Adult Attachment Ratings (AAR): An Item Response Theory Analysis

    OpenAIRE

    Pilkonis, Paul A.; Kim, Yookyung; Yu, Lan; Morse, Jennifer Q.

    2013-01-01

    The Adult Attachment Ratings (AAR) include 3 scales for anxious, ambivalent attachment (excessive dependency, interpersonal ambivalence, and compulsive care-giving), 3 for avoidant attachment (rigid self-control, defensive separation, and emotional detachment), and 1 for secure attachment. The scales include items (ranging from 6–16 in their original form) scored by raters using a 3-point format (0 = absent, 1 = present, and 2 = strongly present) and summed to produce a total score. Item resp...

  20. An Item Response Theory Analysis of the Community of Inquiry Scale

    OpenAIRE

    Mehmet Barış Horzum; Gülden Kaya Uyanık

    2015-01-01

    The aim of this study is to examine validity and reliability of Community of Inquiry Scale commonly used in online learning by the means of Item Response Theory. For this purpose, Community of Inquiry Scale version 14 is applied on 1,499 students of a distance education center’s online learning programs at a Turkish state university via internet. The collected data is analyzed by using a statistical software package. Research data is analyzed in three aspects, which are checking model assumpt...

  1. Dimensionality of the UWES-17: An item response modelling analysis

    Directory of Open Access Journals (Sweden)

    Deon P. de Bruin

    2013-03-01

    Full Text Available Orientation: Questionnaires, particularly the Utrecht Work Engagement Scale (UWES-17, are an almost standard method by which to measure work engagement. Conflicting evidence regarding the dimensionality of the UWES-17 has led to confusion regarding the interpretation of scores.Research purpose: The main focus of this study was to use the Rasch model to provide insight into the dimensionality of the UWES-17, and to assess whether work engagement should be interpreted as one single overall score, three separate scores, or a combination.Motivation for the study: It is unclear whether a summative score is more representative of work engagement or whether scores are more meaningful when interpreted for each dimension separately. Previous work relied on confirmatory factor analysis; the potential of item response models has not been tapped.Research design: A quantitative cross-sectional survey design approach was used. Participants, 2429 employees of a South African Information and Communication Technology (ICT company, completed the UWES-17.Main findings: Findings indicate that work engagement should be treated as a unidimensional construct: individual scores should be interpreted in a summative manner, giving a single global score.Practical/managerial implications: Users of the UWES-17 may interpret a single, summative score for work engagement. Findings of this study should also contribute towards standardising UWES-17 scores, allowing meaningful comparisons to be made.Contribution/value-add: The findings will benefit researchers, organisational consultants and managers. Clarity on dimensionality and interpretation of work engagement will assist researchers in future studies. Managers and consultants will be able to make better-informed decisions when using work engagement data.

  2. The Role of Psychometric Modeling in Test Validation: An Application of Multidimensional Item Response Theory

    Science.gov (United States)

    Schilling, Stephen G.

    2007-01-01

    In this paper the author examines the role of item response theory (IRT), particularly multidimensional item response theory (MIRT) in test validation from a validity argument perspective. The author provides justification for several structural assumptions and interpretations, taking care to describe the role he believes they should play in any…

  3. Secondary Psychometric Examination of the Dimensional Obsessive-Compulsive Scale: Classical Testing, Item Response Theory, and Differential Item Functioning.

    Science.gov (United States)

    Thibodeau, Michel A; Leonard, Rachel C; Abramowitz, Jonathan S; Riemann, Bradley C

    2015-12-01

    The Dimensional Obsessive-Compulsive Scale (DOCS) is a promising measure of obsessive-compulsive disorder (OCD) symptoms but has received minimal psychometric attention. We evaluated the utility and reliability of DOCS scores. The study included 832 students and 300 patients with OCD. Confirmatory factor analysis supported the originally proposed four-factor structure. DOCS total and subscale scores exhibited good to excellent internal consistency in both samples (α = .82 to α = .96). Patient DOCS total scores reduced substantially during treatment (t = 16.01, d = 1.02). DOCS total scores discriminated between students and patients (sensitivity = 0.76, 1 - specificity = 0.23). The measure did not exhibit gender-based differential item functioning as tested by Mantel-Haenszel chi-square tests. Expected response options for each item were plotted as a function of item response theory and demonstrated that DOCS scores incrementally discriminate OCD symptoms ranging from low to extremely high severity. Incremental differences in DOCS scores appear to represent unbiased and reliable differences in true OCD symptom severity. PMID:25422521

  4. A model of hippocampal spiking responses to items during learning of a context-dependent task

    Directory of Open Access Journals (Sweden)

    Florian eRaudies

    2014-09-01

    Full Text Available Single unit recordings in the rat hippocampus have demonstrated shifts in the specificity of spiking activity during learning of a contextual item-reward association task. In this task, rats received reward for responding to different items dependent upon the context an item appeared in, but not dependent upon the location an item appears at. Initially, neurons in the rat hippocampus primarily show firing based on place, but as the rat learns the task this firing became more selective for items. We simulated this effect using a simple circuit model with discrete inputs driving spiking activity representing place and item followed sequentially by a discrete representation of the motor actions involving a response to an item (digging for food or the movement to a different item (movement to a different pot for food. We implemented spiking replay in the network representing neural activity observed during sharp-wave ripple events, and modified synaptic connections based on a simple representation of spike-timing dependent synaptic plasticity. This simple network was able to consistently learn the context-dependent responses, and transitioned from dominant coding of place to a gradual increase in specificity to items consistent with analysis of the experimental data. In addition, the model showed an increase in specificity toward context. The increase of selectivity in the model is accompanied by an increase in binariness of the synaptic weights for cells that are part of the functional network.

  5. Use of NON-PARAMETRIC Item Response Theory to develop a shortened version of the Positive and Negative Syndrome Scale (PANSS

    Directory of Open Access Journals (Sweden)

    Khan Anzalee

    2011-11-01

    Full Text Available Abstract Background Nonparametric item response theory (IRT was used to examine (a the performance of the 30 Positive and Negative Syndrome Scale (PANSS items and their options ((levels of severity, (b the effectiveness of various subscales to discriminate among differences in symptom severity, and (c the development of an abbreviated PANSS (Mini-PANSS based on IRT and a method to link scores to the original PANSS. Methods Baseline PANSS scores from 7,187 patients with Schizophrenia or Schizoaffective disorder who were enrolled between 1995 and 2005 in psychopharmacology trials were obtained. Option characteristic curves (OCCs and Item Characteristic Curves (ICCs were constructed to examine the probability of rating each of seven options within each of 30 PANSS items as a function of subscale severity, and summed-score linking was applied to items selected for the Mini-PANSS. Results The majority of items forming the Positive and Negative subscales (i.e. 19 items performed very well and discriminate better along symptom severity compared to the General Psychopathology subscale. Six of the seven Positive Symptom items, six of the seven Negative Symptom items, and seven out of the 16 General Psychopathology items were retained for inclusion in the Mini-PANSS. Summed score linking and linear interpolation was able to produce a translation table for comparing total subscale scores of the Mini-PANSS to total subscale scores on the original PANSS. Results show scores on the subscales of the Mini-PANSS can be linked to scores on the original PANSS subscales, with very little bias. Conclusions The study demonstrated the utility of non-parametric IRT in examining the item properties of the PANSS and to allow selection of items for an abbreviated PANSS scale. The comparisons between the 30-item PANSS and the Mini-PANSS revealed that the shorter version is comparable to the 30-item PANSS, but when applying IRT, the Mini-PANSS is also a good indicator of

  6. Origin of the Scaling Constant "d" = 1.7 in Item Response Theory.

    Science.gov (United States)

    Camilli, Gregory

    1994-01-01

    Describes the scaling constant "d" = 1.702, used in Item Response Theory, which minimizes the maximum difference between the normal and logistic distribution functions. Recapitulates the theoretical and numerical derivation of "d" given by D. Haley (1952). (SLD)

  7. Numerical Differentiation Methods for Computing Error Covariance Matrices in Item Response Theory Modeling: An Evaluation and a New Proposal

    Science.gov (United States)

    Tian, Wei; Cai, Li; Thissen, David; Xin, Tao

    2013-01-01

    In item response theory (IRT) modeling, the item parameter error covariance matrix plays a critical role in statistical inference procedures. When item parameters are estimated using the EM algorithm, the parameter error covariance matrix is not an automatic by-product of item calibration. Cai proposed the use of Supplemented EM algorithm for…

  8. Item Response Theory Analyses of the Cambridge Face Memory Test (CFMT)

    OpenAIRE

    Cho, Sun-Joo; Wilmer, Jeremy; Herzmann, Grit; McGugin, Rankin; Fiset, Daniel; Van Gulick, Ana E.; Ryan, Katie; Gauthier, Isabel

    2015-01-01

    We evaluated the psychometric properties of the Cambridge face memory test (CFMT; Duchaine & Nakayama, 2006). First, we assessed the dimensionality of the test with a bi-factor exploratory factor analysis (EFA). This EFA analysis revealed a general factor and three specific factors clustered by targets of CFMT. However, the three specific factors appeared to be minor factors that can be ignored. Second, we fit a unidimensional item response model. This item response model showed that the CFMT...

  9. Re-evaluating a vision-related quality of life questionnaire with item response theory (IRT and differential item functioning (DIF analyses

    Directory of Open Access Journals (Sweden)

    Knol Dirk L

    2011-09-01

    Full Text Available Abstract Background For the Low Vision Quality Of Life questionnaire (LVQOL it is unknown whether the psychometric properties are satisfactory when an item response theory (IRT perspective is considered. This study evaluates some essential psychometric properties of the LVQOL questionnaire in an IRT model, and investigates differential item functioning (DIF. Methods Cross-sectional data were used from an observational study among visually-impaired patients (n = 296. Calibration was performed for every dimension of the LVQOL in the graded response model. Item goodness-of-fit was assessed with the S-X2-test. DIF was assessed on relevant background variables (i.e. age, gender, visual acuity, eye condition, rehabilitation type and administration type with likelihood-ratio tests for DIF. The magnitude of DIF was interpreted by assessing the largest difference in expected scores between subgroups. Measurement precision was assessed by presenting test information curves; reliability with the index of subject separation. Results All items of the LVQOL dimensions fitted the model. There was significant DIF on several items. For two items the maximum difference between expected scores exceeded one point, and DIF was found on multiple relevant background variables. Item 1 'Vision in general' from the "Adjustment" dimension and item 24 'Using tools' from the "Reading and fine work" dimension were removed. Test information was highest for the "Reading and fine work" dimension. Indices for subject separation ranged from 0.83 to 0.94. Conclusions The items of the LVQOL showed satisfactory item fit to the graded response model; however, two items were removed because of DIF. The adapted LVQOL with 21 items is DIF-free and therefore seems highly appropriate for use in heterogeneous populations of visually impaired patients.

  10. Compensatory and non-compensatory multidimensional randomized item response models

    NARCIS (Netherlands)

    Fox, J.P.; Entink, R.K.; Avetisyan, M.

    2014-01-01

    Randomized response (RR) models are often used for analysing univariate randomized response data and measuring population prevalence of sensitive behaviours. There is much empirical support for the belief that RR methods improve the cooperation of the respondents. Recently, RR models have been exten

  11. Item response theory analyses of the Cambridge Face Memory Test (CFMT).

    Science.gov (United States)

    Cho, Sun-Joo; Wilmer, Jeremy; Herzmann, Grit; McGugin, Rankin Williams; Fiset, Daniel; Van Gulick, Ana E; Ryan, Kaitlin F; Gauthier, Isabel

    2015-06-01

    We evaluated the psychometric properties of the Cambridge Face Memory Test (CFMT; Duchaine & Nakayama, 2006). First, we assessed the dimensionality of the test with a bifactor exploratory factor analysis (EFA). This EFA analysis revealed a general factor and 3 specific factors clustered by targets of CFMT. However, the 3 specific factors appeared to be minor factors that can be ignored. Second, we fit a unidimensional item response model. This item response model showed that the CFMT items could discriminate individuals at different ability levels and covered a wide range of the ability continuum. We found the CFMT to be particularly precise for a wide range of ability levels. Third, we implemented item response theory (IRT) differential item functioning (DIF) analyses for each gender group and 2 age groups (age ≤ 20 vs. age > 21). This DIF analysis suggested little evidence of consequential differential functioning on the CFMT for these groups, supporting the use of the test to compare older to younger, or male to female, individuals. Fourth, we tested for a gender difference on the latent facial recognition ability with an explanatory item response model. We found a significant but small gender difference on the latent ability for face recognition, which was higher for women than men by 0.184, at age mean 23.2, controlling for linear and quadratic age effects. Finally, we discuss the practical considerations of the use of total scores versus IRT scale scores in applications of the CFMT.

  12. Explanatory multidimensional multilevel random item response model: an application to simultaneous investigation of word and person contributions to multidimensional lexical representations.

    Science.gov (United States)

    Cho, Sun-Joo; Gilbert, Jennifer K; Goodwin, Amanda P

    2013-10-01

    This paper presents an explanatory multidimensional multilevel random item response model and its application to reading data with multilevel item structure. The model includes multilevel random item parameters that allow consideration of variability in item parameters at both item and item group levels. Item-level random item parameters were included to model unexplained variance remaining when item related covariates were used to explain variation in item difficulties. Item group-level random item parameters were included to model dependency in item responses among items having the same item stem. Using the model, this study examined the dimensionality of a person's word knowledge, termed lexical representation, and how aspects of morphological knowledge contributed to lexical representations for different persons, items, and item groups.

  13. Linking Outcomes from Peabody Picture Vocabulary Test Forms Using Item Response Models

    Science.gov (United States)

    Hoffman, Lesa; Templin, Jonathan; Rice, Mabel L.

    2012-01-01

    Purpose: The present work describes how vocabulary ability as assessed by 3 different forms of the Peabody Picture Vocabulary Test (PPVT; Dunn & Dunn, 1997) can be placed on a common latent metric through item response theory (IRT) modeling, by which valid comparisons of ability between samples or over time can then be made. Method: Responses from…

  14. Measuring organizational effectiveness in information and communication technology companies using item response theory.

    Science.gov (United States)

    Trierweiller, Andréa Cristina; Peixe, Blênio César Severo; Tezza, Rafael; Pereira, Vera Lúcia Duarte do Valle; Pacheco, Waldemar; Bornia, Antonio Cezar; de Andrade, Dalton Francisco

    2012-01-01

    The aim of this paper is to measure the effectiveness of the organizations Information and Communication Technology (ICT) from the point of view of the manager, using Item Response Theory (IRT). There is a need to verify the effectiveness of these organizations which are normally associated to complex, dynamic, and competitive environments. In academic literature, there is disagreement surrounding the concept of organizational effectiveness and its measurement. A construct was elaborated based on dimensions of effectiveness towards the construction of the items of the questionnaire which submitted to specialists for evaluation. It demonstrated itself to be viable in measuring organizational effectiveness of ICT companies under the point of view of a manager through using Two-Parameter Logistic Model (2PLM) of the IRT. This modeling permits us to evaluate the quality and property of each item placed within a single scale: items and respondents, which is not possible when using other similar tools.

  15. mirt: A Multidimensional Item Response Theory Package for the R Environment

    Directory of Open Access Journals (Sweden)

    R. Philip Chalmers

    2012-05-01

    Full Text Available Item response theory (IRT is widely used in assessment and evaluation research to explain how participants respond to item level stimuli. Several R packages can be used to estimate the parameters in various IRT models, the most flexible being the ltm (Rizopoulos 2006, eRm (Mair and Hatzinger 2007, and MCMCpack (Martin, Quinn, and Park 2011 packages. However these packages have limitations in that ltm and eRm can only analyze unidimensional IRT models effectively and the exploratory multidimensional extensions available in MCMCpack requires prior understanding of Bayesian estimation convergence diagnostics and are computationally intensive. Most importantly, multidimensional confirmatory item factor analysis methods have not been implemented in any R package.The mirt package was created for estimating multidimensional item response theory parameters for exploratory and confirmatory models by using maximum-likelihood meth- ods. The Gauss-Hermite quadrature method used in traditional EM estimation (e.g., Bock and Aitkin 1981 is presented for exploratory item response models as well as for confirmatory bifactor models (Gibbons and Hedeker 1992. Exploratory and confirmatory models are estimated by a stochastic algorithm described by Cai (2010a,b. Various program comparisons are presented and future directions for the package are discussed.

  16. Cognitive Diagnostic Models for Tests with Multiple-Choice and Constructed-Response Items

    Science.gov (United States)

    Kuo, Bor-Chen; Chen, Chun-Hua; Yang, Chih-Wei; Mok, Magdalena Mo Ching

    2016-01-01

    Traditionally, teachers evaluate students' abilities via their total test scores. Recently, cognitive diagnostic models (CDMs) have begun to provide information about the presence or absence of students' skills or misconceptions. Nevertheless, CDMs are typically applied to tests with multiple-choice (MC) items, which provide less diagnostic…

  17. Are vocabulary tests measurement invariant between age groups? An item response analysis of three popular tests.

    Science.gov (United States)

    Fox, Mark C; Berry, Jane M; Freeman, Sara P

    2014-12-01

    Relatively high vocabulary scores of older adults are generally interpreted as evidence that older adults possess more of a common ability than younger adults. Yet, this interpretation rests on empirical assumptions about the uniformity of item-response functions between groups. In this article, we test item response models of differential responding against datasets containing younger-, middle-aged-, and older-adult responses to three popular vocabulary tests (the Shipley, Ekstrom, and WAIS-R) to determine whether members of different age groups who achieve the same scores have the same probability of responding in the same categories (e.g., correct vs. incorrect) under the same conditions. Contrary to the null hypothesis of measurement invariance, datasets for all three tests exhibit substantial differential responding. Members of different age groups who achieve the same overall scores exhibit differing response probabilities in relation to the same items (differential item functioning) and appear to approach the tests in qualitatively different ways that generalize across items. Specifically, younger adults are more likely than older adults to leave items unanswered for partial credit on the Ekstrom, and to produce 2-point definitions on the WAIS-R. Yet, older adults score higher than younger adults, consistent with most reports of vocabulary outcomes in the cognitive aging literature. In light of these findings, the most generalizable conclusion to be drawn from the cognitive aging literature on vocabulary tests is simply that older adults tend to score higher than younger adults, and not that older adults possess more of a common ability.

  18. Applied orienting response research: some examples.

    Science.gov (United States)

    Tremayne, P; Barry, R J

    1990-01-01

    The development of orienting response (OR) theory has not been accompanied by many applications of the concept--most research still appears to be lab-based and "pure," rather than "applied." We present some examples from our own work in which the OR perspective has been applied in a wider context. These cover the exploration of processing deficits in autistic children, aspects of the "repression" of anxiety in elite athletes, and the locus of alcohol effects. Such applications of the OR concept in real-life situations seem a logical and, indeed, necessary step in the evolution of this area of psychophysiology.

  19. Computer Response Time Measurements of Mood, Fatigue and Symptom Scale Items: Implications for Scale Response Time Uses.

    Science.gov (United States)

    Ryman, David H.; And Others

    1988-01-01

    Describes study conducted with U.S. Marine Corps enlisted personnel to measure response time to computer-administered questionnaire items, and to evaluate how measurement of response time might be useful in various research areas. Topics addressed include mood states; the occurrence of straight lining; and experimental effects of sleep loss and…

  20. Discussion of David Thissen's Bad Questions: An Essay Involving Item Response Theory

    Science.gov (United States)

    Wainer, Howard

    2016-01-01

    The usual role of a discussant is to clarify and correct the paper being discussed, but in this case, the author, Howard Wainer, generally agrees with everything David Thissen says in his essay, "Bad Questions: An Essay Involving Item Response Theory." This essay expands on David Thissen's statement that there are typically two principal…

  1. The Shortened Raven Standard Progressive Matrices: Item Response Theory-Based Psychometric Analyses and Normative Data

    Science.gov (United States)

    Van der Elst, Wim; Ouwehand, Carolijn; van Rijn, Peter; Lee, Nikki; Van Boxtel, Martin; Jolles, Jelle

    2013-01-01

    The purpose of the present study was to evaluate the psychometric properties of a shortened version of the Raven Standard Progressive Matrices (SPM) under an item response theory framework (the one- and two-parameter logistic models). The shortened Raven SPM was administered to N = 453 cognitively healthy adults aged between 24 and 83 years. The…

  2. The Value of Item Response Theory in Clinical Assessment: A Review

    Science.gov (United States)

    Thomas, Michael L.

    2011-01-01

    Item response theory (IRT) and related latent variable models represent modern psychometric theory, the successor to classical test theory in psychological assessment. Although IRT has become prevalent in the measurement of ability and achievement, its contributions to clinical domains have been less extensive. Applications of IRT to clinical…

  3. An Item Response Theory Analysis of the Mathematics Teaching Efficacy Beliefs Instrument

    Science.gov (United States)

    Kieftenbeld, Vincent; Natesan, Prathiba; Eddy, Colleen

    2011-01-01

    The mathematics teaching efficacy beliefs of preservice elementary teachers have been the subject of several studies. A widely used measure in these studies is the Mathematics Teaching Efficacy Beliefs Instrument (MTEBI). The present study provides a detailed analysis of the psychometric properties of the MTEBI using Bayesian item response theory.…

  4. Three Essays on Teacher Education Programs and Test-Takers' Response Times on Test Items

    Science.gov (United States)

    Qian, Hong

    2013-01-01

    This dissertation includes three essays: one essay focuses on the effect of teacher preparation programs on teacher knowledge while the other two focus on test-takers' response times on test items. Essay One addresses the problem of how opportunities to learn in teacher preparation programs influence future elementary mathematics teachers'…

  5. Using Item Response Theory to Evaluate Measurement Precision of Selection Tests at the French Pilot Training

    NARCIS (Netherlands)

    Veldhuis, M.; Matton, N.; Vautier, S.

    2012-01-01

    In pilot selection settings, decisions are often based on cutoff scores. In item response theory the measurement precision of a test score can be evaluated by its degree of information. We investigated whether the maximum of test information corresponded to the cutoff zone for 10 cognitive ability t

  6. Measuring Integration of Information and Communication Technology in Education: An Item Response Modeling Approach

    Science.gov (United States)

    Peeraer, Jef; Van Petegem, Peter

    2012-01-01

    This research describes the development and validation of an instrument to measure integration of Information and Communication Technology (ICT) in education. After literature research on definitions of integration of ICT in education, a comparison is made between the classical test theory and the item response modeling approach for the…

  7. Optimal and Most Exact Confidence Intervals for Person Parameters in Item Response Theory Models

    Science.gov (United States)

    Doebler, Anna; Doebler, Philipp; Holling, Heinz

    2013-01-01

    The common way to calculate confidence intervals for item response theory models is to assume that the standardized maximum likelihood estimator for the person parameter [theta] is normally distributed. However, this approximation is often inadequate for short and medium test lengths. As a result, the coverage probabilities fall below the given…

  8. Mokken scale analysis : Between the Guttman scale and parametric item response theory

    NARCIS (Netherlands)

    van Schuur, Wijbrandt H.

    2003-01-01

    This article introduces a model of ordinal unidimensional measurement known as Mokken scale analysis. Mokken scaling is based on principles of Item Response Theory (IRT) that originated in the Guttman scale. I compare the Mokken model with both Classical Test Theory (reliability or factor analysis)

  9. Assessing Model Data Fit of Unidimensional Item Response Theory Models in Simulated Data

    Science.gov (United States)

    Kose, Ibrahim Alper

    2014-01-01

    The purpose of this paper is to give an example of how to assess the model-data fit of unidimensional IRT models in simulated data. Also, the present research aims to explain the importance of fit and the consequences of misfit by using simulated data sets. Responses of 1000 examinees to a dichotomously scoring 20 item test were simulated with 25…

  10. Comparison of Fixed-Item and Response-Sensitive Versions of an Online Tutorial

    Science.gov (United States)

    Grant, Lyle K.; Courtoreille, Marni

    2007-01-01

    This study is a comparison of 2 versions of an Internet-based tutorial that teaches the behavior-analysis concept of positive reinforcement. A fixed-item group of students studied a version of the tutorial that included 14 interactive examples and nonexamples of the concept. A response-sensitive group of students studied a different version of the…

  11. Using Item Response Theory to Assess Changes in Student Performance Based on Changes in Question Wording

    Science.gov (United States)

    Schurmeier, Kimberly D.; Atwood, Charles H.; Shepler, Carrie G.; Lautenschlager, Gary J.

    2010-01-01

    Five years of longitudinal data for general chemistry student assessments at the University of Georgia have been analyzed using item response theory (IRT). Our analysis indicates that minor changes in question wording on exams can make significant differences in student performance on assessment questions. This analysis encompasses data from over…

  12. A modular approach for item response theory modeling with the R package flirt.

    Science.gov (United States)

    Jeon, Minjeong; Rijmen, Frank

    2016-06-01

    The new R package flirt is introduced for flexible item response theory (IRT) modeling of psychological, educational, and behavior assessment data. flirt integrates a generalized linear and nonlinear mixed modeling framework with graphical model theory. The graphical model framework allows for efficient maximum likelihood estimation. The key feature of flirt is its modular approach to facilitate convenient and flexible model specifications. Researchers can construct customized IRT models by simply selecting various modeling modules, such as parametric forms, number of dimensions, item and person covariates, person groups, link functions, etc. In this paper, we describe major features of flirt and provide examples to illustrate how flirt works in practice.

  13. Applicability of Item Response Theory to the Korean Nurses' Licensing Examination

    Directory of Open Access Journals (Sweden)

    Geum-Hee Jeong

    2005-06-01

    Full Text Available To test the applicability of item response theory (IRT to the Korean Nurses' Licensing Examination (KNLE, item analysis was performed after testing the unidimensionality and goodness-of-fit. The results were compared with those based on classical test theory. The results of the 330-item KNLE administered to 12,024 examinees in January 2004 were analyzed. Unidimensionality was tested using DETECT and the goodness-of-fit was tested using WINSTEPS for the Rasch model and Bilog-MG for the two-parameter logistic model. Item analysis and ability estimation were done using WINSTEPS. Using DETECT, Dmax ranged from 0.1 to 0.23 for each subject. The mean square value of the infit and outfit values of all items using WINSTEPS ranged from 0.1 to 1.5, except for one item in pediatric nursing, which scored 1.53. Of the 330 items, 218 (42.7% were misfit using the two-parameter logistic model of Bilog-MG. The correlation coefficients between the difficulty parameter using the Rasch model and the difficulty index from classical test theory ranged from 0.9039 to 0.9699. The correlation between the ability parameter using the Rasch model and the total score from classical test theory ranged from 0.9776 to 0.9984. Therefore, the results of the KNLE fit unidimensionality and goodness-of-fit for the Rasch model. The KNLE should be a good sample for analysis according to the IRT Rasch model, so further research using IRT is possible.

  14. Modeling the World Health Organization Disability Assessment Schedule II using non-parametric item response models.

    Science.gov (United States)

    Galindo-Garre, Francisca; Hidalgo, María Dolores; Guilera, Georgina; Pino, Oscar; Rojo, J Emilio; Gómez-Benito, Juana

    2015-03-01

    The World Health Organization Disability Assessment Schedule II (WHO-DAS II) is a multidimensional instrument developed for measuring disability. It comprises six domains (getting around, self-care, getting along with others, life activities and participation in society). The main purpose of this paper is the evaluation of the psychometric properties for each domain of the WHO-DAS II with parametric and non-parametric Item Response Theory (IRT) models. A secondary objective is to assess whether the WHO-DAS II items within each domain form a hierarchy of invariantly ordered severity indicators of disability. A sample of 352 patients with a schizophrenia spectrum disorder is used in this study. The 36 items WHO-DAS II was administered during the consultation. Partial Credit and Mokken scale models are used to study the psychometric properties of the questionnaire. The psychometric properties of the WHO-DAS II scale are satisfactory for all the domains. However, we identify a few items that do not discriminate satisfactorily between different levels of disability and cannot be invariantly ordered in the scale. In conclusion the WHO-DAS II can be used to assess overall disability in patients with schizophrenia, but some domains are too general to assess functionality in these patients because they contain items that are not applicable to this pathology. PMID:25524862

  15. Modeling the World Health Organization Disability Assessment Schedule II using non-parametric item response models.

    Science.gov (United States)

    Galindo-Garre, Francisca; Hidalgo, María Dolores; Guilera, Georgina; Pino, Oscar; Rojo, J Emilio; Gómez-Benito, Juana

    2015-03-01

    The World Health Organization Disability Assessment Schedule II (WHO-DAS II) is a multidimensional instrument developed for measuring disability. It comprises six domains (getting around, self-care, getting along with others, life activities and participation in society). The main purpose of this paper is the evaluation of the psychometric properties for each domain of the WHO-DAS II with parametric and non-parametric Item Response Theory (IRT) models. A secondary objective is to assess whether the WHO-DAS II items within each domain form a hierarchy of invariantly ordered severity indicators of disability. A sample of 352 patients with a schizophrenia spectrum disorder is used in this study. The 36 items WHO-DAS II was administered during the consultation. Partial Credit and Mokken scale models are used to study the psychometric properties of the questionnaire. The psychometric properties of the WHO-DAS II scale are satisfactory for all the domains. However, we identify a few items that do not discriminate satisfactorily between different levels of disability and cannot be invariantly ordered in the scale. In conclusion the WHO-DAS II can be used to assess overall disability in patients with schizophrenia, but some domains are too general to assess functionality in these patients because they contain items that are not applicable to this pathology.

  16. 单维项目因素分析:CCFA与IRT估计方法的比较%Unidimensional Item Factor Analysis: A Comparison of Categorical Confirmation Factor Analysis and the Item Response Theory

    Institute of Scientific and Technical Information of China (English)

    刘红云; 李美娟; 骆方; 李小山

    2012-01-01

    item factor load and of item discrimination parameter was influenced by the size of the whole factor load (discrimination). (5) The distribution of the threshold of test item affected the precision of the parameter estimate, and item discrimination was the most sensitive parameter to the threshold. (6) On the whole, the precision of item parameter estimate in SEM framework was higher than that in IRT framework. Both structural equation modeling (SEM) and the item response theory (IRT) could be used for factor analysis of dichotomous item responses. In this case, the measurement models of both approaches were formally equivalent. They were refined within and across different disciplines, and made complementary contributions to central measurement problems encountered in almost all empirical social science research fields. The authors concluded with considerations for categorical item factor analysis and gave some advice for applied researchers.

  17. On the Relationship between Classical Test Theory and Item Response Theory: From One to the Other and Back

    Science.gov (United States)

    Raykov, Tenko; Marcoulides, George A.

    2016-01-01

    The frequently neglected and often misunderstood relationship between classical test theory and item response theory is discussed for the unidimensional case with binary measures and no guessing. It is pointed out that popular item response models can be directly obtained from classical test theory-based models by accounting for the discrete…

  18. Estimating Ordinal Reliability for Likert-Type and Ordinal Item Response Data: A Conceptual, Empirical, and Practical Guide

    Science.gov (United States)

    Gadermann, Anne M.; Guhn, Martin; Zumbo, Bruno D.

    2012-01-01

    This paper provides a conceptual, empirical, and practical guide for estimating ordinal reliability coefficients for ordinal item response data (also referred to as Likert, Likert-type, ordered categorical, or rating scale item responses). Conventionally, reliability coefficients, such as Cronbach's alpha, are calculated using a Pearson…

  19. Sample Size Requirements for Estimation of Item Parameters in the Multidimensional Graded Response Model.

    Science.gov (United States)

    Jiang, Shengyu; Wang, Chun; Weiss, David J

    2016-01-01

    Likert types of rating scales in which a respondent chooses a response from an ordered set of response options are used to measure a wide variety of psychological, educational, and medical outcome variables. The most appropriate item response theory model for analyzing and scoring these instruments when they provide scores on multiple scales is the multidimensional graded response model (MGRM) A simulation study was conducted to investigate the variables that might affect item parameter recovery for the MGRM. Data were generated based on different sample sizes, test lengths, and scale intercorrelations. Parameter estimates were obtained through the flexMIRT software. The quality of parameter recovery was assessed by the correlation between true and estimated parameters as well as bias and root-mean-square-error. Results indicated that for the vast majority of cases studied a sample size of N = 500 provided accurate parameter estimates, except for tests with 240 items when 1000 examinees were necessary to obtain accurate parameter estimates. Increasing sample size beyond N = 1000 did not increase the accuracy of MGRM parameter estimates. PMID:26903916

  20. KernSmoothIRT: An R Package for Kernel Smoothing in Item Response Theory

    Directory of Open Access Journals (Sweden)

    Angelo Mazza

    2014-06-01

    Full Text Available Item response theory (IRT models are a class of statistical models used to describe the response behaviors of individuals to a set of items having a certain number of options. They are adopted by researchers in social science, particularly in the analysis of performance or attitudinal data, in psychology, education, medicine, marketing and other fields where the aim is to measure latent constructs. Most IRT analyses use parametric models that rely on assumptions that often are not satisfied. In such cases, a nonparametric approach might be preferable; nevertheless, there are not many software implementations allowing to use that. To address this gap, this paper presents the R package KernSmoothIRT . It implements kernel smoothing for the estimation of option characteristic curves, and adds several plotting and analytical tools to evaluate the whole test/questionnaire, the items, and the subjects. In order to show the package's capabilities, two real datasets are used, one employing multiple-choice responses, and the other scaled responses.

  1. Sample Size Requirements for Estimation of Item Parameters in the Multidimensional Graded Response Model

    Directory of Open Access Journals (Sweden)

    Shengyu eJiang

    2016-02-01

    Full Text Available Likert types of rating scales in which a respondent chooses a response from an ordered set of response options are used to measure a wide variety of psychological, educational, and medical outcome variables. The most appropriate item response theory model for analyzing and scoring these instruments when they provide scores on multiple scales is the multidimensional graded response model (MGRM. A simulation study was conducted to investigate the variables that might affect item parameter recovery for the MGRM. Data were generated based on different sample sizes, test lengths, and scale intercorrelations. Parameter estimates were obtained through the flexiMIRT software. The quality of parameter recovery was assessed by the correlation between true and estimated parameters as well as bias and root- mean-square-error. Results indicated that for the vast majority of cases studied a sample size of N = 500 provided accurate parameter estimates, except for tests with 240 items when 1,000 examinees were necessary to obtain accurate parameter estimates. Increasing sample size beyond N = 1,000 did not increase the accuracy of MGRM parameter estimates.

  2. An Evaluation of the Brief Symptom Inventory-18 Using Item Response Theory: Which Items Are Most Strongly Related to Psychological Distress?

    Science.gov (United States)

    Meijer, Rob R.; de Vries, Rivka M.; van Bruggen, Vincent

    2011-01-01

    The psychometric structure of the Brief Symptom Inventory-18 (BSI-18; Derogatis, 2001) was investigated using Mokken scaling and parametric item response theory. Data of 487 outpatients, 266 students, and 207 prisoners were analyzed. Results of the Mokken analysis indicated that the BSI-18 formed a strong Mokken scale for outpatients and…

  3. An Evaluation of the Brief Symptom Inventory-18 Using Item Response Theory : Which Items Are Most Strongly Related to Psychological Distress?

    NARCIS (Netherlands)

    Meijer, Rob R.; de Vries, Rivka M.; van Bruggen, Vincent

    2011-01-01

    The psychometric structure of the Brief Symptom Inventory-18 (BSI-18; Derogatis, 2001) was investigated using Mokken scaling and parametric item response theory. Data of 487 outpatients, 266 students, and 207 prisoners were analyzed. Results of the Mokken analysis indicated that the BSI-18 formed a

  4. An evaluation of the Brief Symptom Inventory-18 using item response theory: which items are most strongly related to psychological distress?

    NARCIS (Netherlands)

    Meijer, Rob R.; Vries, de Rivka M.; Bruggen, van Vincent

    2011-01-01

    The psychometric structure of the Brief Symptom Inventory–18 (BSI-18; Derogatis, 2001) was investigated using Mokken scaling and parametric item response theory. Data of 487 outpatients, 266 students, and 207 prisoners were analyzed. Results of the Mokken analysis indicated that the BSI-18 formed a

  5. Re-evaluating a vision-related quality of life questionnaire with item response theory (IRT) and differential item functioning (DIF) analyses.

    NARCIS (Netherlands)

    Nispen, R.M.A. van; Knol, D.L.; Langelaan, M.; Rens, G.H.M.B. van

    2011-01-01

    Background: For the Low Vision Quality Of Life questionnaire (LVQOL) it is unknown whether the psychometric properties are satisfactory when an item response theory (IRT) perspective is considered. This study evaluates some essential psychometric properties of the LVQOL questionnaire in an IRT model

  6. Racial/Ethnic Differences in Responses to the Everyday Discrimination Scale: A Differential Item Functioning Analysis

    OpenAIRE

    Lewis, Tené T.; Yang, Frances M; Jacobs, Elizabeth A.; Fitchett, George

    2012-01-01

    The authors examined the impact of race/ethnicity on responses to the Everyday Discrimination Scale, one of the most widely used discrimination scales in epidemiologic and public health research. Participants were 3,295 middle-aged US women (African-American, Caucasian, Chinese, Hispanic, and Japanese) from the Study of Women’s Health Across the Nation (SWAN) baseline examination (1996–1997). Multiple-indicator, multiple-cause models were used to examine differential item functioning (DIF) on...

  7. Nature, nurture, and item response theory: a psychometric approach to behaviour genetics

    OpenAIRE

    Schwabe, Inga

    2016-01-01

    This dissertation discusses a number of psychometric issues that require special attention in the analysis of genetically-informative data, such as data on twins. These include heterogeneous measurement error, scaling and scale transformation, and harmonization of phenotypes. It is shown how ignoring these issues can result in spurious findings of genotype by environment interaction. Multilevel item response theory models are proposed that can help solve these problems.

  8. Mokken scale analysis: Between the Guttman scale and parametric item response theory

    OpenAIRE

    van Schuur, Wijbrandt H.

    2003-01-01

    This article introduces a model of ordinal unidimensional measurement known as Mokken scale analysis. Mokken scaling is based on principles of Item Response Theory (IRT) that originated in the Guttman scale. I compare the Mokken model with both Classical Test Theory (reliability or factor analysis) and parametric IRT models (especially with the one-parameter logistic model known as the Rasch model). Two nonparametric probabilistic versions of the Mokken model are described: the model of Monot...

  9. The diagnostic utility of separation anxiety disorder symptoms: an item response theory analysis.

    Science.gov (United States)

    Cooper-Vince, Christine E; Emmert-Aronson, Benjamin O; Pincus, Donna B; Comer, Jonathan S

    2014-01-01

    At present, it is not clear whether the current definition of separation anxiety disorder (SAD) is the optimal classification of developmentally inappropriate, severe, and interfering separation anxiety in youth. Much remains to be learned about the relative contributions of individual SAD symptoms for informing diagnosis. Two-parameter logistic Item Response Theory analyses were conducted on the eight core SAD symptoms in an outpatient anxiety sample of treatment-seeking children (N = 359, 59.3 % female, M Age = 11.2) and their parents to determine the diagnostic utility of each of these symptoms. Analyses considered values of item threshold, which characterize the SAD severity level at which each symptom has a 50 % chance of being endorsed, and item discrimination, which characterize how well each symptom distinguishes individuals with higher and lower levels of SAD. Distress related to separation and fear of being alone without major attachment figures showed the strongest discrimination properties and the lowest thresholds for being endorsed. In contrast, worry about harm befalling attachment figures showed the poorest discrimination properties, and nightmares about separation showed the highest threshold for being endorsed. Distress related to separation demonstrated crossing differential item functioning associated with age-at lower separation anxiety levels excessive fear at separation was more likely to be endorsed for children ≥9 years, whereas at higher levels this symptom was more likely to be endorsed by children <9 years. Implications are discussed for optimizing the taxonomy of SAD in youth.

  10. Reading ability and print exposure: item response theory analysis of the author recognition test.

    Science.gov (United States)

    Moore, Mariah; Gordon, Peter C

    2015-12-01

    In the author recognition test (ART), participants are presented with a series of names and foils and are asked to indicate which ones they recognize as authors. The test is a strong predictor of reading skill, and this predictive ability is generally explained as occurring because author knowledge is likely acquired through reading or other forms of print exposure. In this large-scale study (1,012 college student participants), we used item response theory (IRT) to analyze item (author) characteristics in order to facilitate identification of the determinants of item difficulty, provide a basis for further test development, and optimize scoring of the ART. Factor analysis suggested a potential two-factor structure of the ART, differentiating between literary and popular authors. Effective and ineffective author names were identified so as to facilitate future revisions of the ART. Analyses showed that the ART is a highly significant predictor of the time spent encoding words, as measured using eyetracking during reading. The relationship between the ART and time spent reading provided a basis for implementing a higher penalty for selecting foils, rather than the standard method of ART scoring (names selected minus foils selected). The findings provide novel support for the view that the ART is a valid indicator of reading volume. Furthermore, they show that frequency data can be used to select items of appropriate difficulty, and that frequency data from corpora based on particular time periods and types of texts may allow adaptations of the test for different populations. PMID:25410405

  11. Deep brain stimulation and responsiveness of the Persian version of Parkinson's disease questionnaire with 39-items.

    Directory of Open Access Journals (Sweden)

    Gholam Ali Shahidi

    2014-12-01

    Full Text Available Assessment of quality-of-life (QOF as an outcome measure after deep brain stimulation (DBS surgery in patients with Parkinson's disease (PD need a valid, reliable and responsive instrument. The aim of the current study was to determine responsiveness of validated Persian version of PD questionnaire with 39-items (PDQ-39 after DBS surgery in patients with PD.Eleven patients with PD, who were candidate for DBS operation between May 2012 and June 2013 were assessed. PDQ-39 and short-form questionnaire with 36-items (SF-36 were used. To assess responsiveness of PDQ-39 standardized response mean (SRM was used.Mean age was 51.8 (8.8 and all of the patients, but just one were male (10 patients. Mean duration of the disease was 8.7 (2.1 years. Eight patients were categorized as moderate using Hoehn and Yahr (H and Y classification. All patients had a better H and Y score compared with the baseline evaluation (3.09 vs. 0.79. The amount of SRM was above 0.70 for all domains means a large responsiveness for PDQ-39.Persian version of PDQ-39 has an acceptable responsiveness and could be used to assess as an outcome measure to evaluate the effect of therapies on PD.

  12. Validation of Sustainable Development Practices Scale Using the Bayesian Approach to Item Response Theory

    Directory of Open Access Journals (Sweden)

    Martin Hernani Merino

    2014-12-01

    Full Text Available There has been growing recognition of the importance of creating performance measurement tools for the economic, social and environmental management of micro and small enterprise (MSE. In this context, this study aims to validate an instrument to assess perceptions of sustainable development practices by MSEs by means of a Graded Response Model (GRM with a Bayesian approach to Item Response Theory (IRT. The results based on a sample of 506 university students in Peru, suggest that a valid measurement instrument was achieved. At the end of the paper, methodological and managerial contributions are presented.

  13. Guidelines for ensuring socially responsible public procurement : Case city of Espoo, procurement of textile items

    OpenAIRE

    Laukkanen-Kolesnikova, Heini

    2016-01-01

    In modern society, it is essential for companies and municipalities to ensure that all players in the supply chain are acting in a socially responsible way. The purpose of this study was to find out how the city of Espoo can ensure social responsibility in their purchases. The focus of the study was in a procurement of textile items carried out by Espoo during the year 2015. The research for this thesis was based on several qualitative data including the researcher’s observations about...

  14. ltm: An R Package for Latent Variable Modeling and Item Response Analysis

    Directory of Open Access Journals (Sweden)

    Dimitris Rizopoulos

    2006-11-01

    Full Text Available The R package ltm has been developed for the analysis of multivariate dichotomous and polytomous data using latent variable models, under the Item Response Theory approach. For dichotomous data the Rasch, the Two-Parameter Logistic, and Birnbaum's Three-Parameter models have been implemented, whereas for polytomous data Semejima's Graded Response model is available. Parameter estimates are obtained under marginal maximum likelihood using the Gauss-Hermite quadrature rule. The capabilities and features of the package are illustrated using two real data examples.

  15. Gender differences in posttraumatic stress symptoms among OEF/OIF veterans: an item response theory analysis.

    Science.gov (United States)

    King, Matthew W; Street, Amy E; Gradus, Jaimie L; Vogt, Dawne S; Resick, Patricia A

    2013-04-01

    Establishing whether men and women tend to express different symptoms of posttraumatic stress in reaction to trauma is important for both etiological research and the design of assessment instruments. Use of item response theory (IRT) can reveal how symptom reporting varies by gender and help determine if estimates of symptom severity for men and women are equally reliable. We analyzed responses to the PTSD Checklist (PCL) from 2,341 U.S. military veterans (51% female) who completed deployments in support of operations in Afghanistan and Iraq (Operation Enduring Freedom/Operation Iraqi Freedom [OEF/OIF]), and tested for differential item functioning by gender with an IRT-based approach. Among men and women with the same overall posttraumatic stress severity, women tended to report more frequent concentration difficulties and distress from reminders whereas men tended to report more frequent nightmares, emotional numbing, and hypervigilance. These item-level gender differences were small (on average d = 0.05), however, and had little impact on PCL measurement precision or expected total scores. For practical purposes, men's and women's severity estimates had similar reliability. This provides evidence that men and women veterans demonstrate largely similar profiles of posttraumatic stress symptoms following exposure to military-related stressors, and some theoretical perspectives suggest this may hold in other traumatized populations.

  16. A New Item Response Theory Model for Open-Ended Online Homework with Multiple Allowed Attempts

    CERN Document Server

    Gönülateş, Emre

    2015-01-01

    Item Response Theory (IRT) was originally developed in traditional exam settings, and it has been shown that the model does not readily transfer to formative assessment in the form of online homework. We investigate if this is mostly due to learner traits that do not become apparent in exam settings, namely random guessing due to lack of diligence or dedication, and copying work from other students or resources. Both of these traits mask the true ability of the learner, which is the only trait considered in most mainstream unidimensional IRT models. We find that indeed the introduction of these traits allows to better assess the true ability of the learners, as well as to better gauge the quality of assessment items. Correspondence of the model traits to self-reported behavior is investigated and confirmed. We find that of these two traits, copying answers has a larger influence on initial homework attempts than random guessing.

  17. A Teoria da Resposta ao Item: possíveis contribuições aos estudos em marketing The Item Response Theory: possible contributions to marketing studies

    Directory of Open Access Journals (Sweden)

    Danielle Ramos de Miranda Pereira

    2011-01-01

    Full Text Available A constatação da ampla utilização de escalas multidimensionais por parte dos pesquisadores da área de marketing motivou a elaboração de um artigo com o propósito de discutir a aplicação da Teoria da Resposta ao Item (TRI, bem como apresentar a essa área um método que tem se mostrado bastante eficaz na estimação de construtos comportamentais. Sendo assim, o artigo apresenta uma discussão sobre a TRI, ressaltando seus avanços em relação à Teoria Clássica do Teste (TCT e suas aplicações tradicionais no campo da psicometria e da avaliação educacional. Para verificar sua aplicabilidade nos estudos de marketing, julgou-se adequado conduzir uma aplicação prática da TRI em um estudo envolvendo uma escala já bastante utilizada pelos pesquisadores - a de orientação de mercado (Escala MkTor proposta por Narver e Slater (1990. Os resultados da aplicação demonstraram que, embora o modelo da TRI proposto possa ser considerado satisfatório para a aplicação no contexto da Orientação para o Mercado, existem muitos desafios a serem enfrentados por novos estudos como a construção de uma escala com interpretação prática, indicando o que significa para uma empresa possuir um nível de maturidade associado a um determinado construto. As considerações finais ressaltam que a grande contribuição do artigo aos estudos em marketing é a apresentação de um método alternativo para estimar de forma mais apurada os construtos e avaliar a qualidade dos itens das escalas.The widespread utilization of multidimensional scales by researchers in field of marketing have motivated the conduction of a study to discuss the application of the Item Response Theory (IRT as well as presenting a method that has proved very effective in the estimation of behavioral constructs. Therefore, this article presents a discussion about IRT highlighting its advances regarding the Classical Theory of Tests (CTT and its traditional applications in the

  18. Mild to severe social fears: ranking types of feared social situations using item response theory.

    Science.gov (United States)

    Crome, Erica; Baillie, Andrew

    2014-06-01

    Social anxiety disorder is one of the most common mental disorders, and is associated with long term impairment, distress and vulnerability to secondary disorders. Certain types of social fears are more common than others, with public speaking fears typically the most prevalent in epidemiological surveys. The distinction between performance- and interaction-based fears has been the focus of long-standing debate in the literature, with evidence performance-based fears may reflect more mild presentations of social anxiety. This study aims to explicitly test whether different types of social fears differ in underlying social anxiety severity using item response theory techniques. Different types of social fears were assessed using items from three different structured diagnostic interviews in four different epidemiological surveys in the United States (n=2261, n=5411) and Australia (n=1845, n=1497); and ranked using 2-parameter logistic item response theory models. Overall, patterns of underlying severity indicated by different fears were consistent across the four samples with items functioning across a range of social anxiety. Public performance fears and speaking at meetings/classes indicated the lowest levels of social anxiety, with increasing severity indicated by situations such as being assertive or attending parties. Fears of using public bathrooms or eating, drinking or writing in public reflected the highest levels of social anxiety. Understanding differences in the underlying severity of different types of social fears has important implications for the underlying structure of social anxiety, and may also enhance the delivery of social anxiety treatment at a population level. PMID:24873885

  19. Psychometric Examination of an Inventory of Self-Efficacy for the Holland Vocational Themes Using Item Response Theory

    Science.gov (United States)

    Turner, Brandon M.; Betz, Nancy E.; Edwards, Michael C.; Borgen, Fred H.

    2010-01-01

    The psychometric properties of measures of self-efficacy for the six themes of Holland's theory were examined using item response theory. Item and scale quality were compared across levels of the trait continuum; all the scales were highly reliable but differentiated better at some levels of the continuum than others. Applications for adaptive…

  20. Estimation of a Ramsay-Curve Item Response Theory Model by the Metropolis-Hastings Robbins-Monro Algorithm

    Science.gov (United States)

    Monroe, Scott; Cai, Li

    2014-01-01

    In Ramsay curve item response theory (RC-IRT) modeling, the shape of the latent trait distribution is estimated simultaneously with the item parameters. In its original implementation, RC-IRT is estimated via Bock and Aitkin's EM algorithm, which yields maximum marginal likelihood estimates. This method, however, does not produce the…

  1. Scaling Users' Perceptions of Library Service Quality Using Item Response Theory: A LibQUAL+ [TM] Study

    Science.gov (United States)

    Wei, Youhua; Thompson, Bruce; Cook, C. Colleen

    2005-01-01

    LibQUAL+[TM] data to date have not been subjected to the modern measurement theory called polytomous item response theory (IRT). The data interpreted here were collected from 42,090 participants who completed the "American English" version of the 22 core LibQUAL+[TM] items, and 12,552 participants from Australia and Europe who completed the…

  2. Potential application of item-response theory to interpretation of medical codes in electronic patient records

    Directory of Open Access Journals (Sweden)

    Dregan Alex

    2011-12-01

    Full Text Available Abstract Background Electronic patient records are generally coded using extensive sets of codes but the significance of the utilisation of individual codes may be unclear. Item response theory (IRT models are used to characterise the psychometric properties of items included in tests and questionnaires. This study asked whether the properties of medical codes in electronic patient records may be characterised through the application of item response theory models. Methods Data were provided by a cohort of 47,845 participants from 414 family practices in the UK General Practice Research Database (GPRD with a first stroke between 1997 and 2006. Each eligible stroke code, out of a set of 202 OXMIS and Read codes, was coded as either recorded or not recorded for each participant. A two parameter IRT model was fitted using marginal maximum likelihood estimation. Estimated parameters from the model were considered to characterise each code with respect to the latent trait of stroke diagnosis. The location parameter is referred to as a calibration parameter, while the slope parameter is referred to as a discrimination parameter. Results There were 79,874 stroke code occurrences available for analysis. Utilisation of codes varied between family practices with intraclass correlation coefficients of up to 0.25 for the most frequently used codes. IRT analyses were restricted to 110 Read codes. Calibration and discrimination parameters were estimated for 77 (70% codes that were endorsed for 1,942 stroke patients. Parameters were not estimated for the remaining more frequently used codes. Discrimination parameter values ranged from 0.67 to 2.78, while calibration parameters values ranged from 4.47 to 11.58. The two parameter model gave a better fit to the data than either the one- or three-parameter models. However, high chi-square values for about a fifth of the stroke codes were suggestive of poor item fit. Conclusion The application of item response

  3. Using item response theory to explore the psychometric properties of extended matching questions examination in undergraduate medical education

    Directory of Open Access Journals (Sweden)

    Lawton Gemma

    2005-03-01

    Full Text Available Abstract Background As assessment has been shown to direct learning, it is critical that the examinations developed to test clinical competence in medical undergraduates are valid and reliable. The use of extended matching questions (EMQ has been advocated to overcome some of the criticisms of using multiple-choice questions to test factual and applied knowledge. Methods We analysed the results from the Extended Matching Questions Examination taken by 4th year undergraduate medical students in the academic year 2001 to 2002. Rasch analysis was used to examine whether the set of questions used in the examination mapped on to a unidimensional scale, the degree of difficulty of questions within and between the various medical and surgical specialties and the pattern of responses within individual questions to assess the impact of the distractor options. Results Analysis of a subset of items and of the full examination demonstrated internal construct validity and the absence of bias on the majority of questions. Three main patterns of response selection were identified. Conclusion Modern psychometric methods based upon the work of Rasch provide a useful approach to the calibration and analysis of EMQ undergraduate medical assessments. The approach allows for a formal test of the unidimensionality of the questions and thus the validity of the summed score. Given the metric calibration which follows fit to the model, it also allows for the establishment of items banks to facilitate continuity and equity in exam standards.

  4. Measuring Consumers’ Environmental Responsibility: A Synthesis of Constructs and Measurement Scale Items

    Directory of Open Access Journals (Sweden)

    K. M. R. Taufique

    2014-04-01

    Full Text Available It is universal that central to all production is consumption. Without proper management, production along with consumption is likely to be the main sources of environmental problems. This very reality calls for consumers to be environmentally responsible in their consumption behavior. The objective of this paper is to prepare a synthesis of all the possible factors and measurement scale items to be used for assessing consumers’ environmental responsibility. For making such synthesis, all major works done on the field have been thoroughly reviewed.The paper comes up with a total of six parameters that include knowledge & awareness, attitude, green consumer value, emotional affinity toward nature, willingness to act and environment related past behavior. These tentative, yet inclusive set of parameters are thought to be useful for guiding the designing of large scale future empirical researches for developing a dependable inclusive set of parameters to test consumer’ environmental responsibility. A conceptual model and possible measurement items are proposed for further empirical research.

  5. Which person variables predict how people benefit from True-False over Constructed Response items?

    Directory of Open Access Journals (Sweden)

    Stella Bollmann

    2015-06-01

    Full Text Available The aim of this study was the investigation of the variable Benefit from TF, which we assumed to be additionally measured when using True-False instead of Constructed Response tests. Subjects who benefit from True-False have an advantage over other subjects in answering Multiple Choice or True-False exams. We expected it to be related to partial knowledge and examined its relation to other personal abilities and traits in a total of n = 106 psychology students. They completed a statistics exam in Constructed Response and True-False format and benefit items were defined as those to which the associated constructed response answer was not correct. Additionally, verbal intelligence and Big 5 measures were obtained. Results confirm the existence of the person variable Benefit from TF and its relation to partial knowledge. Furthermore, benefiters differed from others in conscientiousness and openness to experience variables. However, contrary to expectations, they did not differ in verbal IQ.

  6. Harmonization of Neuroticism and Extraversion phenotypes across inventories and cohorts in the Genetics of Personality Consortium: an application of Item Response Theory

    DEFF Research Database (Denmark)

    van den Berg, S. M.; de Moor, M. H. M.; McGue, Matt;

    2014-01-01

    Mega- or meta-analytic studies (e.g. genome-wide association studies) are increasingly used in behavior genetics. An issue in such studies is that phenotypes are often measured by different instruments across study cohorts, requiring harmonization of measures so that more powerful fixed effect meta......-analysis of six twin cohorts, total N = 29,496 and 29,501 twin pairs, respectively) with a significant part of the heritability due to non-additive genetic factors. For Extraversion, these genetic factors qualitatively differ across sexes. We showed that our IRT method can lead to a large increase in sample size......-analyses can be employed. Within the Genetics of Personality Consortium, we demonstrate for two clinically relevant personality traits, Neuroticism and Extraversion, how Item-Response Theory (IRT) can be applied to map item data from different inventories to the same underlying constructs. Personality item...

  7. An item response theory analysis of self-report measures of adult attachment.

    Science.gov (United States)

    Fraley, R C; Waller, N G; Brennan, K A

    2000-02-01

    Self-report measures of adult attachment are typically scored in ways (e.g., averaging or summing items) that can lead to erroneous inferences about important theoretical issues, such as the degree of continuity in attachment security and the differential stability of insecure attachment patterns. To determine whether existing attachment scales suffer from scaling problems, the authors conducted an item response theory (IRT) analysis of 4 commonly used self-report inventories: Experiences in Close Relationships scales (K. A. Brennan, C. L. Clark, & P. R. Shaver, 1998), Adult Attachment Scales (N. L. Collins & S. J. Read, 1990), Relationship Styles Questionnaire (D. W. Griffin & K. Bartholomew, 1994) and J. Simpson's (1990) attachment scales. Data from 1,085 individuals were analyzed using F. Samejima's (1969) graded response model. The authors' findings indicate that commonly used attachment scales can be improved in a number of important ways. Accordingly, the authors show how IRT techniques can be used to develop new attachment scales with desirable psychometric properties.

  8. Incorporating Mobility in Growth Modeling for Multilevel and Longitudinal Item Response Data.

    Science.gov (United States)

    Choi, In-Hee; Wilson, Mark

    2016-01-01

    Multilevel data often cannot be represented by the strict form of hierarchy typically assumed in multilevel modeling. A common example is the case in which subjects change their group membership in longitudinal studies (e.g., students transfer schools; employees transition between different departments). In this study, cross-classified and multiple membership models for multilevel and longitudinal item response data (CCMM-MLIRD) are developed to incorporate such mobility, focusing on students' school change in large-scale longitudinal studies. Furthermore, we investigate the effect of incorrectly modeling school membership in the analysis of multilevel and longitudinal item response data. Two types of school mobility are described, and corresponding models are specified. Results of the simulation studies suggested that appropriate modeling of the two types of school mobility using the CCMM-MLIRD yielded good recovery of the parameters and improvement over models that did not incorporate mobility properly. In addition, the consequences of incorrectly modeling the school effects on the variance estimates of the random effects and the standard errors of the fixed effects depended upon mobility patterns and model specifications. Two sets of large-scale longitudinal data are analyzed to illustrate applications of the CCMM-MLIRD for each type of school mobility.

  9. Development and psychometric properties of the Suicidality of Adolescent Screening Scale (SASS) using Multidimensional Item Response Theory.

    Science.gov (United States)

    Sukhawaha, Supattra; Arunpongpaisal, Suwanna; Hurst, Cameron

    2016-09-30

    Suicide prevention in adolescents by early detection using screening tools to identify high suicidal risk is a priority. Our objective was to build a multidimensional scale namely "Suicidality of Adolescent Screening Scale (SASS)" to identify adolescents at risk of suicide. An initial pool of items was developed by using in-depth interview, focus groups and a literature review. Initially, 77 items were administered to 307 adolescents and analyzed using the exploratory Multidimensional Item Response Theory (MIRT) to remove unnecessary items. A subsequent exploratory factor analysis revealed 35 items that collected into 4 factors: Stressors, Pessimism, Suicidality and Depression. To confirm this structure, a new sample of 450 adolescents were collected and confirmatory MIRT factor analysis was performed. The resulting scale was shown to be both construct valid and able to discriminate well between adolescents that had, and hadn't previous attempted suicide. PMID:27450746

  10. Understanding the Relation between Attitude Involvement and Response Latitude Using Item Response Theory

    Science.gov (United States)

    Lake, Christopher J.; Withrow, Scott; Zickar, Michael J.; Wood, Nicole L.; Dalal, Dev K.; Bochinski, Joseph

    2013-01-01

    Adapting the original latitude of acceptance concept to Likert-type surveys, response latitudes are defined as the range of graded response options a person is willing to endorse. Response latitudes were expected to relate to attitude involvement such that high involvement was linked to narrow latitudes (the result of selective, careful…

  11. Mathematical literacy examination items and student errors: An analysis of English Second Language students’ responses

    Directory of Open Access Journals (Sweden)

    Pamela Vale

    2013-04-01

    Full Text Available Mathematical literacy is a real-world practical attribute yet students write a high-stakes examination in order to pass the subject Mathematical Literacy in the National Certificates (Vocational (NC(V. In these examinations, all sources of information are contextualised in language. It can be effortful for English second language students to decode text. The deliberate processing that is required saturates working memory and prevents these students from optimally engaging in problem solving. In this study, 15 items from an NC(V Level 4 Mathematical Literacy examination are selected, as well as 15 student responses to each of these questions. From these responses, those which are incorrect are analysed to determine whether the error is due to insufficient mathematical literacy or a lack of English language proficiency. These results are used as an indication as to whether the examination is fair and valid for this group of students.

  12. Item Response Theory Analyses of Adult Self-Ratings of the ADHD Symptoms in the Current Symptoms Scale

    Science.gov (United States)

    Gomez, Rapson

    2011-01-01

    The graded response model, which is based on item response theory, was used to evaluate the psychometric properties of adult self-ratings (N = 852) of the attention deficit/hyperactivity disorder inattention, hyperactivity, and impulsivity symptoms presented in the Current Symptoms Scale. This scale has four ordered response categories. The…

  13. Evaluation of diagnostic criteria for night eating syndrome using item response theory analysis.

    Science.gov (United States)

    Allison, Kelly C; Engel, Scott G; Crosby, Ross D; de Zwaan, Martina; O'Reardon, John P; Wonderlich, Stephen A; Mitchell, James E; West, Delia Smith; Wadden, Thomas A; Stunkard, Albert J

    2008-12-01

    Uniform diagnostic criteria for the night eating syndrome (NES), a disorder characterized by a delay in the circadian pattern of eating, have not been established. Proposed criteria for NES were evaluated using item response theory (IRT) analysis. Six studies yielded 1,481 Night Eating Questionnaires which were coded to reflect the presence/absence of five night eating symptoms. Symptoms were evaluated based on the clinical usefulness of their diagnostic information and on the assumptions of IRT analysis (unidimensionality, monotonicity, local item independence, correct model specification), using a two parameter logistic (2PL) IRT model. Reports of (1) nocturnal eating and/or evening hyperphagia, (2) initial insomnia, and (3) night awakenings showed high precision in discriminating those with night eating problems, while morning anorexia and delayed morning meal provided little additional information. IRT is a useful tool for evaluating the diagnostic criteria of psychiatric disorders and can be used to evaluate potential diagnostic criteria of NES empirically. Behavioral factors were identified as useful discriminators of NES. Future work should also examine psychological factors in conjunction with those identified here. PMID:18928902

  14. Deconstructing the architecture of alcohol abuse and dependence symptoms in a community sample of late adolescent and emerging adult women: an item response approach.

    Science.gov (United States)

    Duncan, Alexis E; Agrawal, Arpana; Bucholz, Kathleen K; Sartor, Carolyn E; Madden, Pamela A F; Heath, Andrew C

    2011-07-01

    The objective of this study was to examine the underlying factorial architecture of lifetime DSM-IV alcohol use disorder (AUD) criteria in a population-based sample of adolescent and emerging adult female twins who had ever used alcohol (n=2832; aged 18-25 years), and to determine whether thresholds and factor loadings differed by age. Item response modeling was applied to DSM-IV AUD criteria. Compound criteria (e.g., persistent desire or unsuccessful attempts to quit or cut down) were included as separate items. Of the remaining 16 items, tolerance and use despite physical problems were the most and least commonly endorsed items, respectively. Underlying the items was a single factor representing liability to AUDs. Factor loadings ranged from 0.67 for blackouts to 0.90 for time spent using/recovering from effects. Some items assessing different DSM-IV criteria had very similar measurement characteristics, while others assessing the same criterion showed markedly different thresholds and factor loadings. Compared to that of women aged 21-25 years, the threshold for hazardous use was higher in women aged 18-20 years, but lower for used longer than intended and persistent desire to cut down. After accounting for threshold differences, no variations in discrimination across age groups were observed. In agreement with the extant literature, our findings indicate that the factorial structure of AUD is unidimensional, with no support for the abuse/dependence distinction. Individual components of compound criteria may differ in measurement properties; therefore pooling information from such divergent items will reduce information about the AUD construct. PMID:21306836

  15. Disaster Preparedness and Response: Applied Exposure Science

    Science.gov (United States)

    In 2007, the ISEA, predecessor to ISES, held a special roundtable to discuss lessons learned for exposure science during and following environmental disasters, especially the 9/11 attacks and Hurricane Katrina. Since then, environmental agencies have been involved in responses to...

  16. Evaluation of Buss-Perry aggression Questionnaire with item response theory (IRT

    Directory of Open Access Journals (Sweden)

    Dinić Bojana

    2012-01-01

    Full Text Available The aim of this research was to examine the psychometric properties of the Buss-Perry Aggression Questionnaire on Serbian sample, using the IRT model for graded responses. AQ contains four subscales: Physical aggression, Verbal aggression, Hostility and Anger. The sample included 1272 participants, both gender and age ranged from 18 to 68 years, with average age of 31.39 (SD = 12.63 years. Results of IRT analysis suggested that the subscales had greater information in the range of above-average scores, namely in participants with higher level of aggressiveness. The exception was Hostilisty subscale, because it was informative in the wider range of trait. On the other hand, this subscale contains two items which violate assumption of homogenity. Implications for measurement of aggressiveness are discussed.

  17. The Retrospect and Prospect of Non-parametric Item Response Theory%非参数项目反应理论回顾与展望

    Institute of Scientific and Technical Information of China (English)

    陈婧; 康春花; 钟晓玲

    2013-01-01

      相比参数项目反应理论,非参数项目反应理论提供了更吻合实践情境的理论框架。目前非参数项目反应理论研究主要关注参数估计方法及其比较、数据-模型拟合验证等方面,其应用研究则集中于量表修订及个性数据和项目功能差异分析,而在认知诊断理论基础上发展起来的非参数认知诊断理论更是凸显其应用优势。未来研究应更多侧重于非参数项目反应理论的实践应用,对非参数认知诊断理论的研究也值得关注,以充分发挥非参数方法在实践领域的应用优势。%  Compared to parametric item response theory, non-parametric item response theory provide a more appropriate theoretical framework of practice situations. Non-parametric item response theory research focuses on parameter estimation methods and its comparison, data- model fitting verify etc. currently.Its applied research concentrate on scale amendments, personalized data and differential item functioning analysis. Non-parametric cognitive diagnostic theory which based on the parametric cognitive diagnostic theory gives prominence to the advantages of its application.To give full play to the advantages of non-parametric methods in practice,future studies should emphasis on the application of non-parametric item response theory while cognitive diagnosis of the non-parametric study is also worth of attention.

  18. 非参数项目反应理论回顾与展望%The Retrospect and Prospect of Non-parametric Item Response Theory

    Institute of Scientific and Technical Information of China (English)

    陈婧; 康春花; 钟晓玲

    2013-01-01

      相比参数项目反应理论,非参数项目反应理论提供了更吻合实践情境的理论框架。目前非参数项目反应理论研究主要关注参数估计方法及其比较、数据-模型拟合验证等方面,其应用研究则集中于量表修订及个性数据和项目功能差异分析,而在认知诊断理论基础上发展起来的非参数认知诊断理论更是凸显其应用优势。未来研究应更多侧重于非参数项目反应理论的实践应用,对非参数认知诊断理论的研究也值得关注,以充分发挥非参数方法在实践领域的应用优势。%  Compared to parametric item response theory, non-parametric item response theory provide a more appropriate theoretical framework of practice situations. Non-parametric item response theory research focuses on parameter estimation methods and its comparison, data- model fitting verify etc. currently.Its applied research concentrate on scale amendments, personalized data and differential item functioning analysis. Non-parametric cognitive diagnostic theory which based on the parametric cognitive diagnostic theory gives prominence to the advantages of its application.To give full play to the advantages of non-parametric methods in practice,future studies should emphasis on the application of non-parametric item response theory while cognitive diagnosis of the non-parametric study is also worth of attention.

  19. Using Open-Response Fraction Items to Explore the Relationship between Instructional Modalities and Students' Solution Strategies

    Science.gov (United States)

    Shumway, Jessica F.; Moyer-Packenham, Patricia S.; Baker, Joseph M.; Westenskow, Arla; Anderson-Pence, Katie L.; Tucker, Stephen I.; Boyer-Thurgood, Jennifer; Jordan, Kerry E.

    2016-01-01

    The purpose of this study was to explore the relationship between instructional modality used for teaching fractions and third- and fourth-grade students' responses and strategies to open-response fraction items. The participants were 155 third-grade and 200 fourth-grade students from 17 public school classrooms. Students within each class were…

  20. Item Response Theory Analyses of the Parent and Teacher Ratings of the DSM-IV ADHD Rating Scale

    Science.gov (United States)

    Gomez, Rapson

    2008-01-01

    The graded response model (GRM), which is based on item response theory (IRT), was used to evaluate the psychometric properties of the inattention and hyperactivity/impulsivity symptoms in an ADHD rating scale. To accomplish this, parents and teachers completed the DSM-IV ADHD Rating Scale (DARS; Gomez et al., "Journal of Child Psychology and…

  1. Use of Item Response Theory to Examine a Cardiovascular Health Knowledge Measure for Adolescents with Elevated Blood Pressure

    Directory of Open Access Journals (Sweden)

    Stephanie L. Fitzpatrick

    2012-10-01

    Full Text Available The purpose of this study was to assess the psychometric properties of a cardiovascular health knowledge measure for adolescents using item response theory. The measure was developed in the context of a cardiovascular lifestyle intervention for adolescents with elevated blood pressure. Sample consisted of 167 adolescents (mean age = 16.2 years who completed the Cardiovascular Health Knowledge Assessment (CHKA, a 34-item multiple choice test, at baseline and post-intervention. The CHKA was unidimensional and internal consistency was .65 at pretest and .74 at posttest. Rasch analysis results indicated that at pretest the items targeted adolescents with variable levels of health knowledge. However, based on results at posttest, additional hard items are needed to account for the increase in level of cardiovascular health knowledge at post-intervention. Change in knowledge scores was examined using Rasch analysis. Findings indicated there was significant improvement in health knowledge over time [t(119 = -10.3, p< .0001]. In summary, the CHKA appears to contain items that are good approximations of the construct cardiovascular health knowledge and items that target adolescents with moderate levels of knowledge.  DOI: 10.2458/azu_jmmss.v3i1.16111

  2. Analysis of Culture-Specific Items and Translation Strategies Applied in Translating Jalal Al-Ahmad's "By the Pen"

    Science.gov (United States)

    Daghoughi, Shekoufeh; Hashemian, Mahmood

    2016-01-01

    Due to differences across languages, meanings and concepts vary across different languages, too. The most obvious points of difference between languages appear in their literature and their culture-specific items (CSIs), which lead to complexities when transferring meanings and concepts from one language into another. To overcome the complexities…

  3. Proposta de um instrumento de medida para avaliar a satisfação de clientes de bancos utilizando a Teoria da Resposta ao Item Proposal of tool to assess the satisfaction of bank customers using the Item Response Theory

    Directory of Open Access Journals (Sweden)

    Alceu Balbim Junior

    2011-01-01

    Full Text Available Este artigo apresenta um instrumento de medida para avaliação da satisfação de clientes de bancos utilizando a Teoria da Resposta ao Item (TRI. Satisfazer os clientes tem sido uma busca constante das organizações que procuram manterem-se competitivas no mercado. Estudos constatam a relação entre a qualidade percebida pelos clientes, a satisfação e fidelidade. A avaliação da satisfação pode ser realizada por meio da qualidade percebida pelos clientes e a construção de ferramentas de avaliação deve contemplar características específicas da atividade em questão. Embasando-se em artigos que avaliam a satisfação de clientes de bancos, propõe-se um instrumento formado por 29 itens. Os itens foram aplicados a 240 clientes a fim de avaliar a satisfação com o banco de maior relacionamento. Utilizando a Teoria da Resposta ao Item, foram identificados os parâmetros dos itens e a curva de informação. A análise do grau de discriminação dos itens indicou que todos são apropriados. A curva de informação obtida evidenciou o intervalo no qual o instrumento apresenta melhores estimativas para níveis de satisfação. O trabalho apresentou o nível médio de satisfação da amostra e a concentração de clientes nos diferentes níveis de satisfação da escala.This paper presents a model for assessing the satisfaction of bank customers using the Item Response Theory (IRT. Organizations are constantly making effort to satisfy customers seeking to remain competitive. Several studies have reported on the relationship between perceived quality, satisfaction, and loyalty. The assessment of satisfaction can be accomplished through the perceived quality, and the development of assessment tools should address specific features of the activity in question. Based on articles that assess the satisfaction of bank customers, this study proposes an assessment tool consisting of 29 items. The items were applied to 240 clients to assess their

  4. Bifactor and Item Response Theory Analyses of Interviewer Report Scales of Cognitive Impairment in Schizophrenia

    Science.gov (United States)

    Reise, Steven P.; Ventura, Joseph; Keefe, Richard S. E.; Baade, Lyle E.; Gold, James M.; Green, Michael F.; Kern, Robert S.; Mesholam-Gately, Raquelle; Nuechterlein, Keith H.; Seidman, Larry J.; Bilder, Robert

    2011-01-01

    A psychometric analysis of 2 interview-based measures of cognitive deficits was conducted: the 21-item Clinical Global Impression of Cognition in Schizophrenia (CGI-CogS; Ventura et al., 2008), and the 20-item Schizophrenia Cognition Rating Scale (SCoRS; Keefe et al., 2006), which were administered on 2 occasions to a sample of people with…

  5. Interpreting gains and losses in conceptual test using Item Response Theory

    CERN Document Server

    Lamine, Brahim

    2015-01-01

    Conceptual tests are widely used by physics instructors to assess students' conceptual understanding and compare teaching methods. It is common to look at students' changes in their answers between a pre-test and a post-test to quantify a transition in student's conceptions. This is often done by looking at the proportion of incorrect answers in the pre-test that changes to correct answers in the post-test -- the gain -- and the proportion of correct answers that changes to incorrect answers -- the loss. By comparing theoretical predictions to experimental data on the Force Concept Inventory, we shown that Item Response Theory (IRT) is able to fairly well predict the observed gains and losses. We then use IRT to quantify the student's changes in a test-retest situation when no learning occurs and show that $i)$ up to 25\\% of total answers can change due to the non-deterministic nature of student's answer and that $ii)$ gains and losses can go from 0\\% to 100\\%. Still using IRT, we highlight the conditions tha...

  6. An Analysis of Cross Racial Identity Scale Scores Using Classical Test Theory and Rasch Item Response Models

    Science.gov (United States)

    Sussman, Joshua; Beaujean, A. Alexander; Worrell, Frank C.; Watson, Stevie

    2013-01-01

    Item response models (IRMs) were used to analyze Cross Racial Identity Scale (CRIS) scores. Rasch analysis scores were compared with classical test theory (CTT) scores. The partial credit model demonstrated a high goodness of fit and correlations between Rasch and CTT scores ranged from 0.91 to 0.99. CRIS scores are supported by both methods.…

  7. Negative affectivity and social inhibition in cardiovascular disease: evaluating type-D personality and its assessment using item response theory

    NARCIS (Netherlands)

    Emons, Wilco H.M.; Meijer, Rob R.; Denollet, Johan

    2007-01-01

    Objective: Individuals with increased levels of both negative affectivity (NA) and social inhibition (SI)—referred to as type-D personality—are at increased risk of adverse cardiac events. We used item response theory (IRT) to evaluate NA, SI, and type-D personality as measured by the DS14. The obje

  8. Development of an Abbreviated Social Phobia and Anxiety Inventory (SPAI) Using Item Response Theory: The SPAI-23

    Science.gov (United States)

    Roberson-Nay, Roxann; Strong, David R.; Nay, William T.; Beidel, Deborah C.; Turner, Samuel M.

    2007-01-01

    An abbreviated version of the Social Phobia and Anxiety Inventory (SPAI) was developed using methods based in nonparametric item response theory. Participants included a nonclinical sample of 1,482 undergraduates (52% female, mean age = 19.4 years) as well as a clinical sample of 105 individuals (56% female, mean age = 36.4 years) diagnosed with…

  9. Taking the Missing Propensity into Account When Estimating Competence Scores: Evaluation of Item Response Theory Models for Nonignorable Omissions

    Science.gov (United States)

    Köhler, Carmen; Pohl, Steffi; Carstensen, Claus H.

    2015-01-01

    When competence tests are administered, subjects frequently omit items. These missing responses pose a threat to correctly estimating the proficiency level. Newer model-based approaches aim to take nonignorable missing data processes into account by incorporating a latent missing propensity into the measurement model. Two assumptions are typically…

  10. Increasing the Number of Replications in Item Response Theory Simulations: Automation through SAS and Disk Operating System

    Science.gov (United States)

    Gagne, Phill; Furlow, Carolyn; Ross, Terris

    2009-01-01

    In item response theory (IRT) simulation research, it is often necessary to use one software package for data generation and a second software package to conduct the IRT analysis. Because this can substantially slow down the simulation process, it is sometimes offered as a justification for using very few replications. This article provides…

  11. The Effect of Using Different Weights for Multiple-Choice and Free-Response Item Sections

    Science.gov (United States)

    Hendrickson, Amy; Patterson, Brian; Melican, Gerald

    2008-01-01

    Presented at the Annual National Council on Measurement in Education (NCME) in New York in March 2008. This presentation explores how different item weighting can affect the effective weights, validity coefficents and test reliability of composite scores among test takers.

  12. Improving the Reliability of Student Scores from Speeded Assessments: An Illustration of Conditional Item Response Theory Using a Computer-Administered Measure of Vocabulary

    Science.gov (United States)

    Petscher, Yaacov; Mitchell, Alison M.; Foorman, Barbara R.

    2016-01-01

    A growing body of literature suggests that response latency, the amount of time it takes an individual to respond to an item, may be an important factor to consider when using assessment data to estimate the ability of an individual. Considering that tests of passage and list fluency are being adapted to a computer administration format, it is possible that accounting for individual differences in response times may be an increasingly feasible option to strengthen the precision of individual scores. The present research evaluated the differential reliability of scores when using classical test theory and item response theory as compared to a conditional item response model which includes response time as an item parameter. Results indicated that the precision of student ability scores increased by an average of 5 % when using the conditional item response model, with greater improvements for those who were average or high ability. Implications for measurement models of speeded assessments are discussed.

  13. Small Sample Estimation in Dichotomous Item Response Models: Effect of Priors Based on Judgmental Information on the Accuracy of Item Parameter Estimates. LSAC Research Report Series.

    Science.gov (United States)

    Swaminathan, Hariharan; Hambleton, Ronald K.; Sireci, Stephen G.; Xing, Dehui; Rizavi, Saba M.

    The primary objective of this study was to investigate how incorporating prior information improves estimation of item parameters in two small samples. The factors that were investigated were sample size and the type of prior information. To investigate the accuracy with which item parameters in the Law School Admission Test (LSAT) are estimated,…

  14. Item Characteristic Curve Estimation of Signal Detection Theory-Based Personality Data: A Two-Stage Approach to Item Response Modeling.

    Science.gov (United States)

    Williams, Kevin M.; Zumbo, Bruno D.

    2003-01-01

    Developed an item characteristic curve estimation of signal detection theory based personality data. Results for 266 college students taking the Overclaiming Questionnaire (D. Paulhus and N. Bruce, 1990) suggest that this method is a reasonable approach to describing item functioning and that there are advantages to this method over traditional…

  15. Applying Systems Design and Item Response Theory to the Problem of Measuring Information Literacy Skills.

    Science.gov (United States)

    O'Connor, Lisa G.; Radcliff, Carolyn J.; Gedeon, Julie A.

    2002-01-01

    Reports on the development of the Standardized Assessment of Information Literacy Skills (SAILS) at Kent State University (Ohio) for programmatic-level assessment of information literacy skills. Once validated, the instrument will be used to assess entry skills upon admission and longitudinally to ascertain whether there is significant change in…

  16. Applying the Nominal Response Model within a Longitudinal Framework to Construct the Positive Family Relationships Scale

    Science.gov (United States)

    Preston, Kathleen Suzanne Johnson; Parral, Skye N.; Gottfried, Allen W.; Oliver, Pamella H.; Gottfried, Adele Eskeles; Ibrahim, Sirena M.; Delany, Danielle

    2015-01-01

    A psychometric analysis was conducted using the nominal response model under the item response theory framework to construct the Positive Family Relationships scale. Using data from the Fullerton Longitudinal Study, this scale was constructed within a long-term longitudinal framework spanning middle childhood through adolescence. Items tapping…

  17. Item analysis of single-peaked response data : the psychometric evaluation of bipolar measurement scales

    NARCIS (Netherlands)

    Polak, Maaike Geertruida

    2011-01-01

    The thesis explains the fundamental difference between unipolar and bipolar measurement scales for psychological characteristics. We explore the use of correspondence analysis (CA), a technique that is similar to principal component analysis and is available in SAS and SPSS, to select items that tog

  18. A Substantive Process Analysis of Responses to Items from the Multistate Bar Examination

    Science.gov (United States)

    Bonner, Sarah M.; D'Agostino, Jerome V.

    2012-01-01

    We investigated examinees' cognitive processes while they solved selected items from the Multistate Bar Exam (MBE), a high-stakes professional certification examination. We focused on ascertaining those mental processes most frequently used by examinees, and the most common types of errors in their thinking. We compared the relationships between…

  19. Partially Compensatory Multidimensional Item Response Theory Models: Two Alternate Model Forms

    Science.gov (United States)

    DeMars, Christine E.

    2016-01-01

    Partially compensatory models may capture the cognitive skills needed to answer test items more realistically than compensatory models, but estimating the model parameters may be a challenge. Data were simulated to follow two different partially compensatory models, a model with an interaction term and a product model. The model parameters were…

  20. Comparisons across depression assessment instruments in adolescence and young adulthood: An Item Response Theory study using two linking methods

    OpenAIRE

    Olino, Thomas M.; Yu, Lan; McMakin, Dana L.; Forbes, Erika E.; John R. Seeley; Lewinsohn, Peter M.; Pilkonis, Paul A.

    2013-01-01

    Item response theory (IRT) methods allow for comparing the utility of instruments based on the range and precision of severity assessed by each instrument. As adolescents and young adults can display rapid increases in depressive symptoms, there is a crucial need to sensitively assess mild elevations of symptoms (as an index of initial risk) and moderate-severe symptoms (as an indicator of treatment disposition). We compare the information assessed by the Beck Depression Inventory (BDI) to th...

  1. World Health Organization Quality-of-Life Scale (WHOQOL-BREF: Analyses Of Their Item Response Theory Properties Based On The Graded Responses Model

    Directory of Open Access Journals (Sweden)

    Shahrum Vahedi

    2010-11-01

    Full Text Available "nObjective: This study has used Item Response Theory (IRT to examine the psychometric properties of Health-Related Quality-of-Life. "nMethod: This investigation is a descriptive- analytic study. Subjects were 370 undergraduate students of nursing and midwifery who were selected from Tabriz University of Medical Sciences. All participants were asked to complete the Farsi version of WHOQOL-BREF. Samejima's graded response model was used for the analyses. "nResults: The results revealed that the discrimination parameters for all items in the four scales were low to moderate. The threshold parameters showed adequate representation of the relevant traits from low to the mean trait level. With the exception of 15, 18, 24 and 26 items, all other items showed low item information function values, and thus relatively high reliability from low trait levels to moderate levels. "nConclusions: The results of this study indicate that although there was general support for the psychometric properties of the WHOQOL-BREF from an IRT perspective, this measure can be further improved. IRT analyses provided useful measurement information and demonstrated to be a better methodological approach for enhancing our knowledge of the functionality of WHOQOL-BREF.

  2. World Health Organization Quality-of-Life Scale (WHOQOL-BREF): Analyses of Their Item Response Theory Properties Based on the Graded Responses Model

    Science.gov (United States)

    2010-01-01

    Objective This study has used Item Response Theory (IRT) to examine the psychometric properties of Health-Related Quality-of-Life. Method This investigation is a descriptive- analytic study. Subjects were 370 undergraduate students of nursing and midwifery who were selected from Tabriz University of Medical Sciences. All participants were asked to complete the Farsi version of WHOQOL-BREF. Samejima's graded response model was used for the analyses. Results The results revealed that the discrimination parameters for all items in the four scales were low to moderate. The threshold parameters showed adequate representation of the relevant traits from low to the mean trait level. With the exception of 15, 18, 24 and 26 items, all other items showed low item information function values, and thus relatively high reliability from low trait levels to moderate levels. Conclusions The results of this study indicate that although there was general support for the psychometric properties of the WHOQOL-BREF from an IRT perspective, this measure can be further improved. IRT analyses provided useful measurement information and demonstrated to be a better methodological approach for enhancing our knowledge of the functionality of WHOQOL-BREF. PMID:22952508

  3. A responsible agenda for applied linguistics: Confessions of a philosopher

    Directory of Open Access Journals (Sweden)

    Albert Weideman

    2011-08-01

    Full Text Available When we undertake academic, disciplinary work, we rely on philosophical starting points. Several straightforward illustrations of this can be found in the history of applied linguistics. It is evident from the history of our field that various historically influential approaches to our discipline base themselves upon different academic confessions. This paper examines the effects of basing our applied linguistic work on the idea that applied linguistics is a discipline concerned with design. Such a characterisation does justice to both modernist and postmodernist emphases in applied linguistics. Conceptualisations of applied linguistics that came with the proposals for communicative language teaching (CLT some thirty to forty years ago propelled the discipline squarely into postmodern times. To account for this, we need to develop a theory of applied linguistics which shows what constitutive and regulative conditions exist for doing applied linguistic designs. A responsible agenda for applied linguistics today has as its first responsibility to free the users of its designs from toil and drudgery, as well as from becoming victims of fashion, ideology or theory. Secondly, it should design solutions to language problems in such a way that the technical imagination of the designer is not restricted but supported by theory and empirical investigation, and that the productive pedagogical fantasy of the implementers of such plans is set free. Thirdly, it must seek to become accountable by designing theoretically and socially defensible solutions to language problems, solutions that relieve some of the suffering, pain, poverty and injustice in our world.

  4. Development of new physical activity and sedentary behavior change self-efficacy questionnaires using item response modeling

    Directory of Open Access Journals (Sweden)

    Venditti Elizabeth

    2009-03-01

    Full Text Available Abstract Background Theoretically, increased levels of physical activity self-efficacy (PASE should lead to increased physical activity, but few studies have reported this effect among youth. This failure may be at least partially attributable to measurement limitations. In this study, Item Response Modeling (IRM was used to develop new physical activity and sedentary behavior change self-efficacy scales. The validity of the new scales was compared with accelerometer assessments of physical activity and sedentary behavior. Methods New PASE and sedentary behavior change (TV viewing, computer video game use, and telephone use self-efficacy items were developed. The scales were completed by 714, 6th grade students in seven US cities. A limited number of participants (83 also wore an accelerometer for five days and provided at least 3 full days of complete data. The new scales were analyzed using Classical Test Theory (CTT and IRM; a reduced set of items was produced with IRM and correlated with accelerometer counts per minute and minutes of sedentary, light and moderate to vigorous activity per day after school. Results The PASE items discriminated between high and low levels of PASE. Full and reduced scales were weakly correlated (r = 0.18 with accelerometer counts per minute after school for boys, with comparable associations for girls. Weaker correlations were observed between PASE and minutes of moderate to vigorous activity (r = 0.09 – 0.11. The uni-dimensionality of the sedentary scales was established by both exploratory factor analysis and the fit of items to the underlying variable and reliability was assessed across the length of the underlying variable with some limitations. The reduced sedentary behavior scales had poor reliability. The full scales were moderately correlated with light intensity physical activity after school (r = 0.17 to 0.33 and sedentary behavior (r = -0.29 to -0.12 among the boys, but not for girls. Conclusion New

  5. Testing whether the DSM-5 personality disorder trait model can be measured with a reduced set of items: An item response theory investigation of the Personality Inventory for DSM-5.

    Science.gov (United States)

    Maples, Jessica L; Carter, Nathan T; Few, Lauren R; Crego, Cristina; Gore, Whitney L; Samuel, Douglas B; Williamson, Rachel L; Lynam, Donald R; Widiger, Thomas A; Markon, Kristian E; Krueger, Robert F; Miller, Joshua D

    2015-12-01

    The fifth edition of the Diagnostic and Statistical Manual of Mental Disorders (DSM-5) includes an alternative model of personality disorders (PDs) in Section III, consisting in part of a pathological personality trait model. To date, the 220-item Personality Inventory for DSM-5 (PID-5; Krueger, Derringer, Markon, Watson, & Skodol, 2012) is the only extant self-report instrument explicitly developed to measure this pathological trait model. The present study used item response theory-based analyses in a large sample (n = 1,417) to investigate whether a reduced set of 100 items could be identified from the PID-5 that could measure the 25 traits and 5 domains. This reduced set of PID-5 items was then tested in a community sample of adults currently receiving psychological treatment (n = 109). Across a wide range of criterion variables including NEO PI-R domains and facets, DSM-5 Section II PD scores, and externalizing and internalizing outcomes, the correlational profiles of the original and reduced versions of the PID-5 were nearly identical (rICC = .995). These results provide strong support for the hypothesis that an abbreviated set of PID-5 items can be used to reliably, validly, and efficiently assess these personality disorder traits. The ability to assess the DSM-5 Section III traits using only 100 items has important implications in that it suggests these traits could still be measured in settings in which assessment-related resources (e.g., time, compensation) are limited.

  6. Testing whether the DSM-5 personality disorder trait model can be measured with a reduced set of items: An item response theory investigation of the Personality Inventory for DSM-5.

    Science.gov (United States)

    Maples, Jessica L; Carter, Nathan T; Few, Lauren R; Crego, Cristina; Gore, Whitney L; Samuel, Douglas B; Williamson, Rachel L; Lynam, Donald R; Widiger, Thomas A; Markon, Kristian E; Krueger, Robert F; Miller, Joshua D

    2015-12-01

    The fifth edition of the Diagnostic and Statistical Manual of Mental Disorders (DSM-5) includes an alternative model of personality disorders (PDs) in Section III, consisting in part of a pathological personality trait model. To date, the 220-item Personality Inventory for DSM-5 (PID-5; Krueger, Derringer, Markon, Watson, & Skodol, 2012) is the only extant self-report instrument explicitly developed to measure this pathological trait model. The present study used item response theory-based analyses in a large sample (n = 1,417) to investigate whether a reduced set of 100 items could be identified from the PID-5 that could measure the 25 traits and 5 domains. This reduced set of PID-5 items was then tested in a community sample of adults currently receiving psychological treatment (n = 109). Across a wide range of criterion variables including NEO PI-R domains and facets, DSM-5 Section II PD scores, and externalizing and internalizing outcomes, the correlational profiles of the original and reduced versions of the PID-5 were nearly identical (rICC = .995). These results provide strong support for the hypothesis that an abbreviated set of PID-5 items can be used to reliably, validly, and efficiently assess these personality disorder traits. The ability to assess the DSM-5 Section III traits using only 100 items has important implications in that it suggests these traits could still be measured in settings in which assessment-related resources (e.g., time, compensation) are limited. PMID:25844534

  7. Using item response theory to study the convergent and discriminant validity of three questionnaires measuring cigarette dependence.

    Science.gov (United States)

    Courvoisier, Delphine; Etter, Jean-François

    2008-09-01

    To determine whether the Cigarette Dependence Scale, the Fagerström Test for Nicotine Dependence, and the Nicotine Dependence Syndrome Scale (NDSS) reliably and correctly assessed both weakly and severely dependent individuals, the authors collected data via Internet from 2,435 current smokers, from 2004 to 2007. They used a 2-parameter item response model to determine the difficulty and discrimination of each question and used correlations between latent scores to assess convergent and discriminant validity. The reliability of all scales was close to or exceeded .70. Both the Cigarette Dependence Scale and the Fagerström Test for Nicotine Dependence had 1 misfitting item. Each NDSS scale had at least 2 misfitting items. The information curve of each of the questionnaires peaked between -2 and 2 and was low at both extremes. All questionnaires had adequate reliability and were more informative for a medium level of the underlying cigarette dependence continuum than for both extremes of this continuum. The correlations between latent scores indicated good convergent validity between questionnaires and low discriminant validity between NDSS subscales, except for Tolerance. This result suggests that nicotine dependence may not be composed of 5 dimensions but may be unidimensional and distinct from reduced sensitivity to the effects of smoking (Tolerance). PMID:18778132

  8. Item Response Theory Analysis of Two Questionnaire Measures of Arthritis-Related Self-Efficacy Beliefs from Community-Based US Samples

    Directory of Open Access Journals (Sweden)

    Thelma J. Mielenz

    2010-01-01

    Full Text Available Using item response theory (IRT, we examined the Rheumatoid Arthritis Self-efficacy scale (RASE collected from a People with Arthritis Can Exercise RCT (346 participants and 2 subscales of the Arthritis Self-efficacy scale (ASE collected from an Active Living Every Day (ALED RCT (354 participants to determine which one better identifies low arthritis self-efficacy in community-based adults with arthritis. The item parameters were estimated in Multilog using the graded response model. The 2 ASE subscales are adequately explained by one factor. There was evidence for 2 locally dependent item pairs; two items from these pairs were removed when we reran the model. The exploratory factor analysis results for RASE showed a multifactor solution which led to a 9-factor solution. In order to perform IRT analysis, one item from each of the 9 subfactors was selected. Both scales were effective at measuring a range of arthritis SE.

  9. Development of the Knee Quality of Life (KQoL-26 26-item questionnaire: data quality, reliability, validity and responsiveness

    Directory of Open Access Journals (Sweden)

    Atwell Chris

    2008-07-01

    Full Text Available Abstract Background This article describes the development and validation of a self-reported questionnaire, the KQoL-26, that is based on the views of patients with a suspected ligamentous or meniscal injury of the knee that assesses the impact of their knee problem on the quality of their lives. Methods Patient interviews and focus groups were used to derive questionnaire content. The instrument was assessed for data quality, reliability, validity, and responsiveness using data from a randomised trial and patient survey about general practitioners' use of Magnetic Resonance Imaging for patients with a suspected ligamentous or meniscal injury. Results Interview and focus group data produced a 40-item questionnaire designed for self-completion. 559 trial patients and 323 survey patients responded to the questionnaire. Following principal components analysis and Rasch analysis, 26 items were found to contribute to three scales of knee-related quality of life: physical functioning, activity limitations, and emotional functioning. Item-total correlations ranged from 0.60–0.82. Cronbach's alpha and test retest reliability estimates were 0.91–0.94 and 0.80–0.93 respectively. Hypothesised correlations with the Lysholm Knee Scale, EQ-5D, SF-36 and knee symptom questions were evidence for construct validity. The instrument produced highly significant change scores for 65 trial patients indicating that their knee was a little or somewhat better at six months. The new instrument had higher effect sizes (range 0.86–1.13 and responsiveness statistics (range 1.50–2.13 than the EQ-5D and SF-36. Conclusion The KQoL-26 has good evidence for internal reliability, test-retest reliability, validity and responsiveness, and is recommended for use in randomised trials and other evaluative studies of patients with a suspected ligamentous or meniscal injury.

  10. Item Banking with Embedded Standards

    Science.gov (United States)

    MacCann, Robert G.; Stanley, Gordon

    2009-01-01

    An item banking method that does not use Item Response Theory (IRT) is described. This method provides a comparable grading system across schools that would be suitable for low-stakes testing. It uses the Angoff standard-setting method to obtain item ratings that are stored with each item. An example of such a grading system is given, showing how…

  11. Performance of the Family Satisfaction with the End-of-Life Care (FAMCARE) measure in an ethnically diverse cohort: Psychometric analyses using item response theory

    OpenAIRE

    Jeanne A. Teresi; Ornstein, Katherine; Ocepek-Welikson, Katja; Ramirez, Mildred; Siu, Albert

    2013-01-01

    The Family Satisfaction with End-of-Life Care (FAMCARE) has been used widely among caregivers to individuals with cancer. The aim of this study was to evaluate the psychometric properties of this measure using item response theory (IRT).

  12. Factors affecting study efficiency and item non-response in health surveys in developing countries: the Jamaica national healthy lifestyle survey

    Directory of Open Access Journals (Sweden)

    Bennett Franklyn

    2007-02-01

    Full Text Available Abstract Background Health surveys provide important information on the burden and secular trends of risk factors and disease. Several factors including survey and item non-response can affect data quality. There are few reports on efficiency, validity and the impact of item non-response, from developing countries. This report examines factors associated with item non-response and study efficiency in a national health survey in a developing Caribbean island. Methods A national sample of participants aged 15–74 years was selected in a multi-stage sampling design accounting for 4 health regions and 14 parishes using enumeration districts as primary sampling units. Means and proportions of the variables of interest were compared between various categories. Non-response was defined as failure to provide an analyzable response. Linear and logistic regression models accounting for sample design and post-stratification weighting were used to identify independent correlates of recruitment efficiency and item non-response. Results We recruited 2012 15–74 year-olds (66.2% females at a response rate of 87.6% with significant variation between regions (80.9% to 97.6%; p Conclusion Informative health surveys are possible in developing countries. While survey response rates may be satisfactory, item non-response was high in respect of income and sexual practice. In contrast to developed countries, non-response to questions on income is higher and has different correlates. These findings can inform future surveys.

  13. Differential sensitivity theory applied to movement of maxima responses. [LMFBR

    Energy Technology Data Exchange (ETDEWEB)

    Maudlin, P.J.; Parks, C.V.; Cacuci, D.G.

    1981-01-01

    Differential sensitivity theory (DST) is a recently developed methodology to evaluate response derivatives dR/d..cap alpha.. by using adjoint functions which correspond to the differentiated (with respect to an arbitrary parameter ..cap alpha..) linear or nonlinear physical system of equations. However, for many problems, where responses of importance are local maxima such as peak temperature, power, or heat flux, changes in the phase space location of the peak itself are of interest. This summary will present the DST procedure for predicting phase space shifts of maxima responses as applied to the MELT-III fast reactor safety code. An FFTF protected transient involving a $.23/s ramp reactivity insertion with scram on high power was selected for investigation.

  14. Uncertainties in the Item Parameter Estimates and Robust Automated Test Assembly

    Science.gov (United States)

    Veldkamp, Bernard P.; Matteucci, Mariagiulia; de Jong, Martijn G.

    2013-01-01

    Item response theory parameters have to be estimated, and because of the estimation process, they do have uncertainty in them. In most large-scale testing programs, the parameters are stored in item banks, and automated test assembly algorithms are applied to assemble operational test forms. These algorithms treat item parameters as fixed values,…

  15. Magnetic response to applied electrostatic field in external magnetic field

    Energy Technology Data Exchange (ETDEWEB)

    Adorno, T.C. [Universidade de Sao Paulo, Instituto de Fisica, Caixa Postal 66318, Sao Paulo, SP (Brazil); University of Florida, Department of Physics, Gainesville, FL (United States); Gitman, D.M. [Universidade de Sao Paulo, Instituto de Fisica, Caixa Postal 66318, Sao Paulo, SP (Brazil); Tomsk State University, Department of Physics, Tomsk (Russian Federation); Shabad, A.E. [P. N. Lebedev Physics Institute, Moscow (Russian Federation)

    2014-04-15

    We show, within QED and other possible nonlinear theories, that a static charge localized in a finite domain of space becomes a magnetic dipole, if it is placed in an external (constant and homogeneous) magnetic field in the vacuum. The magnetic moment is quadratic in the charge, depends on its size and is parallel to the external field, provided the charge distribution is at least cylindrically symmetric. This magneto-electric effect is a nonlinear response of the magnetized vacuum to an applied electrostatic field. Referring to the simple example of a spherically symmetric applied field, the nonlinearly induced current and its magnetic field are found explicitly throughout the space; the pattern of the lines of force is depicted, both inside and outside the charge, which resembles that of a standard solenoid of classical magnetostatics. (orig.)

  16. Magnetic response to applied electrostatic field in external magnetic field

    CERN Document Server

    Adorno, T C; Shabad, A E

    2014-01-01

    We show, within QED and other possible nonlinear theories, that a static charge localized in a finite domain of space becomes a magnetic dipole, if it is placed in an external (constant and homogeneous) magnetic field in the vacuum. The magnetic moment is quadratic in the charge, depends on its size and is parallel to the external field, provided the charge distribution is at least cylindrically symmetric. This magneto-electric effect is a nonlinear response of the magnetized vacuum to an applied electrostatic field. Referring to a simple example of a spherically-symmetric applied field, the nonlinearly induced current and its magnetic field are found explicitly throughout the space, the pattern of lines of force is depicted, both inside and outside the charge, which resembles that of a standard solenoid of classical magnetostatics.

  17. Item response theory analysis of the Amyotrophic Lateral Sclerosis Functional Rating Scale-Revised in the Pooled Resource Open-Access ALS Clinical Trials Database.

    Science.gov (United States)

    Bacci, Elizabeth D; Staniewska, Dorota; Coyne, Karin S; Boyer, Stacey; White, Leigh Ann; Zach, Neta; Cedarbaum, Jesse M

    2016-01-01

    Our objective was to examine dimensionality and item-level performance of the Amyotrophic Lateral Sclerosis Functional Rating Scale-Revised (ALSFRS-R) across time using classical and modern test theory approaches. Confirmatory factor analysis (CFA) and Item Response Theory (IRT) analyses were conducted using data from patients with amyotrophic lateral sclerosis (ALS) Pooled Resources Open-Access ALS Clinical Trials (PRO-ACT) database with complete ALSFRS-R data (n = 888) at three time-points (Time 0, Time 1 (6-months), Time 2 (1-year)). Results demonstrated that in this population of 888 patients, mean age was 54.6 years, 64.4% were male, and 93.7% were Caucasian. The CFA supported a 4* individual-domain structure (bulbar, gross motor, fine motor, and respiratory domains). IRT analysis within each domain revealed misfitting items and overlapping item response category thresholds at all time-points, particularly in the gross motor and respiratory domain items. Results indicate that many of the items of the ALSFRS-R may sub-optimally distinguish among varying levels of disability assessed by each domain, particularly in patients with less severe disability. Measure performance improved across time as patient disability severity increased. In conclusion, modifications to select ALSFRS-R items may improve the instrument's specificity to disability level and sensitivity to treatment effects. PMID:26473473

  18. An autoregressive growth model for longitudinal item analysis.

    Science.gov (United States)

    Jeon, Minjeong; Rabe-Hesketh, Sophia

    2016-09-01

    A first-order autoregressive growth model is proposed for longitudinal binary item analysis where responses to the same items are conditionally dependent across time given the latent traits. Specifically, the item response probability for a given item at a given time depends on the latent trait as well as the response to the same item at the previous time, or the lagged response. An initial conditions problem arises because there is no lagged response at the initial time period. We handle this problem by adapting solutions proposed for dynamic models in panel data econometrics. Asymptotic and finite sample power for the autoregressive parameters are investigated. The consequences of ignoring local dependence and the initial conditions problem are also examined for data simulated from a first-order autoregressive growth model. The proposed methods are applied to longitudinal data on Korean students' self-esteem. PMID:26645083

  19. Psychometric Properties of the Children's Depression Inventory: An Item Response Theory Analysis across Age in a Nonclinical, Longitudinal, Adolescent Sample

    Science.gov (United States)

    Lee, Young-Sun; Krishnan, Anita; Park, Yoon Soo

    2012-01-01

    The purpose of this study was to investigate psychometric properties of the Children's Depression Inventory within a nonclinical and longitudinal sample (8th and 12th grades). Using the Rasch rating scale, most items represented one dimension. There was adequate separation among items and no overlap between ranges of item difficulties with latent…

  20. Culturally Sensitive Depression Assessment for Chinese American Immigrants: Development of a Comprehensive Measure and a Screening Scale Using an Item Response Approach

    OpenAIRE

    Wong, Rose; Wu, Rufina; Guo, Carmen; Lam, Julia K.; Snowden, Lonnie R.

    2011-01-01

    The present mixed methods study developed a comprehensive measure and a screening scale of depression for Chinese American immigrants by combining an emic approach with item response analysis. Clinical participants were immigrants diagnosed by licensed clinicians who worked in the community. Qualitative interviews with clinicians and clinical participants (N = 63) supported the definition of the construct of depression—which guided scale development—and a 47-item pilot scale. Clinical and com...

  1. Difference in method of administration did not significantly impact item response

    DEFF Research Database (Denmark)

    Bjorner, Jakob B; Rose, Matthias; Gandek, Barbara;

    2014-01-01

    , fatigue, and depression) were completed by 923 adults (age 18-89) with chronic obstructive pulmonary disease, depression, or rheumatoid arthritis. In a randomized cross-over design, subjects answered one form by interactive voice response (IVR) technology, paper questionnaire (PQ), personal digital...

  2. Students' Initial Impressions of Teaching Effectiveness: An Analysis of Structured Response Items.

    Science.gov (United States)

    Hayward, Pamela A.

    Because an instructor's first interaction with students on the first day of class can determine the success of those to follow, it is important to explore what happens in a classroom setting before offering prescriptions on how to best handle situations in that setting. To understand how students' responses to specific attributes related to…

  3. Guide to good practices for the development of test items

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    1997-01-01

    While the methodology used in developing test items can vary significantly, to ensure quality examinations, test items should be developed systematically. Test design and development is discussed in the DOE Guide to Good Practices for Design, Development, and Implementation of Examinations. This guide is intended to be a supplement by providing more detailed guidance on the development of specific test items. This guide addresses the development of written examination test items primarily. However, many of the concepts also apply to oral examinations, both in the classroom and on the job. This guide is intended to be used as guidance for the classroom and laboratory instructor or curriculum developer responsible for the construction of individual test items. This document focuses on written test items, but includes information relative to open-reference (open book) examination test items, as well. These test items have been categorized as short-answer, multiple-choice, or essay. Each test item format is described, examples are provided, and a procedure for development is included. The appendices provide examples for writing test items, a test item development form, and examples of various test item formats.

  4. The Elements of Item Response Theory and its Framework in Analyzing Introductory Astronomy College Student Misconceptions. I. Galaxies

    CERN Document Server

    Favia, Andrej; Thorpe, Geoffrey L

    2013-01-01

    This is the first in a series of papers that analyze college student beliefs in realms where common astronomy misconceptions are prevalent. Data was collected through administration of an inventory distributed at the end of an introductory college astronomy course. In this paper, we present the basic mathematics of item response theory (IRT), and then we use it to explore concepts related to galaxies. We show how IRT determines the difficulty of each galaxy topic under consideration. We find that the concept of galaxy spatial distribution presents the greatest challenge to students of all the galaxy topics. We also find and present the most logical sequence to teach galaxy topics as a function of the audience's age.

  5. Lessons from the Ebola Outbreak: Action Items for Emerging Infectious Disease Preparedness and Response.

    Science.gov (United States)

    Jacobsen, Kathryn H; Aguirre, A Alonso; Bailey, Charles L; Baranova, Ancha V; Crooks, Andrew T; Croitoru, Arie; Delamater, Paul L; Gupta, Jhumka; Kehn-Hall, Kylene; Narayanan, Aarthi; Pierobon, Mariaelena; Rowan, Katherine E; Schwebach, J Reid; Seshaiyer, Padmanabhan; Sklarew, Dann M; Stefanidis, Anthony; Agouris, Peggy

    2016-03-01

    As the Ebola outbreak in West Africa wanes, it is time for the international scientific community to reflect on how to improve the detection of and coordinated response to future epidemics. Our interdisciplinary team identified key lessons learned from the Ebola outbreak that can be clustered into three areas: environmental conditions related to early warning systems, host characteristics related to public health, and agent issues that can be addressed through the laboratory sciences. In particular, we need to increase zoonotic surveillance activities, implement more effective ecological health interventions, expand prediction modeling, support medical and public health systems in order to improve local and international responses to epidemics, improve risk communication, better understand the role of social media in outbreak awareness and response, produce better diagnostic tools, create better therapeutic medications, and design better vaccines. This list highlights research priorities and policy actions the global community can take now to be better prepared for future emerging infectious disease outbreaks that threaten global public health and security. PMID:26915507

  6. The Children's Behavior Questionnaire very short scale: psychometric properties and development of a one-item temperament scale.

    Science.gov (United States)

    Sleddens, Ester F C; Hughes, Sheryl O; O'Connor, Teresia M; Beltran, Alicia; Baranowski, Janice C; Nicklas, Theresa A; Baranowski, Tom

    2012-02-01

    Little research has been conducted on the psychometrics of the very short scale (36 items) of the Children's Behavior Questionnaire, and no one-item temperament scale has been tested for use in applied work. In this study, 237 United States caregivers completed a survey to define their child's behavioral patterns (i.e., Surgency, Negative Affectivity Effortful Control) using both scales. Psychometrics of the 36-item Children's Behavior Questionnaire were examined using classical test theory, principal factor analysis, and item response modeling. Classical test theory analysis demonstrated adequate internal consistency and factor analysis confirmed a three-factor structure. Potential improvements to the measure were identified using item response modeling. A one-item (three response categories) temperament scale was validated against the three temperament factors of the 36-item scale. The temperament response categories correlated with the temperament factors of the 36-item scale, as expected. The one-item temperament scale may be applicable for clinical use.

  7. Maximizing measurement efficiency of behavior rating scales using Item Response Theory: An example with the Social Skills Improvement System - Teacher Rating Scale.

    Science.gov (United States)

    Anthony, Christopher J; DiPerna, James C; Lei, Pui-Wa

    2016-04-01

    Measurement efficiency is an important consideration when developing behavior rating scales for use in research and practice. Although most published scales have been developed within a Classical Test Theory (CTT) framework, Item Response Theory (IRT) offers several advantages for developing scales that maximize measurement efficiency. The current study provides an example of using IRT to maximize rating scale efficiency with the Social Skills Improvement System - Teacher Rating Scale (SSIS - TRS), a measure of student social skills frequently used in practice and research. Based on IRT analyses, 27 items from the Social Skills subscales and 14 items from the Problem Behavior subscales of the SSIS - TRS were identified as maximally efficient. In addition to maintaining similar content coverage to the published version, these sets of maximally efficient items demonstrated similar psychometric properties to the published SSIS - TRS.

  8. RATING CREATION FOR PROFESSIONAL EDUCATIONAL ORGANIZATIONS BASED ON THE ITEM RESPONSE THEORY

    Directory of Open Access Journals (Sweden)

    N. E. Erganova

    2016-01-01

    Full Text Available The aim of the investigation is to theoretically justify and describe approval of the measurement of the level of provision of educational services, education qualities and rating of vocational educational organizations.Methods. The fundamentals of methodology of the research conducted by authors are made by provisions of system approach; research on a schematization and modeling of pedagogical objects; the provision of the theory of measurement of latent variables. As the main methods of research the analysis, synthesis, the comparative analysis, statistical methods of processing of results of research are applied.Results. The paper gives a short comparative analysis of potentials of qualitative approach and strong points of the theory of latent variables in evaluating the quality of education and ratings of the investigated object. The technique of measurement of level of rendering educational services at creation of a rating of the professional educational organizations is stated.Scientific novelty. Pedagogical opportunities of the theory of measurement of latent variables are investigated; the principles of creation of ratings of the professional educational organizations are designated.Practical significance. The operational construct of the latent variable «quality of education» for the secondary professional education (SPE approved in the Perm Territory which can form base of formation of similar constructs for creation of a rating of the professional educational organizations in other regions is developed.

  9. An item response theory analysis of DSM-IV diagnostic criteria for personality disorders: findings from the national epidemiologic survey on alcohol and related conditions.

    Science.gov (United States)

    Harford, Thomas C; Chen, Chiung M; Saha, Tulshi D; Smith, Sharon M; Hasin, Deborah S; Grant, Bridget F

    2013-01-01

    The purpose of this study was to evaluate the psychometric properties of DSM-IV symptom criteria for assessing personality disorders (PDs) in a national population and to compare variations in proposed symptom coding for social and/or occupational dysfunction. Data were obtained from a total sample of 34,653 respondents from Waves 1 and 2 of the National Epidemiologic Survey on Alcohol and Related Conditions (NESARC). For each personality disorder, confirmatory factor analysis (CFA) established a 1-factor latent factor structure for the respective symptom criteria. A 2-parameter item response theory (IRT) model was applied to the symptom criteria for each PD to assess the probabilities of symptom item endorsements across different values of the underlying trait (latent factor). Findings were compared with a separate IRT model using an alternative coding of symptom criteria that requires distress/impairment to be related to each criterion. The CFAs yielded a good fit for a single underlying latent dimension for each PD. Findings from the IRT indicated that DSM-IV PD symptom criteria are clustered in the moderate to severe range of the underlying latent dimension for each PD and are peaked, indicating high measurement precision only within a narrow range of the underlying trait and lower measurement precision at lower and higher levels of severity. Compared with the NESARC symptom coding, the IRT results for the alternative symptom coding are shifted toward the more severe range of the latent trait but generally have lower measurement precision for each PD. The IRT findings provide support for a reliable assessment of each PD for both NESARC and alternative coding for distress/impairment. The use of symptom dysfunction for each criterion, however, raises a number of issues and implications for the DSM-5 revision currently proposed for Axis II disorders (American Psychiatric Association, 2010).

  10. Developmental changes in reading do not alter the development of visual processing skills: an application of explanatory item response models in grades K-2.

    Science.gov (United States)

    Santi, Kristi L; Kulesz, Paulina A; Khalaf, Shiva; Francis, David J

    2015-01-01

    Visual processing has been widely studied in regard to its impact on a students' ability to read. A less researched area is the role of reading in the development of visual processing skills. A cohort-sequential, accelerated-longitudinal design was utilized with 932 kindergarten, first, and second grade students to examine the impact of reading acquisition on the processing of various types of visual discrimination and visual motor test items. Students were assessed four times per year on a variety of reading measures and reading precursors and two popular measures of visual processing over a 3-year period. Explanatory item response models were used to examine the roles of person and item characteristics on changes in visual processing abilities and changes in item difficulties over time. Results showed different developmental patterns for five types of visual processing test items, but most importantly failed to show consistent effects of learning to read on changes in item difficulty. Thus, the present study failed to find support for the hypothesis that learning to read alters performance on measures of visual processing. Rather, visual processing and reading ability improved together over time with no evidence to suggest cross-domain influences from reading to visual processing. Results are discussed in the context of developmental theories of visual processing and brain-based research on the role of visual skills in learning to read. PMID:25717311

  11. Developmental changes in reading do not alter the development of visual processing skills: An application of explanatory item response models in grades K-2

    Directory of Open Access Journals (Sweden)

    Kristi L Santi

    2015-02-01

    Full Text Available Visual processing has been widely studied in regard to its impact on a students’ ability to read. A less researched area is the role of reading in the development of visual processing skills. A cohort-sequential, accelerated-longitudinal design was utilized with 932 kindergarten, first, and second grade students to examine the impact of reading acquisition on the processing of various types of visual discrimination and visual motor test items. Students were assessed four times per year on a variety of reading measures and reading precursors and two popular measures of visual processing over a three-year period. Explanatory item response models were used to examine the roles of person and item characteristics on changes in visual processing abilities and changes in item difficulties over time. Results showed different developmental patterns for five types of visual processing test items, but most importantly failed to show consistent effects of learning to read on changes in item difficulty. Thus, the present study failed to find support for the hypothesis that learning to read alters performance on measures of visual processing. Rather, visual processing and reading ability improved together over time with no evidence to suggest cross-domain influences from reading to visual processing. Results are discussed in the context of developmental theories of visual processing and brain-based research on the role of visual skills in learning to read.

  12. Calibrating the Medical Council of Canada’s Qualifying Examination Part I using an integrated item response theory framework: a comparison of models and designs

    Science.gov (United States)

    2016-01-01

    Purpose: The aim of this research was to compare different methods of calibrating multiple choice question (MCQ) and clinical decision making (CDM) components for the Medical Council of Canada’s Qualifying Examination Part I (MCCQEI) based on item response theory. Methods: Our data consisted of test results from 8,213 first time applicants to MCCQEI in spring and fall 2010 and 2011 test administrations. The data set contained several thousand multiple choice items and several hundred CDM cases. Four dichotomous calibrations were run using BILOG-MG 3.0. All 3 mixed item format (dichotomous MCQ responses and polytomous CDM case scores) calibrations were conducted using PARSCALE 4. Results: The 2-PL model had identical numbers of items with chi-square values at or below a Type I error rate of 0.01 (83/3,499 or 0.02). In all 3 polytomous models, whether the MCQs were either anchored or concurrently run with the CDM cases, results suggest very poor fit. All IRT abilities estimated from dichotomous calibration designs correlated very highly with each other. IRT-based pass-fail rates were extremely similar, not only across calibration designs and methods, but also with regard to the actual reported decision to candidates. The largest difference noted in pass rates was 4.78%, which occurred between the mixed format concurrent 2-PL graded response model (pass rate= 80.43%) and the dichotomous anchored 1-PL calibrations (pass rate= 85.21%). Conclusion: Simpler calibration designs with dichotomized items should be implemented. The dichotomous calibrations provided better fit of the item response matrix than more complex, polytomous calibrations. PMID:26883811

  13. Calibrating the Medical Council of Canada’s Qualifying Examination Part I using an integrated item response theory framework: a comparison of models and designs

    Directory of Open Access Journals (Sweden)

    Andre F. De Champlain

    2016-01-01

    Full Text Available Purpose: The aim of this research was to compare different methods of calibrating multiple choice question (MCQ and clinical decision making (CDM components for the Medical Council of Canada’s Qualifying Examination Part I (MCCQEI based on item response theory. Methods: Our data consisted of test results from 8,213 first time applicants to MCCQEI in spring and fall 2010 and 2011 test administrations. The data set contained several thousand multiple choice items and several hundred CDM cases. Four dichotomous calibrations were run using BILOG-MG 3.0. All 3 mixed item format (dichotomous MCQ responses and polytomous CDM case scores calibrations were conducted using PARSCALE 4. Results: The 2-PL model had identical numbers of items with chi-square values at or below a Type I error rate of 0.01 (83/3,499 or 0.02. In all 3 polytomous models, whether the MCQs were either anchored or concurrently run with the CDM cases, results suggest very poor fit. All IRT abilities estimated from dichotomous calibration designs correlated very highly with each other. IRT-based pass-fail rates were extremely similar, not only across calibration designs and methods, but also with regard to the actual reported decision to candidates. The largest difference noted in pass rates was 4.78%, which occurred between the mixed format concurrent 2-PL graded response model (pass rate= 80.43% and the dichotomous anchored 1-PL calibrations (pass rate= 85.21%. Conclusion: Simpler calibration designs with dichotomized items should be implemented. The dichotomous calibrations provided better fit of the item response matrix than more complex, polytomous calibrations.

  14. Psychopathy in adolescent offenders: an item response theory study of the antisocial process screening device-self report and the Psychopathy Checklist: Youth Version.

    Science.gov (United States)

    Dillard, Crystal L; Salekin, Randall T; Barker, Edward D; Grimes, Ross D

    2013-04-01

    Few studies have examined the item functioning of youth psychopathy measures or compared the functioning of clinician and self-report based indices. Even fewer studies have made these comparisons in both male and female adolescent samples. The present study examined the applicability of items from two psychopathy measures, the Antisocial Process Screening Device (APSD; Frick, P. J., & Hare, R. D., 2001, The Antisocial Process Screening Device. Toronto, Ontario, Canada: Multi-Health Systems) and Psychopathy Checklist: Youth Version (PCL:YV; Forth, A. E., Kosson, D. S., & Hare, R. D., 2003, The Psychopathy Checklist: Youth Version. Toronto, Ontario, Canada: Multi-Health Systems), to adolescent boys and girls who had come into contact with the law. Item Response Theory was used to test item functioning of the two psychopathy indices. Examination of the Item Response Theory trace lines indicated that the APSD and the PCL:YV have both highly discriminating and poorly discriminating items and that the measures differ in the regions of psychopathy they cover. The PCL:YV is particularly effective at assessing interpersonal and affective features of psychopathy and to a lesser extent, lifestyle and antisocial features. The APSD appears to be effective at assessing narcissism and impulsivity but not callousness. In addition, the items most discriminating of the underlying construct of psychopathy for males and females demonstrate some important differences. These findings suggest that the measures may tap different underlying elements of the same overlaying construct. This may account for modest correlations between the measures. The findings suggest that clinicians should be aware of the regions that each measure best taps and also suggest that continued refinement and revisions to the youth psychopathy measures may be required. PMID:22686465

  15. Desenvolvimento de uma escala para medir o potencial empreendedor utilizando a Teoria da Resposta ao Item (TRI Development of a scale to measure the entrepreneurial potential using the Item Response Theory (IRT

    Directory of Open Access Journals (Sweden)

    Luciano Ricardo Rath Alves

    2011-01-01

    Full Text Available Diversas variáveis estão relacionadas ao desenvolvimento da atividade empreendedora, verifica-se, entre elas, a importância do agente empreendedor. Dos estudos que contribuem para o seu entendimento, este segue a linha que defende que o empreendedor tem características e traços de personalidade singulares em relação à população, os quais são propícios ao sucesso do empreendedorismo. O objetivo deste trabalho é desenvolver uma escala para medir o potencial empreendedor utilizando a Teoria da Resposta ao Item. Foi utilizado o modelo logístico de dois parâmetros da TRI. As estimativas dos parâmetros foram obtidas a partir da amostra com 764 pessoas que responderam a um instrumento composto por 103 itens. A curva de informação e do erro padrão do teste e a interpretação qualitativa de níveis da escala permitiram determinar o intervalo mais apropriado para utilização do instrumento. Os resultados mostraram que a escala é mais adequada para avaliar indivíduos com baixo até moderadamente alto potencial empreendedor. Por isso, sugere-se que novos itens sejam incorporados ao instrumento para mensurar e interpretar níveis ainda mais elevados. A Teoria da Resposta ao Item permite que novos itens sejam calibrados a fim de mensurar os empreendedores com alto potencial empreendedor, aproveitando os dados já obtidos.Several variables are related to the development of entrepreneurial activities. An important one among them is the entrepreneurial agent. This study is one of many that contribute to the understanding of the entrepreneurial agent. In its line of thought, it upholds the idea that the entrepreneur has characteristics and personality traits that stand out from the general population and that are favorable to the success of the entrepreneurship. This study aims at developing a measurement scale for entrepreneurial potential using the Item Response Theory. The items were generated by Santos (2008 based on a theoretical model

  16. A Practical Guide to Check the Consistency of Item Response Patterns in Clinical Research Through Person-Fit Statistics: Examples and a Computer Program.

    Science.gov (United States)

    Meijer, Rob R; Niessen, A Susan M; Tendeiro, Jorge N

    2016-02-01

    Although there are many studies devoted to person-fit statistics to detect inconsistent item score patterns, most studies are difficult to understand for nonspecialists. The aim of this tutorial is to explain the principles of these statistics for researchers and clinicians who are interested in applying these statistics. In particular, we first explain how invalid test scores can be detected using person-fit statistics; second, we provide the reader practical examples of existing studies that used person-fit statistics to detect and to interpret inconsistent item score patterns; and third, we discuss a new R-package that can be used to identify and interpret inconsistent score patterns.

  17. Computerized Adaptive Testing with Item Clones. Research Report.

    Science.gov (United States)

    Glas, Cees A. W.; van der Linden, Wim J.

    To reduce the cost of item writing and to enhance the flexibility of item presentation, items can be generated by item-cloning techniques. An important consequence of cloning is that it may cause variability on the item parameters. Therefore, a multilevel item response model is presented in which it is assumed that the item parameters of a…

  18. Measuring Student Involvement: A Comparison of Classical Test Theory and Item Response Theory in the Construction of Scales from Student Surveys

    Science.gov (United States)

    Sharkness, Jessica; DeAngelo, Linda

    2011-01-01

    This study compares the psychometric utility of Classical Test Theory (CTT) and Item Response Theory (IRT) for scale construction with data from higher education student surveys. Using 2008 Your First College Year (YFCY) survey data from the Cooperative Institutional Research Program at the Higher Education Research Institute at UCLA, two scales…

  19. Innovative application of a multidimensional item response model in assessing the influence of social desirability on the pseudo-relationship between self-efficacy and behavior

    Science.gov (United States)

    This study examined multidimensional item response theory (MIRT) modeling to assess social desirability (SocD) influences on self-reported physical activity self-efficacy (PASE) and fruit and vegetable self-efficacy (FVSE). The observed sample included 473 Houston-area adolescent males (10–14 years)...

  20. Investigating the Population Sensitivity Assumption of Item Response Theory True-Score Equating across Two Subgroups of Examinees and Two Test Formats

    Science.gov (United States)

    von Davier, Alina A.; Wilson, Christine

    2008-01-01

    Dorans and Holland (2000) and von Davier, Holland, and Thayer (2003) introduced measures of the degree to which an observed-score equating function is sensitive to the population on which it is computed. This article extends the findings of Dorans and Holland and of von Davier et al. to item response theory (IRT) true-score equating methods that…

  1. An item response theory analysis of Harter's Self-Perception Profile for Children or why strong clinical scales should be distrusted

    NARCIS (Netherlands)

    Egberink, Iris J. L.; Meijer, Rob R.

    2011-01-01

    The authors investigated the psychometric properties of the subscales of the Self-Perception Profile for Children with item response theory (IRT) models using a sample of 611 children. Results from a nonparametric Mokken analysis and a parametric IRT approach for boys (n = 268) and girls (n = 343) w

  2. Waste minimization concepts applied to oil spill response

    International Nuclear Information System (INIS)

    Lessons learned from past US oil spill response histories show that prudent waste management principles have not been a primary consideration in making decisions for tactical response to major open-water oil spills. Contingency planners (government and industry) consistently choose a mechanical response strategy usually resulting in significant shoreline impact and waste generation (secondary pollution from response actions). Generally, the Environmental Protection Agency's waste minimization hierarchy is not used when managing a major open-water oil spill, subsequent cleanup of oiled shorelines, response to oiled wildlife, and final disposal of oily waste. Contingency plans do not adequately weigh the ecological ramifications from response-generated waste and response-generated pollution when deciding how to protect the environment. This paper shows how the EPA's waste minimization hierarchy should be used during all phases of an oil spill response: strategic planning, tactical planning, and response execution

  3. Understanding and quantifying cognitive complexity level in mathematical problem solving items

    Directory of Open Access Journals (Sweden)

    SUSAN E. EMBRETSON

    2008-09-01

    Full Text Available The linear logistic test model (LLTM; Fischer, 1973 has been applied to a wide variety of new tests. When the LLTM application involves item complexity variables that are both theoretically interesting and empirically supported, several advantages can result. These advantages include elaborating construct validity at the item level, defining variables for test design, predicting parameters of new items, item banking by sources of complexity and providing a basis for item design and item generation. However, despite the many advantages of applying LLTM to test items, it has been applied less often to understand the sources of complexity for large-scale operational test items. Instead, previously calibrated item parameters are modeled using regression techniques because raw item response data often cannot be made available. In the current study, both LLTM and regression modeling are applied to mathematical problem solving items from a widely used test. The findings from the two methods are compared and contrasted for their implications for continued development of ability and achievement tests based on mathematical problem solving items.

  4. Dynamic and Comprehensive Item Selection Strategies for Computerized Adaptive Testing Based on Graded Response Model%多级评分计算机化自适应测验动态综合选题策略

    Institute of Scientific and Technical Information of China (English)

    罗芬; 丁树良; 王晓庆

    2012-01-01

    Item selection strategy (ISS) is a core component in Computerized Adaptive Testing (CAT). Polytomous items can provide more information about examinee compared with dichotomous items, and adopting polytomously scored items in test is a research direction of CAT. As we know, the most widely used ISS is the maximum Fisher information (MFI) criterion, which raises concerns about cost-efficiency of the pool utilization and poses security risks for CAT programs. Chang & Ying (1999) and Chang, Qian, & Ying (2001) proposed two alternative item selection procedures, the a-stratified method (a-STR) and the a-stratified with b blocking method (&-STR) based on dichotomous model, with the goal to remedy the problems of item overexposure and item underexposure produced by MFI. However, the technology of a-STR and fc-STR is static because the items are stratified according to the given information at the beginning of test. Based on graded response model (GRM), a technique of the reduction dimensionality of difficulty (or step) parameters was employed[0] to construct some ISSs recently. The limitation of this dimension reduction technique is that it loses a lot of information. Thus, in order to improve MFI, two new item selection methods are proposed based on GRM: (1) modify the technique of the reduction dimensionality of difficulty (or step) parameters by integrating the interval estimation; (2) dynamic a-STR and dynamic fc-STR methods are implemented in the testing process. On one hand, these new ISSs can avoid and remedy the limitations of MFI and make good use of the advantages of the Fisher information function (FIF); FIF compresses all item parameters and ability parameters, so it is a comprehensive tool for all parameters in nature. On the other hand, the new ISSs employ the property that FIF could represent the inverse of the variance of the ability estimation, let £ be the square root of the reciprocal ofthe Fisher information, d be the absolute deviation between the

  5. A New Extension of the Binomial Error Model for Responses to Items of Varying Difficulty in Educational Testing and Attitude Surveys.

    Directory of Open Access Journals (Sweden)

    James A Wiley

    Full Text Available We put forward a new item response model which is an extension of the binomial error model first introduced by Keats and Lord. Like the binomial error model, the basic latent variable can be interpreted as a probability of responding in a certain way to an arbitrarily specified item. For a set of dichotomous items, this model gives predictions that are similar to other single parameter IRT models (such as the Rasch model but has certain advantages in more complex cases. The first is that in specifying a flexible two-parameter Beta distribution for the latent variable, it is easy to formulate models for randomized experiments in which there is no reason to believe that either the latent variable or its distribution vary over randomly composed experimental groups. Second, the elementary response function is such that extensions to more complex cases (e.g., polychotomous responses, unfolding scales are straightforward. Third, the probability metric of the latent trait allows tractable extensions to cover a wide variety of stochastic response processes.

  6. Developing a Numerical Ability Test for Students of Education in Jordan: An Application of Item Response Theory

    Science.gov (United States)

    Abed, Eman Rasmi; Al-Absi, Mohammad Mustafa; Abu shindi, Yousef Abdelqader

    2016-01-01

    The purpose of the present study is developing a test to measure the numerical ability for students of education. The sample of the study consisted of (504) students from 8 universities in Jordan. The final draft of the test contains 45 items distributed among 5 dimensions. The results revealed that acceptable psychometric properties of the test;…

  7. Item response theory was used to shorten EORTC QLQ-C30 scales for use in palliative care

    NARCIS (Netherlands)

    M.A. Petersen; M. Groenvold; N. Aaronson; J. Blazeby; Y. Brandberg; A. de Graeff; P. Fayers; E. Hammerlid; M. Sprangers; G. Velikova; J.B. Bjorner

    2006-01-01

    Background and Objective: The goal was to develop a shortened version of the EORTC QLQ-C30 for use in palliative care. We wanted to keep as few items as possible in each scale while still being able to compare results with studies using the original scales. We examined the possibilities of shortenin

  8. The D-Optimality Item Selection Criterion in the Early Stage of CAT: A Study with the Graded Response Model

    Science.gov (United States)

    Passos, Valeria Lima; Berger, Martijn P. F.; Tan, Frans E. S.

    2008-01-01

    During the early stage of computerized adaptive testing (CAT), item selection criteria based on Fisher"s information often produce less stable latent trait estimates than the Kullback-Leibler global information criterion. Robustness against early stage instability has been reported for the D-optimality criterion in a polytomous CAT with the…

  9. Quality of life in the Danish general population--normative data and validity of WHOQOL-BREF using Rasch and item response theory models

    DEFF Research Database (Denmark)

    Noerholm, V; Groenvold, M; Watt, T;

    2004-01-01

    BACKGROUND: The main objective of this study was to investigate the construct validity of the WHOQOL-BREF by use of Rasch and Item Response Theory models and to examine the stability of the model across high/low scoring individuals, gender, education, and depressive illness. Furthermore, the obje......BACKGROUND: The main objective of this study was to investigate the construct validity of the WHOQOL-BREF by use of Rasch and Item Response Theory models and to examine the stability of the model across high/low scoring individuals, gender, education, and depressive illness. Furthermore...... population. The response rate was 68.5%, and the sample reported here contained 1101 respondents: 578 women and 519 men (four respondents did not indicate their genders). RESULTS: Each of the four domains of the WHOQOL-BREF scale fitted a two-parameter IRT model, but did not fit the Rasch model. Due...... to multidimensionality, the total score of 26 items fitted neither model. Regression analysis was carried out, showing a level of explained variance of between 10 and 14%. The mean scores of the WHOQOL-BREF are reported as normative data for the general Danish population. CONCLUSION: The profile of the four WHOQOL...

  10. Applying Bayesian belief networks in rapid response situations

    Energy Technology Data Exchange (ETDEWEB)

    Gibson, William L [Los Alamos National Laboratory; Deborah, Leishman, A. [Los Alamos National Laboratory; Van Eeckhout, Edward [Los Alamos National Laboratory

    2008-01-01

    The authors have developed an enhanced Bayesian analysis tool called the Integrated Knowledge Engine (IKE) for monitoring and surveillance. The enhancements are suited for Rapid Response Situations where decisions must be made based on uncertain and incomplete evidence from many diverse and heterogeneous sources. The enhancements extend the probabilistic results of the traditional Bayesian analysis by (1) better quantifying uncertainty arising from model parameter uncertainty and uncertain evidence, (2) optimizing the collection of evidence to reach conclusions more quickly, and (3) allowing the analyst to determine the influence of the remaining evidence that cannot be obtained in the time allowed. These extended features give the analyst and decision maker a better comprehension of the adequacy of the acquired evidence and hence the quality of the hurried decisions. They also describe two example systems where the above features are highlighted.

  11. Teoria de Resposta ao Item na análise de uma prova de estatística em universitários Item Response Theory to analyze a statistics test in university students

    Directory of Open Access Journals (Sweden)

    Claudette Maria Medeiros Vendramini

    2005-12-01

    Full Text Available Este estudo objetivou aplicar a Teoria de Resposta ao Item na análise das 15 questões de múltipla escolha de uma prova de estatística apresentada na forma de gráficos ou de tabelas estatísticas. Participaram 413 universitários, selecionados por conveniência, de duas instituições da rede particular de ensino superior, predominantemente do curso de Psicologia (91,5%. Os universitários foram 80% do gênero feminino e do período diurno (69,8%, com idades de 16 a 53 anos, média 24,4 e desvio padrão 7,4. A prova é predominantemente unidimensional e os itens são mais bem ajustados ao modelo logístico de três parâmetros. Os índices de discriminação, dificuldade e correlação bisserial apresentam valores aceitáveis. Os resultados mostram as dificuldades apresentadas pelos estudantes com relação aos conceitos matemáticos e estatísticos, dificuldades essas observadas em outras pesquisas desde o ensino fundamental. Sugere-se que esses conceitos sejam tratados mais profundamente no ensino superior.This study aimed to use the Item Response Theory to analyze the 15 multiple-choice questions of a statistics test presented in the statistics graphics or tables form. The 414 university students were selected by convenience from two private universities, predominantly psychology students (91.5%. The university students were 80% female and with 16-53 years old, mean 24.4 and standard deviation 7.4. The test has predominantly one dimension and the items can be better fitting to the model of three parameters. The indexes of difficulty, discrimination and bisserial correlation presented acceptable values. The results indicate the difficulties of university students in the mathematic and statistic concepts, that difficulties are observed in the other studies since the elementary education. One suggests making more profound studies of these concepts in higher education.

  12. Natural History of Dependency in the Elderly: A 24-Year Population-Based Study Using a Longitudinal Item Response Theory Model.

    Science.gov (United States)

    Edjolo, Arlette; Proust-Lima, Cécile; Delva, Fleur; Dartigues, Jean-François; Pérès, Karine

    2016-02-15

    We aimed to describe the hierarchical structure of Instrumental Activities of Daily Living (IADL) and basic Activities of Daily Living (ADL) and trajectories of dependency before death in an elderly population using item response theory methodology. Data were obtained from a population-based French cohort study, the Personnes Agées QUID (PAQUID) Study, of persons aged ≥65 years at baseline in 1988 who were recruited from 75 randomly selected areas in Gironde and Dordogne. We evaluated IADL and ADL data collected at home every 2-3 years over a 24-year period (1988-2012) for 3,238 deceased participants (43.9% men). We used a longitudinal item response theory model to investigate the item sequence of 11 IADL and ADL combined into a single scale and functional trajectories adjusted for education, sex, and age at death. The findings confirmed the earliest losses in IADL (shopping, transporting, finances) at the partial limitation level, and then an overlapping of concomitant IADL and ADL, with bathing and dressing being the earliest ADL losses, and finally total losses for toileting, continence, eating, and transferring. Functional trajectories were sex-specific, with a benefit of high education that persisted until death in men but was only transient in women. An in-depth understanding of this sequence provides an early warning of functional decline for better adaptation of medical and social care in the elderly.

  13. Natural History of Dependency in the Elderly: A 24-Year Population-Based Study Using a Longitudinal Item Response Theory Model.

    Science.gov (United States)

    Edjolo, Arlette; Proust-Lima, Cécile; Delva, Fleur; Dartigues, Jean-François; Pérès, Karine

    2016-02-15

    We aimed to describe the hierarchical structure of Instrumental Activities of Daily Living (IADL) and basic Activities of Daily Living (ADL) and trajectories of dependency before death in an elderly population using item response theory methodology. Data were obtained from a population-based French cohort study, the Personnes Agées QUID (PAQUID) Study, of persons aged ≥65 years at baseline in 1988 who were recruited from 75 randomly selected areas in Gironde and Dordogne. We evaluated IADL and ADL data collected at home every 2-3 years over a 24-year period (1988-2012) for 3,238 deceased participants (43.9% men). We used a longitudinal item response theory model to investigate the item sequence of 11 IADL and ADL combined into a single scale and functional trajectories adjusted for education, sex, and age at death. The findings confirmed the earliest losses in IADL (shopping, transporting, finances) at the partial limitation level, and then an overlapping of concomitant IADL and ADL, with bathing and dressing being the earliest ADL losses, and finally total losses for toileting, continence, eating, and transferring. Functional trajectories were sex-specific, with a benefit of high education that persisted until death in men but was only transient in women. An in-depth understanding of this sequence provides an early warning of functional decline for better adaptation of medical and social care in the elderly. PMID:26825927

  14. Cross-validation study using item response theory: the health-related quality of life for eating disorders questionnaire-short version.

    Science.gov (United States)

    Bilbao, Amaia; Las Hayas, Carlota; Forero, Carlos G; Padierna, Angel; Martin, Josune; Quintana, José M

    2014-08-01

    The Health-Related Quality of Life for Eating Disorder-Short questionnaire is one of the most suitable existing instruments for measuring quality of life in patients with eating disorders. The objective of the study was to evaluate its reliability, validity, and responsiveness in a cohort of 377 patients. A comprehensive validation process was performed, including confirmatory factor analysis and a graded response model, and assessments of reliability and responsiveness at 1 year of follow-up. The confirmatory factor analysis confirmed the two second-order latent traits, social maladjustment, and mental health and functionality. The graded response model results showed that all items were good for discriminating their respective latent traits. Cronbach's alpha coefficients were high, and responsiveness parameters showed moderate changes. In conclusion, this short questionnaire has good psychometric properties. Its simplicity and ease of application further enhance its acceptability and usefulness in clinical research and trials, as well as in routine practice. PMID:24235177

  15. Item Difficulty Modeling of Paragraph Comprehension Items

    Science.gov (United States)

    Gorin, Joanna S.; Embretson, Susan E.

    2006-01-01

    Recent assessment research joining cognitive psychology and psychometric theory has introduced a new technology, item generation. In algorithmic item generation, items are systematically created based on specific combinations of features that underlie the processing required to correctly solve a problem. Reading comprehension items have been more…

  16. Item response theory in the production of indicators of socioeconomic metropolitan region of Maringá, Paraná State, Brazil - doi: 10.4025/actascitechnol.v34i4.10478

    Directory of Open Access Journals (Sweden)

    Vanessa Rufino da Silva

    2012-10-01

    Full Text Available This study aimed to identify and produce through models of Item Response Theory (IRT a socio-economic indicator based in the items observed in 2000 Census, following the methodology by Soares (2005. By the IRT Methodology, this indicator, as a latent variable, is obtained through the construction of specific models and scales, making it possible to measure this variable, which according to Andrade et al. (2000, IRT analyzes each item which compose the measuring instrument. This case consists of binary or dichotomous items, which assess the possession of certain assets of domestic comfort. The characteristics of each item were analyzed, as the ability to discrimination and income necessary for the possession of certain property. It was concluded that with 13 items, a trustworthy questionnaire can be done for the construction of a socioeconomic index of Maringa’s metropolitan region.

  17. Making Life Easier with Effort: Basic Findings and Applied Research on Response Effort.

    Science.gov (United States)

    Friman, Patrick C.; Poling, Alan

    1995-01-01

    This paper summarizes basic research on response effort in diverse applied areas including deceleration of aberrant behavior, attention deficit-hyperactivity disorder, oral habits, littering, and problem solving. The paper concludes that response effort as an independent variable has potent effects, and research exploring the applied benefits of…

  18. Fuzzy clustering applied to a demand response model in a smart grid contingency scenario

    OpenAIRE

    Pereira, Rita; Fagundes, Andre; Melicio, Rui; Mendes, Victor,; Figueiredo, Joao; Martins, Joao; Quadrado, Jose

    2014-01-01

    This paper focus on a demand response model analysis in a smart grid context considering a contingency scenario. A fuzzy clustering technique is applied on the developed demand response model and an analysis is performed for the contingency scenario. Model considerations and architecture are described. The demand response developed model aims to support consumers decisions regarding their consumption needs and possible economic benefits.

  19. Identifying the ‘red flags’ for unhealthy weight control among adolescents: Findings from an item response theory analysis of a national survey

    Directory of Open Access Journals (Sweden)

    Utter Jennifer

    2012-08-01

    Full Text Available Abstract Background Weight control behaviors are common among young people and are associated with poor health outcomes. Yet clinicians rarely ask young people about their weight control; this may be due to uncertainty about which questions to ask, specifically around whether certain weight loss strategies are healthier or unhealthy or about what weight loss behaviors are more likely to lead to adverse outcomes. Thus, the aims of the current study are: to confirm, using item response theory analysis, that the underlying latent constructs of healthy and unhealthy weight control exist; to determine the ‘red flag’ weight loss behaviors that may discriminate unhealthy from healthy weight loss; to determine the relationships between healthy and unhealthy weight loss and mental health; and to examine how weight control may vary among demographic groups. Methods Data were collected as part of a national health and wellbeing survey of secondary school students in New Zealand (n = 9,107 in 2007. Item response theory analyses were conducted to determine the underlying constructs of weight control behaviors and the behaviors that discriminate unhealthy from healthy weight control. Results The current study confirms that there are two underlying constructs of weight loss behaviors which can be described as healthy and unhealthy weight control. Unhealthy weight control was positively correlated with depressive mood. Fasting and skipping meals for weight loss had the lowest item thresholds on the unhealthy weight control continuum, indicating that they act as ‘red flags’ and warrant further discussion in routine clinical assessments. Conclusions Routine assessments of weight control strategies by clinicians are warranted, particularly for screening for meal skipping and fasting for weight loss as these behaviors appear to ‘flag’ behaviors that are associated with poor mental wellbeing.

  20. How we know it hurts: item analysis of written narratives reveals distinct neural responses to others' physical pain and emotional suffering.

    Directory of Open Access Journals (Sweden)

    Emile Bruneau

    Full Text Available People are often called upon to witness, and to empathize with, the pain and suffering of others. In the current study, we directly compared neural responses to others' physical pain and emotional suffering by presenting participants (n = 41 with 96 verbal stories, each describing a protagonist's physical and/or emotional experience, ranging from neutral to extremely negative. A separate group of participants rated "how much physical pain", and "how much emotional suffering" the protagonist experienced in each story, as well as how "vivid and movie-like" the story was. Although ratings of Pain, Suffering and Vividness were positively correlated with each other across stories, item-analyses revealed that each scale was correlated with activity in distinct brain regions. Even within regions of the "Shared Pain network" identified using a separate data set, responses to others' physical pain and emotional suffering were distinct. More broadly, item analyses with continuous predictors provided a high-powered method for identifying brain regions associated with specific aspects of complex stimuli - like verbal descriptions of physical and emotional events.

  1. Modifying parents peer attachment scale with item response theory%用项目反应理论修订父母同伴依恋量表

    Institute of Scientific and Technical Information of China (English)

    臧运洪; 赵守盈; 陈维; 潘运; 张禹

    2012-01-01

    The item discrimination, difficulty and information peak function of the item response theory are used to revise parents peer attachment scale produced by Armsden and Greenberg ( 1991 ), the purpose is that this scale revised is more accurate to survey the status of parents peer attachment of Chinese youth. SPSS15.0 software is used to manage data , using MULTILOG 7.03 software to analysis parameters, using AMOS4.0 to test the verification revised. Results are as follows : 1. Parents peer attachment scale is one-dimensional which can be revised by item response theory. 2. The item discrimination a, difficulty b of new scale are with reasonable scope. 3. The test information peak function of new scale is smaller and has a higher reliability. New father and peer attachment scale contain two factors: trust and communication. New mather attachment scale include factors: trust, communication and alienation,which have the same factors with the original scale . Surveyed officially, the scale revised can effectively survey the status of parents peer attachment of Chinese miao youth.%应用项目反应理论的区分度、难度和信息函数峰值3个参数对Armsden和Greenberg(1991)的父母同伴依恋量表进行修订,目的:使修订后的量表更能精确地调查中国初中生的依恋现状。结果:父母同伴依恋量表符合单维性检验,可以根据项目反应理论进行修订。新量表的区分度a值和难度b值具有合理的取值范围。新量表的测验信息峰值函数变小,具有更高的信度。新父亲和同伴依恋量表均包含两个因子:信任和沟通。新母亲依恋量表包含的因子个数和原量表相同:信任、沟通和疏离。经正式施测,修订后的量表可以有效地调查中国苗族初中生的依恋现状。

  2. Ranking Popular Items By Naive Bayes Algorithm

    Directory of Open Access Journals (Sweden)

    Shiramshetty Gouthami

    2012-03-01

    Full Text Available The problem of ranking popular items is getting increasing interest from a number of research areas. Several algorithms have been proposed for this task. The described problem of ranking and suggesting items arises in diverse applications include interactive computational system for helping people to leverage social information; in technical these systems are called social navigation systems. These social navigation systems help each individual in their performance and decision making over selecting the items. Based on the each individual response the ranking and suggesting of popular items were done. The individual feedback might be obtained by displaying a set of suggested items, where the selection of items is based on the preference of the individual. The aim is to suggest popular items by rapidly studying the true popularity ranking of items. The difficulty in suggesting the true popular items to the users can give emphasis to reputation for some items but may mutilate the resulting item ranking for other items. So the problem of ranking and suggesting items affected many applications including suggestions and search query suggestions for social tagging systems. In this paper we propose Naïve Bayes algorithm for ranking and suggesting popular items.

  3. Obtaining Some Degree of Correspondence Between Unequatable Scores: A Comparison of Item Response Theory and Equipercentile Equating Methods.

    Science.gov (United States)

    Yen, Wendy M.

    Test scores that are not perfectly reliable cannot be strictly equated unless they are strictly parallel. This fact implies that tau equivalence can be lost if an equipercentile equating is applied to observed scores that are not strictly parallel. Thirty-six simulated data sets are produced to simulate equating tests with different difficulties…

  4. An Item Response Theory Analysis of the Mindful Attention Awareness Scale%正念注意觉知量表IRT分析研究

    Institute of Scientific and Technical Information of China (English)

    赵守盈; 石艳梅; 郭海辉

    2014-01-01

    The item discrimination, difficulty and information peak function of the item response theory are used to revise Mindful Attention Awareness Scale developed by Brown &Ryan(2003).Data collected from a sample of Chinese middle school teachers were submitted to analysis with MULTILOG7.03 software and the results suggested that MAAS exhibited promising psychometric properties and the scale satisfied on dimension hypothesis.The is suffi-ciently accurate to survey the status of mindfulness of Chinese teachers.The conclusions were drawn as as:1 ) The item discrimination a, threshold b of the scale were satisfactory.2) The test information peak function of 6 item shorten scale was quite high and the scale had a reasonable reliability.3) The indicators of the Confirmative Factor Analysis ( CFA) meet the requirements.%正念注意觉知量表( MAAS)是测量正念注意水平最常用的量表之一,以中小学教师为被试,以项目反应理论用方法与技术对量表各项目的区分度、域值和信息函数峰值4个参数做了分析探讨。结果显示 MMAS 支持单维性假设,具有良好的心理测量学指标,对正念注意水平的测量具有较高的精准性。量表存在6个信息量很高的项目,其信息量之和接近量表总信息量的70%,提示这几个项目可以构成一个简式量表。对新量表做验证性因素分析,各项指标达到要求。

  5. 教育考试中短测验的分析方法——基于两种项目反应理论方法的比较研究%Item Analysis of Short Test in Educational Testing: Comparative Study on Parameter and Non-parameter Item Response Theory

    Institute of Scientific and Technical Information of China (English)

    何壮; 袁淑莉; 赵守盈

    2012-01-01

    教育考试中专题、短测验等形式是命题的一种主要方式。对这类测验的分析,可以从参数项目反应理论和非参数项目反应理论入手。本研究分别选取Rasch模型和Mokken模型对某高三文科综合地理试卷进行分析比较。使用winsteps和xeaaibre软件进行Rasch分析,得到难度、信息量、项目功能差异等参数;使用MSP软件进行Mokken分析,得到正答率和同质性系数。比较两种结果,得出以下结论:(1)非参数项目反应理论以正答率对题目排序与参数项目反应理论以难度排序一致;(2)而有个别不符合参数项目反应理论标准的题目对提高测验质量同样有意义,不应被删除;(3)进行维度检验和题目筛选时,非参数项目反应理论标准比参数项目反应理论标准更加严格;(4)两种理论的项目功能差异检验结果一致。%As one of the significant types of tests, the test project and short test are popular in educational testing. Parameter and non-parameter item response theory being the starts, these tests were under analysis. Compared was the geography paper in inaugurated arts taken by some senior three students. During this comparison the Rasch and Mokken model were respectively selected. For analyzing software Winsteps and Xcalibre were utilized to analyze item parameters in Rasch model. Analyzed in detail were the parameters of difficulty, differential item functioning and information curve. Software MSP was for the purpose of analyzing items in Mokken model. Besides, the statistics of accurate rate and coefficients of homogeneity were also analyzed in detail. Finally, four conclusions were arrived at as the following: ( 1 ) The estimate results of difficulty between non-parameter and parameter item response theory were equivalent. (2)Those items, which failed to fit parameter item response theory, succeeded in non-parameter item response theory. (3)Non-parameter item

  6. An Effective Multimedia Item Shell Design for Individualized Education: The Crome Project

    Directory of Open Access Journals (Sweden)

    Irene Cheng

    2008-01-01

    Full Text Available There are several advantages to creating multimedia item types and applying computer-based adaptive testing in education. First is the capability to motivate learning by making the learners feel more engaged and in an interactive environment. Second is a better concept representation, which is not possible in conventional multiple-choice tests. Third is the advantage of individualized curriculum design, rather than a curriculum designed for an average student. Fourth is a good choice of the next question, associated with the appropriate difficulty level based on a student's response to the current question. However, many issues need to be addressed when achieving these goals, including: (a the large number of item types required to represent the current multiple-choice questions in multimedia formats, (b the criterion used to determine the difficulty level of a multimedia question item, and (c the methodology applied to the question selection process for individual students. In this paper, we propose a multimedia item shell design that not only reduces the number of item types required, but also computes difficulty level of an item automatically. The concept of question seed is introduced to make content creation more cost-effective. The proposed item shell framework facilitates efficient communication between user responses at the client, and the scoring agents integrated with a student ability assessor at the server. We also describe approaches for automatically estimating difficulty level of questions, and discuss preliminary evaluation of multimedia item types by students.

  7. Principles and procedures of considering item sequence effects in the development of calibrated item pools: Conceptual analysis and empirical illustration

    Directory of Open Access Journals (Sweden)

    Safir Yousfi

    2012-12-01

    Full Text Available Item responses can be context-sensitive. Consequently, composing test forms flexibly from a calibrated item pool requires considering potential context effects. This paper focuses on context effects that are related to the item sequence. It is argued that sequence effects are not necessarily a violation of item response theory but that item response theory offers a powerful tool to analyze them. If sequence effects are substantial, test forms cannot be composed flexibly on the basis of a calibrated item pool, which precludes applications like computerized adaptive testing. In contrast, minor sequence effects do not thwart applications of calibrated item pools. Strategies to minimize the detrimental impact of sequence effects on item parameters are discussed and integrated into a nomenclature that addresses the major features of item calibration designs. An example of an item calibration design demonstrates how this nomenclature can guide the process of developing a calibrated item pool.

  8. Losing Items in the Psychogeriatric Nursing Home

    Directory of Open Access Journals (Sweden)

    J. van Hoof PhD

    2016-09-01

    Full Text Available Introduction: Losing items is a time-consuming occurrence in nursing homes that is ill described. An explorative study was conducted to investigate which items got lost by nursing home residents, and how this affects the residents and family caregivers. Method: Semi-structured interviews and card sorting tasks were conducted with 12 residents with early-stage dementia and 12 family caregivers. Thematic analysis was applied to the outcomes of the sessions. Results: The participants stated that numerous personal items and assistive devices get lost in the nursing home environment, which had various emotional, practical, and financial implications. Significant amounts of time are spent on trying to find items, varying from 1 hr up to a couple of weeks. Numerous potential solutions were identified by the interviewees. Discussion: Losing items often goes together with limitations to the participation of residents. Many family caregivers are reluctant to replace lost items, as these items may get lost again.

  9. Analyzing effects of aperture size and applied voltage on the response time

    Science.gov (United States)

    Kim, YooKwang; Lee, Jin Su; Won, Yong Hyub

    2016-03-01

    Electrowetting lens is a promising technique for non-mechanical vari-focal lens, because of fast response time, wide expressible diopter, and etc. Although electrowetting related papers are actively published, no one did not clearly define the relationship among electrowetting parameters, especially in AC driven case. Analysis for AC voltage driving is needed because AC electrowetting has many advantages like low hysteresis and short settling time. In this experiment we confirmed that the response time depends on aperture size and applied voltage. Response time measurement for lens aperture of 200-1000um and applied voltage of 0-70V with 1kHz frequency was conducted. Experimental data was compared with simulation result by COMSOL Multiphysics program with the same condition, and they correspond with each other well. As voltage increases, the overshoot height becomes higher, so it has longer oscillation and settling time. On the other hand if aperture size decreases, the surface tension of lens wall could be delivered effectively to the center region of meniscus, so it has less oscillation and shorter settling time. The result was that in 500um aperture no more than 30V should be applied to ensure 1ms response time. In 200um aperture, the voltage limit is disappeared.

  10. Response of self-assembly for magnetite nanocrystal in magnetic fluid under an applied magnetic field

    Institute of Scientific and Technical Information of China (English)

    Yun Zou; Yiyou Nie; Ziyun Di; Dongchen Zhang; Minghuang Sang; Xianfeng Chen

    2008-01-01

    @@ The response time and transmittivity of the magnetic fluid (MF) for different concentrations at room temperature were investigated in this letter. The volume fraction of the investigated sample ranged from 0.44% to 6.47%. It was found that the transmittivity decreased with increasing concentration under a given magnetic field, and the evolution time was changed with different concentrations. Moreover, the light intensity decreased rapidly at the beginning and then became stable when the magnetic field was applied.

  11. The Professional Context as a Predictor for Response Distortion in the Adaption-Innovation Inventory--An Investigation Using Mixture Distribution Item Response Theory Models

    Science.gov (United States)

    Fischer, Sebastian; Freund, Philipp Alexander

    2014-01-01

    The Adaption-Innovation Inventory (AII), originally developed by Kirton (1976), is a widely used self-report instrument for measuring problem-solving styles at work. The present study investigates how scores on the AII are affected by different response styles. Data are collected from a combined sample (N = 738) of students, employees, and…

  12. Response of vetch, lentil, chickpea and red pea to pre- or post-emergence applied herbicides

    Directory of Open Access Journals (Sweden)

    I. Vasilakoglou

    2013-09-01

    Full Text Available Broad-leaved weeds constitute a serious problem in the production of winter legumes, but few selective herbicides controlling these weeds have been registered in Europe. Four field experiments were conducted in 2009/10 and repeated in 2010/11 in Greece to study the response of common vetch (Vicia sativa L., lentil (Lens culinaris Medik., chickpea (Cicer arietinum L. and red pea (Lathyrus cicera L. to several rates of the herbicides pendimethalin, S-metolachlor, S-metolachlor plus terbuthylazine and flumioxazin applied pre-emergence, as well as imazamox applied post-emergence. Phytotoxicity, crop height, total weight and seed yield were evaluated during the experiments. The results of this study suggest that common vetch, lentil, chickpea and red pea differed in their responses to the herbicides tested. Pendimethalin at 1.30 kg ha-1, S-metolachlor at 0.96 kg ha-1 and flumioxazine at 0.11 kg ha-1 used as pre-emergence applied herbicides provided the least phytotoxicity to legumes. Pendimethalin at 1.98 kg ha-1 and both rates of S-metolachlor plus terbuthylazine provided the greatest common lambsquarters (Chenopodium album L. control. Imazamox at 0.03 to 0.04 kg ha-1 could also be used as early post-emergence applied herbicide in common vetch and red pea without any significant detrimental effect.

  13. Research Computer Adaptive Testing Algorithms Based on Item Response Theory%基于项目反应理论的计算机自适应测试算法的研究与实现

    Institute of Scientific and Technical Information of China (English)

    刘锋; 郭维威

    2014-01-01

    With the continuous development of educational testing theory, computer adaptive test has been widely studied and applied, this paper is based on item response theory, using the three-parameter Logistic model to provide a efficient computerized adaptive testing algorithms, thereby to improve the efficiency and accuracy of the test, and finally achieve the evaluation of the ability of candidates.%随着教育测试理论的不断发展,计算机自适应测试得到了广泛的研究与应用,本文在项目反应理论的基础上,采用三参数Logistic模型进行研究,提出了一种有效的计算机自适应测试算法,从而提高测试的效率和准确性,实现对应试者能力水平的估计。

  14. Postural responses to anterior and posterior perturbations applied to the upper trunk of standing human subjects.

    Science.gov (United States)

    Colebatch, James G; Govender, Sendhil; Dennis, Danielle L

    2016-02-01

    This study concerned the effects of brisk perturbations applied to the shoulders of standing subjects to displace them either forwards or backwards, our aim being to characterise the responses to these disturbances. Subjects stood on a force platform, and acceleration was measured at the level of C7, the sacrum and both tibial tuberosities. Surface EMG was measured from soleus (SOL), tibialis anterior (TA), the hamstrings (HS), quadriceps (QUAD), rectus abdominis (RA) and lumbar paraspinal (PS) muscles. Trials were recorded for each of four conditions: subjects' eyes open (reference) or closed and on a firm (reference) or compliant surface. Observations were also made of voluntary postural reactions to a tap over the deltoid. Anterior perturbations (mean C7 acceleration 251.7 mg) evoked activity within the dorsal muscles (SOL, HS, PS) with a similar latency to voluntary responses to shoulder tapping. Responses to posterior perturbations (mean C7 acceleration -240.4 mg) were more complex beginning, on average, at shorter latency than voluntary activity (median TA 78.0 ms). There was activation of TA, QUAD and SOL associated with initial forward acceleration of the lower legs. The EMG responses consisted of an initial phasic discharge followed by a more prolonged one. These responses differ from the pattern of automatic postural responses that follow displacements at the level of the ankles, and it is unlikely that proprioceptive afferents excited by ankle movement had a role in the initial responses. Vision and surface properties had only minor effects. Perturbations of the upper trunk evoke stereotyped compensatory postural responses for each direction of perturbation. For posterior perturbations, EMG onset occurs earlier than for voluntary responses. PMID:26487178

  15. Gibbs sampling method in multidimensional two parameter logistic item response model%多维二参数Logistic项目反应模型的Gibbs抽样法

    Institute of Scientific and Technical Information of China (English)

    付志慧; 李斌

    2014-01-01

    Item response theory (IRT)is based on three assumptions:unidimensionality,local independence and monotonicity.However these assumption has some defects need to be improved.Research shows that use unidimensional model to fit multidimensional data will increase measurement error and make wrong inference to students’ability.Just because of this researchers extend the unidimensional IRT to the multidimensional IRT from different perspectives.Since the multidimensional model has more parameters need to be estimated,traditional methods such as marginal maximum likelihood and Bayes modal estimation procedures are not suitable.However,Gibbs sampler has a great potential to be an efficient and versatile estimation procedure in item response theory.In this article,based on a data augmentation scheme using the Gibbs sampler,we propose a Bayesian procedure to estimate the multidimensional two parameter logistic model (2PLM).With the introduction of latent variable,the full conditional distributions are tractable,and consequently the Gibbs sampling is easy to implement for any prior assumptions.%项目反应理论主要有3个基本假设:单维性,局部独立性和单调性。但是这3个假设存在一些弊端亟待解决。一些科学研究表明,用单维模型来模拟多维测量数据往往会增大测量误差,导致对学生的能力做出不正确的推论。因此,研究者基于各种不同的测验背景,将单维项目反应模型推广到多维项目反应模型。多维项目反应模型涉及到的参数较多,如果采用传统的估计方法,如边际最大似然法和贝叶斯众数估计法处理起来比较困难。然而,在项目反应理论中,Gibbs抽样法可以作为一种高效灵活的估计方法加以应用。基于 Gibbs 抽样的增加数据的技巧,给出了多维二参数 Logistic项目反应模型的Bayes估计方法。随着潜在变量的引入,每个参数的满条件分布都很容易得到,并且不受先

  16. Análise de Teoria de Resposta ao Item de um instrumento breve de avaliação de comportamentos antissociais = Item Response Theory Analysis of a brief instrument for assessing antisocial behaviors

    Directory of Open Access Journals (Sweden)

    Hauck Filho, Nelson

    2014-01-01

    Full Text Available Comportamentos antissociais são comuns a diversas condições psicopatológicas, incluindo transtornos da personalidade (e. g. , antissocial e narcisista e transtornos do humor (e. g. , transtorno bipolar. Todavia, até o momento, havia uma importante lacuna no contexto brasileiro no que diz respeito à avaliação breve dos comportamentos antissociais em indivíduos adultos de contextos não carcerários. Em virtude disso, o presente estudo teve como objetivo a construção e a análise mediante Teoria de Resposta ao Item de um instrumento breve para uso em pesquisas e rastreio junto à população geral adulta. As análises das respostas de 204 estudantes universitários (média de idades = 23,56 anos; DP = 7,70; 60,6% mulheres a um conjunto de itens permitiram reter 13 itens com excelentes propriedades psicométricas. Esses itens se mostraram avaliativos de um fator geral de antissocialidade, interpretável como uma propensão ao antagonismo, à não cooperação e à agressão em uma diversidade de contextos sociais. Limitações do estudo são discutidas ao final

  17. Applying Total Physical Response(TPR)Theory to Teaching Chinese Children English

    Institute of Scientific and Technical Information of China (English)

    张院院

    2015-01-01

    [Abstrac]Now it has become a fashion in our society that young learners aged from 6 or even younger participate in foreign language learning.With the Second Language Acquisition theories, it is believed that learning a foreign language from the childhood can facilitate the learning.Children need a teaching method which conforms to their psychological and physical characteristics.American psychologist James Asher develops Total Physical Response, which advocates leaning through physical actions.He believes that children should learn a foreign language happily and confidently, just like the process of acquiring their mother tongue.However, Total Physical Response can not be applied effectively in the teaching process due to children's instincts and characteristics.If there is a way or strategy which takes advantage of children's characteristics and control their behavior in class, the teaching results would be more satisfying.

  18. Applying Total Physical Response(TPR)Theory to Teaching Chinese Children English

    Institute of Scientific and Technical Information of China (English)

    张院院

    2015-01-01

    Now it has become a fashion in our society that young learners aged from 6 or even younger participate in foreign language learning.With the Second Language Acquisition theories,it is believed that learning a foreign language from the childhood can facilitate the learning.Children need a teaching method which conforms to their psychological and physical characteristics.American psychologist James Asher develops Total Physical Response,which advocates leaning through physical actions.He believes that children should learn a foreign language happily and confidently,just like the process of acquiring their mother tongue.However,Total Physical Response can not be applied effectively in the teaching process due to children’s instincts and characteristics.If there is a way or strategy which takes advantage of children’s characteristics and control their behavior in class,the teaching results would be more satisfying.

  19. Academic freedom and the professional responsibilities of applied ethicists: a comment on Minerva.

    Science.gov (United States)

    Dawson, Angus; Herington, Jonathan

    2014-05-01

    Academic freedom is an important good, but it comes with several responsibilities. In this commentary we seek to do two things. First, we argue against Francesca Minerva's view of academic freedom as presented in her article 'New threats to academic freedom' on a number of grounds. We reject the nature of the absolutist moral claim to free speech for academics implicit in the article; we reject the elitist role for academics as truth-seekers explicit in her view; and we reject a possible more moderate re-construction of her view based on the harm/offence distinction. Second, we identify some of the responsibilities of applied ethicists, and illustrate how they recommend against allowing for anonymous publication of research. Such a proposal points to the wider perils of a public discourse which eschews the calm and careful discussion of ideas.

  20. Academic freedom and the professional responsibilities of applied ethicists: a comment on Minerva.

    Science.gov (United States)

    Dawson, Angus; Herington, Jonathan

    2014-05-01

    Academic freedom is an important good, but it comes with several responsibilities. In this commentary we seek to do two things. First, we argue against Francesca Minerva's view of academic freedom as presented in her article 'New threats to academic freedom' on a number of grounds. We reject the nature of the absolutist moral claim to free speech for academics implicit in the article; we reject the elitist role for academics as truth-seekers explicit in her view; and we reject a possible more moderate re-construction of her view based on the harm/offence distinction. Second, we identify some of the responsibilities of applied ethicists, and illustrate how they recommend against allowing for anonymous publication of research. Such a proposal points to the wider perils of a public discourse which eschews the calm and careful discussion of ideas. PMID:24724542

  1. Evaluation of energy response of neutron rem monitor applied to high-energy accelerator facilities

    Energy Technology Data Exchange (ETDEWEB)

    Nakane, Yoshihiro; Harada, Yasunori; Sakamoto, Yukio [Japan Atomic Energy Research Inst., Tokai, Ibaraki (Japan). Tokai Research Establishment] [and others

    2003-03-01

    A neutron rem monitor was newly developed for applying to the high-intensity proton accelerator facility (J-PARC) that is under construction as a joint project between the Japan Atomic Energy Research Institute and the High Energy Accelerator Research Organization. To measure the dose rate accurately for wide energy range of neutrons from thermal to high-energy region, the neutron rem monitor was fabricated by adding a lead breeder layer to a conventional neutron rem monitor. The energy response of the monitor was evaluated by using neutron transport calculations for the energy range from thermal to 150 MeV. For verifying the results, the response was measured at neutron fields for the energy range from thermal to 65 MeV. The comparisons between the energy response and dose conversion coefficients show that the newly developed neutron rem monitor has a good performance in energy response up to 150 MeV, suggesting that the present study offered prospects of a practical fabrication of the rem monitor applicable to the high intensity proton accelerator facility. (author)

  2. Sugarcane Yield Response to Furrow-Applied Organic Amendments on Sand Soils

    Directory of Open Access Journals (Sweden)

    J. Mabry McCray

    2015-01-01

    Full Text Available Organic amendments have been shown to increase sugarcane yield on sand soils in Florida. These soils have very low water and nutrient-holding capacities because of the low content of organic matter, silt, and clay. Because of high costs associated with broadcast application, this field study was conducted to determine sugarcane yield response to furrow application of two organic amendments on sand soils. One experiment compared broadcast application (226 m3 ha−1 of mill mud and yard waste compost, furrow application (14, 28, and 56 m3 ha−1 of these materials, and no amendment. Another experiment compared furrow applications (28 and 56 m3 ha−1 of mill mud and yard waste compost with no amendment. There were significant yield (t sucrose ha−1 responses to broadcast and furrow-applied mill mud but responses to furrow applications were not consistent across sites. There were no significant yield responses to yard waste compost suggesting that higher rates or repeated applications of this amendment will be required to achieve results comparable to mill mud. Results also suggest that enhancing water and nutrient availability in the entire volume of the root zone with broadcast incorporation of organic amendments is the more effective approach for low organic matter sands.

  3. Methodological issues regarding power of classical test theory (CTT and item response theory (IRT-based approaches for the comparison of patient-reported outcomes in two groups of patients - a simulation study

    Directory of Open Access Journals (Sweden)

    Boyer François

    2010-03-01

    Full Text Available Abstract Background Patients-Reported Outcomes (PRO are increasingly used in clinical and epidemiological research. Two main types of analytical strategies can be found for these data: classical test theory (CTT based on the observed scores and models coming from Item Response Theory (IRT. However, whether IRT or CTT would be the most appropriate method to analyse PRO data remains unknown. The statistical properties of CTT and IRT, regarding power and corresponding effect sizes, were compared. Methods Two-group cross-sectional studies were simulated for the comparison of PRO data using IRT or CTT-based analysis. For IRT, different scenarios were investigated according to whether items or person parameters were assumed to be known, to a certain extent for item parameters, from good to poor precision, or unknown and therefore had to be estimated. The powers obtained with IRT or CTT were compared and parameters having the strongest impact on them were identified. Results When person parameters were assumed to be unknown and items parameters to be either known or not, the power achieved using IRT or CTT were similar and always lower than the expected power using the well-known sample size formula for normally distributed endpoints. The number of items had a substantial impact on power for both methods. Conclusion Without any missing data, IRT and CTT seem to provide comparable power. The classical sample size formula for CTT seems to be adequate under some conditions but is not appropriate for IRT. In IRT, it seems important to take account of the number of items to obtain an accurate formula.

  4. 用项目反应理论分析自陈量表时最佳模型的选择%Choice of optimal item response model for analysis of self-report questionnaire

    Institute of Scientific and Technical Information of China (English)

    周晶; 郭庆科

    2005-01-01

    log-likelihood with the least items whose mean square residual error were greater than 2, and the volume of test information was greater than that provided by one-parameter model and no less than the three-parameter model. Therefore, the two-parameter logistic model was the best for 2-point scoring model. But the measurement precision of two-parameter Logistic model was lower than that of multi-grade response model.CONCLUSION: When 2-point items are adopted in self-report questionnaire, 2-parameter logistic model can be applied but not 1- or 3-parameter Logistic models. But when the questionnaire uses items that have more than 2 response grades, the measurement precision can be better than that of 2-point data. Merge of the options for the items may result in lowered measurement precision.

  5. Modeling Item Response Times with a Two-State Mixture Model: A New Approach to Measuring Speededness. Law School Admission Council Computerized Testing Report. LSAC Research Report Series.

    Science.gov (United States)

    Schnipke, Deborah L.; Scrams, David J.

    Speededness refers to the extent to which time limits affect test takers' performance. With regard to the Law School Admission Test (LSAT), speededness is currently measured by calculating the proportion of test takers who do not reach each item on the test. These proportions typically increase slightly toward the end of the test, indicating that…

  6. Optimal Test Design with Rule-Based Item Generation

    Science.gov (United States)

    Geerlings, Hanneke; van der Linden, Wim J.; Glas, Cees A. W.

    2013-01-01

    Optimal test-design methods are applied to rule-based item generation. Three different cases of automated test design are presented: (a) test assembly from a pool of pregenerated, calibrated items; (b) test generation on the fly from a pool of calibrated item families; and (c) test generation on the fly directly from calibrated features defining…

  7. Asymmetric Response of Ferroelastic Domain-Wall Motion under Applied Bias.

    Science.gov (United States)

    Jablonski, Michael L; Liu, Shi; Winkler, Christopher R; Damodaran, Anoop R; Grinberg, Ilya; Martin, Lane W; Rappe, Andrew M; Taheri, Mitra L

    2016-02-10

    The switching of domains in ferroelectric and multiferroic materials plays a central role in their application to next-generation computer systems, sensing applications, and memory storage. A detailed understanding of the response to electric fields and the switching behavior in the presence of complex domain structures and extrinsic effects (e.g., defects and dislocations) is crucial for the design of improved ferroelectrics. In this work, in situ transmission electron microscopy is coupled with atomistic molecular dynamics simulations to explore the response of 71° ferroelastic domain walls in BiFeO3 with various orientations under applied electric-field excitation. We observe that 71° domain walls can have intrinsically asymmetric responses to opposing biases. In particular, when the electric field has a component normal to the domain wall, forward and backward domain-wall velocities can be dramatically different for equal and opposite fields. Additionally, the presence of defects and dislocations can strongly affect the local switching behaviors through pinning or nucleation of the domain walls. These results offer insight for controlled ferroelastic domain manipulation via electric-field engineering. PMID:26695346

  8. Sliding Wear Response of a Bronze Bushing: Influence of Applied Load and Test Environment

    Science.gov (United States)

    Prasad, B. K.

    2012-10-01

    This investigation pertains to the examination of the sliding wear behavior of a leaded-tin bronze bushing under the conditions of varying applied loads and test environments against a steel shaft. The test environment was changed by adding 5% of solid lubricants like talc and lead to an oil lubricant separately as well as in combination; the fraction of the two (solid) lubricants within the solid lubricant mixture was varied in the range of 25-75% in the latter case. The wear performance of the bushing was characterized in terms of the wear rate, frictional heating, and friction coefficient. The increasing load led to deterioration in the wear response, while the addition of the solid lubricant particles produced a reverse effect. Further, an appreciable difference in the wear behavior was not observed when the tests were conducted in the oil plus talc and oil plus lead lubricant mixtures. However, the oil containing lead and talc together brought about a significant improvement in the wear response; best results were obtained in the case of the lubricant mixture consisting of lead and talc together in the ratio of 3:1 in the oil. The observed wear behavior of the samples has been discussed in terms of specific characteristics of various microconstituents. The features of the wear surfaces and subsurface regions further substantiated the wear response and enabled us to understand the operating material removal mechanisms.

  9. Responses of mink to auditory stimuli: Prerequisites for applying the ‘cognitive bias’ approach

    DEFF Research Database (Denmark)

    Svendsen, Pernille Maj; Malmkvist, Jens; Halekoh, Ulrich;

    2012-01-01

    The aim of the study was to determine and validate prerequisites for applying a cognitive (judgement) bias approach to assessing welfare in farmed mink (Neovison vison). We investigated discrimination ability and associative learning ability using auditory cues. The mink (n = 15 females) were...... mink only showed habituation in experiment 2. Regardless of the frequency used (2 and 18 kHz), cues predicting the danger situation initially elicited slower responses compared to those predicting the safe situation but quickly became faster. Using auditory cues as discrimination stimuli for female...... farmed mink in a judgement bias approach would thus appear to be feasible. However several specific issues are to be considered in order to successfully adapt a cognitive bias approach to mink, and these are discussed....

  10. Responses of Greenhouse Tomato and Pepper Yields and Nitrogen Dynamics to Applied Compound Fertilizers

    Institute of Scientific and Technical Information of China (English)

    ZHU Jian-Hua; LI Xiao-Lin; ZHANG Fu-Suo; LI Jun-Liang; P.CHRISTIE

    2004-01-01

    Yield and N uptake of tomato (Lycopersicum esculentum Mill.) and pepper (Capsicum annuum L.) crops in five successive rotations receiving two compound fertilizers (12-12-17 and 21-8-11 N-P2O5-K2O) were studied to determine 1)crop responses,2) dynamics of NO3-N and NH4-N in different soil layers,3) N balance and 4) system-level N efficiencies.Five treatments (2 fertilizers,2 fertilizer rates and a control),each with three replicates,were arranged in the study.The higher N fertilizer rate,300 kg N ha-1 (versus 150 kg N ha-i),returned higher vegetable fruit yields and total aboveground N uptake with the largest crop responses occurring for the low-N fertilizer (12-12-17) applied at 300 kg N ha-1 rather than with the high-N fertilizer (21-8-11). Ammonium-N in the top 90 cm of the soil profile declined during the experiment,while nitrate-N remained at a similar level throughout the experiment with the lower rate of fertilizer N.At the higher rate of N fertilizer there was a continuous NO3-N accumulation of over 800 kg N ha-1. About 200 kg N ha-1 was applied with irrigation to each crop using NO3-contaminated groundwater. In general,about 50% of the total N input was recovered from all treatments. Pepper,relative to tomato,used N more efficiently with smaller N losses,but the crops utilized less than 29% of the fertilizer N over the two and a half-year period. Local agricultural practices maintained high residual soil nutrient status. Thus,optimization of irrigation is required to minimize nitrate leaching and maximize crop N recovery.

  11. A Stepwise Test Characteristic Curve Method to Detect Item Parameter Drift

    Science.gov (United States)

    Guo, Rui; Zheng, Yi; Chang, Hua-Hua

    2015-01-01

    An important assumption of item response theory is item parameter invariance. Sometimes, however, item parameters are not invariant across different test administrations due to factors other than sampling error; this phenomenon is termed item parameter drift. Several methods have been developed to detect drifted items. However, most of the…

  12. Was Kiobel Detrimental to Corporate Social Responsibility? Applying Lessons Learnt From American Exceptionalism

    Directory of Open Access Journals (Sweden)

    Benjamin Thompson

    2014-02-01

    Full Text Available The recent decision in the US Supreme Court Kiobel case applied the presumption against extraterritoriality towards the Alien Tort Statute, decreasing the potential scope of tort actions that can be made against corporations for severe human rights violations. In light of the growing influence of multinational corporations and the lack of any international law regime to regulate corporate wrongdoing, this decision might be seen as a blow against one of the few potential avenues for justice for those victims of corporate human rights violations. The Alien Tort Statute is not a jurisdictional statute that allows for claims under international law but is rather a uniquely American cause of action unconnected to international law. The question remains whether an extension of American law to provide remedies for severe corporate human rights abuses can be justified in the absence of any such remedies existent in international law. This article will attempt to answer this question applying criteria developed by leading scholars in response to American exceptionalism. It will argue that the Kiobel decision, rather than being detrimental to holding corporations accountable, actually addresses many of the negative aspects of extraterritorial litigation whilst preserving some possibility of remedy for victims of severe human rights violations by corporations.

  13. Curriculum, Translation, and Differential Functioning of Measurement and Geometry Items

    Science.gov (United States)

    Emenogu, Barnabas C.; Childs, Ruth A.

    2005-01-01

    A test item exhibits differential item functioning (DIF) if students with the same ability find it differentially difficult. When the item is administered in French and English, differences in language difficulty and meaning are the most likely explanations. However, curriculum differences may also contribute to DIF. The responses of Ontario…

  14. Privacy concerns in responses to sensitive questions: a survey experiment on the influence of numeric codes on unit nonresponse, item nonresponse, and misreporting

    OpenAIRE

    Bader, Felix; Bauer, Johannes; Kroher, Martina; Riordan, Patrick

    2016-01-01

    "Paper-and-pencil surveys are a widely used method for gaining data. Numeric codes printed on the questionnaire are often a prerequisite for the use of scan software, which, in turn, permits a fast and efficient entering of the data from such surveys. However, printed numbers used for optical mark recognition on a questionnaire can provoke concerns about anonymity that may lead to unit nonresponse, item nonresponse, and misreporting. To test this, we conducted an experiment in a mail survey o...

  15. Hormonal and neuromuscular responses to mechanical vibration applied to upper extremity muscles.

    Directory of Open Access Journals (Sweden)

    Riccardo Di Giminiani

    Full Text Available OBJECTIVE: To investigate the acute residual hormonal and neuromuscular responses exhibited following a single session of mechanical vibration applied to the upper extremities among different acceleration loads. METHODS: Thirty male students were randomly assigned to a high vibration group (HVG, a low vibration group (LVG, or a control group (CG. A randomized double-blind, controlled-parallel study design was employed. The measurements and interventions were performed at the Laboratory of Biomechanics of the University of L'Aquila. The HVG and LVG participants were exposed to a series of 20 trials ×10 s of synchronous whole-body vibration (WBV with a 10-s pause between each trial and a 4-min pause after the first 10 trials. The CG participants assumed an isometric push-up position without WBV. The outcome measures were growth hormone (GH, testosterone, maximal voluntary isometric contraction during bench-press, maximal voluntary isometric contraction during handgrip, and electromyography root-mean-square (EMGrms muscle activity (pectoralis major [PM], triceps brachii [TB], anterior deltoid [DE], and flexor carpi radialis [FCR]. RESULTS: The GH increased significantly over time only in the HVG (P = 0.003. Additionally, the testosterone levels changed significantly over time in the LVG (P = 0.011 and the HVG (P = 0.001. MVC during bench press decreased significantly in the LVG (P = 0.001 and the HVG (P = 0.002. In the HVG, the EMGrms decreased significantly in the TB (P = 0.006 muscle. In the LVG, the EMGrms decreased significantly in the DE (P = 0.009 and FCR (P = 0.006 muscles. CONCLUSION: Synchronous WBV acutely increased GH and testosterone serum concentrations and decreased the MVC and their respective maximal EMGrms activities, which indicated a possible central fatigue effect. Interestingly, only the GH response was dependent on the acceleration with respect to the subjects' responsiveness.

  16. Modelización de una Prueba de Analogías Figurales con la Teoría de Respuesta al Ítem / Modelling Figural Analogies Test with the Item Response Theory

    Directory of Open Access Journals (Sweden)

    G. Diego Blum

    2011-12-01

    Full Text Available The psychometric properties of a Figural Analogies Test are described within the framework of Item Response Theory. Thirty-six 2x2 matrix figures were constructed by using location, distortion and number rules. The sample included 499 psychology students from the University of Buenos Aires, 79% of whom were women. The 3-Parameter Logistic Model was used obtaining a highly satisfactory global fit at 5% (p = .47. Only 3 items did not fit the model. It had good overall discriminatory power (a: M = 1.02, SD = .33, a medium level of difficulty (b: M = -.03, SD = .63 and the c level was slightly lower than expected with six possible answers (c: M = .14, SD = .05. The conditions for modelling the test and possible disadvantages of the present study are discussed.

  17. A Comparison of Anchor-Item Designs for the Concurrent Calibration of Large Banks of Likert-Type Items

    Science.gov (United States)

    Garcia-Perez, Miguel A.; Alcala-Quintana, Rocio; Garcia-Cueto, Eduardo

    2010-01-01

    Current interest in measuring quality of life is generating interest in the construction of computerized adaptive tests (CATs) with Likert-type items. Calibration of an item bank for use in CAT requires collecting responses to a large number of candidate items. However, the number is usually too large to administer to each subject in the…

  18. An emotional functioning item bank of 24 items for computerized adaptive testing (CAT) was established

    DEFF Research Database (Denmark)

    Petersen, Morten Aa.; Gamper, Eva-Maria; Costantini, Anna;

    2016-01-01

    OBJECTIVE: To improve measurement precision, the European Organisation for Research and Treatment of Cancer (EORTC) Quality of Life Group is developing an item bank for computerized adaptive testing (CAT) of emotional functioning (EF). The item bank will be within the conceptual framework...... international sample of cancer patients. This included evaluations of dimensionality, item response theory (IRT) model fit, differential item functioning (DIF), and of measurement precision/statistical power. RESULTS: Responses were obtained from 1,023 cancer patients from four countries. The evaluations showed...... that 24 items could be included in a unidimensional IRT model. DIF did not seem to have any significant impact on the estimation of EF. Evaluations indicated that the CAT measure may reduce sample size requirements by up to 50% compared to the QLQ-C30 EF scale without reducing power. CONCLUSION...

  19. The Role of Item Models in Automatic Item Generation

    Science.gov (United States)

    Gierl, Mark J.; Lai, Hollis

    2012-01-01

    Automatic item generation represents a relatively new but rapidly evolving research area where cognitive and psychometric theories are used to produce tests that include items generated using computer technology. Automatic item generation requires two steps. First, test development specialists create item models, which are comparable to templates…

  20. Avertable dose intervention applied in emergency response dose evaluation system for nuclear emergency preparedness in Taiwan

    International Nuclear Information System (INIS)

    In Taiwan the new guides for the nuclear emergency public protective action were laid down by the Atomic Energy Council (AEC) of Executive Yuan, Taiwan, ROC on July 15th, 2005. The main modifications of the guides are that the avertable dose is applied as the intervention levels and suggests the public protective actions. The emergency response dose evaluation system named RPDOSE, which was developed in 2005, was employed in this work to enhance the capability of the avertable dose evaluation for the villages in the emergency planning zone (EPZ). The period of the long-term weather forecasting data was extended from 4 to 8 days to satisfy the requirement of avertable dose computing. According to the intervention levels, the RPDOSE system is used to calculate the avertable dose and suggest appropriate public protective actions such as sheltering, evacuation or iodine prophylaxis as well as the proposed acting times for each village in the EPZ. This system was employed and examined in the annual nuclear emergency exercise of 2008 in the Maanshan nuclear power plant.

  1. Evaluation of Item Candidates: The PROMIS Qualitative Item Review

    OpenAIRE

    DeWalt, Darren A.; Rothrock, Nan; Yount, Susan; Stone, Arthur A.

    2007-01-01

    One of the PROMIS (Patient-Reported Outcome Measurement Information System) network's primary goals is the development of a comprehensive item bank for patient-reported outcomes of chronic diseases. For its first set of item banks, PROMIS chose to focus on pain, fatigue, emotional distress, physical function, and social function. An essential step for the development of an item pool is the identification, evaluation, and revision of extant questionnaire items for the core item pool. In this w...

  2. Consumer satisfaction and item response theory: creating a measurement scale Avaliação do nível de satisfação de alunos de uma instituição de ensino superior: uma aplicação da teoria da resposta ao item

    Directory of Open Access Journals (Sweden)

    Silvana Ligia Vincenzi Bortolotti

    2012-01-01

    Full Text Available Today, people have increasingly demanded more from the state and enterprises. Consumer satisfaction is not an organizational option, but rather a matter of survival for any institution. The quest for measurement of consumer satisfaction has been ongoing in many areas of research, and researchers have concentrated efforts to demonstrate the psychometric quality of their measurements. However, the techniques employed by these commitments have not kept pace with the advances in psychometric theory and methods. The Item Response Theory (IRT is an approach used for assessing latent trait. It is commonly used in educational and psychological tests and provides additional information beyond that obtained from classic psychometric techniques. This article presents a model of cumulative application of item response theory to measure the extent of students' satisfaction with their courses by creating a measurement scale. The Graded Response Model was used. The results demonstrate the effectiveness of this theory in measuring satisfaction since it places both items as individuals on the same scale. This theory may be valuable in the evaluation of customer satisfaction and many other organizational phenomena. The findings may help the decision maker of an enterprise with the correction of flows, processes, and procedures, and, consequently, it may help generate increased efficiency and effectiveness in daily tasks and in event management business. Finally, the information obtained from the analysis can play a role in the development and/or evaluation of institutional planning.O tema deste trabalho é a utilização da Teoria da Resposta ao Item (TRI como ferramenta de avaliação de aspectos organizacionais específicos. O objetivo é aplicar um modelo cumulativo da TRI para criar uma medida de satisfação de alunos com seus cursos, avaliando também a satisfação no ensino e criando uma escala de medida. Muito utilizada nas áreas educacional e psicol

  3. SHIPPING OF RADIOACTIVE ITEMS

    CERN Multimedia

    TIS/RP Group

    2001-01-01

    The TIS-RP group informs users that shipping of small radioactive items is normally guaranteed within 24 hours from the time the material is handed in at the TIS-RP service. This time is imposed by the necessary procedures (identification of the radionuclides, determination of dose rate and massive objects require a longer procedure and will therefore take longer.

  4. Uma proposta de análise de um construto para medição dos fatores críticos da gestão pela qualidade por intermédio da Teoria da Resposta ao Item A construct analysis proposal for measuring the total quality management critical factors through the item response theory

    Directory of Open Access Journals (Sweden)

    João Welliandre Carneiro Alexandre

    2002-08-01

    Full Text Available Neste artigo propomos o uso de modelos da Teoria da Resposta ao Item (TRI na análise de construtos elaborados para medir a Gestão pela Qualidade Total (GQT como uma alternativa à Teoria Clássica de Medida (TCM. São apresentados o modelo geral para itens dicotomizados assim como as interpretações dos parâmetros do modelo. Os resultados mostram que a TRI pode ser uma poderosa ferramenta na análise das práticas da GQT e da maturidade organizacional, dentro da filosofia da qualidade.In this article we propose the use of models of item response theory (IRT in the constructs analysis of the total quality management (TQM as an alternative to the classical measure theory (CMT. Interpretations of the model parameters are done. The results show that the IRT can be a powerfull tool in the analysis of TQM implementations as also in the study of organizational maturity in quality philosophy.

  5. Guideline Implementation: Prevention of Retained Surgical Items.

    Science.gov (United States)

    Fencl, Jennifer L

    2016-07-01

    A surgical item unintentionally retained in a patient after an operative or other invasive procedure is a serious, preventable medical error with the potential to cause the patient great harm. Perioperative RNs play a key role in preventing retained surgical items (RSIs). The updated AORN "Guideline for prevention of retained surgical items" provides guidance for implementing a consistent, multidisciplinary approach to RSI prevention; accounting for surgical items; preventing retention of device fragments; reconciling count discrepancies; and using adjunct technologies to supplement manual count procedures. This article focuses on key points of the guideline to help perioperative personnel provide optimal care during a procedure. Key points addressed include taking responsibility for RSI prevention as a team; minimizing distractions, noise, and interruptions during counts; using consistent counting methods; reconciling discrepancies; and participating in performance-improvement activities. Perioperative RNs should review the complete guideline for additional information and for guidance in writing and updating policies and procedures. PMID:27350354

  6. Criteria for eliminating items of a Test of Figural Analogies

    Directory of Open Access Journals (Sweden)

    Diego Blum

    2013-12-01

    Full Text Available This paper describes the steps taken to eliminate two of the items in a Test of Figural Analogies (TFA. The main guidelines of psychometric analysis concerning Classical Test Theory (CTT and Item Response Theory (IRT are explained. The item elimination process was based on both the study of the CTT difficulty and discrimination index, and the unidimensionality analysis. The a, b, and c parameters of the Three Parameter Logistic Model of IRT were also considered for this purpose, as well as the assessment of each item fitting this model. The unfavourable characteristics of a group of TFA items are detailed, and decisions leading to their possible elimination are discussed.

  7. Cotton response to poultry litter applied by subsurface banding relative to surface broadcasting

    Science.gov (United States)

    Dry poultry litter is typically land-applied by surface broadcasting, a practice that exposes certain litter nutrients to volatilization loss. Applying litter with a new, experimental implement that places the litter in narrow bands below the soil surface may reduce or eliminate such losses but has...

  8. 'Forget me (not)?' - Remembering Forget-Items Versus Un-Cued Items in Directed Forgetting.

    Science.gov (United States)

    Zwissler, Bastian; Schindler, Sebastian; Fischer, Helena; Plewnia, Christian; Kissler, Johanna M

    2015-01-01

    Humans need to be able to selectively control their memories. This capability is often investigated in directed forgetting (DF) paradigms. In item-method DF, individual items are presented and each is followed by either a forget- or remember-instruction. On a surprise test of all items, memory is then worse for to-be-forgotten items (TBF) compared to to-be-remembered items (TBR). This is thought to result mainly from selective rehearsal of TBR, although inhibitory mechanisms also appear to be recruited by this paradigm. Here, we investigate whether the mnemonic consequences of a forget instruction differ from the ones of incidental encoding, where items are presented without a specific memory instruction. Four experiments were conducted where un-cued items (UI) were interspersed and recognition performance was compared between TBR, TBF, and UI stimuli. Accuracy was encouraged via a performance-dependent monetary bonus. Experiments varied the number of items and their presentation speed and used either letter-cues or symbolic cues. Across all experiments, including perceptually fully counterbalanced variants, memory accuracy for TBF was reduced compared to TBR, but better than for UI. Moreover, participants made consistently fewer false alarms and used a very conservative response criterion when responding to TBF stimuli. Thus, the F-cue results in active processing and reduces false alarm rate, but this does not impair recognition memory beyond an un-cued baseline condition, where only incidental encoding occurs. Theoretical implications of these findings are discussed. PMID:26635657

  9. SHIPPING OF RADIOACTIVE ITEMS

    CERN Multimedia

    TIS/RP Group

    2001-01-01

    The TIS-RP group informs users that shipping of small radioactive items is normally guaranteed within 24 hours from the time the material is handed in at the TIS-RP service. This time is imposed by the necessary procedures (identification of the radionuclides, determination of dose rate, preparation of the package and related paperwork). Large and massive objects require a longer procedure and will therefore take longer.

  10. Data Visualization of Item-Total Correlation by Median Smoothing

    Science.gov (United States)

    Yu, Chong Ho; Douglas, Samantha; Lee, Anna; An, Min

    2016-01-01

    This paper aims to illustrate how data visualization could be utilized to identify errors prior to modeling, using an example with multi-dimensional item response theory (MIRT). MIRT combines item response theory and factor analysis to identify a psychometric model that investigates two or more latent traits. While it may seem convenient to…

  11. Effects of Reusing Baseline Volumes of Interest by Applying (Non-)Rigid Image Registration on Positron Emission Tomography Response Assessments

    OpenAIRE

    van Velden, Floris H. P.; Ida A Nissen; Wendy Hayes; Velasquez, Linda M; Hoekstra, Otto S.; Ronald Boellaard

    2014-01-01

    OBJECTIVES: Reusing baseline volumes of interest (VOI) by applying non-rigid and to some extent (local) rigid image registration showed good test-retest variability similar to delineating VOI on both scans individually. The aim of the present study was to compare response assessments and classifications based on various types of image registration with those based on (semi)-automatic tumour delineation. METHODS: Baseline (n = 13), early (n = 12) and late (n = 9) response (after one and three ...

  12. Analyzing the Role of Corporate Social Responsibility on Reputation Building and Image Formation : Case Lapland University of Applied Sciences

    OpenAIRE

    Ghiyaei, Parisa

    2014-01-01

    Today Corporate Social Responsibility (henceforth CSR) is gaining cumulative importance among individuals and corporations. CSR is an umbrella word which covers all responsibilities related to economic, environment, society and ethics. Concepts such as green marketing and sustainability have recently been added under the umbrella of CSR. Lapland University of Applied Sciences (hereinafter Lapland UAS) is interested in building reputation. Therefore, the main objective of the current rese...

  13. Applying international standards and guidelines on corporate social responsibility: An action plan

    NARCIS (Netherlands)

    Cramer, J.M.

    2005-01-01

    How can a company start the process of corporate social responsibility in an international context, thereby makinge use of diverse standards and guidelines? This question immediately came to the fore emerged after the start of the programme ‘Corporate social responsibility in international context’

  14. Physics Items and Student's Performance at Enem

    CERN Document Server

    Gonçalves, Wanderley P

    2013-01-01

    The Brazilian National Assessment of Secondary Education (ENEM, Exame Nacional do Ensino M\\'edio) has changed in 2009: from a self-assessment of competences at the end of high school to an assessment that allows access to college and student financing. From a single general exam, now there are tests in four areas: Mathematics, Language, Natural Sciences and Social Sciences. A new Reference Matrix is build with components as cognitive domains, competencies, skills and knowledge objects; also, the methodological framework has changed, using now Item Response Theory to provide scores and allowing longitudinal comparison of results from different years, providing conditions for monitoring high school quality in Brazil. We present a study on the issues discussed in Natural Science Test of ENEM over the years 2009, 2010 and 2011. Qualitative variables are proposed to characterize the items, and data from students' responses in Physics items were analysed. The qualitative analysis reveals the characteristics of the ...

  15. Bayesian item selection in constrained adaptive testing using shadow tests

    OpenAIRE

    Bernard P. Veldkamp

    2010-01-01

    Application of Bayesian item selection criteria in computerized adaptive testing might result in improvement of bias and MSE of the ability estimates. The question remains how to apply Bayesian item selection criteria in the context of constrained adaptive testing, where large numbers of specifications have to be taken into account in the item selection process. The Shadow Test Approach is a general purpose algorithm for administering constrained CAT. In this paper it is shown how the approac...

  16. Item Analysis and Differential Item Functioning of a Brief Conduct Problem Screen

    OpenAIRE

    Wu, Johnny; King, Kevin M; Racz, Sarah Jensen; Witkiewitz, Katie; McMahon, Robert J.

    2011-01-01

    Research has shown that boys display higher levels of childhood conduct problems than girls, and Black children display higher levels than White children, but few studies have tested for scalar equivalence of conduct problems across gender and race. The authors conducted a 2-parameter item response theory (IRT) model to examine item characteristics of the Authority Acceptance scale from the Teacher Observation of Classroom Adaptation-Revised (AA-TOCA-R; L. Larsson-Werthamer, S. G. Kellam, & L...

  17. A Comparison of Mantel-Haenszel Differential Item Functioning Parameters. LSAC Research Report Series.

    Science.gov (United States)

    Schnipke, Deborah L.; Roussos, Louis A.; Pashley, Peter J.

    Differential item functioning (DIF) analyses are conducted to investigate how items function in various subgroups. The Mantel-Haenszel (MH) DIF statistic is used at the Law School Admission Council and other testing companies. When item functioning can be well-described in terms of a one- or two-parameter logistic item response theory (IRT) model…

  18. IRT Item Parameters and the Reliability and Validity of Pretest, Posttest, and Gain Scores

    Science.gov (United States)

    May, Kim; Jackson, Tameika S.

    2005-01-01

    The effect of different combinations of item response theory (IRT) item parameters (item difficulty, item discrimination, and the guessing probability) on the reliability and construct validity (correlation with the latent trait being measured) of pretest, posttest, and gain scores was analytically examined using the 3-parameter logistic (3PL)…

  19. Intentional Forgetting Reduces Color-Naming Interference: Evidence from Item-Method Directed Forgetting

    Science.gov (United States)

    Lee, Yuh-shiow; Lee, Huang-mou; Fawcett, Jonathan M.

    2013-01-01

    In an item-method-directed forgetting task, Chinese words were presented individually, each followed by an instruction to remember or forget. Colored probe items were presented following each memory instruction requiring a speeded color-naming response. Half of the probe items were novel and unrelated to the preceding study item, whereas the…

  20. New technologies for item monitoring

    International Nuclear Information System (INIS)

    This report responds to the Department of Energy's request that Sandia National Laboratories compare existing technologies against several advanced technologies as they apply to DOE needs to monitor the movement of material, weapons, or personnel for safety and security programs. The authors describe several material control systems, discuss their technologies, suggest possible applications, discuss assets and limitations, and project costs for each system. The following systems are described: WATCH system (Wireless Alarm Transmission of Container Handling); Tag system (an electrostatic proximity sensor); PANTRAK system (Personnel And Material Tracking); VRIS (Vault Remote Inventory System); VSIS (Vault Safety and Inventory System); AIMS (Authenticated Item Monitoring System); EIVS (Experimental Inventory Verification System); Metrox system (canister monitoring system); TCATS (Target Cueing And Tracking System); LGVSS (Light Grid Vault Surveillance System); CSS (Container Safeguards System); SAMMS (Security Alarm and Material Monitoring System); FOIDS (Fiber Optic Intelligence ampersand Detection System); GRADS (Graded Radiation Detection System); and PINPAL (Physical Inventory Pallet)

  1. Response surface method applied to optimization of estradiol permeation in chitosan membranes

    Indian Academy of Sciences (India)

    Luciano Mengatto; María I Cabrera; Julio A Luna

    2012-06-01

    The present work deals with the study of estradiol permeation in chitosan membranes. A fractional factorial design was built for the determination of the main factors affecting estradiol permeation. The independent factors analysed were: concentration of chitosan, concentration of cross-linking agent, cross-linking time and thermal treatment. It was found that concentration of chitosan and cross-linking time significantly affected the response. The effects of thermal treatment and concentration of cross-linking agent were not significant. An optimization process based on response surface methodology was carried out in order to develop a statistical model which describes the relationship between active independent variables and estradiol flux. This model can be used to find out a combination of factor levels during response optimization. Possible options for response optimization are to maximize, minimize or move towards a target value.

  2. Applying international standards and guidelines on corporate social responsibility: An action plan

    OpenAIRE

    Cramer, J.M.

    2005-01-01

    How can a company start the process of corporate social responsibility in an international context, thereby makinge use of diverse standards and guidelines? This question immediately came to the fore emerged after the start of the programme ‘Corporate social responsibility in international context’ programme of the National Initiative for Sustainable Development (NIDO), running which runs from January 2003 – August 2005. The objective of this programme is to concretise, in cooperation with 22...

  3. Person Heterogeneity of the BDI-II-C and Its Effects on Dimensionality and Construct Validity: Using Mixture Item Response Models

    Science.gov (United States)

    Wu, Pei-Chen; Huang, Tsai-Wei

    2010-01-01

    This study was to apply the mixed Rasch model to investigate person heterogeneity of Beck Depression Inventory-II-Chinese version (BDI-II-C) and its effects on dimensionality and construct validity. Person heterogeneity was reflected by two latent classes that differ qualitatively. Additionally, person heterogeneity adversely affected the…

  4. 项目反应理论下计算机自适应考试系统的设计与实现%The Design and Implementation of Computer Adaptive Testing Based on Item Response Theory

    Institute of Scientific and Technical Information of China (English)

    姜霞; 张晖; 李波

    2014-01-01

    Computer adaptive testing based on the Item Response Theory can choose diffirent questions for examinees according to their abilities and arrange different papers for different examinees , It can estimate the examinees ’ abilities faster and more accurately .This paper describes the procedures of computerized adaptive testing .Research were conducted and corresponding solutions were proposed about the model se -lection, item selection strategies , parameter estimation and other key technologies , making possible the design and development of the computerized adaptive testing system .%项目反应理论下的计算机自适应考试能够根据应试者的实际能力水平选择相应难度的试题,做到因人施测,实现更快、更准地对应试者能力进行估计。对计算机自适应考试施测过程进行了描述,对模型的选择、选题策略、参数估计等关键技术进行研究并提出相应的解决方法,实现了系统的设计与开发。

  5. Conscientiousness in the workplace : Applying mixture IRT to investigate scalability and predictive validity

    NARCIS (Netherlands)

    Egberink, I.J.L.; Meijer, R.R.; Veldkamp, B.P.

    2010-01-01

    Mixture item response theory (IRT) models have been used to assess multidimensionality of the construct being measured and to detect different response styles for different groups. In this study a mixture version of the graded response model was applied to investigate scalability and predictive vali

  6. Safety Evaluation for Packaging (onsite) T Plant Canyon Items

    Energy Technology Data Exchange (ETDEWEB)

    OBRIEN, J.H.

    2000-07-14

    This safety evaluation for packaging (SEP) evaluates and documents the ability to safely ship mostly unique inventories of miscellaneous T Plant canyon waste items (T-P Items) encountered during the canyon deck clean off campaign. In addition, this SEP addresses contaminated items and material that may be shipped in a strong tight package (STP). The shipments meet the criteria for onsite shipments as specified by Fluor Hanford in HNF-PRO-154, Responsibilities and Procedures for all Hazardous Material Shipments.

  7. Safety Evaluation for Packaging (onsite) T Plant Canyon Items

    International Nuclear Information System (INIS)

    This safety evaluation for packaging (SEP) evaluates and documents the ability to safely ship mostly unique inventories of miscellaneous T Plant canyon waste items (T-P Items) encountered during the canyon deck clean off campaign. In addition, this SEP addresses contaminated items and material that may be shipped in a strong tight package (STP). The shipments meet the criteria for onsite shipments as specified by Fluor Hanford in HNF-PRO-154, Responsibilities and Procedures for all Hazardous Material Shipments

  8. 17 CFR 229.406 - (Item 406) Code of ethics.

    Science.gov (United States)

    2010-04-01

    ... 17 Commodity and Securities Exchanges 2 2010-04-01 2010-04-01 false (Item 406) Code of ethics. 229... 406) Code of ethics. (a) Disclose whether the registrant has adopted a code of ethics that applies to... code of ethics, explain why it has not done so. (b) For purposes of this Item 406, the term code...

  9. Bayesian item selection in constrained adaptive testing using shadow tests

    NARCIS (Netherlands)

    Veldkamp, Bernard P.

    2010-01-01

    Application of Bayesian item selection criteria in computerized adaptive testing might result in improvement of bias and MSE of the ability estimates. The question remains how to apply Bayesian item selection criteria in the context of constrained adaptive testing, where large numbers of specificati

  10. Bayesian Item Selection in Constrained Adaptive Testing Using Shadow Tests

    Science.gov (United States)

    Veldkamp, Bernard P.

    2010-01-01

    Application of Bayesian item selection criteria in computerized adaptive testing might result in improvement of bias and MSE of the ability estimates. The question remains how to apply Bayesian item selection criteria in the context of constrained adaptive testing, where large numbers of specifications have to be taken into account in the item…

  11. Calibrating well-being, quality of life and common mental disorder items: psychometric epidemiology in public mental health research

    Science.gov (United States)

    Böhnke, Jan R.; Croudace, Tim J.

    2016-01-01

    Background The assessment of ‘general health and well-being’ in public mental health research stimulates debates around relative merits of questionnaire instruments and their items. Little evidence regarding alignment or differential advantages of instruments or items has appeared to date. Aims Population-based psychometric study of items employed in public mental health narratives. Method Multidimensional item response theory was applied to General Health Questionnaire (GHQ-12), Warwick-Edinburgh Mental Well-being Scale (WEMWBS) and EQ-5D items (Health Survey for England, 2010–2012; n = 19 290). Results A bifactor model provided the best account of the data and showed that the GHQ-12 and WEMWBS items assess mainly the same construct. Only one item of the EQ-5D showed relevant overlap with this dimension (anxiety/depression). Findings were corroborated by comparisons with alternative models and cross-validation analyses. Conclusions The consequences of this lack of differentiation (GHQ-12 v. WEMWBS) for mental health and well-being narratives deserves discussion to enrich debates on priorities in public mental health and its assessment. PMID:26635327

  12. Radiation-therapeutic properties of living and killed antitularensis vaccine applied after polysaccharide modulation of immune response

    International Nuclear Information System (INIS)

    The popular conception of modern immunology that the most reliable remedy against tularensis infection is the living antitularensis vaccine is considered. Similar conception exists in the radiobiology, where the radioresistance enhancement with different biological response modifiers is mainly due to the stimulation of the cell mediated immune protection. The comparative study is performed of the radiotherapeutic features of living and killed antitularensis vaccines, applied after polysaccharide stimulation. It is established that the best effect has been observed with the killed antitularensis vaccine applied 14 days before gamma irradiation (cobalt-60, 6.8 Gy). (author)

  13. Donor impurity states and related optical response in a lateral coupled dot-ring system under applied electric field

    Energy Technology Data Exchange (ETDEWEB)

    Correa, J.D. [Departamento de Ciencias Básicas, Universidad de Medellín, Medellín (Colombia); Mora-Ramos, M.E. [Centro de Investigación en Ciencias, Instituto de Ciencias Básicas y Aplicadas, Universidad Autonoma del Estado de Morelos, Av. Universidad 1001, CP 62209 Cuernavaca, Morelos (Mexico); Duque, C.A., E-mail: cduque@fisica.udea.edu.co [Grupo de Materia Condensada-UdeA, Instituto de Física, Facultad de Ciencias Exactas y Naturales, Universidad de Antioquia UdeA, Calle 70 No. 52-21, Medellín (Colombia)

    2015-09-01

    A study on the effects of an externally applied electric field on the linear optical absorption and relative refractive index change associated with transitions between off-center donor impurity states in laterally coupled quantum dot-ring system is reported. Electron states are calculated within the effective mass and parabolic band approximations by means of an exact diagonalization procedure. The states and the optical response in each case show significant sensitivity to the geometrical distribution of confining energies as well as to the strength of the applied field.

  14. AQUATIC ANIMAL RESPIRATION AND COUGH RESPONSE APPLIED TO INNOVATIVE ENVIRONMENTAL BIOMONITORING: A BIBLIOGRAPHY

    Science.gov (United States)

    This bibliography encompasses a body of in-depth technical information on the mechanics and physiology of respiration in aquatic animals (vertebrate and invertebrate). In compiling the bibliography, special emphasis was given to identifying studies that deal with responses of thi...

  15. GENDER DIFFERENCES IN APPLYING COMPLIMENTS AND COMPLIMENT RESPONSES IN CHINESE CONTEXT

    Institute of Scientific and Technical Information of China (English)

    OuanLihong

    2004-01-01

    The previous research done by the author shows that there exist significant differences between men and women in their realization patterns of compliments and compliment responses. These differences are reflected in the strategies used in complimenting and responding to compliments. Generally, women tend to use more polite strategies than men do. This article will explore these differences from both social and cultural perspectives.

  16. The household responsibility system and social change in rural Guizhou, China: applying a cohort approach

    NARCIS (Netherlands)

    Yuan, J.

    2010-01-01

    Since the introduction of the Household Responsibility System (HRS) in 1978, Chinese rural households have experienced many changes. The HRS allows farming households to organize their own agricultural production on contracted lands, enabling them to work more efficiently and get more benefits compa

  17. Dose response of selected ion chambers in applied homogeneous transverse and longitudinal magnetic fields

    International Nuclear Information System (INIS)

    Purpose: The magnetic fields of an integrated MR-Linac system will alter the paths of electrons that produce ions in the ionization chambers. The dose response of selected ion chambers is evaluated in the presence of varying transverse and longitudinal magnetic fields. The investigation is useful in calibration of therapeutic x-ray beams associated with MR-Linac systems. Methods: The Monte Carlo code PENELOPE was used to model the irradiation of NE2571, and PR06C ionization chambers in the presence of a transverse and longitudinal (with respect to the photon beam) magnetic fields of varying magnitude. The long axis of each chamber was simulated both parallel and perpendicular to the incident photon beam for each magnetic field case. The dose deposited in each chamber for each case was compared to the case with zero magnetic field by means of a ratio. The PR06C chamber's response was measured in the presence of a transverse magnetic field with field strengths ranging from 0.0 to 0.2 T to compare to simulated results. Results: The simulations and measured data show that in the presence of a transverse magnetic field there is a considerable dose response (maximum of 11% near 1.0 T in the ion chambers investigated, which depends on the magnitude of magnetic field, and relative orientation of the magnetic field, radiation beam, and ion chamber. Measurements made with the PR06C chamber verify these results in the region of measurement. In contrast, a longitudinal magnetic field produces only a slight increase in dose response (2% at 1.5 T) that rises slowly with increasing magnetic field and is seemingly independent of chamber orientation. Response trends were similar for the two ion chambers and relative orientations considered, but slight variations are present from chamber to chamber. Conclusions: Care must be taken when making ion chamber measurements in a transverse magnetic field. Ion chamber responses vary not only with transverse field strength, but with chamber

  18. Dose response of selected solid state detectors in applied homogeneous transverse and longitudinal magnetic fields

    International Nuclear Information System (INIS)

    Purpose: MR-Linac devices under development worldwide will require standard calibration, commissioning, and quality assurance. Solid state radiation detectors are often used for dose profiles and percent depth dose measurements. The dose response of selected solid state detectors is therefore evaluated in varying transverse and longitudinal magnetic fields for this purpose. Methods: The Monte Carlo code PENELOPE was used to model irradiation of a PTW 60003 diamond detector and IBA PFD diode detector in the presence of a magnetic field. The field itself was varied in strength, and oriented both transversely and longitudinally with respect to the incident photon beam. The long axis of the detectors was oriented either parallel or perpendicular to the photon beam. The dose to the active volume of each detector in air was scored, and its ratio to dose with zero magnetic field strength was determined as the “dose response” in magnetic field. Measurements at low fields for both detectors in transverse magnetic fields were taken to evaluate the accuracy of the simulations. Additional simulations were performed in a water phantom to obtain few representative points for beam profile and percent depth dose measurements. Results: Simulations show significant dose response as a function of magnetic field in transverse field geometries. This response can be near 20% at 1.5 T, and it is highly dependent on the detectors’ relative orientation to the magnetic field, the energy of the photon beam, and detector composition. Measurements at low transverse magnetic fields verify the simulations for both detectors in their relative orientations to radiation beam. Longitudinal magnetic fields, in contrast, show little dose response, rising slowly with magnetic field, and reaching 0.5%–1% at 1.5 T regardless of detector orientation. Water tank and in air simulation results were the same within simulation uncertainty where lateral electronic equilibrium is present and expectedly

  19. Item analysis of in use multiple choice questions in pharmacology

    Science.gov (United States)

    Kaur, Mandeep; Singla, Shweta; Mahajan, Rajiv

    2016-01-01

    Background: Multiple choice questions (MCQs) are a common method of assessment of medical students. The quality of MCQs is determined by three parameters such as difficulty index (DIF I), discrimination index (DI), and distracter efficiency (DE). Objectives: The objective of this study is to assess the quality of MCQs currently in use in pharmacology and discard the MCQs which are not found useful. Materials and Methods: A class test of central nervous system unit was conducted in the Department of Pharmacology. This test comprised 50 MCQs/items and 150 distracters. A correct response to an item was awarded one mark with no negative marking for incorrect response. Each item was analyzed for three parameters such as DIF I, DI, and DE. Results: DIF of 38 (76%) items was in the acceptable range (P = 30–70%), 11 (22%) items were too easy (P > 70%), and 1 (2%) item was too difficult (P 0.35), of 12 (24%) items was good (d = 0.20–0.34), and of 7 (14%) items was poor (d < 0.20). A total of 50 items had 150 distracters. Among these, 27 (18%) were nonfunctional distracters (NFDs) and 123 (82%) were functional distracters. Items with one NFD were 11 and with two NFDs were 8. Based on these parameters, 6 items were discarded, 17 were revised, and 27 were kept for subsequent use. Conclusion: Item analysis is a valuable tool as it helps us to retain the valuable MCQs and discard the items which are not useful. It also helps in increasing our skills in test construction and identifies the specific areas of course content which need greater emphasis or clarity. PMID:27563581

  20. Site response of heterogeneous natural deposits to harmonic excitation applied to more than 100 case histories

    Science.gov (United States)

    Chenari, Reza Jamshidi; Bostani Taleshani, Shirin Aminzadeh

    2016-06-01

    Variation of shear-wave propagation velocity (SWV) with depth was studied by analyzing more than one hundred actual SWV profiles. Linear, power, and hyperbolic variation schemes were investigated to find the most representative form for naturally occurred alluvial deposits. It was found that hyperbolic (asymptotic) variation dominates the majority of cases and it can be reliably implemented in analytical or analytical-numerical procedures. Site response analyses for a one-layer heterogeneous stratum were conducted to find an equivalent homogeneous alternative which simplifies the analysis procedure but does not compromise the accuracy of the resonance and amplification responses. Harmonic average, arithmetic average and mid-value equivalents are chosen from the literature for investigation. Furthermore, full and partial depth averaging schemes were evaluated and compared in order to verify the validity of current practices which rely upon averaging shallow depths, viz., the first 30 m of the strata. Engineering bedrock concept was discussed and the results were compared.

  1. The Umbra Simulation and Integration Framework Applied to Emergency Response Training

    Science.gov (United States)

    Hamilton, Paul Lawrence; Britain, Robert

    2010-01-01

    The Mine Emergency Response Interactive Training Simulation (MERITS) is intended to prepare personnel to manage an emergency in an underground coal mine. The creation of an effective training environment required realistic emergent behavior in response to simulation events and trainee interventions, exploratory modification of miner behavior rules, realistic physics, and incorporation of legacy code. It also required the ability to add rich media to the simulation without conflicting with normal desktop security settings. Our Umbra Simulation and Integration Framework facilitated agent-based modeling of miners and rescuers and made it possible to work with subject matter experts to quickly adjust behavior through script editing, rather than through lengthy programming and recompilation. Integration of Umbra code with the WebKit browser engine allowed the use of JavaScript-enabled local web pages for media support. This project greatly extended the capabilities of Umbra in support of training simulations and has implications for simulations that combine human behavior, physics, and rich media.

  2. A new simulation method for turbines in wake - Applied to extreme response during operation

    DEFF Research Database (Denmark)

    Thomsen, K.; Aagaard Madsen, H.

    2005-01-01

    The work focuses on prediction of load response for wind turbines operating in wind forms using a newly developed aeroelostic simulation method The traditionally used concept is to adjust the free flow turbulence intensity to account for increased loads in wind farms-a methodology that might...... be suitable for fatigue load simulation. For extreme response during operation the success of this simplified approach depends significantly on the physical mechanism causing the extremes. If the physical mechanism creating increased loads in wake operation is different from an increased turbulence intensity......, the resulting extremes might be erroneous. For blade loads the traditionally used simplified approach works better than for integrated rotor loads-where the instantaneous load gradient across the rotor disc is causing the extreme loads. In the article the new wake simulation approach is illustrated...

  3. Applying Tep Measurements to Assess the Response of Hastelloy to Long Time Aging

    Science.gov (United States)

    Ifergane, S.; Gelbstein, Y.; Dahan, I.; Pinkas, M.; Landau, A.

    2009-03-01

    Hastelloy C-276 service temperature is restricted due to precipitation of the intermetallic compound μ. Time-temperature curves indicate that the highest precipitation rate is obtained at about 870° C. Thermoelectric Power (TEP) measurements were applied to monitor the precipitation kinetics during aging at 870° C. The TEP was found to be well correlated with the amount of μ phase formed during aging and with the reduction in impact energy and ductility. It was demonstrated that TEP measurements could be used to monitor aging of Hastelloy C-276.

  4. Extensions of the Ordered Response Model Applied to Consumer Valuation of New Products

    OpenAIRE

    Das, J.W.M.

    1995-01-01

    In an ordered response model the observed variable is based upon classifying an unobserved variable into one out of a finite number of intervals forming a dissection of the real line (cf. Amemiya, 1981). This model considers the boundaries of the intervals as (unknown) deterministic parameters, the same for every individual. Terza (1985) extended this through the relaxation of the assumed constancy of the boundaries: he allowed the boundaries to be a linear function of observed explanatory va...

  5. Improving humanitarian response through an innovative pre-positioning concept : an investigation of how commercial vessels can be used to store and transport relief items

    OpenAIRE

    Wilberg, Kristin Heien; Olafsen, Amund Leinaas

    2013-01-01

    Both the number of natural disasters and the people affected by these disasters have increased substantially during the recent decades. Not only is the frequency higher, but the complexity, severity and magnitude of natural disasters has also increased. This trend, combined with the limited amount of funding provided by donors, has created a critical need for improved humanitarian response systems. Even though logistics has evolved from being seen as a necessary expense to become an important...

  6. Measuring health status in British patients with rheumatoid arthritis: reliability, validity and responsiveness of the short form 36-item health survey (SF-36).

    Science.gov (United States)

    Ruta, D A; Hurst, N P; Kind, P; Hunter, M; Stubbings, A

    1998-04-01

    The objective was to assess the performance of the SF-36 health survey (SF-36) in a sample of patients with rheumatoid arthritis (RA) stratified by functional class. The eight SF-36 subscales and the two summary scales (the physical and mental component scales) were assessed for test retest reliability, construct validity and responsiveness to self-reported change in health. In 233 patients with RA, the SF-36 scales were: reliable (intra-class correlation coefficients 0.76-0.93); correlated with American College of Rheumatology (ACR) core disease activity measures [Spearman r = -0.12 (erythrocyte sedimentation rate) to -0.89 (Modified Health Assessment Questionnaire)]; and responsive to improvements in health (standardized response means 0.27-0.9). The distribution of scores on four of the eight subscales (physical function, role limitations physical, role limitations emotional and social function) was clearly non-Gaussian. Very marked floor effects were noted with the physical function scale, and both ceiling and floor effects with the other three subscales. The two SF-36 physical and mental component summary scales are reliable, valid and responsive measures of health status in patients with RA. Six of the eight subscales meet standards required for comparing groups of patients, and the physical function and general health scales may be suitable for monitoring individuals. The two scales measuring role limitations have poor measurement characteristics. The SF-36 pain and physical function scales may be suitable for use as patient self-assessed measures of pain and physical function within the ACR core disease activity set. PMID:9619895

  7. Enhancing Radiological Emergency Preparedness and Response in South East Asia Through Applied Training and Capability Development

    International Nuclear Information System (INIS)

    The potential malicious use of high activity radioactive sources remains a security concern for governments and the international community. The Code of Conduct on the Safety and Security of Radioactive Sources recognizes the importance of having in place the expertise, measures and tools to detect, respond to and mitigate the consequences of accidents or malicious acts involving radioactive sources. The Australian Regional Security of Radioactive Sources Project has collaborated with States in South East Asia to enhance their radiological emergency preparedness and response (EPR) capacity to security related incidents involving radioactive sources out of regulatory control. The aim of this collaboration is to improve and maintain the national core technical capabilities to enable an effective and safe response to any security related radiological incident. The main elements of this collaborative approach are: (a) identifying the priority areas for training through needs analysis; (b) strengthening individual professional expertise through a structured approach to training; and (c) enhancing individual agency and national nuclear and radiological EPR arrangements and capabilities. This collaboration has enhanced the sustainable development and implementation of South East Asian States’ national EPR capabilities and arrangements to ensure detection, response and mitigation measures are effective, systematic and well integrated within their national framework. (author)

  8. Counterfeit and Fraudulent Items - Mitigating the risk

    International Nuclear Information System (INIS)

    This presentation (slides) provides an overview of the industry's challenges and activities. Firstly, it outlines the differences between counterfeit, fraudulent, suspect, and also substandard items. Notice is given that items could be found not to meet the standard, but the difference in the intent to deceive with counterfeit and fraudulent items is the critical element. Examples from other industries are used which also rely heavily on the assurance of quality for safety. It also informs that EPRI has just completed a report in October 2009 in coordination with other US government agencies and industry organizations; this report, entitled Counterfeit, Substandard and Fraudulent Items, number 1019163, is available for free on the EPRI web site. As a follow-up to this report, EPRI is developing a CFSI Database; any country interested in a collaborative agreement is invited to use and contribute to the database information. Finally, it stresses the importance of the oversight of contractors, training to raise the awareness of the employees and the inspectors, and having a response plan for identified items

  9. Social Responsibility in Research Practice: Engaging applied scientists with the socio-ethical context of their work

    OpenAIRE

    Schuurbiers, D.

    2010-01-01

    How to encourage researchers to critically reflect on the ethical and social dimensions of their work? That is the central research question of this thesis. It starts from the assumption that the neutrality view of the social responsibility of the researcher – the view that researchers have no business with the social and ethical dimensions of their work – has become untenable, at least as far as applied sciences such as nano- and biotechnology are concerned. Instead, this thesis adopts a bro...

  10. Reorientation response of magnetic microspheres attached to gold electrodes under an applied magnetic field

    Energy Technology Data Exchange (ETDEWEB)

    De Los Santos Valladares, L.; Reeve, R.M.; Mitrelias, T.; Langford, R.M.; Barnes, C.H.W., E-mail: luis_d_v@hotmail.com [Cavendish Laboratory, Department of Physics, University of Cambridge Materials and Structures Laboratory (United Kingdom); Bustamante Dominguez, A. [Laboratorio de Ceramicos y Nanomateriales, Facultad de Ciencias Fisicas, Universidad Nacional Mayor de San Marcos, Lima (Peru); Aguiar, J. Albino [Universidade Federal de Pernambuco (UFPE), Recife, PE (Brazil). Departamento de Fisica; Azuma, Y. [Materials and Structures Laboratory, Tokyo Institute of Technology, Midori-ku, Yokohama (Japan); Majima, Y. [CREST, Japan Science and Technology Agency (JST), Midori-ku, Yokohama (Japan)

    2013-08-15

    In this work, we report the mechanical reorientation of thiolated ferromagnetic microspheres bridging a pair of gold electrodes under an external magnetic field. When an external magnetic field (7 kG) is applied during the measurement of the current-voltage characteristics of a carboxyl ferromagnetic microsphere (4 μm diameter) attached to two gold electrodes by self-assembled monolayers (SAMs) of octane dithiol (C{sub 8}H{sub 18}S{sub 2}), the current signal is distorted. Rather than due to magnetoresistance, this effect is caused by a mechanical reorientation of the ferromagnetic sphere, which alters the number of SAMs between the sphere and the electrodes and therefore affects conduction. To study the physical reorientation of the ferromagnetic particles, we measure their hysteresis loops while suspended in a liquid solution. (author)

  11. Response of Triatoma infestans to pour-on cypermethrin applied to chickens under laboratory conditions

    Directory of Open Access Journals (Sweden)

    Ivana Amelotti

    2009-05-01

    Full Text Available This article reports the effects of a pour-on formulation of cypermethrin (6% active ingredient applied to chickens exposed to Triatoma infestans, the main vector of Chagas disease in rural houses of the Gran Chaco Region of South America. This study was designed as a completely random experiment with three experimental groups and five replicates. Third instar nymphs were fed on chickens treated with 0, 1 and 2 cc of the formulation. Nymphs were allowed to feed on the chickens at different time intervals after the insecticide application. Third-instar nymphs fed on treated chickens showed a higher mortality, took less blood during feeding and had a lower moulting rate. The mortality rate was highest seven days after the insecticide solution application and blood intake was affected until 30 days after the application of the solution.

  12. Individuality of Item Interpretation in Interchangeable ACL Scales

    Science.gov (United States)

    Fiske, Donald W.; Barack, Leonard I.

    1976-01-01

    The diversity among interpretations of single items in personality questionnaires has been noted previously. Using adjectives from the Adjective Check List (ACL), the study sought evidence bearing on these questions: Does such diversity make the responses to an item not comparable across subjects? If so, what are the implications for scores based…

  13. Differential Weighting of Items to Improve University Admission Test Validity

    OpenAIRE

    Eduardo Backhoff Escudero; Felipe Tirado Segura; Norma Larrazolo Reyna

    2001-01-01

    This paper gives an evaluation of different ways to increase university admission test criterion-related validity, by differentially weighting test items. We compared four methods of weighting multiple-choice items of the Basic Skills and Knowledge Examination (EXHCOBA): (1) punishing incorrect responses by a constant factor, (2) weighting incorrect responses, considering the levels of error, (3) weighting correct responses, considering the item’s difficulty, based on the Classic Measur...

  14. A Formulation of the Mantel-Haenszel Differential Item Functioning Parameter with Practical Implications. Statistical Report. LSAC Research Report Series.

    Science.gov (United States)

    Roussos, Louis A.; Schnipke, Deborah L.; Pashley, Peter J.

    The Mantel-Haenszel (MH) differential item functioning (DIF) parameter for uniform DIF is well defined when item responses follow the two-parameter-logistic (2PPL) item response function (IRF), but not when they follow the three-parameter-logistic (3PL) IRF, the model typically used with multiple choice items. This research report presents a…

  15. A price-responsive dispatching strategy for Vehicle-to-Grid: An economic evaluation applied to the case of Singapore

    Science.gov (United States)

    Pelzer, Dominik; Ciechanowicz, David; Aydt, Heiko; Knoll, Alois

    2014-06-01

    Employing electric vehicles as short-term energy storage could improve power system stability and at the same time create a new income source for vehicle owners. In this paper, the economic viability of this concept referred to as Vehicle-to-Grid is investigated. For this purpose, a price-responsive charging and dispatching strategy built upon temporally resolved electricity market data is presented. This concept allows vehicle owners to maximize returns by restricting market participation to profitable time periods. As a case study, this strategy is then applied using the example of Singapore. It is shown that an annual loss of S 1000 resulting from a non-price-responsive strategy as employed in previous works can be turned into a S 130 profit by applying the price-responsive approach. In addition to this scenario, realistic mobility patterns which restrict the temporal availability of vehicles are considered. In this case, profits in the range of S 21-S 121 are achievable. Returns in this order of magnitude are not expected to make Vehicle-to-Grid a viable business case, sensitivity analyses, however, show that improved technical parameters could increase profitability. It is further assumed that employing the price-responsive strategy to other national markets may yield significantly greater returns.

  16. Monte Carlo simulation of the response functions of CdTe detectors to be applied in x-ray spectroscopy

    International Nuclear Information System (INIS)

    In this work, the energy response functions of a CdTe detector were obtained by Monte Carlo (MC) simulation in the energy range from 5 to 160 keV, using the PENELOPE code. In the response calculations the carrier transport features and the detector resolution were included. The computed energy response function was validated through comparison with experimental results obtained with 241Am and 152Eu sources. In order to investigate the influence of the correction by the detector response at diagnostic energy range, x-ray spectra were measured using a CdTe detector (model XR-100T, Amptek), and then corrected by the energy response of the detector using the stripping procedure. Results showed that the CdTe exhibits good energy response at low energies (below 40 keV), showing only small distortions on the measured spectra. For energies below about 80 keV, the contribution of the escape of Cd- and Te-K x-rays produce significant distortions on the measured x-ray spectra. For higher energies, the most important correction is the detector efficiency and the carrier trapping effects. The results showed that, after correction by the energy response, the measured spectra are in good agreement with those provided by a theoretical model of the literature. Finally, our results showed that the detailed knowledge of the response function and a proper correction procedure are fundamental for achieving more accurate spectra from which quality parameters (i.e., half-value layer and homogeneity coefficient) can be determined. - Highlights: • The response function of a CdTe detector was determined by Monte Carlo simulation. • The simulation takes into account all interaction process, the carrier transport and the Gaussian resolution. • The influence of different effects of spectral distortion was investigated. • CdTe detector was applied for x-ray spectroscopy. • The proper correction procedure is needed to achieve realistic x-ray spectra

  17. Dynamic response of a tunable phononic crystal under applied mechanical and magnetic loadings

    Science.gov (United States)

    Bayat, Alireza; Gordaninejad, Faramarz

    2015-06-01

    The dynamic response of a tunable phononic crystal consisting of a porous hyperelastic magnetoelastic elastomer subjected to a macroscopic deformation and an external magnetic field is theoretically investigated. Finite deformations and magnetic induction influence phononic characteristics of the periodic structure through geometrical pattern transformation and material properties. A magnetoelastic energy function is proposed to develop constitutive laws considering large deformations and magnetic induction in the periodic structure. Analytical and finite element methods are utilized to compute the dispersion relation and band structure of the phononic crystal for different cases of deformation and magnetic loadings. It is demonstrated that magnetic induction not only controls the band diagram of the structure but also has a strong effect on preferential directions of wave propagation.

  18. Modelling Vegetation Response to Climate Change in the Upper Danube Subcatchment applying a Biophysical Landsurface Model.

    Science.gov (United States)

    Hank, T.; Mauser, W.

    2009-04-01

    The manifold exchange processes that occur between landsurface and atmosphere are largely determined through the living vegetation cover that dynamically responds to atmospheric conditions such as humidity, temperature or the concentration of carbon dioxide respectively. When dealing with the mapping of biospheric feedbacks on changing climatic conditions, the numerical description of the involved processes represents a helpful tool and reliable instrument for the investigation of the dynamics that are part of these landsurface exchange cycles. A considerable number of current studies concentrates on the modelling of global dynamic reactions of the vegetation cover on changing atmospheric parameters. Nonetheless, questions concerning the regional effects of climate change are getting more and more important for stakeholders and decision makers worldwide. Within the scope of the GLOWA-Danube cooperative project, which is funded by the German Federal Ministry of Education and Research (BMB+F), the physically-based hydrological model PROMET (process of radiation mass and energy transfer) is applied to investigate the consequences of climate change on the regional scale. PROMET largely represents the landsurface component of the DANUBIA decision support system, which has been recently enhanced by an explicit model of photosynthesis. The assimilation model was combined with a model of stomatal conductance and the respective physiological submodels to enable a spatial modelling of active vegetation growth, taking the sensitivity of the photosynthetic apparatus with respect to changing atmospheric conditions into account. The combined model approach was applied to a set of climate scenarios, all tracing the characteristics of the moderate IPCC A1B scenario, but featuring different realizations of this storyline. The meteorology for the scenario runs was generated, using a stochastic method that is based on a statistical analysis and rearrangement of measured

  19. Teste de Raciocínio Auditivo Musical (RAu: estudo inicial por meio da Teoria de Reposta ao Item Test de Raciocinio Auditivo Musical (RAu: estudio inicial a través de la Teoría de Repuesta al Ítem Auditory Musical Reasoning Test: an initial study with Item Response Theory

    Directory of Open Access Journals (Sweden)

    Fernando Pessotto

    2012-12-01

    ón entre los grupos de músicos y no músicos. Los datos encontrados apuntan evidencias de que los ítems miden una dimensión principal (alfa=0,92 con alta capacidad para diferenciar los grupos de músicos profesionales, aficionados y laicos obteniéndose un coeficiente de validez de criterio de r=0,68. Los resultados indican evidencias positivas de precisión y validez para el RAu.This study investigated internal structure and criterion validity of a test that aims at assessing auditory processing of musical ability (Auditory Musical Reasoning Test, RAu. 162 people of both sexes were evaluated, 56.8% men, aged between 15 and 59 years of age (M=27.5; SD=9.01. Participants were divided among musicians (N=24, amateurs (N=62 and lay people (N=76 according to the extension of their knowledge in music. Full Information Item Factor Analysis verified the dimensionality of the instrument and also the properties of the items via Item Response Theory (IRT. Furthermore, we sought to identify the ability to discriminate between professional musicians, amateurs and lay people. Data showed evidence that the items measure a major dimension (alpha=.92 with high ability to differentiate groups of musicians, amateurs and lay people giving a criterion validity coefficient of r=.68. The results indicate positive evidence of reliability and validity for RAu test.

  20. A Comparison of Four Differential Item Functioning Procedures in the Presence of Multidimensionality

    Science.gov (United States)

    Bastug, Özlem Yesim Özbek

    2016-01-01

    Differential item functioning (DIF), or item bias, is a relatively new concept. It has been one of the most controversial and the most studied subject in measurement theory. DIF occurs when people who have the same ability level but from different groups have a different probability of a correct response. According to Item Response Theory (IRT),…

  1. Response surface methodology applied to Supercritical Fluid Extraction (SFE) of carotenoids from Persimmon (Diospyros kaki L.).

    Science.gov (United States)

    Zaghdoudi, Khalil; Framboisier, Xavier; Frochot, Céline; Vanderesse, Régis; Barth, Danielle; Kalthoum-Cherif, Jamila; Blanchard, Fabrice; Guiavarc'h, Yann

    2016-10-01

    Supercritical carbon dioxide with ethanol as co-solvent was used to extract carotenoids from persimmon fruits (Diospyros kaki L.). Based on a response surface methodology (RSM), a predicting model describing the effects of CO2 temperature, pressure, flow rate, ethanol percentage and extraction time was set up for each of the four carotenoids of interest. The best extraction yields in our experimental domain were found at 300 bars, 60°C, 25% (w/w) ethanol, 3mL/min flow rate and 30min for xanthophylls (all-trans-lutein, all-trans-zeaxanthin and all-trans-β-cryptoxanthin). The yields were 15.46±0.56, 16.81±1.74 and 33.23±2.91μg/g of persimmon powder for all-trans-lutein, all-trans-zeaxanthin and all-trans-β-cryptoxanthin, respectively. As a non-oxygenated carotenoid, all-trans-β-carotene was better extracted using 100 bars, 40°C, 25% (w/w) ethanol, 1mL/min flow rate and 30min extraction time, with an extraction yield of 11.19±0.47μg/g of persimmon powder. PMID:27132842

  2. Comparative Study of Various E. coli Strains for Biohydrogen Production Applying Response Surface Methodology

    Directory of Open Access Journals (Sweden)

    Péter Bakonyi

    2012-01-01

    Full Text Available The proper strategy to establish efficient hydrogen-producing biosystems is the biochemical, physiological characterization of hydrogen-producing microbes followed by metabolic engineering in order to give extraordinary properties to the strains and, finally, bioprocess optimization to realize enhanced hydrogen fermentation capability. In present paper, it was aimed to show the utility both of strain engineering and process optimization through a comparative study of wild-type and genetically modified E. coli strains, where the effect of two major operational factors (substrate concentration and pH on bioH2 production was investigated by experimental design and response surface methodology (RSM was used to determine the suitable conditions in order to obtain maximum yields. The results revealed that by employing the genetically engineered E. coli (DJT 135 strain under optimized conditions (pH: 6.5; Formate conc.: 1.25 g/L, 0.63 mol H2/mol formate could be attained, which was 1.5 times higher compared to the wild-type E. coli (XL1-BLUE that produced 0.42 mol H2/mol formate (pH: 6.4; Formate conc.: 1.3 g/L.

  3. Computerized adaptive testing with item cloning

    NARCIS (Netherlands)

    Glas, Cees A.W.; Linden, van der Wim J.

    2003-01-01

    To increase the number of items available for adaptive testing and reduce the cost of item writing, the use of techniques of item cloning has been proposed. An important consequence of item cloning is possible variability between the item parameters. To deal with this variability, a multilevel item

  4. Selecting Items for Criterion-Referenced Tests.

    Science.gov (United States)

    Mellenbergh, Gideon J.; van der Linden, Wim J.

    1982-01-01

    Three item selection methods for criterion-referenced tests are examined: the classical theory of item difficulty and item-test correlation; the latent trait theory of item characteristic curves; and a decision-theoretic approach for optimal item selection. Item contribution to the standardized expected utility of mastery testing is discussed. (CM)

  5. Automatic Association of News Items.

    Science.gov (United States)

    Carrick, Christina; Watters, Carolyn

    1997-01-01

    Discussion of electronic news delivery systems and the automatic generation of electronic editions focuses on the association of related items of different media type, specifically photos and stories. The goal is to be able to determine to what degree any two news items refer to the same news event. (Author/LRW)

  6. Implementing and operating the Hanford Environmental Information System and applying it to the carbon tetrachloride expedited response action

    International Nuclear Information System (INIS)

    To manage waste and perform environmental monitoring and restoration at the 1450-square kilometer (560-square mile) Hanford Site in southeastern Washington State, vast amounts of scientific and technical data are being generated from sampling. This paper provides an overview of the Hanford Environmental Information System (HEIS), a computerized system designed and implemented to manage the Site's environmental sampling data, lessons learned from putting HEIS into operation, and how HEIS is being applied to the carbon tetrachloride expedited response action being performed at the Site

  7. Three controversies over item disclosure in medical licensure examinations.

    Science.gov (United States)

    Park, Yoon Soo; Yang, Eunbae B

    2015-01-01

    In response to views on public's right to know, there is growing attention to item disclosure - release of items, answer keys, and performance data to the public - in medical licensure examinations and their potential impact on the test's ability to measure competence and select qualified candidates. Recent debates on this issue have sparked legislative action internationally, including South Korea, with prior discussions among North American countries dating over three decades. The purpose of this study is to identify and analyze three issues associated with item disclosure in medical licensure examinations - 1) fairness and validity, 2) impact on passing levels, and 3) utility of item disclosure - by synthesizing existing literature in relation to standards in testing. Historically, the controversy over item disclosure has centered on fairness and validity. Proponents of item disclosure stress test takers' right to know, while opponents argue from a validity perspective. Item disclosure may bias item characteristics, such as difficulty and discrimination, and has consequences on setting passing levels. To date, there has been limited research on the utility of item disclosure for large scale testing. These issues requires ongoing and careful consideration. PMID:26374693

  8. Three controversies over item disclosure in medical licensure examinations

    Directory of Open Access Journals (Sweden)

    Yoon Soo Park

    2015-09-01

    Full Text Available In response to views on public's right to know, there is growing attention to item disclosure – release of items, answer keys, and performance data to the public – in medical licensure examinations and their potential impact on the test's ability to measure competence and select qualified candidates. Recent debates on this issue have sparked legislative action internationally, including South Korea, with prior discussions among North American countries dating over three decades. The purpose of this study is to identify and analyze three issues associated with item disclosure in medical licensure examinations – 1 fairness and validity, 2 impact on passing levels, and 3 utility of item disclosure – by synthesizing existing literature in relation to standards in testing. Historically, the controversy over item disclosure has centered on fairness and validity. Proponents of item disclosure stress test takers’ right to know, while opponents argue from a validity perspective. Item disclosure may bias item characteristics, such as difficulty and discrimination, and has consequences on setting passing levels. To date, there has been limited research on the utility of item disclosure for large scale testing. These issues requires ongoing and careful consideration.

  9. Item Analysis and Differential Item Functioning of a Brief Conduct Problem Screen

    Science.gov (United States)

    Wu, Johnny; King, Kevin M.; Witkiewitz, Katie; Racz, Sarah Jensen; McMahon, Robert J.

    2012-01-01

    Research has shown that boys display higher levels of childhood conduct problems than girls, and Black children display higher levels than White children, but few studies have tested for scalar equivalence of conduct problems across gender and race. The authors conducted a 2-parameter item response theory (IRT) model to examine item…

  10. The Influence of Item Formats when Locating a Student on a Learning Progression in Science

    Directory of Open Access Journals (Sweden)

    Jing Chen

    2016-07-01

    Full Text Available Learning progressions are used to describe how students’ understanding of a topic progresses over time. This study evaluates the effectiveness of different item formats for placing students into levels along a learning progression for carbon cycling. The item formats investigated were Constructed Response (CR items and two types of two-tier items: (1 Ordered Multiple-Choice (OMC followed by CR items and (2 Multiple True or False (MTF followed by CR items. Our results suggest that estimates of students’ learning progression level based on OMC and MTF responses are moderately predictive of their level based on CR responses. With few exceptions, CR items were effective for differentiating students among learning progression levels. Based on the results, we discuss how to design and best use items in each format to more accurately measure students’ level along learning progressions in science.

  11. The 18 Household Food Security Survey items provide valid food security classifications for adults and children in the Caribbean

    OpenAIRE

    Nunes Cheryl; Gulliford Martin C; Rocke Brian

    2006-01-01

    Abstract Background We tested the properties of the 18 Household Food Security Survey (HFSS) items, and the validity of the resulting food security classifications, in an English-speaking middle-income country. Methods Survey of primary school children in Trinidad and Tobago. Parents completed the HFSS. Responses were analysed for the 10 adult-referenced items and the eight child-referenced items. Item response theory models were fitted. Item calibrations and subject scores from a one-paramet...

  12. An Empirical Investigation of Methods for Assessing Item Fit for Mixed Format Tests

    Science.gov (United States)

    Chon, Kyong Hee; Lee, Won-Chan; Ansley, Timothy N.

    2013-01-01

    Empirical information regarding performance of model-fit procedures has been a persistent need in measurement practice. Statistical procedures for evaluating item fit were applied to real test examples that consist of both dichotomously and polytomously scored items. The item fit statistics used in this study included the PARSCALE's G[squared],…

  13. Quantitative penetration testing with item response theory

    NARCIS (Netherlands)

    Arnold, Florian; Pieters, Wolter; Stoelinga, Mariëlle

    2014-01-01

    Existing penetration testing approaches assess the vulnerability of a system by determining whether certain attack paths are possible in practice. Thus, penetration testing has so far been used as a qualitative research method. To enable quantitative approaches to security risk management, including

  14. Item response times in computerized adaptive testing

    OpenAIRE

    LUTZ F. HORNKE

    2000-01-01

    Tiempos de respuesta al ítem en tests adaptativos informatizados. Los tests adaptativos informatizados (TAI) proporcionan puntuaciones y a la vez tiempos de respuesta a los ítems. La investigación sobre el significado adicional que se puede obtener de la información contenida en los tiempos de respuesta es de especial interés. Se dispuso de los datos de 5912 jóvenes en un test adaptativo informatizado. Estudios anteriores indican mayores tiempos de respuesta cuan...

  15. Thirty years of nonparametric item response theory

    NARCIS (Netherlands)

    Molenaar, W.

    2001-01-01

    Relationships between a mathematical measurement model and its real-world applications are discussed. A distinction is made between large data matrices commonly found in educational measurement and smaller matrices found in attitude and personality measurement. Nonparametric methods are evaluated fo

  16. Quantitative penetration testing with item response theory

    NARCIS (Netherlands)

    Pieters, W.; Arnold, F.; Stoelinga, M.I.A.

    2013-01-01

    Existing penetration testing approaches assess the vulnerability of a system by determining whether certain attack paths are possible in practice. Therefore, penetration testing has thus far been used as a qualitative research method. To enable quantitative approaches to security risk management, in

  17. Nonparametric Cognitive Diagnosis:A Cluster Diagnostic Method Based on Grade Response Items%非参数认知诊断方法:多级评分的聚类分析

    Institute of Scientific and Technical Information of China (English)

    康春花; 任平; 曾平飞

    2015-01-01

    基于属性合分和聚类分析的思想提出了适用于多级评分的聚类分析方法,同时探讨了属性层次结构、样本容量和失误率对该方法判准率的影响。研究发现:(1)该方法在各种试验情境下均有较高的模式判准率和边际判准率;(2)判准率不依赖样本容量的大小,使其可适用于小型测评及课堂评估;(3)判准率受属性层次紧密度影响较小;(4)该方法在实践情境中表现出较好的内外部效度。%Examinations help students learn more efficiently by filling their learning gaps. To achieve this goal, we have to differentiate students who have from those who have not mastered a set of attributes as measured by the test through cognitive diagnostic assessment. K-means cluster analysis, being a nonparametric cognitive diagnosis method requires the Q-matrix only, which reflects the relationship between attributes and items. This does not require the estimation of the parameters, so is independent of sample size, simple to operate, and easy to understand. Previous research use the sum score vectors or capability scores vector as the clustering objects. These methods are only adaptive for dichotomous data. Structural response items are, however, the main type used in examinations, particularly as required in recent reforms. On the basis of previous research, this paper puts forward a method to calculate a capability matrix reflecting the mastery level on skills and is applicable to grade response items. Our study included four parts. First, we introduced the K-means cluster diagnosis method which has been adapted for dichotomous data. Second, we expanded the K-means cluster diagnosis method for grade response data (GRCDM). Third, in Part Two, we investigated the performance of the method introduced using a simulation study. Fourth, we investigated the performance of the method in an empirical study. The simulation study focused on three factors. First, the sample size was

  18. The ITER 3D Magnetic Diagnostic Response to Applied n=3 and n=4 RMP's

    Energy Technology Data Exchange (ETDEWEB)

    Lazerson, S A [PPPL

    2014-09-01

    The ITER magnetic diagnostic response to applied n=3 and n=4 RMPs has been calculated for the 15MA scenario. The VMEC code was utilized to calculate free boundary 3D ideal MHD equilibria, where the non-stellarator symmetric terms were included in the calculation. This allows an assessment to be made of the possible boundary displacements due to RMP application in ITER. As the VMEC code assumes a continuous set of nested flux surface, the possibility of island and stochastic region formation is ignored. At the start of the current at-top (L-Mode) application of n = 4 RMP's indicates approximately 1 cm peak-to-peak displacements on the low field side of the plasma while later in the shot (H-mode) perturbations as large as 3 cm are present. Forward modeling of the ITER magnetic diagnostics indicates significant non-axisymmetric plasma response, exceeding 10% the axisymmetric signal in many of the flux loops. Magnetic field probes seem to indicate a greater robustness to 3D effects but still indicate large sensitivities to 3D effects in a number of sensors. Forward modeling of the diagnostics response to 3D equilibria allows assessment of diagnostics design and control scenarios.

  19. What's in a Topic? Exploring the Interaction between Test-Taker Age and Item Content in High-Stakes Testing

    Science.gov (United States)

    Banerjee, Jayanti; Papageorgiou, Spiros

    2016-01-01

    The research reported in this article investigates differential item functioning (DIF) in a listening comprehension test. The study explores the relationship between test-taker age and the items' language domains across multiple test forms. The data comprise test-taker responses (N = 2,861) to a total of 133 unique items, 46 items of which were…

  20. OSL and Tl response characterization of micro LiF:Mg, Ti dosimeters to be applied to VMAT quality assurance

    Energy Technology Data Exchange (ETDEWEB)

    Bravim, A.; Campos, L. L. [Instituto de Pesquisas Energeticas e Nucleares / CNEN, Av. Lineu Prestes 2242, Cidade Universitaria, 05508-000 Sao Paulo (Brazil); Sakuraba, R. K.; Da Cruz, J. C., E-mail: ambravim@hotmail.com [Sociedade Beneficente Israelita Brasileira - Hospital Albert Einstein, Av. Albert Einstein 627/701, Jardim Leonor, 05652-900 Sao Paulo (Brazil)

    2014-08-15

    VMAT Rapid Arc is a new method of treatment responsible for a change in the setting of radiotherapy, bringing benefits and allowing a lower toxicity in the treatment of patients. With this treatment is possible to minimize the radiation dose to the healthy tissues and escalate the dose to the target volume (tumor) (Hall, 1998; Mundt, 2005; Bortfeld, 2006). The quality assurance is essential to verify the operation of all components involved in the process of treatment planning and dose delivery. Several organizations recommended the verification of patient dose for quality improvement in radiotherapy and the recommended maximum values for the uncertainty in the dose range of ± 5% (ICRU, 1976, AAPM, 1983). This paper aims to evaluate the feasibility of applying LiF:Mg,Ti micro dosimeters as a new method of dosimetry to VMAT Rapid Arc. (Author)

  1. A Classical Test Theory Perspective on LSAT Local Item Dependence. LSAC Research Report Series. Statistical Report.

    Science.gov (United States)

    Reese, Lynda M.

    This study extended prior Law School Admission Council (LSAC) research related to the item response theory (IRT) local item independence assumption into the realm of classical test theory. Initially, results from the Law School Admission Test (LSAT) and two other tests were investigated to determine the approximate state of local item independence…

  2. Item Characteristic Curve Parameters: Effects of Sample Size on Linear Equating.

    Science.gov (United States)

    Ree, Malcom James; Jensen, Harald E.

    By means of computer simulation of test responses, the reliability of item analysis data and the accuracy of equating were examined for hypothetical samples of 250, 500, 1000, and 2000 subjects for two tests with 20 equating items plus 60 additional items on the same scale. Birnbaum's three-parameter logistic model was used for the simulation. The…

  3. Curriculum and Translation Differential Item Functioning: A Comparison of Two DIF Detection Techniques.

    Science.gov (United States)

    Emenogu, Barnabas; Childs, Ruth A.

    This study investigated the possible impacts of language and curriculum differences on the performance of test items by subpopulations of students. Focusing on Measurement and Geometry items completed by students in French- and English-language schools in Ontario made it possible to explore the differences and to compare the item response theory…

  4. Detecting DIF in Polytomous Items Using MACS, IRT and Ordinal Logistic Regression

    Science.gov (United States)

    Elosua, Paula; Wells, Craig

    2013-01-01

    The purpose of the present study was to compare the Type I error rate and power of two model-based procedures, the mean and covariance structure model (MACS) and the item response theory (IRT), and an observed-score based procedure, ordinal logistic regression, for detecting differential item functioning (DIF) in polytomous items. A simulation…

  5. Bootstrap Standard Errors for Maximum Likelihood Ability Estimates When Item Parameters Are Unknown

    Science.gov (United States)

    Patton, Jeffrey M.; Cheng, Ying; Yuan, Ke-Hai; Diao, Qi

    2014-01-01

    When item parameter estimates are used to estimate the ability parameter in item response models, the standard error (SE) of the ability estimate must be corrected to reflect the error carried over from item calibration. For maximum likelihood (ML) ability estimates, a corrected asymptotic SE is available, but it requires a long test and the…

  6. Selection of unidimensional scales from a multidimensional item bank in the polytomous Mokken IRT model

    NARCIS (Netherlands)

    Hemker, BT; Sijtsma, Klaas; Molenaar, Ivo W

    1995-01-01

    An automated item selection procedure for selecting unidimensional scales of polytomous items from multidimensional datasets is developed for use in the context of the Mokken item response theory model of monotone homogeneity (Mokken & Lewis, 1982). The selection procedure is directly based on the s

  7. Item Vector Plots for the Multidimensional Three-Parameter Logistic Model

    Science.gov (United States)

    Bryant, Damon; Davis, Larry

    2011-01-01

    This brief technical note describes how to construct item vector plots for dichotomously scored items fitting the multidimensional three-parameter logistic model (M3PLM). As multidimensional item response theory (MIRT) shows promise of being a very useful framework in the test development life cycle, graphical tools that facilitate understanding…

  8. The Asymptotic Distribution of Ability Estimates: Beyond Dichotomous Items and Unidimensional IRT Models

    Science.gov (United States)

    Sinharay, Sandip

    2015-01-01

    The maximum likelihood estimate (MLE) of the ability parameter of an item response theory model with known item parameters was proved to be asymptotically normally distributed under a set of regularity conditions for tests involving dichotomous items and a unidimensional ability parameter (Klauer, 1990; Lord, 1983). This article first considers…

  9. Applying Physically Representative Watershed Modelling to Assess Peak and Low Flow Response to Timber Harvest: Application for Watershed Assessments

    Science.gov (United States)

    MacDonald, R. J.; Anderson, A.; Silins, U.; Craig, J. R.

    2014-12-01

    Forest harvesting, insects, disease, wildfire, and other disturbances can combine with climate change to cause unknown changes to the amount and timing of streamflow from critical forested watersheds. Southern Alberta forest and alpine areas provide downstream water supply for agriculture and water utilities that supply approximately two thirds of the Alberta population. This project uses datasets from intensely monitored study watersheds and hydrological model platforms to extend our understanding of how disturbances and climate change may impact various aspects of the streamflow regime that are of importance to downstream users. The objectives are 1) to use the model output of watershed response to disturbances to inform assessments of forested watersheds in the region, and 2) to investigate the use of a new flexible modelling platform as a tool for detailed watershed assessments and hypothesis testing. Here we applied the RAVEN hydrological modelling framework to quantify changes in key hydrological processes driving peak and low flows in a headwater catchment along the eastern slopes of the Canadian Rocky Mountains. The model was applied to simulate the period from 2006 to 2011 using data from the Star Creek watershed in southwestern Alberta. The representation of relevant hydrological processes was verified using snow survey, meteorological, and vegetation data collected through the Southern Rockies Watershed Project. Timber harvest scenarios were developed to estimate the effects of cut levels ranging from 20 to 100% over a range of elevations, slopes, and aspects. We quantified changes in the timing and magnitude of low flow and high flow events during the 2006 to 2011 period. Future work will assess changes in the probability of low and high flow events using a long-term meteorological record. This modelling framework enables relevant processes at the watershed scale to be accounted in a physically robust and computational efficient manner. Hydrologic

  10. Het nut van item respons theorie bij de constructie en evaluatie van niet-cognitieve instrumenten voor selectie en assessment binnen organisaties. : (The usefulness of item response theory for the construction and evaluation of noncognitive tests in personnel selection and assessment.)

    NARCIS (Netherlands)

    Egberink, Iris J. L.; Meijer, Rob R.

    2012-01-01

    In this article we discuss the use of IRT for the development and application of noncognitive measures in personnel selection and career development. We introduce the basic principles of IRT and we discuss the usefulness of IRT to evaluate the quality of items and tests to assess the measurement pre

  11. Relation between relative growth rate, endogenous gibberellins, and the response to applied gibberellic acid for Plantago major.

    Science.gov (United States)

    Dijkstra, P; Reegen, H; Kuiper, P J

    1990-08-01

    Relationships between relative growth rate (RGR), endogenous gibberellin (GA) concentration and the response to application of gibberellic acid (GA(3) ) were studied for two inbred lines of Plantago major L., which differed in RGR. A4, the fast-growing inbred line, had a higher free GA concentration than the slow-growing W9, as analyzed by enzyme immunoassay. GA(3) application increased total plant weight and RGR(3) particularly for the slow-growing line. Chlorophyll a content and photosynthetic activity per unit leaf area were decreased, while transpiration rate was unaffected by GA(3) application. The increase in RGR by GA(3) application was associated with an increased leaf weight ratio; specific leaf area and percentage of dry matter in the leaves were only temporarily affected. Root respiration rate per unit dry weight was unaffected. The correlation between low RGR, low GA concentration and high responsiveness to applied GA(3) supports the contention that gibberellins are involved in the regulation of RGR. However, the transient influence of GA(3) application on some growth components suggests the involvement of other regulatory factors in addition to GA.

  12. Better assessment of physical function: item improvement is neglected but essential

    OpenAIRE

    Bruce, Bonnie; Fries, James F; Ambrosini, Debbie; Lingala, Bharathi; Gandek, Barbara; Rose, Matthias; Ware, John E.

    2009-01-01

    Introduction Physical function is a key component of patient-reported outcome (PRO) assessment in rheumatology. Modern psychometric methods, such as Item Response Theory (IRT) and Computerized Adaptive Testing, can materially improve measurement precision at the item level. We present the qualitative and quantitative item-evaluation process for developing the Patient Reported Outcomes Measurement Information System (PROMIS) Physical Function item bank. Methods The process was stepwise: we sea...

  13. Psychometric latent response models

    OpenAIRE

    Maris, E.

    1995-01-01

    In this paper, some psychometric models will be presented that belong to the larger class oflatent response models (LRMs). First, LRMs are introduced by means of an application in the field ofcomponential item response theory (Embretson, 1980, 1984). Second, a general definition of LRMs (not specific for the psychometric subclass) is given. Third, some more psychometric LRMs, and examples of how they can be applied, are presented. Fourth, a method for obtaining maximum likelihood (ML) and som...

  14. Estimating the Nominal Response Model Under Nonnormal Conditions

    OpenAIRE

    Preston, KSJ; Reise, SP

    2014-01-01

    The nominal response model (NRM), a much understudied polytomous item response theory (IRT) model, provides researchers the unique opportunity to evaluate within-item category distinctions. Polytomous IRT models, such as the NRM, are frequently applied to psychological assessments representing constructs that are unlikely to be normally distributed in the population. Unfortunately, models estimated using estimation software with the MML/EM algorithm frequently employs a set of normal quadratu...

  15. Increasing Active Student Responding in a University Applied Behavior Analysis Course: The Effect of Daily Assessment and Response Cards on End of Week Quiz Scores

    Science.gov (United States)

    Malanga, Paul R.; Sweeney, William J.

    2008-01-01

    The study compared the effects of daily assessment and response cards on average weekly quiz scores in an introduction to applied behavior analysis course. An alternating treatments design (Kazdin 1982, "Single-case research designs." New York: Oxford University Press; Cooper et al. 2007, "Applied behavior analysis." Upper Saddle River:…

  16. Item Generation for Test Development [Book Review].

    Science.gov (United States)

    Papanastasiou, Elena C.

    2003-01-01

    This volume, based on papers presented at a 1998 conference, collects thinking and research on item generation for test development. It includes materials on psychometric and cognitive theory, construct-oriented approaches to item generation, the item generation process, and some applications of item generative principles. (SLD)

  17. Item Type and Gender Differences on the Mental Rotations Test

    Science.gov (United States)

    Voyer, Daniel; Doyle, Randi A.

    2010-01-01

    This study investigated gender differences on the Mental Rotations Test (MRT) as a function of item and response types. Accordingly, 86 male and 109 female undergraduate students completed the MRT without time limits. Responses were coded as reflecting two correct (CC), one correct and one wrong (CW), two wrong (WW), one correct and one blank…

  18. MMPI-2 Item Endorsements in Dissociative Identity Disorder vs. Simulators.

    Science.gov (United States)

    Brand, Bethany L; Chasson, Gregory S; Palermo, Cori A; Donato, Frank M; Rhodes, Kyle P; Voorhees, Emily F

    2016-03-01

    Elevated scores on some MMPI-2 (Minnesota Multiphasic Inventory-2) validity scales are common among patients with dissociative identity disorder (DID), which raises questions about the validity of their responses. Such patients show elevated scores on atypical answers (F), F-psychopathology (Fp), atypical answers in the second half of the test (FB), schizophrenia (Sc), and depression (D) scales, with Fp showing the greatest utility in distinguishing them from coached and uncoached DID simulators. In the current study, we investigated the items on the MMPI-2 F, Fp, FB, Sc, and D scales that were most and least commonly endorsed by participants with DID in our 2014 study and compared these responses with those of coached and uncoached DID simulators. The comparisons revealed that patients with DID most frequently endorsed items related to dissociation, trauma, depression, fearfulness, conflict within family, and self-destructiveness. The coached group more successfully imitated item endorsements of the DID group than did the uncoached group. However, both simulating groups, especially the uncoached group, frequently endorsed items that were uncommonly endorsed by the DID group. The uncoached group endorsed items consistent with popular media portrayals of people with DID being violent, delusional, and unlawful. These results suggest that item endorsement patterns can provide useful information to clinicians making determinations about whether an individual is presenting with DID or feigning. PMID:26944745

  19. Approximate Revenue Maximization with Multiple Items

    OpenAIRE

    Sergiu Hart; Noam Nisan

    2012-01-01

    Myerson's classic result provides a full description of how a seller can maximize revenue when selling a single item. We address the question of revenue maximization in the simplest possible multi-item setting: two items and a single buyer who has independently distributed values for the items, and an additive valuation. In general, the revenue achievable from selling two independent items may be strictly higher than the sum of the revenues obtainable by selling each of them separately. In fa...

  20. Construct, item, and method bias of cognitive and personality tests in South Africa

    Directory of Open Access Journals (Sweden)

    D. Meiring

    2005-10-01

    Full Text Available Bias was studied for two cognitive tests and a personality test at three levels: the construct underlying the test (“construct bias", method-related aspects such as response sets (“method bias", and the items (“item bias". The sample consisted of 13 681 participants who had applied for entry-level jobs in the South African Police Service. The cognitive instruments produced very good construct equivalence and low item bias. However, various scales of the personality questionnaire revealed construct bias in various ethnic groups. The item bias in the personality scales was low. Method bias did not have any impact on the (small size of the cross-cultural differences in the personality scales. In addition, several personality scales revealed low internal consistencies, notably in the black groups. Opsomming Sydigheid is bestudeer vir twee kognitiewe toetse en ’n persoonlikheidstoets op drie vlakke: die konstruk onderliggend aan die toets (“konstruksydigheid�?, metode-verwante aspekte soos responspatrone (“metodesydigheid�?, en die items (“itemsydigheid�?. Die steekproef het bestaan uit 13 681 deelnemers wat aansoek gedoen het om intreevlak-poste in die Suid-Afrikaanse Polisiediens. Die kognitiewe instrumente het baie goeie konstrukekwivalensie en lae itemsydigheid getoon. Verskeie skale van die persoonlikheidsvraelys het egter konstruksydigheid in verskeie etniese groepe getoon. Die itemsydigheid in die persoonlikheidskale was laag. Metodesydigheid het nie enige uitwerking op die (klein omvang van die kruiskulturele verskille in die persoonlikheidskale gehad nie. Verder het verskeie persoonlikheidskale lae interne konsekwentheid getoon – veral in die swart groepe.

  1. Interpretation of differential item functioning analyses using external review

    DEFF Research Database (Denmark)

    Scott, Neil W; Fayers, Peter M; Aaronson, Neil K;

    2010-01-01

    using blinded reviewers, to help interpret these results. The authors conducted a literature review of this topic to describe the current usage of external reviews alongside DIF analyses. It concentrated on studies of health-related quality of life instruments, but studies in other fields were also...... considered. Relatively few examples of blinded item reviews were identified, and these were mostly from educational studies. A case study using blinded bilingual reviewers alongside translation DIF analyses of a health-related quality of life instrument is described. Future researchers should consider......Differential item functioning (DIF) analyses are used to determine whether certain groups respond differently to a particular item of a test or questionnaire; however, these do not explain the reasons for observed response differences. Many studies have used external reviews of items, sometimes...

  2. Identifying predictors of physics item difficulty: A linear regression approach

    Science.gov (United States)

    Mesic, Vanes; Muratovic, Hasnija

    2011-06-01

    Large-scale assessments of student achievement in physics are often approached with an intention to discriminate students based on the attained level of their physics competencies. Therefore, for purposes of test design, it is important that items display an acceptable discriminatory behavior. To that end, it is recommended to avoid extraordinary difficult and very easy items. Knowing the factors that influence physics item difficulty makes it possible to model the item difficulty even before the first pilot study is conducted. Thus, by identifying predictors of physics item difficulty, we can improve the test-design process. Furthermore, we get additional qualitative feedback regarding the basic aspects of student cognitive achievement in physics that are directly responsible for the obtained, quantitative test results. In this study, we conducted a secondary analysis of data that came from two large-scale assessments of student physics achievement at the end of compulsory education in Bosnia and Herzegovina. Foremost, we explored the concept of “physics competence” and performed a content analysis of 123 physics items that were included within the above-mentioned assessments. Thereafter, an item database was created. Items were described by variables which reflect some basic cognitive aspects of physics competence. For each of the assessments, Rasch item difficulties were calculated in separate analyses. In order to make the item difficulties from different assessments comparable, a virtual test equating procedure had to be implemented. Finally, a regression model of physics item difficulty was created. It has been shown that 61.2% of item difficulty variance can be explained by factors which reflect the automaticity, complexity, and modality of the knowledge structure that is relevant for generating the most probable correct solution, as well as by the divergence of required thinking and interference effects between intuitive and formal physics knowledge

  3. Identifying predictors of physics item difficulty: A linear regression approach

    Directory of Open Access Journals (Sweden)

    Hasnija Muratovic

    2011-06-01

    Full Text Available Large-scale assessments of student achievement in physics are often approached with an intention to discriminate students based on the attained level of their physics competencies. Therefore, for purposes of test design, it is important that items display an acceptable discriminatory behavior. To that end, it is recommended to avoid extraordinary difficult and very easy items. Knowing the factors that influence physics item difficulty makes it possible to model the item difficulty even before the first pilot study is conducted. Thus, by identifying predictors of physics item difficulty, we can improve the test-design process. Furthermore, we get additional qualitative feedback regarding the basic aspects of student cognitive achievement in physics that are directly responsible for the obtained, quantitative test results. In this study, we conducted a secondary analysis of data that came from two large-scale assessments of student physics achievement at the end of compulsory education in Bosnia and Herzegovina. Foremost, we explored the concept of “physics competence” and performed a content analysis of 123 physics items that were included within the above-mentioned assessments. Thereafter, an item database was created. Items were described by variables which reflect some basic cognitive aspects of physics competence. For each of the assessments, Rasch item difficulties were calculated in separate analyses. In order to make the item difficulties from different assessments comparable, a virtual test equating procedure had to be implemented. Finally, a regression model of physics item difficulty was created. It has been shown that 61.2% of item difficulty variance can be explained by factors which reflect the automaticity, complexity, and modality of the knowledge structure that is relevant for generating the most probable correct solution, as well as by the divergence of required thinking and interference effects between intuitive and formal

  4. Development and community-based validation of eight item banks to assess mental health.

    Science.gov (United States)

    Batterham, Philip J; Sunderland, Matthew; Carragher, Natacha; Calear, Alison L

    2016-09-30

    There is a need for precise but brief screening of mental health problems in a range of settings. The development of item banks to assess depression and anxiety has resulted in new adaptive and static screeners that accurately assess severity of symptoms. However, expansion to a wider array of mental health problems is required. The current study developed item banks for eight mental health problems: social anxiety disorder, panic disorder, post-traumatic stress disorder, obsessive-compulsive disorder, adult attention-deficit hyperactivity disorder, drug use, psychosis and suicidality. The item banks were calibrated in a population-based Australian adult sample (N=3175) by administering large item pools (45-75 items) and excluding items on the basis of local dependence or measurement non-invariance. Item Response Theory parameters were estimated for each item bank using a two-parameter graded response model. Each bank consisted of 19-47 items, demonstrating excellent fit and precision across a range of -1 to 3 standard deviations from the mean. No previous study has developed such a broad range of mental health item banks. The calibrated item banks will form the basis of a new system of static and adaptive measures to screen for a broad array of mental health problems in the community. PMID:27500552

  5. Detecting Gender Bias Through Test Item Analysis

    Science.gov (United States)

    González-Espada, Wilson J.

    2009-03-01

    Many physical science and physics instructors might not be trained in pedagogically appropriate test construction methods. This could lead to test items that do not measure what they are intended to measure. A subgroup of these items might show bias against some groups of students. This paper describes how the author became aware of potentially biased items against females in his examinations, which led to the exploration of fundamental issues related to item validity, gender bias, and differential item functioning, or DIF. A brief discussion of DIF in the context of university courses, as well as practical suggestions to detect possible gender-biased items, follows.

  6. Prediction of true test scores from observed item scores and ancillary data.

    Science.gov (United States)

    Haberman, Shelby J; Yao, Lili; Sinharay, Sandip

    2015-05-01

    In many educational tests which involve constructed responses, a traditional test score is obtained by adding together item scores obtained through holistic scoring by trained human raters. For example, this practice was used until 2008 in the case of GRE(®) General Analytical Writing and until 2009 in the case of TOEFL(®) iBT Writing. With use of natural language processing, it is possible to obtain additional information concerning item responses from computer programs such as e-rater(®). In addition, available information relevant to examinee performance may include scores on related tests. We suggest application of standard results from classical test theory to the available data to obtain best linear predictors of true traditional test scores. In performing such analysis, we require estimation of variances and covariances of measurement errors, a task which can be quite difficult in the case of tests with limited numbers of items and with multiple measurements per item. As a consequence, a new estimation method is suggested based on samples of examinees who have taken an assessment more than once. Such samples are typically not random samples of the general population of examinees, so that we apply statistical adjustment methods to obtain the needed estimated variances and covariances of measurement errors. To examine practical implications of the suggested methods of analysis, applications are made to GRE General Analytical Writing and TOEFL iBT Writing. Results obtained indicate that substantial improvements are possible both in terms of reliability of scoring and in terms of assessment reliability.

  7. Selection of Common Items as an Unrecognized Source of Variability in Test Equating: A Bootstrap Approximation Assuming Random Sampling of Common Items

    Science.gov (United States)

    Michaelides, Michalis P.; Haertel, Edward H.

    2014-01-01

    The standard error of equating quantifies the variability in the estimation of an equating function. Because common items for deriving equated scores are treated as fixed, the only source of variability typically considered arises from the estimation of common-item parameters from responses of samples of examinees. Use of alternative, equally…

  8. Development of the Assessment Items of Debris Flow Using the Delphi Method

    Science.gov (United States)

    Byun, Yosep; Seong, Joohyun; Kim, Mingi; Park, Kyunghan; Yoon, Hyungkoo

    2016-04-01

    In recent years in Korea, Typhoon and the localized extreme rainfall caused by the abnormal climate has increased. Accordingly, debris flow is becoming one of the most dangerous natural disaster. This study aimed to develop the assessment items which can be used for conducting damage investigation of debris flow. Delphi method was applied to classify the realms of assessment items. As a result, 29 assessment items which can be classified into 6 groups were determined.

  9. The Relationship between Item Context Characteristics and Student Performance: The Case of the 2006 and 2009 PISA Science Items

    Science.gov (United States)

    Ruiz-Primo, Maria Araceli; Li, Min

    2015-01-01

    Background: A long-standing premise in test design is that contextualizing test items makes them concrete, less demanding, and more conducive to determining whether students can apply or transfer their knowledge. Purpose: We assert that despite decades of study and experience, much remains to be learned about how to construct effective and fair…

  10. Decoding dynamic brain patterns from evoked responses: A tutorial on multivariate pattern analysis applied to time-series neuroimaging data

    OpenAIRE

    Grootswagers, Tijl; Wardle, Susan G.; Carlson, Thomas A.

    2016-01-01

    Multivariate pattern analysis (MVPA) or brain decoding methods have become standard practice in analysing fMRI data. Although decoding methods have been extensively applied in Brain Computing Interfaces (BCI), these methods have only recently been applied to time-series neuroimaging data such as MEG and EEG to address experimental questions in Cognitive Neuroscience. In a tutorial-style review, we describe a broad set of options to inform future time-series decoding studies from a Cognitive N...

  11. Item Overlap Correlations: Definitions, Interpretations, and Implications.

    Science.gov (United States)

    Hsu, Louis M.

    1994-01-01

    Item overlap coefficient (IOC) formulas are discussed, providing six warnings about their calculation and interpretation and some explanations of why item overlap influences the Minnesota Multiphasic Personality Inventory and the Millon Clinical Multiaxial Inventory factor structures. (SLD)

  12. Effects of statistical models and items difficulties on making trait-level inferences: A simulation study

    Directory of Open Access Journals (Sweden)

    Nelson Hauck Filho

    2014-12-01

    Full Text Available Researchers dealing with the task of estimating locations of individuals on continuous latent variables may rely on several statistical models described in the literature. However, weighting costs and benefits of using one specific model over alternative models depends on empirical information that is not always clearly available. Therefore, the aim of this simulation study was to compare the performance of seven popular statistical models in providing adequate latent trait estimates in conditions of items difficulties targeted at the sample mean or at the tails of the latent trait distribution. Results suggested an overall tendency of models to provide more accurate estimates of true latent scores when using items targeted at the sample mean of the latent trait distribution. Rating Scale Model, Graded Response Model, and Weighted Least Squares Mean- and Variance-adjusted Confirmatory Factor Analysis yielded the most reliable latent trait estimates, even when applied to inadequate items for the sample distribution of the latent variable. These findings have important implications concerning some popular methodological practices in Psychology and related areas.

  13. Monte Carlo simulation of the response functions of Cd Te detectors to be applied in X-rays spectroscopy

    Energy Technology Data Exchange (ETDEWEB)

    Tomal, A. [Universidade Federale de Goias, Instituto de Fisica, Campus Samambaia, 74001-970, Goiania, (Brazil); Lopez G, A. H.; Santos, J. C.; Costa, P. R., E-mail: alessandra_tomal@yahoo.com.br [Universidade de Sao Paulo, Instituto de Fisica, Rua du Matao Travessa R. 187, Cidade Universitaria, 05508-090 Sao Paulo (Brazil)

    2014-08-15

    In this work, the energy response functions of a Cd Te detector were obtained by Monte Carlo simulation in the energy range from 5 to 150 keV, using the Penelope code. The response functions simulated included the finite detector resolution and the carrier transport. The simulated energy response matrix was validated through comparison with experimental results obtained for radioactive sources. In order to investigate the influence of the correction by the detector response at diagnostic energy range, x-ray spectra were measured using a Cd Te detector (model Xr-100-T, Amptek), and then corrected by the energy response of the detector using the stripping procedure. Results showed that the Cd Te exhibit good energy response at low energies (below 40 keV), showing only small distortions on the measured spectra. For energies below about 70 keV, the contribution of the escape of Cd- and Te-K x-rays produce significant distortions on the measured x-ray spectra. For higher energies, the most important correction is the detector efficiency and the carrier trapping effects. The results showed that, after correction by the energy response, the measured spectra are in good agreement with those provided by different models from the literature. Finally, our results showed that the detailed knowledge of the response function and a proper correction procedure are fundamental for achieve more accurate spectra from which several qualities parameters (i.e. half-value layer, effective energy and mean energy) can be determined. (Author)

  14. Applying the Reader-Response Theory to Literary Texts in EFL-Pre-Service Teachers' Initial Education

    Science.gov (United States)

    Garzón, Eliana; Castañeda-Peña, Harold

    2015-01-01

    This article presents the pedagogical implementation of the reader-response theory in a class of English as a foreign language with language pre-service teachers as they experience the reading of two short stories. The research took place over a 16 week period in which students kept a portfolio of their written responses to the stories.…

  15. Investigating factors affecting students’ performance to PISA Science items

    Directory of Open Access Journals (Sweden)

    V. Hatzinikita

    2008-01-01

    Full Text Available The present paper aims to investigate, on the one hand, the extent to which PISA Science items validly assess the knowledge and skills of 15 year-old Greek students, while, on the other hand, to examine the effect of the following factors: student’s gender, scientific processes and contexts (situations on the students’ performance in these PISA items. The research used paper-and-pencil test with published PISA Science items, conducted individual semi-structured interviews with 15 year-old students and finally marked the students’ responses, according to the PISA marking guide. Τhe basic finding resulting from the data analysis is that the paper-and-pencil test with the PISA Science items does not tend, unlike the interview, to effectively record the Greek students’ Science knowledge and skills. Moreover, the analysis revealed that the performance of students in the PISA Science items (paper-and-pencil test and interview tend to be independent of the student’s gender and depend on the context in which the knowledge and processes are assessed. Additionally, the possible correlation between the students’ performance and the factor of scientific processes seems to depend on the setting in which the students provide their responses (paper-and-pencil test or interview.

  16. Item Analysis in Introductory Economics Testing.

    Science.gov (United States)

    Tinari, Frank D.

    1979-01-01

    Computerized analysis of multiple choice test items is explained. Examples of item analysis applications in the introductory economics course are discussed with respect to three objectives: to evaluate learning; to improve test items; and to help improve classroom instruction. Problems, costs and benefits of the procedures are identified. (JMD)

  17. Real and Artificial Differential Item Functioning

    Science.gov (United States)

    Andrich, David; Hagquist, Curt

    2012-01-01

    The literature in modern test theory on procedures for identifying items with differential item functioning (DIF) among two groups of persons includes the Mantel-Haenszel (MH) procedure. Generally, it is not recognized explicitly that if there is real DIF in some items which favor one group, then as an artifact of this procedure, artificial DIF…

  18. An Investigation of Item Fit Statistics for Mixed IRT Models

    Science.gov (United States)

    Chon, Kyong Hee

    2009-01-01

    The purpose of this study was to investigate procedures for assessing model fit of IRT models for mixed format data. In this study, various IRT model combinations were fitted to data containing both dichotomous and polytomous item responses, and the suitability of the chosen model mixtures was evaluated based on a number of model fit procedures.…

  19. A Comparison of Item Fit Statistics for Mixed IRT Models

    Science.gov (United States)

    Chon, Kyong Hee; Lee, Won-Chan; Dunbar, Stephen B.

    2010-01-01

    In this study we examined procedures for assessing model-data fit of item response theory (IRT) models for mixed format data. The model fit indices used in this study include PARSCALE's G[superscript 2], Orlando and Thissen's S-X[superscript 2] and S-G[superscript 2], and Stone's chi[superscript 2*] and G[superscript 2*]. To investigate the…

  20. IRT-Estimated Reliability for Tests Containing Mixed Item Formats

    Science.gov (United States)

    Shu, Lianghua; Schwarz, Richard D.

    2014-01-01

    As a global measure of precision, item response theory (IRT) estimated reliability is derived for four coefficients (Cronbach's a, Feldt-Raju, stratified a, and marginal reliability). Models with different underlying assumptions concerning test-part similarity are discussed. A detailed computational example is presented for the targeted…

  1. Design patterns for digital item types in higher education

    NARCIS (Netherlands)

    Draaijer, S.; Hartog, R.J.M.

    2007-01-01

    A set of design patterns for digital item types has been developed in response to challenges identified in various projects by teachers in higher education. The goal of the projects in question was to design and develop formative and summative tests, and to develop interactive learning material in t

  2. The Application of Strength of Association Statistics to the Item Analysis of an In-Training Examination in Diagnostic Radiology.

    Science.gov (United States)

    Diamond, James J.; McCormick, Janet

    1986-01-01

    Using item responses from an in-training examination in diagnostic radiology, the application of a strength of association statistic to the general problem of item analysis is illustrated. Criteria for item selection, general issues of reliability, and error of measurement are discussed. (Author/LMO)

  3. The Impact of Test Dimensionality, Common-Item Set Format, and Scale Linking Methods on Mixed-Format Test Equating

    Science.gov (United States)

    Öztürk-Gübes, Nese; Kelecioglu, Hülya

    2016-01-01

    The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…

  4. 48 CFR 52.223-9 - Estimate of Percentage of Recovered Material Content for EPA-Designated Items.

    Science.gov (United States)

    2010-10-01

    ... recovered material content for EPA-designated item(s) delivered and/or used in contract performance... responsible for the performance of this contract and hereby certify that the percentage of recovered material content for EPA-designated items met the applicable contract specifications or other...

  5. A new method for synthesizing radiation dose-response data from multiple trials applied to prostate cancer

    DEFF Research Database (Denmark)

    Diez, Patricia; Vogelius, Ivan S; Bentzen, Søren M

    2010-01-01

    A new method is presented for synthesizing dose-response data for biochemical control of prostate cancer according to study design (randomized vs. nonrandomized) and risk group (low vs. intermediate-high)....

  6. Monte Carlo simulation of the response functions of CdTe detectors to be applied in x-ray spectroscopy.

    Science.gov (United States)

    Tomal, A; Santos, J C; Costa, P R; Lopez Gonzales, A H; Poletti, M E

    2015-06-01

    In this work, the energy response functions of a CdTe detector were obtained by Monte Carlo (MC) simulation in the energy range from 5 to 160keV, using the PENELOPE code. In the response calculations the carrier transport features and the detector resolution were included. The computed energy response function was validated through comparison with experimental results obtained with (241)Am and (152)Eu sources. In order to investigate the influence of the correction by the detector response at diagnostic energy range, x-ray spectra were measured using a CdTe detector (model XR-100T, Amptek), and then corrected by the energy response of the detector using the stripping procedure. Results showed that the CdTe exhibits good energy response at low energies (below 40keV), showing only small distortions on the measured spectra. For energies below about 80keV, the contribution of the escape of Cd- and Te-K x-rays produce significant distortions on the measured x-ray spectra. For higher energies, the most important correction is the detector efficiency and the carrier trapping effects. The results showed that, after correction by the energy response, the measured spectra are in good agreement with those provided by a theoretical model of the literature. Finally, our results showed that the detailed knowledge of the response function and a proper correction procedure are fundamental for achieving more accurate spectra from which quality parameters (i.e., half-value layer and homogeneity coefficient) can be determined. PMID:25599872

  7. Applications of NLP Techniques to Computer-Assisted Authoring of Test Items for Elementary Chinese

    Science.gov (United States)

    Liu, Chao-Lin; Lin, Jen-Hsiang; Wang, Yu-Chun

    2010-01-01

    The authors report an implemented environment for computer-assisted authoring of test items and provide a brief discussion about the applications of NLP techniques for computer assisted language learning. Test items can serve as a tool for language learners to examine their competence in the target language. The authors apply techniques for…

  8. Are Faculty Predictions or Item Taxonomies Useful for Estimating the Outcome of Multiple-Choice Examinations?

    Science.gov (United States)

    Kibble, Jonathan D.; Johnson, Teresa

    2011-01-01

    The purpose of this study was to evaluate whether multiple-choice item difficulty could be predicted either by a subjective judgment by the question author or by applying a learning taxonomy to the items. Eight physiology faculty members teaching an upper-level undergraduate human physiology course consented to participate in the study. The…

  9. 32 CFR 507.17 - Procurement and wear of heraldic items.

    Science.gov (United States)

    2010-07-01

    ... 32 National Defense 3 2010-07-01 2010-07-01 true Procurement and wear of heraldic items. 507.17... AUTHORITIES AND PUBLIC RELATIONS MANUFACTURE AND SALE OF DECORATIONS, MEDALS, BADGES, INSIGNIA, COMMERCIAL USE... Procurement and wear of heraldic items. (a) The provisions of this part do not apply to contracts awarded...

  10. A Method on the Item Investment Risk Interval Decision-making of Processing Ranking Style

    Institute of Scientific and Technical Information of China (English)

    CHEN Li-wen

    2002-01-01

    In this paper, on the bases of the defeot of riskful type and indefinite type decisions, the concept of the type of item investment probability scheduling decision is given, and a linear programming model and its solution are made out. The feasibility of probability scheduling type item investment plan is studied by applying the quality of interval arithmetic.

  11. 78 FR 26790 - Summary of Responses To Request for Information (RFI): Opportunities To Apply a Department of...

    Science.gov (United States)

    2013-05-08

    ... content. Respondents also recommended that the TXT4Tots messages be linked to additional sources of... Information (RFI): Opportunities To Apply a Department of Health and Human Services Message Library To Advance...-based messages on nutrition and physical activity targeted to parents, caregivers, and health...

  12. Expected Equating Error Resulting from Incorrect Handling of Item Parameter Drift among the Common Items

    Science.gov (United States)

    Miller, G. Edward; Fitzpatrick, Steven J.

    2009-01-01

    Incorrect handling of item parameter drift during the equating process can result in equating error. If the item parameter drift is due to construct-irrelevant factors, then inclusion of these items in the estimation of the equating constants can be expected to result in equating error. On the other hand, if the item parameter drift is related to…

  13. Instructional Topics in Educational Measurement (ITEMS) Module: Using Automated Processes to Generate Test Items

    Science.gov (United States)

    Gierl, Mark J.; Lai, Hollis

    2013-01-01

    Changes to the design and development of our educational assessments are resulting in the unprecedented demand for a large and continuous supply of content-specific test items. One way to address this growing demand is with automatic item generation (AIG). AIG is the process of using item models to generate test items with the aid of computer…

  14. Item Modeling Concept Based on Multimedia Authoring

    Directory of Open Access Journals (Sweden)

    Janez Stergar

    2008-09-01

    Full Text Available In this paper a modern item design framework for computer based assessment based on Flash authoring environment will be introduced. Question design will be discussed as well as the multimedia authoring environment used for item modeling emphasized. Item type templates are a structured means of collecting and storing item information that can be used to improve the efficiency and security of the innovative item design process. Templates can modernize the item design, enhance and speed up the development process. Along with content creation, multimedia has vast potential for use in innovative testing. The introduced item design template is based on taxonomy of innovative items which have great potential for expanding the content areas and construct coverage of an assessment. The presented item design approach is based on GUI's – one for question design based on implemented item design templates and one for user interaction tracking/retrieval. The concept of user interfaces based on Flash technology will be discussed as well as implementation of the innovative approach of the item design forms with multimedia authoring. Also an innovative method for user interaction storage/retrieval based on PHP extending Flash capabilities in the proposed framework will be introduced.

  15. Improved Maximal Length Frequent Item Set Mining

    Directory of Open Access Journals (Sweden)

    P.C.S.Nagendra setty

    2012-09-01

    Full Text Available Association rule mining is one of the most important technique in data mining. Which wide range of applications It aims it searching for intersecting relationships among items in large data sets and discovers association rules. The important of association rule mining is increasing with the demand of finding frequent patterns from large data sources. The exploitation of frequent item set has been restricted by the large number of generated frequent item set and high computational cost in real world applications. To avoid these problems we can use maximum length frequent item sets in generating association rules. The maximum length frequent item sets can be efficiently discovered on very large data sets. At present in research we have LFIMiner algorithm and MaxLFI algorithm to generate maximum length frequent item sets. Here we are proposing a new algorithm called FPMAX for generating maximum length frequent item sets that uses lattice graph data structure.

  16. A comparison of response spectrum and direct integration analysis methods as applied to a nuclear component support structure (U)

    International Nuclear Information System (INIS)

    Seismic qualification of Class I nuclear components is accomplished using a variety of analytical methods. This paper compares the results of time history dynamic analyses of a heat exchanger support structure using response spectrum and time history direct integration analysis methods. Dynamic analysis is performed on the detailed component models using the two methods. A linear elastic model is used for both the response spectrum and direct integration methods. A nonlinear model, which includes friction and nonlinear springs, is analyzed using time history input by direct integration. The loads from the three cases are compared

  17. An iterative method applied to optimize the design of PIN photodiodes for enhanced radiation tolerance and maximum light response

    Energy Technology Data Exchange (ETDEWEB)

    Cedola, A.P., E-mail: ariel.cedola@ing.unlp.edu.a [Grupo de Estudio de Materiales y Dispositivos Electronicos (GEMyDE), Dpto. Electrotecnia, Facultad de Ingenieria, Universidad Nacional de La Plata, 48 y 116, C.C. 91, La Plata 1900, Buenos Aires (Argentina); Cappelletti, M.A. [Grupo de Estudio de Materiales y Dispositivos Electronicos (GEMyDE), Dpto. Electrotecnia, Facultad de Ingenieria, Universidad Nacional de La Plata, 48 y 116, C.C. 91, La Plata 1900, Buenos Aires (Argentina); Casas, G. [Grupo de Estudio de Materiales y Dispositivos Electronicos (GEMyDE), Dpto. Electrotecnia, Facultad de Ingenieria, Universidad Nacional de La Plata, 48 y 116, C.C. 91, La Plata 1900, Buenos Aires (Argentina); Universidad Nacional de Quilmes, Roque Saenz Pena 352, Bernal 1876, Buenos Aires (Argentina); Peltzer y Blanca, E.L. [Grupo de Estudio de Materiales y Dispositivos Electronicos (GEMyDE), Dpto. Electrotecnia, Facultad de Ingenieria, Universidad Nacional de La Plata, 48 y 116, C.C. 91, La Plata 1900, Buenos Aires (Argentina); Instituto de Fisica de Liquidos y Sistemas Biologicos (IFLYSIB), CONICET - UNLP - CIC, La Plata 1900, Buenos Aires (Argentina)

    2011-02-11

    An iterative method based on numerical simulations was developed to enhance the proton radiation tolerance and the responsivity of Si PIN photodiodes. The method allows to calculate the optimal values of the intrinsic layer thickness and the incident light wavelength, in function of the light intensity and the maximum proton fluence to be supported by the device. These results minimize the effects of radiation on the total reverse current of the photodiode and maximize its response to light. The implementation of the method is useful in the design of devices whose operation point should not suffer variations due to radiation.

  18. Social Responsibility in Research Practice: Engaging applied scientists with the socio-ethical context of their work

    NARCIS (Netherlands)

    Schuurbiers, D.

    2010-01-01

    How to encourage researchers to critically reflect on the ethical and social dimensions of their work? That is the central research question of this thesis. It starts from the assumption that the neutrality view of the social responsibility of the researcher – the view that researchers have no busin

  19. A Comparison of Sales Response Predictions From Demand Models Applied to Store-Level versus Panel Data

    NARCIS (Netherlands)

    Andrews, Rick L.; Currim, Imran S.; Leeflang, Peter S. H.

    2011-01-01

    In order to generate sales promotion response predictions, marketing analysts estimate demand models using either disaggregated (consumer-level) or aggregated (store-level) scanner data. Comparison of predictions from these demand models is complicated by the fact that models may accommodate differe

  20. Applying dynamic parameters to predict hemodynamic response to volume expansion in spontaneously breathing patients with septic shock.

    Science.gov (United States)

    Lanspa, Michael J; Grissom, Colin K; Hirshberg, Eliotte L; Jones, Jason P; Brown, Samuel M

    2013-02-01

    Volume expansion is a mainstay of therapy in septic shock, although its effect is difficult to predict using conventional measurements. Dynamic parameters, which vary with respiratory changes, appear to predict hemodynamic response to fluid challenge in mechanically ventilated, paralyzed patients. Whether they predict response in patients who are free from mechanical ventilation is unknown. We hypothesized that dynamic parameters would be predictive in patients not receiving mechanical ventilation. This is a prospective, observational, pilot study. Patients with early septic shock and who were not receiving mechanical ventilation received 10-mL/kg volume expansion (VE) at their treating physician's discretion after initial resuscitation in the emergency department. We used transthoracic echocardiography to measure vena cava collapsibility index and aortic velocity variation before VE. We used a pulse contour analysis device to measure stroke volume variation (SVV). Cardiac index was measured immediately before and after VE using transthoracic echocardiography. Hemodynamic response was defined as an increase in cardiac index 15% or greater. Fourteen patients received VE, five of whom demonstrated a hemodynamic response. Vena cava collapsibility index and SVV were predictive (area under the curve = 0.83, 0.92, respectively). Optimal thresholds were calculated: vena cava collapsibility index, 15% or greater (positive predictive value, 62%; negative predictive value, 100%; P = 0.03); SVV, 17% or greater (positive predictive value 100%, negative predictive value 82%, P = 0.03). Aortic velocity variation was not predictive. Vena cava collapsibility index and SVV predict hemodynamic response to fluid challenge patients with septic shock who are not mechanically ventilated. Optimal thresholds differ from those described in mechanically ventilated patients.

  1. Differential item functioning in Patient Reported Outcomes Measurement Information System® (PROMIS®) Physical Functioning short forms: Analyses across ethnically diverse groups

    OpenAIRE

    Jones, Richard N.; Doug Tommet; Mildred Ramirez; Roxanne Jensen; Teresi, Jeanne A.

    2016-01-01

    We analyzed physical functioning short form items derived from the PROMIS® item bank (PF16) using data from more than 5,000 recently diagnosed, ethnically diverse cancer patients. Our goal was to determine if the short form items demonstrated evidence of differential item functioning (DIF) according to sociodemographic characteristics in this clinical sample. We evaluated respons-es for evidence of unidimensionality, local independence (given a single common factor), differen-tial item functi...

  2. Influence Applied Potential on the Formation of Self-Organized ZnO Nanorod Film and Its Photoelectrochemical Response

    Directory of Open Access Journals (Sweden)

    Nur Azimah Abd Samad

    2016-01-01

    Full Text Available The present paper reports on the facile formation of ZnO nanorod photocatalyst electrodeposited on Zn foil in the production of hydrogen gas via water photoelectrolysis. Based on the results, ZnO nanorod films were successfully grown via electrochemical deposition in an optimum electrolyte set of 0.5 mM zinc chloride and 0.1 M potassium chloride at pH level of 5-6 and electrochemical deposition temperature of around 70°C. The study was also conducted at a very low stirring rate with different applied potentials. Applied potential was one of the crucial aspects in the formation of self-organized ZnO nanorod film via control of the field-assisted dissolution and field-assisted deposition rates during the electrochemical deposition process. Interestingly, low applied potentials of 1 V during electrochemical deposition produced a high aspect ratio and density of self-organized ZnO nanorod distribution on the Zn substrate with an average diameter and length of ~37.9 nm and ~249.5 nm, respectively. Therefore, it exhibited a high photocurrent density that reached 17.8 mA/cm2 under ultraviolet illumination and 12.94 mA/cm2 under visible illumination. This behaviour was attributed to the faster transport of photogenerated electron/hole pairs in the nanorod’s one-dimensional wall surface, which prevented backward reactions and further reduced the number of recombination centres.

  3. Understanding emotional responses to breast/ovarian cancer genetic risk assessment: an applied test of a cognitive theory of emotion.

    Science.gov (United States)

    Phelps, Ceri; Bennett, Paul; Brain, Kate

    2008-10-01

    This study explored whether Smith and Lazarus' (1990, 1993) cognitive theory of emotion could predict emotional responses to an emotionally ambiguous real-life situation. Questionnaire data were collected from 145 women upon referral for cancer genetic risk assessment. These indicated a mixed emotional reaction of both positive and negative emotions to the assessment. Hierarchical regression analyses revealed that the hypothesised models explained between 20% and 33% of the variance of anxiety, hope and gratitude scores, but only 10% of the variance for challenge scores. For the previously unmodelled emotion of relief, 31% of the variance was explained by appraisals and core relational themes. The findings help explain why emotional responses to cancer genetic risk assessment vary and suggest that improving the accuracy of individuals' beliefs and expectations about the assessment process may help subsequent adaptation to risk information.

  4. Understanding emotional responses to breast/ovarian cancer genetic risk assessment: an applied test of a cognitive theory of emotion.

    Science.gov (United States)

    Phelps, Ceri; Bennett, Paul; Brain, Kate

    2008-10-01

    This study explored whether Smith and Lazarus' (1990, 1993) cognitive theory of emotion could predict emotional responses to an emotionally ambiguous real-life situation. Questionnaire data were collected from 145 women upon referral for cancer genetic risk assessment. These indicated a mixed emotional reaction of both positive and negative emotions to the assessment. Hierarchical regression analyses revealed that the hypothesised models explained between 20% and 33% of the variance of anxiety, hope and gratitude scores, but only 10% of the variance for challenge scores. For the previously unmodelled emotion of relief, 31% of the variance was explained by appraisals and core relational themes. The findings help explain why emotional responses to cancer genetic risk assessment vary and suggest that improving the accuracy of individuals' beliefs and expectations about the assessment process may help subsequent adaptation to risk information. PMID:18942008

  5. GIS analysis to apply theoretical Minimal Model on glacier flow line and assess glacier response in climate change scenarios

    OpenAIRE

    M. Moretti; Mattavelli, M; De Amicis, Mattia; Maggi, V

    2014-01-01

    The development of theoretical work about glacier dynamics has given rise to the construction of mathematical models to assess glacier response in climate change scenarios. Glacier are sentinels of climate condition and the Project of Interest NextData will favour new data production about the present and past climatic variability and future climate projections, as well as new assessments of the impact of climate change on environment. The aim of this specific research program is to develo...

  6. A hybrid multiple criteria decision analysis framework for corporate social responsibility implementation applied to an extractive industry case study

    OpenAIRE

    Poplawska, Jolanta; Labib, Ashraf; Reed, Debbie

    2015-01-01

    Integration of Corporate Social Responsibility (CSR) into a company’s mainstream strategy is a complex task. Practical implementation of CSR requires analysis of both the external and internal environments to determine the prospects and challenges significantly influencing integration of sustainability into business strategy. In order to overcome limitations of single multi criteria decision analysis (MCDA) models, this article proposes a hybrid integrated framework combining cognitive mappin...

  7. Applying Disruptive Preference Test Protocols to Increase the Number of "No Preference" Responses in the Placebo Pair, Using Chinese Consumers.

    Science.gov (United States)

    Xia, Yixun; Zhong, Fang; O'Mahony, Michael

    2016-09-01

    One form of paired preference test protocol requires consumers to assess 2 pairs of products. One is the target pair under consideration, while the other is a putatively identical pair named the "placebo pair" which is also presented as a control. Counterintuitively, the majority of consumers report preferences when presented with the placebo pair. Their response frequencies are hypothesized to be those of consumers having "no preference" and are compared with the response frequencies elicited by a target pair, to determine whether the target pair elicits significant preferences. The primary goal of this paper was to study the robustness of 2 new so called disruptive protocols that reduced the proportion of consumers, who reported preferences when assessing a putatively identical pair of products. For this task, the tests were performed in a different language, in a different country, using different products from before. The results showed that the proportion of consumers reporting preferences for the placebo pair was reduced, confirming earlier work. Also, comparison of d' values showed a lack of significant overall differences between the placebo and target pairs, while chi-squared analyses indicated significant differences in the response frequencies. This indicated that the sample was segmented into 2 balanced groups with opposing preferences.

  8. 49 CFR 375.207 - What items must be in my advertisements?

    Science.gov (United States)

    2010-10-01

    ... 49 Transportation 5 2010-10-01 2010-10-01 false What items must be in my advertisements? 375.207... Services to My Customers General Responsibilities § 375.207 What items must be in my advertisements? (a) You and your agents must publish and use only truthful, straightforward, and honest advertisements....

  9. From items to syndromes in the Hypomania Checklist (HCL-32): Psychometric validation and clinical validity analysis

    DEFF Research Database (Denmark)

    Bech, P; Christensen, E M; Vinberg, M;

    2011-01-01

    -taking/irritable behaviour in the HCL-32. Using the Bech-Rafaelsen Mania Scale as index of clinical validity a shorter version was developed. Item response theory analysis was used to evaluate whether the total score of the HCL-32 was sufficient to measure subthreshold bipolarity. The short 13-item Mood Disorder...

  10. Bounds on Quantiles in the Presence of Full and Partial Item Nonresponse

    NARCIS (Netherlands)

    Vazquez-Alvarez, R.; Melenberg, B.; van Soest, A.H.O.

    1999-01-01

    Microeconomic surveys are usually subject to the problem of item nonresponse, typically associated with variables like income and wealth, where confidentiality and/or lack of accurate information can affect the response behavior of the individual. Follow up categorical questions can reduce item nonr

  11. 75 FR 57287 - Notice of Intent to Repatriate a Cultural Item: Oshkosh Public Museum, Oshkosh, WI

    Science.gov (United States)

    2010-09-20

    ... National Park Service Notice of Intent to Repatriate a Cultural Item: Oshkosh Public Museum, Oshkosh, WI... repatriate a cultural item in the possession of the Oshkosh Public Museum, Oshkosh, WI, that meets the... determinations in this notice are the sole responsibility of the museum, institution, or Federal agency that...

  12. Parent Ratings of ADHD Symptoms: Generalized Partial Credit Model Analysis of Differential Item Functioning across Gender

    Science.gov (United States)

    Gomez, Rapson

    2012-01-01

    Objective: Generalized partial credit model, which is based on item response theory (IRT), was used to test differential item functioning (DIF) for the "Diagnostic and Statistical Manual of Mental Disorders" (4th ed.), inattention (IA), and hyperactivity/impulsivity (HI) symptoms across boys and girls. Method: To accomplish this, parents completed…

  13. A Paradox in the Study of the Benefits of Test-Item Review

    Science.gov (United States)

    van der Linden, Wim J.; Jeon, Minjeong; Ferrara, Steve

    2011-01-01

    According to a popular belief, test takers should trust their initial instinct and retain their initial responses when they have the opportunity to review test items. More than 80 years of empirical research on item review, however, has contradicted this belief and shown minor but consistently positive score gains for test takers who changed…

  14. A Simulation Study of the Effects of Ability Range Restriction on IRT Item Bias Detection Procedures.

    Science.gov (United States)

    Lautenschlager, Gary J.; Park, Dong-Gun

    The effects of variations in degree of range restriction and different subgroup sample sizes on the validity of several item bias detection procedures based on Item Response Theory (IRT) were investigated in a simulation study. The degree of range restriction for each of two subpopulations was varied by cutting the specified subpopulation ability…

  15. IRT Item Parameter Recovery with Marginal Maximum Likelihood Estimation Using Loglinear Smoothing Models

    Science.gov (United States)

    Casabianca, Jodi M.; Lewis, Charles

    2015-01-01

    Loglinear smoothing (LLS) estimates the latent trait distribution while making fewer assumptions about its form and maintaining parsimony, thus leading to more precise item response theory (IRT) item parameter estimates than standard marginal maximum likelihood (MML). This article provides the expectation-maximization algorithm for MML estimation…

  16. Parallel Matrix Factorization for Binary Response

    CERN Document Server

    Khanna, Rajiv; Agarwal, Deepak; Chen, Beechung

    2012-01-01

    Predicting user affinity to items is an important problem in applications like content optimization, computational advertising, and many more. While bilinear random effect models (matrix factorization) provide state-of-the-art performance when minimizing RMSE through a Gaussian response model on explicit ratings data, applying it to imbalanced binary response data presents additional challenges that we carefully study in this paper. Data in many applications usually consist of users' implicit response that are often binary -- clicking an item or not; the goal is to predict click rates, which is often combined with other measures to calculate utilities to rank items at runtime of the recommender systems. Because of the implicit nature, such data are usually much larger than explicit rating data and often have an imbalanced distribution with a small fraction of click events, making accurate click rate prediction difficult. In this paper, we address two problems. First, we show previous techniques to estimate bi...

  17. Applying Central Composite Design and Response Surface Methodology to Optimize Growth and Biomass Production of Haemophilus influenzae Type b

    Science.gov (United States)

    Momen, Seyed Bahman; Siadat, Seyed Davar; Akbari, Neda; Ranjbar, Bijan; Khajeh, Khosro

    2016-01-01

    Background Haemophilus influenzae type b (Hib) is the leading cause of bacterial meningitis, otitis media, pneumonia, cellulitis, bacteremia, and septic arthritis in infants and young children. The Hib capsule contains the major virulence factor, and is composed of polyribosyl ribitol phosphate (PRP) that can induce immune system response. Vaccines consisting of Hib capsular polysaccharide (PRP) conjugated to a carrier protein are effective in the prevention of the infections. However, due to costly processes in PRP production, these vaccines are too expensive. Objectives To enhance biomass, in this research we focused on optimizing Hib growth with respect to physical factors such as pH, temperature, and agitation by using a response surface methodology (RSM). Materials and Methods We employed a central composite design (CCD) and a response surface methodology to determine the optimum cultivation conditions for growth and biomass production of H. influenzae type b. The treatment factors investigated were initial pH, agitation, and temperature, using shaking flasks. After Hib cultivation and determination of dry biomass, analysis of experimental data was performed by the RSM-CCD. Results The model showed that temperature and pH had an interactive effect on Hib biomass production. The dry biomass produced in shaking flasks was about 5470 mg/L, which was under an initial pH of 8.5, at 250 rpm and 35° C. Conclusions We found CCD and RSM very effective in optimizing Hib culture conditions, and Hib biomass production was greatly influenced by pH and incubation temperature. Therefore, optimization of the growth factors to maximize Hib production can lead to 1) an increase in bacterial biomass and PRP productions, 2) lower vaccine prices, 3) vaccination of more susceptible populations, and 4) lower risk of Hib infections.

  18. Current Situations and Future Perspectives of Applying Corporate Responsibility in Finland : Cases: Alma Media, Finnair, Finnvera, Metso, UPM

    OpenAIRE

    Bui, Nghiem Dac Vinh

    2012-01-01

    Corporate Responsibility (CR) has been a very popular concept these days in business life. Stakeholders use CR to monitor and evaluate a company's operation(s). Companies use CR as a method of risk management, strategic marketing, and even in attracting new investors. The emerging of CR worldwide is really fast and obvious. Within 50 years, CR has developed from a small initiative to be an important part of every business. In Finland, the concept of CR was also adopted and utilised. This pape...

  19. Segmenting and targeting American university students to promote responsible alcohol use: a case for applying social marketing principles.

    Science.gov (United States)

    Deshpande, Sameer; Rundle-Thiele, Sharyn

    2011-10-01

    The current study contributes to the social marketing literature in the American university binge-drinking context in three innovative ways. First, it profiles drinking segments by "values" and "expectancies" sought from behaviors. Second, the study compares segment values and expectancies of two competing behaviors, that is, binge drinking and participation in alternative activities. Third, the study compares the influence of a variety of factors on both behaviors in each segment. Finally, based on these findings and feedback from eight university alcohol prevention experts, appropriate strategies to promote responsible alcohol use for each segment are proposed. PMID:22054026

  20. Response of a particle in a one-dimensional lattice to an applied force: Dynamics of the effective mass

    CERN Document Server

    Duque-Gomez, Federico

    2012-01-01

    We study the behaviour of the expectation value of the acceleration of a particle in a one-dimensional periodic potential when an external homogeneous force is suddenly applied. The theory is formulated in terms of modified Bloch states that include the interband mixing induced by the force. This approach allows us to understand the behaviour of the wavepacket, which responds with a mass that is initially the bare mass, and subsequently oscillates around the value predicted by the effective mass. If Zener tunneling can be neglected, the expression obtained for the acceleration of the particle is valid over timescales of the order of a Bloch oscillation, which are of interest for experiments with cold atoms in optical lattices. We discuss how these oscillations can be tuned in an optical lattice for experimental detection.

  1. Measuring prescribing: the shortcomings of the item.

    OpenAIRE

    Bogle, S. M.; Harris, C. M.

    1994-01-01

    OBJECTIVES--To assess the validity of the item as a measure of the volume of a drug prescribed; and to investigate the possibility that higher quantities per item are prescribed for patients who are not exempt from the prescription charge. DESIGN--Five substudies. For the first, a frequency distribution was derived of the different quantities per item of 10 commonly used drugs prescribed by 20 randomly selected practices in each of five family health service authority areas. For the second, t...

  2. Periradicular Tissue Responses to Biologically Active Molecules or MTA When Applied in Furcal Perforation of Dogs' Teeth

    Directory of Open Access Journals (Sweden)

    Anna Zairi

    2012-01-01

    Full Text Available The aim of this study was the comparative evaluation of inflammatory reactions and tissue responses to four growth factors, or mineral trioxide aggregate (MTA, or a zinc-oxide-eugenol-based cement (IRM as controls, when used for the repair of furcal perforations in dogs’ teeth. Results showed significantly higher inflammatory cell response in the transforming growth factorβ1 (TGFβ1 and zinc-oxide-eugenol-based cement (IRM groups and higher rates of epithelial proliferation in the TGFβ1, basic fibroblast growth factor (bFGF, and insulin growth factor-I (IGF-I groups compared to the MTA. Significantly higher rates of bone formation were found in the control groups compared to the osteogenic protein-1 (OP-1. Significantly higher rates of cementum formation were observed in the IGF-I and bFGF groups compared to the IRM. None of the biologically active molecules can be suggested for repairing furcal perforations, despite the fact that growth factors exerted a clear stimulatory effect on cementum formation and inhibited collagen capsule formation. MTA exhibited better results than the growth factors.

  3. Assessment of Water Quality in Coastal Environments of Mohammedia Applying Responses of Biochemical Biomarkers in the Brown Mussel Perna perna

    Directory of Open Access Journals (Sweden)

    Laila El Jourmi

    2012-01-01

    Full Text Available The present work aims to assess the marine environment quality in Mohammedia, using the response of the biochemical biomarkers in the brown mussel Perna perna. The biomarkers selected in this work are : glutathione S-transferase (GST as phase II enzyme and the acetylcholinesterase (AChE activity as neurotoxicity marker. The Oxidative stress is evaluated using catalase (CAT, a well-known anti-oxidant enzyme, and malondialdehyde (MDA accumulation as marker of oxidation of membrane phospholipids through lipid peroxidation. And finally the metallothioneine (MT as stress proteins. Our data indicated that CAT, GST activity and MDA, MT concentration in whole mussel bodies, are a higher and significant (p 0.05 in mussels collected at polluted site when compared to specimen sampled from control one.In contrary the response of AChE activity was significantly inhibited in mussels from polluted site when compared to control value. The multi-marker results confirm that mussels from Mohammedia have been submitted to polluted environment.

  4. Wavelet prism decomposition analysis applied to CARS spectroscopy: a tool for accurate and quantitative extraction of resonant vibrational responses.

    Science.gov (United States)

    Kan, Yelena; Lensu, Lasse; Hehl, Gregor; Volkmer, Andreas; Vartiainen, Erik M

    2016-05-30

    We propose an approach, based on wavelet prism decomposition analysis, for correcting experimental artefacts in a coherent anti-Stokes Raman scattering (CARS) spectrum. This method allows estimating and eliminating a slowly varying modulation error function in the measured normalized CARS spectrum and yields a corrected CARS line-shape. The main advantage of the approach is that the spectral phase and amplitude corrections are avoided in the retrieved Raman line-shape spectrum, thus significantly simplifying the quantitative reconstruction of the sample's Raman response from a normalized CARS spectrum in the presence of experimental artefacts. Moreover, the approach obviates the need for assumptions about the modulation error distribution and the chemical composition of the specimens under study. The method is quantitatively validated on normalized CARS spectra recorded for equimolar aqueous solutions of D-fructose, D-glucose, and their disaccharide combination sucrose. PMID:27410113

  5. From concepts to lexical items.

    Science.gov (United States)

    Bierwisch, M; Schreuder, R

    1992-03-01

    In this paper we address the question how in language production conceptual structures are mapped onto lexical items. First we describe the lexical system in a fairly abstract way. Such a system consists of, among other things, a fixed set of basic lexical entries characterized by four groups of information: phonetic form, grammatical features, argument structure, and semantic form. A crucial assumption of the paper is that the meaning in a lexical entry has a complex internal structure composed of more primitive elements (decomposition). Some aspects of argument structure and semantic form and their interaction are discussed with respect to the issue of synonymy. We propose two different mappings involved in lexical access. One maps conceptual structures to semantic forms, and the other maps semantic forms to conceptual structures. Both mappings are context dependent and are many-to-many mappings. We present an elaboration of Levelt's (1989) model in which these processes interact with the grammatical encoder and the mental lexicon. Then we address the consequences of decomposition for processing models, especially the nature of the input of lexical access and the time course. Processing models that use the activation metaphor may have difficulties accounting for certain phenomena where a certain lemma triggers not one, but two or more word forms that have to be produced with other word forms in between.

  6. 41 CFR 101-30.701-1 - Item reduction study.

    Science.gov (United States)

    2010-07-01

    ... 41 Public Contracts and Property Management 2 2010-07-01 2010-07-01 true Item reduction study. 101....7-Item Reduction Program § 101-30.701-1 Item reduction study. Item reduction study means the study... so identified, a replacement item shall be proposed. The result of item reduction studies...

  7. 41 CFR 101-30.701-2 - Item standardization code.

    Science.gov (United States)

    2010-07-01

    ... 41 Public Contracts and Property Management 2 2010-07-01 2010-07-01 true Item standardization code....7-Item Reduction Program § 101-30.701-2 Item standardization code. Item standardization code (ISC) means a code assigned an item in the supply system which identifies the item as authorized...

  8. An Item Analysis and Validity Investigation of Bender Visual Motor Gestalt Test Score Items

    Science.gov (United States)

    Lambert, Nadine M.

    1971-01-01

    This investigation attempted to demonstrate the utility of standard item analysis procedures for selecting the most reliable and valid items for scoring Bender Visual Motor Gestalt Test test records. (Author)

  9. Quantitative Analysis of Complex Multiple-Choice Items in Science Technology and Society: Item Scaling

    Directory of Open Access Journals (Sweden)

    Ángel Vázquez Alonso

    2005-05-01

    Full Text Available The scarce attention to assessment and evaluation in science education research has been especially harmful for Science-Technology-Society (STS education, due to the dialectic, tentative, value-laden, and controversial nature of most STS topics. To overcome the methodological pitfalls of the STS assessment instruments used in the past, an empirically developed instrument (VOSTS, Views on Science-Technology-Society have been suggested. Some methodological proposals, namely the multiple response models and the computing of a global attitudinal index, were suggested to improve the item implementation. The final step of these methodological proposals requires the categorization of STS statements. This paper describes the process of categorization through a scaling procedure ruled by a panel of experts, acting as judges, according to the body of knowledge from history, epistemology, and sociology of science. The statement categorization allows for the sound foundation of STS items, which is useful in educational assessment and science education research, and may also increase teachers’ self-confidence in the development of the STS curriculum for science classrooms.

  10. Sharing the cost of redundant items

    DEFF Research Database (Denmark)

    Hougaard, Jens Leth; Moulin, Hervé

    2014-01-01

    are network connectivity problems when an existing (possibly inefficient) network must be maintained. We axiomatize a family cost ratios based on simple liability indices, one for each agent and for each item, measuring the relative worth of this item across agents, and generating cost allocation rules...

  11. 38 CFR 3.1606 - Transportation items.

    Science.gov (United States)

    2010-07-01

    ... 38 Pensions, Bonuses, and Veterans' Relief 1 2010-07-01 2010-07-01 false Transportation items. 3... Burial Benefits § 3.1606 Transportation items. The transportation costs of those persons who come within... shipment. (6) Cost of transportation by common carrier including amounts paid as Federal taxes. (7) Cost...

  12. Item versus System Learning: Explaining Free Variation.

    Science.gov (United States)

    Ellis, Rod

    1999-01-01

    Provides an explanation for the existence of free variation in learner language. Argues that interlanguage is best conceptualized as sets of loose lexical networks that are gradually reorganized into a system or systems. Free variation arises when learners add items to those they have already acquired and before they analyze these items and…

  13. Applying the LLTM for the determination of children’s cognitive age-acceleration function

    Directory of Open Access Journals (Sweden)

    Klaus D. Kubinger

    2011-06-01

    Full Text Available The paper uses Item Response Theory (IRT for modeling and hypothesis testing children’s cogni-tive age-acceleration function – within calibration and standardization of some intelligence test. For this, basically Fischer’s Linear logistic test model (LLTM; Fischer, 1973, 2005 is applied. How-ever, instead of originally decomposing the item difficulty parameters of the Rasch model into certain hypothesized elementary parameters, we now suggest to decompose the person parameter alike. That is, there is a decomposition into a testee’s basic ability parameter and an age-leveled effect due to the developmental stage of the age-group in question. For convenience, we only inter-change testees and items in order to facilitate parameter estimation and model test – of course, the Rasch model is totally symmetric as concerns testees and items. By doing so, all findings in the context of LLTM apply; in particular, pertinent program packages are at our disposal. In order to examine the suggested approach’s feasibility, an empirical example is given. An Analogy test with eight items administered to more than 300 testees aged between 6 and 16, was analyzed. As a matter of fact, the logistic acceleration function proved to fit the data well and best.

  14. How Important Are Items on a Student Evaluation? A Study of Item Salience

    Science.gov (United States)

    Hills, Stacey Barlow; Naegle, Natali; Bartkus, Kenneth R.

    2009-01-01

    Although student evaluations of teaching (SETs) have been the subject of numerous research studies, the salience of SET items to students has not been examined. In the present study, the authors surveyed 484 students from a large public university. The authors suggest that not all items are viewed equally and that measures of item salience can…

  15. Designing P-Optimal Item Pools in Computerized Adaptive Tests with Polytomous Items

    Science.gov (United States)

    Zhou, Xuechun

    2012-01-01

    Current CAT applications consist of predominantly dichotomous items, and CATs with polytomously scored items are limited. To ascertain the best approach to polytomous CAT, a significant amount of research has been conducted on item selection, ability estimation, and impact of termination rules based on polytomous IRT models. Few studies…

  16. Studies on the response of resistive-wall modes to applied magnetic perturbations in the EXTRAP T2R reversed field pinch

    Science.gov (United States)

    Gregoratto, D.; Drake, J. R.; Yadikin, D.; Liu, Y. Q.; Paccagnella, R.; Brunsell, P. R.; Bolzonella, T.; Marchiori, G.; Cecconello, M.

    2005-09-01

    Arrays of magnetic coils and sensors in the EXTRAP T2R [P. R. Brunsell et al., Plasma Phys. Controlled Fusion 43 1457 (2001)] reversed-field pinch have been used to investigate the plasma response to an applied resonant magnetic perturbation in the range of the resistive-wall modes (RWMs). Measured RWM growth rates agree with predictions of a cylindrical ideal-plasma model. The linear growth of low-n marginally stable RWMs is related to the so-called resonant-field amplification due to a dominant ∣n∣=2 machine error field of about 2 G. The dynamics of the m =1 RWMs interacting with the applied field produced by the coils can be accurately described by a two-pole system. Estimated poles and residues are given with sufficient accuracy by the cylindrical model with a thin continuous wall.

  17. Calibration of the PROMIS physical function item bank in Dutch patients with rheumatoid arthritis.

    Directory of Open Access Journals (Sweden)

    Martijn A H Oude Voshaar

    Full Text Available OBJECTIVE: To calibrate the Dutch-Flemish version of the PROMIS physical function (PF item bank in patients with rheumatoid arthritis (RA and to evaluate cross-cultural measurement equivalence with US general population and RA data. METHODS: Data were collected from RA patients enrolled in the Dutch DREAM registry. An incomplete longitudinal anchored design was used where patients completed all 121 items of the item bank over the course of three waves of data collection. Item responses were fit to a generalized partial credit model adapted for longitudinal data and the item parameters were examined for differential item functioning (DIF across country, age, and sex. RESULTS: In total, 690 patients participated in the study at time point 1 (T2, N = 489; T3, N = 311. The item bank could be successfully fitted to a generalized partial credit model, with the number of misfitting items falling within acceptable limits. Seven items demonstrated DIF for sex, while 5 items showed DIF for age in the Dutch RA sample. Twenty-five (20% items were flagged for cross-cultural DIF compared to the US general population. However, the impact of observed DIF on total physical function estimates was negligible. DISCUSSION: The results of this study showed that the PROMIS PF item bank adequately fit a unidimensional IRT model which provides support for applications that require invariant estimates of physical function, such as computer adaptive testing and targeted short forms. More studies are needed to further investigate the cross-cultural applicability of the US-based PROMIS calibration and standardized metric.

  18. Reliability, Validity, and Predictive Utility of the 25-Item Criminogenic Cognitions Scale (CCS).

    Science.gov (United States)

    Tangney, June Price; Stuewig, Jeffrey; Furukawa, Emi; Kopelovich, Sarah; Meyer, Patrick; Cosby, Brandon

    2012-10-01

    Theory, research, and clinical reports suggest that moral cognitions play a role in initiating and sustaining criminal behavior. The 25 item Criminogenic Cognitions Scale (CCS) was designed to tap 5 dimensions: Notions of entitlement; Failure to Accept Responsibility; Short-Term Orientation; Insensitivity to Impact of Crime; and Negative Attitudes Toward Authority. Results from 552 jail inmates support the reliability, validity, and predictive utility of the measure. The CCS was linked to criminal justice system involvement, self-report measures of aggression, impulsivity, and lack of empathy. Additionally, the CCS was associated with violent criminal history, antisocial personality, and clinicians' ratings of risk for future violence and psychopathy (PCL:SV). Furthermore, criminogenic thinking upon incarceration predicted subsequent official reports of inmate misconduct during incarceration. CCS scores varied somewhat by gender and race. Research and applied uses of CCS are discussed. PMID:24072946

  19. Effect of Applied Potential on the Formation of Self-Organized TiO2 Nanotube Arrays and Its Photoelectrochemical Response

    Directory of Open Access Journals (Sweden)

    Chin Wei Lai

    2011-01-01

    Full Text Available Self-organized TiO2 nanotube arrays have been fabricated by anodization of Ti foil in an electrochemical bath consisting of 1 M of glycerol with 0.5 wt% of NH4F. The effects of applied potential on the resulting nanotubes were illustrated. Among all of the applied potentials, 30 V resulted in the highest uniformity and aspect ratio TiO2 nanotube arrays with the tube's length approximately 1 μm and pore's size of 85 nm. TiO2 nanotube arrays were amorphous in as-anodized condition. The anatase phase was observed after annealing at 400∘C in air atmosphere. The effect of crystallization and effective surface area of TiO2 nanotube arrays in connection with the photoelectrochemical response was reported. Photoelectrochemical response under illumination was enhanced by using the annealed TiO2 nanotube arrays which have larger effective surface area to promote more photoinduced electrons.

  20. Comparisons of methamphetamine psychotic and schizophrenic symptoms: a differential item functioning analysis.

    Science.gov (United States)

    Srisurapanont, Manit; Arunpongpaisal, Suwanna; Wada, Kiyoshi; Marsden, John; Ali, Robert; Kongsakon, Ronnachai

    2011-06-01

    The concept of negative symptoms in methamphetamine (MA) psychosis (e.g., poverty of speech, flatten affect, and loss of drive) is still uncertain. This study aimed to use differential item functioning (DIF) statistical techniques to differentiate the severity of psychotic symptoms between MA psychotic and schizophrenic patients. Data of MA psychotic and schizophrenic patients were those of the participants in the WHO Multi-Site Project on Methamphetamine-Induced Psychosis (or WHO-MAIP study) and the Risperidone Long-Acting Injection in Thai Schizophrenic Patients (or RLAI-Thai study), respectively. To confirm the unidimensionality of psychotic syndromes, we applied the exploratory and confirmatory factor analyses (EFA and CFA) on the eight items of Manchester scale. We conducted the DIF analysis of psychotic symptoms observed in both groups by using nonparametric kernel-smoothing techniques of item response theory. A DIF composite index of 0.30 or greater indicated the difference of symptom severity. The analyses included the data of 168 MA psychotic participants and the baseline data of 169 schizophrenic patients. For both data sets, the EFA and CFA suggested a three-factor model of the psychotic symptoms, including negative syndrome (poverty of speech, psychomotor retardation and flatten/incongruous affect), positive syndrome (delusions, hallucinations and incoherent speech) and anxiety/depression syndrome (anxiety and depression). The DIF composite indexes comparing the severity differences of all eight psychotic symptoms were lower than 0.3. The results suggest that, at the same level of syndrome severity (i.e., negative, positive, and anxiety/depression syndromes), the severity of psychotic symptoms, including the negative ones, observed in MA psychotic and schizophrenic patients are almost the same. PMID:21277930

  1. The PROMIS Physical Function item bank was calibrated to a standardized metric and shown to improve measurement efficiency

    DEFF Research Database (Denmark)

    Rose, Matthias; Bjørner, Jakob; Gandek, Barbara;

    2014-01-01

    of 16,065 adults answered item subsets (n>2,200/item) on the Internet, with oversampling of the chronically ill. Classical test and item response theory methods were used to evaluate 149 PROMIS PF items plus 10 Short Form-36 and 20 Health Assessment Questionnaire-Disability Index items. A graded....... In simulations, a 10-item computerized adaptive test (CAT) eliminated floor and decreased ceiling effects, achieving higher measurement precision than any comparable length static tool across four SDs of the measurement range. Improved psychometric properties were transferred to the CAT's superior ability...... to identify differences between age and disease groups. CONCLUSION: The item bank provides a common metric and can improve the measurement of PF by facilitating the standardization of patient-reported outcome measures and implementation of CATs for more efficient PF assessments over a larger range....

  2. Translation of Transylvanian Culture-Specific Items into English

    Directory of Open Access Journals (Sweden)

    Kémenes Árpád

    2015-12-01

    Full Text Available The present paper focuses on some difficulties encountered during the translation of culture-specific items in Zsuzsa Tapodi’s articles Links between the East and the West: Historical Bonds between the Hungarians and the Balkan Peoples and Hungarian Ethnographic Region in Romania, published in the May 2014 issue of Carmina Balcanica. As far as the theoretical framework adopted in this study is concerned, the terminology on translation strategies relies on the taxonomy developed by Aixelá (1996, while the classification of culture-specific items has been influenced by Dimitriu (2002 and Yılmaz-Gümüş (2012. The study provides a definition of the term ‘culture-specific item’, considers the targetreaders’ awareness of source-language culture, and presents a number of translation strategies applied to mediate culture-bound information between the source and target cultures.

  3. Indirect associations between multiple items and a mining algorithm

    Institute of Scientific and Technical Information of China (English)

    Ni Min; Xu Xiaofei; Deng Shengchun

    2005-01-01

    Indirect association is a high level relationship between items and frequent itemsets in data. Current research approaches on indirect association mining are limited to indirect association between itempairs, which will discovertoo many rules from dataset. A formal definition of indirect association between multiple items is presented, along with an algorithm, SET-NIA,for mining this kind of indirect associations based on anti-monotonicity of indirect associations and frequent itempair support matrix. While the found rules contain same information as compared to the rules found by indirect association between itempairs mining algorithms, this notion brings space-saving in storage ofthe rules as well as superiority for human to understand and apply the rules. Experiments conducted on two real-word datasets show that SET-NIA can effectively find fewer rules than existing algorithms which mine indirect association between itempairs, the experimental results also prove that SET-NIA has better performance than existing algorithms.

  4. Ion beam studies of archaeological gold jewellery items

    Science.gov (United States)

    Demortier, G.

    1996-06-01

    Analytical work on material of archaeological interest performed at LARN mainly concerns gold jewellery, with an emphasis to solders on the artefacts and to gold plating or copper depletion gilding. PIXE, RBS but also PIGE and NRA have been applied to a large variety of items. On the basis of elemental analysis, we have identified typical workmanship of ancient goldsmiths in various regions of the world: finely decorated Mesopotamian items, Hellenistic and Byzantine craftsmanship, cloisonne of the Merovingian period, depletion gilding on Pre-Colombian tumbaga. This paper is some shortening of the work performed at LARN during the last ten years. Criteria to properly use PIXE for quantitative analysis of non-homogeneous ancient artefacts presented at the 12th IBA conference in 1995 are also shortly discussed.

  5. Ion beam studies of archaeological gold jewellery items

    International Nuclear Information System (INIS)

    Analytical work on material of archaeological interest performed at LARN mainly concerns gold jewellery, with an emphasis to solders on the artefacts and to gold plating or copper depletion gilding. PIXE, RBS but also PIGE and NRA have been applied to a large variety of items. On the basis of elemental analysis, we have identified typical workmanship of ancient goldsmiths in various regions of the world: finely decorated Mesopotamian items, Hellenistic and Byzantine craftsmanship, cloisonne of the Merovingian period, depletion gilding on Pre-Colombian tumbaga. This paper is some shortening of the work performed at LARN during the last ten years. Criteria to properly use PIXE for quantitative analysis of non-homogeneous ancient artefacts presented at the 12th IBA conference in 1995 are also shortly discussed. (orig.)

  6. Ion beam studies of archaeological gold jewellery items

    Energy Technology Data Exchange (ETDEWEB)

    Demortier, G. [Facultes Universitaires Notre-Dame de la Paix, Namur (Belgium). Lab. d`Analyses par Reactions Nucleaires

    1996-06-01

    Analytical work on material of archaeological interest performed at LARN mainly concerns gold jewellery, with an emphasis to solders on the artefacts and to gold plating or copper depletion gilding. PIXE, RBS but also PIGE and NRA have been applied to a large variety of items. On the basis of elemental analysis, we have identified typical workmanship of ancient goldsmiths in various regions of the world: finely decorated Mesopotamian items, Hellenistic and Byzantine craftsmanship, cloisonne of the Merovingian period, depletion gilding on Pre-Colombian tumbaga. This paper is some shortening of the work performed at LARN during the last ten years. Criteria to properly use PIXE for quantitative analysis of non-homogeneous ancient artefacts presented at the 12th IBA conference in 1995 are also shortly discussed. (orig.).

  7. Psychometric evaluation and predictive validity of Ryffs psychological well-being items in a UK birth cohort sample of women

    OpenAIRE

    Wadsworth Michael EJ; Kuh Diana; Huppert Felicia A; Ploubidis George B; Abbott Rosemary A; Croudace Tim J

    2006-01-01

    Abstract Background Investigations of the structure of psychological well-being items are useful for advancing knowledge of what dimensions define psychological well-being in practice. Ryff has proposed a multidimensional model of psychological well-being and her questionnaire items are widely used but their latent structure and factorial validity remains contentious. Methods We applied latent variable models for factor analysis of ordinal/categorical data to a 42-item version of Ryff's psych...

  8. Responses of Rapid Viscoanalyzer Profile and Other Rice Grain Qualities to Exogenously Applied Plant Growth Regulators under High Day and High Night Temperatures.

    Science.gov (United States)

    Fahad, Shah; Hussain, Saddam; Saud, Shah; Hassan, Shah; Chauhan, Bhagirath Singh; Khan, Fahad; Ihsan, Muhammad Zahid; Ullah, Abid; Wu, Chao; Bajwa, Ali Ahsan; Alharby, Hesham; Amanullah; Nasim, Wajid; Shahzad, Babar; Tanveer, Mohsin; Huang, Jianliang

    2016-01-01

    High-temperature stress degrades the grain quality of rice; nevertheless, the exogenous application of plant growth regulators (PGRs) might alleviate the negative effects of high temperatures. In the present study, we investigated the responses of rice grain quality to exogenously applied PGRs under high day temperatures (HDT) and high night temperatures (HNT) under controlled conditions. Four different combinations of ascorbic acid (Vc), alpha-tocopherol (Ve), brassinosteroids (Br), methyl jasmonates (MeJA) and triazoles (Tr) were exogenously applied to two rice cultivars (IR-64 and Huanghuazhan) prior to the high-temperature treatment. A Nothing applied Control (NAC) was included for comparison. The results demonstrated that high-temperature stress was detrimental for grain appearance and milling qualities and that both HDT and HNT reduced the grain length, grain width, grain area, head rice percentage and milled rice percentage but increased the chalkiness percentage and percent area of endosperm chalkiness in both cultivars compared with ambient temperature (AT). Significantly higher grain breakdown, set back, consistence viscosity and gelatinization temperature, and significantly lower peak, trough and final viscosities were observed under high-temperature stress compared with AT. Thus, HNT was more devastating for grain quality than HDT. The exogenous application of PGRs ameliorated the adverse effects of high temperature in both rice cultivars, and Vc+Ve+MejA+Br was the best combination for both cultivars under high temperature stress.

  9. Responses of Rapid Viscoanalyzer Profile and Other Rice Grain Qualities to Exogenously Applied Plant Growth Regulators under High Day and High Night Temperatures

    Science.gov (United States)

    Fahad, Shah; Hussain, Saddam; Saud, Shah; Hassan, Shah; Chauhan, Bhagirath Singh; Khan, Fahad; Ihsan, Muhammad Zahid; Ullah, Abid; Wu, Chao; Bajwa, Ali Ahsan; Alharby, Hesham; Amanullah; Nasim, Wajid; Shahzad, Babar; Tanveer, Mohsin; Huang, Jianliang

    2016-01-01

    High-temperature stress degrades the grain quality of rice; nevertheless, the exogenous application of plant growth regulators (PGRs) might alleviate the negative effects of high temperatures. In the present study, we investigated the responses of rice grain quality to exogenously applied PGRs under high day temperatures (HDT) and high night temperatures (HNT) under controlled conditions. Four different combinations of ascorbic acid (Vc), alpha-tocopherol (Ve), brassinosteroids (Br), methyl jasmonates (MeJA) and triazoles (Tr) were exogenously applied to two rice cultivars (IR-64 and Huanghuazhan) prior to the high-temperature treatment. A Nothing applied Control (NAC) was included for comparison. The results demonstrated that high-temperature stress was detrimental for grain appearance and milling qualities and that both HDT and HNT reduced the grain length, grain width, grain area, head rice percentage and milled rice percentage but increased the chalkiness percentage and percent area of endosperm chalkiness in both cultivars compared with ambient temperature (AT). Significantly higher grain breakdown, set back, consistence viscosity and gelatinization temperature, and significantly lower peak, trough and final viscosities were observed under high-temperature stress compared with AT. Thus, HNT was more devastating for grain quality than HDT. The exogenous application of PGRs ameliorated the adverse effects of high temperature in both rice cultivars, and Vc+Ve+MejA+Br was the best combination for both cultivars under high temperature stress. PMID:27472200

  10. Blooms' separation of the final exam of Engineering Mathematics II: Item reliability using Rasch measurement model

    Science.gov (United States)

    Fuaad, Norain Farhana Ahmad; Nopiah, Zulkifli Mohd; Tawil, Norgainy Mohd; Othman, Haliza; Asshaari, Izamarlina; Osman, Mohd Hanif; Ismail, Nur Arzilah

    2014-06-01

    In engineering studies and researches, Mathematics is one of the main elements which express physical, chemical and engineering laws. Therefore, it is essential for engineering students to have a strong knowledge in the fundamental of mathematics in order to apply the knowledge to real life issues. However, based on the previous results of Mathematics Pre-Test, it shows that the engineering students lack the fundamental knowledge in certain topics in mathematics. Due to this, apart from making improvements in the methods of teaching and learning, studies on the construction of questions (items) should also be emphasized. The purpose of this study is to assist lecturers in the process of item development and to monitor the separation of items based on Blooms' Taxonomy and to measure the reliability of the items itself usingRasch Measurement Model as a tool. By using Rasch Measurement Model, the final exam questions of Engineering Mathematics II (Linear Algebra) for semester 2 sessions 2012/2013 were analysed and the results will provide the details onthe extent to which the content of the item providesuseful information about students' ability. This study reveals that the items used in Engineering Mathematics II (Linear Algebra) final exam are well constructed but the separation of the items raises concern as it is argued that it needs further attention, as there is abig gap between items at several levels of Blooms' cognitive skill.

  11. A Strategy for Optimizing Item-Pool Management

    NARCIS (Netherlands)

    Ariel, Adelaide; Linden, van der Wim J.; Veldkamp, Bernard P.

    2006-01-01

    Item-pool management requires a balancing act between the input of new items into the pool and the output of tests assembled from it. A strategy for optimizing item-pool management is presented that is based on the idea of a periodic update of an optimal blueprint for the item pool to tune item prod

  12. The Analysis of The Multiple Choice Item

    Institute of Scientific and Technical Information of China (English)

    曹吴惠

    2009-01-01

    In this article, the author analyzes in detail the advantages, disadvantages and forming of the multiple choice item in examinations. On its basis, the author also exploree some aspects the teacher should pay attention to while setting an examination paper.

  13. Evaluating commercial-grade replacement items

    International Nuclear Information System (INIS)

    A nuclear power plant uses thousands of replacement items during its lifetime. When many suppliers abandoned their quality assurance programs during the plant construction slowdown of the early 1980s, an EPRI-sponsored utility group called NCIG developed guidelines to help utilities evaluate replacement items themselves. However, the need became clear for greater collaboration. Thus EPRI established the Joint Utility Task Group (JUTG) to pool utilities' resources and develop criteria for dedicating commercial-grade items for safety-related nuclear applications. So far, the JUTG has developed a generic evaluation process and completed technical evaluations of 56 replacement items. The resulting information is available in an on-line database through EPRINET

  14. NHRIC (National Health Related Items Code)

    Data.gov (United States)

    U.S. Department of Health & Human Services — The National Health Related Items Code (NHRIC) is a system for identification and numbering of marketed device packages that is compatible with other numbering...

  15. Basic Stand Alone Carrier Line Items PUF

    Data.gov (United States)

    U.S. Department of Health & Human Services — This release contains the Basic Stand Alone (BSA) Carrier Line Items Public Use Files (PUF) with information from Medicare Carrier claims. The CMS BSA Carrier Line...

  16. Development of the Oxford Participation and Activities Questionnaire: constructing an item pool

    Directory of Open Access Journals (Sweden)

    Kelly L

    2015-05-01

    Full Text Available Laura Kelly, Crispin Jenkinson, Sarah Dummett, Jill Dawson, Ray Fitzpatrick, David Morley Health Services Research Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK Purpose: The Oxford Participation and Activities Questionnaire is a patient-reported outcome measure in development that is grounded on the World Health Organization International Classification of Functioning, Disability, and Health (ICF. The study reported here aimed to inform and generate an item pool for the new measure, which is specifically designed for the assessment of participation and activity in patients experiencing a range of health conditions. Methods: Items were informed through in-depth interviews conducted with 37 participants spanning a range of conditions. Interviews aimed to identify how their condition impacted their ability to participate in meaningful activities. Conditions included arthritis, cancer, chronic back pain, diabetes, motor neuron disease, multiple sclerosis, Parkinson's disease, and spinal cord injury. Transcripts were analyzed using the framework method. Statements relating to ICF themes were recast as questionnaire items and shown for review to an expert panel. Cognitive debrief interviews (n=13 were used to assess items for face and content validity. Results: ICF themes relevant to activities and participation in everyday life were explored, and a total of 222 items formed the initial item pool. This item pool was refined by the research team and 28 generic items were mapped onto all nine chapters of the ICF construct, detailing activity and participation. Cognitive interviewing confirmed the questionnaire instructions, items, and response options were acceptable to participants. Conclusion: Using a clear conceptual basis to inform item generation, 28 items have been identified as suitable to undergo further psychometric testing. A large-scale postal survey will follow in order to refine the instrument further and

  17. Developing a Model for Optimizing Inventory of Repairable Items at Single Operating Base

    OpenAIRE

    Le, Tin

    2016-01-01

    The use of EOQ model in inventory management is popular. However, EOQ models has many disadvantages, especially, when the model is applied to manage repairable items. In order to deal with high-cost and repairable items, Craig C. Sherbrooke introduced a model in his book “Optimal Inventory Modeling of Systems: Multi-Echelon Techniques”. The research focus is to implement and develop a program to execute the single-site in-ventory model for repairable items. The model helps to significantl...

  18. Cognitive interviewing methodology in the development of a pediatric item bank: a patient reported outcomes measurement information system (PROMIS study

    Directory of Open Access Journals (Sweden)

    DeWalt Darren A

    2009-01-01

    Full Text Available Abstract Background The evaluation of patient-reported outcomes (PROs in health care has seen greater use in recent years, and methods to improve the reliability and validity of PRO instruments are advancing. This paper discusses the cognitive interviewing procedures employed by the Patient Reported Outcomes Measurement Information System (PROMIS pediatrics group for the purpose of developing a dynamic, electronic item bank for field testing with children and adolescents using novel computer technology. The primary objective of this study was to conduct cognitive interviews with children and adolescents to gain feedback on items measuring physical functioning, emotional health, social health, fatigue, pain, and asthma-specific symptoms. Methods A total of 88 cognitive interviews were conducted with 77 children and adolescents across two sites on 318 items. From this initial item bank, 25 items were deleted and 35 were revised and underwent a second round of cognitive interviews. A total of 293 items were retained for field testing. Results Children as young as 8 years of age were able to comprehend the majority of items, response options, directions, recall period, and identify problems with language that was difficult for them to understand. Cognitive interviews indicated issues with item comprehension on several items which led to alternative wording for these items. Conclusion Children ages 8–17 years were able to comprehend most item stems and response options in the present study. Field testing with the resulting items and response options is presently being conducted as part of the PROMIS Pediatric Item Bank development process.

  19. Techniques for reducing error in the calorimetric measurement of low wattage items

    Energy Technology Data Exchange (ETDEWEB)

    Sedlacek, W.A.; Hildner, S.S.; Camp, K.L.; Cremers, T.L.

    1993-08-01

    The increased need for the measurement of low wattage items with production calorimeters has required the development of techniques to maximize the precision and accuracy of the calorimeter measurements. An error model for calorimetry measurements is presented. This model is used as a basis for optimizing calorimetry measurements through baseline interpolation. The method was applied to the heat measurement of over 100 items and the results compared to chemistry assay and mass spectroscopy.

  20. An analysis of the differential item function through Mantel-Haenszel, SIBTEST and Logistic Regression Methods

    Directory of Open Access Journals (Sweden)

    Süleyman Demir

    2014-04-01

    Full Text Available This study performs a Differential Item Function (DIF analysis in terms of gender and culture on the items available in the PISA 2009 mathematics literacy sub-test. The DIF analyses were done through the Mantel Haenszel, Logistic Regression and the SIBTEST methods. The data for the gender variable were collected from the responses given by 332 students to the items in the mathematics literacy sub-test during the administration of the 5th booklet in the PISA 2009 application whereas the data for the culture variable were collected through the application of the 5th booklet in Turkey, Germany, Finland and the United States in the PISA 2009 application. As a result of DIF analysis according to gender, 4 items carried out in favor of men, only one item can be said to be advantageous in favor of girls. As a result of DIF analysis according to culture, 16 items for Turkish and German students, 14 items for Turkish and Finn students, 18 items for Turkish and United States students were determined.

  1. 不同认知成分在图形推理测验项目难度预测中的作用%The Role of Different Cognitive Components in the Prediction of the Figural Reasoning Test's Item Difficulty

    Institute of Scientific and Technical Information of China (English)

    李中权; 王力; 张厚粲; 周仁来

    2011-01-01

    Figural reasoning tests (as represented by Raven's tests) are widely applied as effective measures of fluid intelligence in recruitment and personnel selection. However, several studies have revealed that those tests are not appropriate anymore due to high item exposure rates. Computerized automatic item generation (AIG) has gradually been recognized as a promising technique in handling item exposure. Understanding sources of item variation constitutes the initial stage of Computerized AIG, that is, searching for the underlying processing components and the stimuli that significantly influence those components. Some studies have explored sources of item variation, but so far there are no consistent results. This study investigated the relation between item difficulties and stimuli factors (e.g., familiarity of figures, abstraction of attributes, perceptual organization, and memory load) and determines the relative importance of those factors in predicting item difficulities.Eight sets of figural reasoning tests (each set containing 14 items imitating items from Raven's Advanced Progressive Matrics, APM) were constructed manipulating the familiarity of figures, the degree of abstraction of attributes, the perceptual organization as well as the types and number of rules. Using anchor-test design, these tests were administrated via the internet to 6323 participants with 10 items drawing from APAM as anchor items; thus, each participant completed 14 items from either one set and 10 anchor items within half an hour. In order to prevent participants from using response elimination strategy, we presented one item stem first, then alternatives in turn, and asked participants to determine which alternative was the best.DIMTEST analyses were conducted on the participants' responses on each of eight tests. Results showed that items measure a single dimension on each test. Likelihood ratio test indicated that the data fit two-parameter logistic model (2PL) best. Items were

  2. Gender differences in national assessment of educational progress science items: What does i don't know really mean?

    Science.gov (United States)

    Linn, Marcia C.; de Benedictis, Tina; Delucchi, Kevin; Harris, Abigail; Stage, Elizabeth

    The National Assessment of Educational Progress Science Assessment has consistently revealed small gender differences on science content items but not on science inquiry items. This assessment differs from others in that respondents can choose I don't know rather than guessing. This paper examines explanations for the gender differences including (a) differential prior instruction, (b) differential response to uncertainty and use of the I don't know response, (c) differential response to figurally presented items, and (d) different attitudes towards science. Of these possible explanations, the first two received support. Females are more likely to use the I don't know response, especially for items with physical science content or masculine themes such as football. To ameliorate this situation we need more effective science instruction and more gender-neutral assessment items.

  3. Evaluating the Mathematics Interest Inventory Using Item Response Theory: Differential Item Functioning across Gender and Ethnicities

    Science.gov (United States)

    Wei, Tianlan; Chesnut, Steven R.; Barnard-Brak, Lucy; Stevens, Tara; Olivárez, Arturo, Jr.

    2014-01-01

    As the United States has begun to lag behind other developed countries in performance on mathematics and science, researchers have sought to explain this with theories of teaching, knowledge, and motivation. We expand this examination by further analyzing a measure of interest that has been linked to student performance in mathematics and…

  4. Use of Robust z in Detecting Unstable Items in Item Response Theory Models

    Science.gov (United States)

    Huynh, Huynh; Meyer, Patrick

    2010-01-01

    The first part of this paper describes the use of the robust z[subscript R] statistic to link test forms using the Rasch (or one-parameter logistic) model. The procedure is then extended to the two-parameter and three-parameter logistic and two-parameter partial credit (2PPC) models. A real set of data was used to illustrate the extension. The…

  5. Single-Item Measurement of Suicidal Behaviors: Validity and Consequences of Misclassification.

    Directory of Open Access Journals (Sweden)

    Alexander J Millner

    Full Text Available Suicide is a leading cause of death worldwide. Although research has made strides in better defining suicidal behaviors, there has been less focus on accurate measurement. Currently, the widespread use of self-report, single-item questions to assess suicide ideation, plans and attempts may contribute to measurement problems and misclassification. We examined the validity of single-item measurement and the potential for statistical errors. Over 1,500 participants completed an online survey containing single-item questions regarding a history of suicidal behaviors, followed by questions with more precise language, multiple response options and narrative responses to examine the validity of single-item questions. We also conducted simulations to test whether common statistical tests are robust against the degree of misclassification produced by the use of single-items. We found that 11.3% of participants that endorsed a single-item suicide attempt measure engaged in behavior that would not meet the standard definition of a suicide attempt. Similarly, 8.8% of those who endorsed a single-item measure of suicide ideation endorsed thoughts that would not meet standard definitions of suicide ideation. Statistical simulations revealed that this level of misclassification substantially decreases statistical power and increases the likelihood of false conclusions from statistical tests. Providing a wider range of response options for each item reduced the misclassification rate by approximately half. Overall, the use of single-item, self-report questions to assess the presence of suicidal behaviors leads to misclassification, increasing the likelihood of statistical decision errors. Improving the measurement of suicidal behaviors is critical to increase understanding and prevention of suicide.

  6. An Application of Reverse Engineering to Automatic Item Generation: A Proof of Concept Using Automatically Generated Figures

    Science.gov (United States)

    Lorié, William A.

    2013-01-01

    A reverse engineering approach to automatic item generation (AIG) was applied to a figure-based publicly released test item from the Organisation for Economic Cooperation and Development (OECD) Programme for International Student Assessment (PISA) mathematical literacy cognitive instrument as part of a proof of concept. The author created an item…

  7. Structural changes in single membranes in response to an applied transmembrane electric potential revealed by time-resolved neutron/X-ray interferometry

    International Nuclear Information System (INIS)

    Highlights: ► Time-resolved (or transient) neutron/X-ray reflectivity. ► Neutron/X-ray reflectivity enhanced by interferometric techniques. ► Electric potential induced changes in a hybrid lipid bilayer membrane. ► Electric potential induced changes in a voltage-sensor protein membrane. - Abstract: The profile structure of a hybrid lipid bilayer, tethered to the surface of an inorganic substrate and fully hydrated with a bulk aqueous medium in an electrochemical cell, was investigated as a function of the applied transbilayer electric potential via time-resolved neutron reflectivity, enhanced by interferometry. Significant, and fully reversible structural changes were observed in the distal half (with respect to the substrate surface) of the hybrid bilayer comprised of a zwitterionic phospholipid in response to a +100 mV potential with respect to 0 mV. These arise presumably due to reorientation of the electric dipole present in the polar headgroup of the phospholipid and its resulting effect on the thickness of the phospholipid’s hydrocarbon chain layer within the hybrid bilayer’s profile structure. The profile structure of the voltage-sensor domain from a voltage-gated ion channel protein within a phospholipid bilayer membrane, tethered to the surface of an inorganic substrate and fully hydrated with a bulk aqueous medium in an electrochemical cell, was also investigated as a function of the applied transmembrane electric potential via time-resolved X-ray reflectivity, enhanced by interferometry. Significant, fully-reversible, and different structural changes in the protein were detected in response to ±100 mV potentials with respect to 0 mV. The approach employed is that typical of transient spectroscopy, shown here to be applicable to both neutron and X-ray reflectivity of thin films

  8. Gender differential item functioning on a national field-specific test: The case of PhD entrance exam of TEFL in Iran

    OpenAIRE

    Alireza Ahmadi; Ali Darabi Bazvand

    2016-01-01

    Differential Item Functioning (DIF) exists when examinees of equal ability from different groups have different probabilities of successful performance in a certain item. This study examined gender differential item functioning across the PhD Entrance Exam of TEFL (PEET) in Iran, using both logistic regression (LR) and one-parameter item response theory (1-p IRT) models. The PEET is a national test consisting of a centralized written examination designed to provide information on the eligibil...

  9. Cultural Consensus Theory: Aggregating Continuous Responses in a Finite Interval

    Science.gov (United States)

    Batchelder, William H.; Strashny, Alex; Romney, A. Kimball

    Cultural consensus theory (CCT) consists of cognitive models for aggregating responses of "informants" to test items about some domain of their shared cultural knowledge. This paper develops a CCT model for items requiring bounded numerical responses, e.g. probability estimates, confidence judgments, or similarity judgments. The model assumes that each item generates a latent random representation in each informant, with mean equal to the consensus answer and variance depending jointly on the informant and the location of the consensus answer. The manifest responses may reflect biases of the informants. Markov Chain Monte Carlo (MCMC) methods were used to estimate the model, and simulation studies validated the approach. The model was applied to an existing cross-cultural dataset involving native Japanese and English speakers judging the similarity of emotion terms. The results sharpened earlier studies that showed that both cultures appear to have very similar cognitive representations of emotion terms.

  10. [XS-DIF: program for analysis of Differential Item Functioning in Excel].

    Science.gov (United States)

    Ordóñez, Xavier G; Romero, Sonia J

    2007-02-01

    XS-DIF is a program for detection of Differential Item Functioning (DIF) using Item Response Theory (IRT). It calculates Lords Chi-Square, Raju's Signed Area and Unsigned Area, and Kim and Cohen's Closed-interval signed area and Closed-interval unsigned area. XS-DIF was designed to be executed in Excel 2000 and it has a capacity of analysis of up to 100 items. It is useful to support data analysis of research projects and in detection and teaching processes in DIF.

  11. A sampling and classification item selection approach with content balancing.

    Science.gov (United States)

    Chen, Pei-Hua

    2015-03-01

    Existing automated test assembly methods typically employ constrained combinatorial optimization. Constructing forms sequentially based on an optimization approach usually results in unparallel forms and requires heuristic modifications. Methods based on a random search approach have the major advantage of producing parallel forms sequentially without further adjustment. This study incorporated a flexible content-balancing element into the statistical perspective item selection method of the cell-only method (Chen et al. in Educational and Psychological Measurement, 72(6), 933-953, 2012). The new method was compared with a sequential interitem distance weighted deviation model (IID WDM) (Swanson & Stocking in Applied Psychological Measurement, 17(2), 151-166, 1993), a simultaneous IID WDM, and a big-shadow-test mixed integer programming (BST MIP) method to construct multiple parallel forms based on matching a reference form item-by-item. The results showed that the cell-only method with content balancing and the sequential and simultaneous versions of IID WDM yielded results comparable to those obtained using the BST MIP method. The cell-only method with content balancing is computationally less intensive than the sequential and simultaneous versions of IID WDM. PMID:24610145

  12. Dynamic characteristics of laser Doppler flowmetry signals obtained in response to a local and progressive pressure applied on diabetic and healthy subjects

    Science.gov (United States)

    Humeau, Anne; Koitka, Audrey; Abraham, Pierre; Saumet, Jean-Louis; L'Huillier, Jean-Pierre

    2004-09-01

    In the biomedical field, the laser Doppler flowmetry (LDF) technique is a non-invasive method to monitor skin perfusion. On the skin of healthy humans, LDF signals present a significant transient increase in response to a local and progressive pressure application. This vasodilatory reflex response may have important implications for cutaneous pathologies involved in various neurological diseases and in the pathophysiology of decubitus ulcers. The present work analyses the dynamic characteristics of these signals on young type 1 diabetic patients, and on healthy age-matched subjects. To obtain accurate dynamic characteristic values, a de-noising wavelet-based algorithm is first applied to LDF signals. All the de-noised signals are then normalised to the same value. The blood flow peak and the time to reach this peak are then calculated on each computed signal. The results show that a large vasodilation is present on signals of healthy subjects. The mean peak occurs at a pressure of 3.2 kPa approximately. However, a vasodilation of limited amplitude appears on type 1 diabetic patients. The maximum value is visualised, on the average, when the pressure is 1.1 kPa. The inability for diabetic patients to increase largely their cutaneous blood flow may bring explanations to foot ulcers.

  13. The effect of Trier Social Stress Test (TSST on item and associative recognition of words and pictures in healthy participants

    Directory of Open Access Journals (Sweden)

    Jonathan eGuez

    2016-04-01

    Full Text Available Psychological stress, induced by the Trier Social Stress Test (TSST, has repeatedly been shown to alter memory performance. Although factors influencing memory performance such as stimulus nature (verbal /pictorial and emotional valence have been extensively studied, results whether stress impairs or improves memory are still inconsistent. This study aimed at exploring the effect of TSST on item versus associative memory for neutral, verbal, and pictorial stimuli. 48 healthy subjects were recruited, 24 participants were randomly assigned to the TSST group and the remaining 24 participants were assigned to the control group. Stress reactivity was measured by psychological (subjective state anxiety ratings and physiological (Galvanic skin response recording measurements. Subjects performed an item-association memory task for both stimulus types (words, pictures simultaneously, before, and after the stress/non-stress manipulation. The results showed that memory recognition for pictorial stimuli was higher than for verbal stimuli. Memory for both words and pictures was impaired following TSST; while the source for this impairment was specific to associative recognition in pictures, a more general deficit was observed for verbal material, as expressed in decreased recognition for both items and associations following TSST. Response latency analysis indicated that the TSST manipulation decreased response time but at the cost of memory accuracy. We conclude that stress does not uniformly affect memory; rather it interacts with the task’s cognitive load and stimulus type. Applying the current study results to patients diagnosed with disorders associated with traumatic stress, our findings in healthy subjects under acute stress provide further support for our assertion that patients’ impaired memory originates in poor recollection processing following depletion of attentional resources.

  14. Performance-based evaluation of commercial-grade items

    International Nuclear Information System (INIS)

    For the last decade, regulatory expectations for the procurement process for nuclear safety-related commercial-grade items (CGIs) have increased. These changes are driven by US Nuclear Regulatory Commission (NRC) concern for fraudulent or misrepresented parts and significant marketplace changes. The industry responded to these concerns by developing improved procurement programs that changed the detail to which parts were specified and received and provided for verification of attributes that were critical to successful performance of safety-related function(s). Like its counterparts, Duquesne Light Company (DLCo) began applying enhanced program requirements to procurements initiated after September 1, 1989, in order to meet the industry's January 1, 1990, commitment

  15. Software for MUF evaluating in item nuclear material accounting

    International Nuclear Information System (INIS)

    Nuclear material accounting is a key measure for nuclear safeguard. Software for MUF evaluation in item nuclear material accounting was worked out in this paper. It is composed of several models, including input model, data processing model, data inquiring model, data print model, system setting model etc. It could be used to check the variance of the measurement and estimate the confidence interval according to the MUF value. To insure security of the data multi-user management function was applied in the software. (authors)

  16. Clinical relevance of single item quality of life indicators in cancer clinical trials

    OpenAIRE

    Bernhard, J.; Sullivan, M.; Hürny, C; Coates, A S; Rudenstam, C-M

    2001-01-01

    We investigated the hypothesis that global single-item quality-of-life indicators are less precise for specific treatment effects (discriminant validity) than multi-item scales but similarly efficient for overall treatment comparisons and changes over time (responsiveness) because they reflect the summation of the individual meaning and importance of various factors. Linear analogue self-assessment (LASA) indicators for physical well-being, mood and coping were compared with the Hospital Anxi...

  17. Volume 42, Issue5 (May 2005)Articles in the Current Issue:Developmental growth in students' concept of energy: Analysis of selected items from the TIMSS database

    Science.gov (United States)

    Liu, Xiufeng; McKeough, Anne

    2005-05-01

    The aim of this study was to develop a model of students' energy concept development. Applying Case's (1985, 1992) structural theory of cognitive development, we hypothesized that students' concept of energy undergoes a series of transitions, corresponding to systematic increases in working memory capacity. The US national sample from the Third International Mathematics and Science Study (TIMSS) database was used to test our hypothesis. Items relevant to the energy concept in the TIMSS test booklets for three populations were identified. Item difficulty from Rasch modeling was used to test the hypothesized developmental sequence, and percentage of students' correct responses was used to test the correspondence between students' age/grade level and level of the energy concepts. The analysis supported our hypothesized sequence of energy concept development and suggested mixed effects of maturation and schooling on energy concept development. Further, the results suggest that curriculum and instruction design take into consideration the developmental progression of students' concept of energy.

  18. Ventajas de los Modelos Politómicos de Teoría de Respuesta al ítem en la Medición de Actitudes Sociales: El Análisis de un Caso Advantages of Polytomous Models of Item Response Theory in Measuring Social Attitudes: A Case Study

    Directory of Open Access Journals (Sweden)

    Rodrigo Asún

    2008-11-01

    Full Text Available A pesar de sus ventajas en la investigación psicométrica, la Teoría de Respuesta al ítem (TRI no ha logrado imponerse en la práctica cotidiana de medición de constructos psicológicos o actitudes sociales. En esta investigación se muestra la utilidad de trabajar con modelos de TRI politómicos a través de su comparación con la Teoría Clásica, al estudiar el comportamiento de una escala de intolerancia. Se concluye que las ventajas de los modelos politómicos de la TRI no se encuentran en la forma en que escalan a los individuos ni en la estimación de los parámetros de los ítems, sino en la obtención de mayor información respecto al funcionamiento del instrumento, que podría ser utilizado para su mejoramiento futuro.Despite its advantages in psychometric research, Item Response Theory (IRT has not been regularly used in the measurement of psychological constructs or social attitudes. This study used a scale of intolerance to demonstrate the usefulness of working with polytomous models of IRT in comparison with the Classical Test Theory to study attitudes. It is concluded that the advantages of the polytomous models of IRT are found not in the form in which the people are scaled or in the estimation of item parameters, but in obtaining better information of the psychometric proprieties of an instrument, information that can be used to improve the instrument.

  19. Technical evaluation process for specifying replacement items

    International Nuclear Information System (INIS)

    As nuclear power plants continue in the transition to a strictly operating and maintenance mode, the engineering involvement required to specify and procure replacement parts and components continues to increase. The evaluation process for a replacement item has one simple goal: to specify an effective set of technical and quality requirements that will result in the procurement of an item meeting the plant design basis. The evaluation process is depicted in a simplified block diagram showing the elements of engineering evaluation from a functional perspective. These elements are the following: (1) need for a technical evaluation; (2) components and part classification; (3) failure mode and effect analysis (FMEA); (4) critical characteristics for design determination; (5) like-for-like or alternate item evaluation; and (6) technical and quality requirements determination

  20. Are Inferential Reading Items More Susceptible to Cultural Bias than Literal Reading Items?

    Science.gov (United States)

    Banks, Kathleen

    2012-01-01

    The purpose of this article is to illustrate a seven-step process for determining whether inferential reading items were more susceptible to cultural bias than literal reading items. The seven-step process was demonstrated using multiple-choice data from the reading portion of a reading/language arts test for fifth and seventh grade Hispanic,…