WorldWideScience

Sample records for nonreversed ces-d items

  1. Validating a shortened depression scale (10 item CES-D among HIV-positive people in British Columbia, Canada.

    Directory of Open Access Journals (Sweden)

    Wendy Zhang

    Full Text Available OBJECTIVE: To establish the reliability and validity of a shortened (10-item depression scale used among HIV-positive patients enrolled in the Drug Treatment Program in British Columbia, Canada. METHODS: The 10-item CES-D (Center for Epidemiologic Studies Depression Scale was examined among 563 participants who initiated antiretroviral therapy (ART between August 1, 1996 and June 30, 2002. Internal consistency of the scale was measured by Cronbach's alpha. Using the original CES-D 20 as primary criteria, comparisons were made using the Kappa statistic. Predictive accuracy of CES-D 10 was assessed by calculating sensitivity, specificity, positive predictive values and negative predictive values. Factor analysis was also performed to determine if the CES-D 10 contained the same factors of positive and negative affect found in the original development of the CES-D. RESULTS: The correlation between the original and the shortened scale is very high (Spearman correlation coefficient  =0.97 (P<0.001. Internal consistency reliability coefficients of the CES-D 10 were satisfactory (Cronbach α=0.88. The CES-D 10 showed comparable accuracy to the original CES-D 20 in classifying participants with depressive symptoms (Kappa=0.82, P<0.001. Sensitivity of CES-D 10 was 91%; specificity was 92%; and positive predictive value was 92%. Factor analysis demonstrates that CES-D 10 contains the same underlying factors of positive and negative affect found in the original development of the CES-D 20. CONCLUSION: The 10-item CES-D is a comparable tool to measure depressive symptoms among HIV-positive research participants.

  2. Test of item-response bias in the CES-D scale. experience from the New Haven EPESE study.

    Science.gov (United States)

    Cole, S R; Kawachi, I; Maller, S J; Berkman, L F

    2000-03-01

    We present results of item-response bias analyses of the exogenous variables age, gender, and race for all items from the Center for Epidemiologic Studies Depression (CES-D) scale using data (N = 2340) from the New Haven component of the Established Populations for Epidemiologic Studies of the Elderly (EPESE). The proportional odds of blacks responding higher on the CES-D items "people are unfriendly" and "people dislike me" were 2.29 (95% confidence interval: 1.74, 3.02) and 2.96 (95% confidence interval: 2.15, 4.07) times that of whites matched on overall depressive symptoms, respectively. In addition, the proportional odds of women responding higher on the CES-D item "crying spells" were 2.14 (95% confidence interval: 1.60, 2.82) times that of men matched on overall depressive symptoms. Our data indicate the CES-D would have greater validity among this diverse group of older men and women after removal of the crying item and two interpersonal items.

  3. Curtailment and Stochastic Curtailment to Shorten the CES-D

    Science.gov (United States)

    Finkelman, Matthew D.; Smits, Niels; Kim, Wonsuk; Riley, Barth

    2012-01-01

    The Center for Epidemiologic Studies-Depression (CES-D) scale is a well-known self-report instrument that is used to measure depressive symptomatology. Respondents who take the full-length version of the CES-D are administered a total of 20 items. This article investigates the use of curtailment and stochastic curtailment (SC), two sequential…

  4. The CES-D as a Measure of Psychological Distress Among International Students: Measurement and Structural Invariance Across Gender.

    Science.gov (United States)

    Suh, Hanna; van Nuenen, Marieke; Rice, Kenneth G

    2017-10-01

    Detecting psychological distress among international students can be challenging given diverse languages, cultural backgrounds, and lack of refined measurement properties of measures tailored to international students. Despite the challenges, ensuring that a psychological distress measure works effectively has considerable potential value for assessment purposes. The current study evaluates the measurement properties of a short 10-item version of Radloff's Center for Epidemiologic Studies Depression Scale (CES-D). Grounded in long-standing evidence on gender differences in depressive symptoms, specific attention was given to examining measurement invariance of the CES-D Short-form across women and men. Based on a large, two-cohort sample of international students ( N = 468), and through multiple analyses evaluating factor structure and measurement invariance, we derived an even briefer, seven-item single-factor form of the CES-D (CES-D Short-form International) that can be used with international students.

  5. A longitudinal evaluation of the Center for Epidemiologic Studies-Depression scale (CES-D in a Rheumatoid Arthritis Population using Rasch Analysis

    Directory of Open Access Journals (Sweden)

    Tennant Alan

    2007-07-01

    Full Text Available Abstract Background The aim of this study was to test the internal validity of the total Center for Epidemiologic Studies-Depression (CES-D scale using Rasch analysis in a rheumatoid arthritis (RA population. Methods CES-D was administered to 157 patients with RA over three time points within a 12 month period. Rasch analysis was applied using RUMM2020 software to assess the overall fit of the model, the response scale used, individual item fit, differential item functioning (DIF and person separation. Results Pooled data across three time points was shown to fit the Rasch model with removal of seven items from the original 20-item CES-D scale. It was necessary to rescore the response format from four to three categories in order to improve the scale's fit. Two items demonstrated some DIF for age and gender but were retained within the 13-item CES-D scale. A new cut point for depression score of 9 was found to correspond to the original cut point score of 16 in the full CES-D scale. Conclusion This Rasch analysis of the CES-D in a longstanding RA cohort resulted in the construction of a modified 13-item scale with good internal validity. Further validation of the modified scale is recommended particularly in relation to the new cut point for depression.

  6. An assessment of the measurement equivalence of English and French versions of the Center for Epidemiologic Studies Depression (CES-D Scale in systemic sclerosis.

    Directory of Open Access Journals (Sweden)

    Vanessa C Delisle

    Full Text Available Center for Epidemiologic Studies Depression (CES-D Scale scores in English- and French-speaking Canadian systemic sclerosis (SSc patients are commonly pooled in analyses, but no studies have evaluated the metric equivalence of the English and French CES-D. The study objective was to examine the metric equivalence of the CES-D in English- and French-speaking SSc patients.The CES-D was completed by 1007 English-speaking and 248 French-speaking patients from the Canadian Scleroderma Research Group Registry. Confirmatory factor analysis (CFA was used to assess the factor structure in both samples. The Multiple-Indicator Multiple-Cause (MIMIC model was utilized to assess differential item functioning (DIF.A two-factor model (Positive and Negative affect showed excellent fit in both samples. Statistically significant, but small-magnitude, DIF was found for 3 of 20 CES-D items, including items 3 (Blues, 10 (Fearful, and 11 (Sleep. Prior to accounting for DIF, French-speaking patients had 0.08 of a standard deviation (SD lower latent scores for the Positive factor (95% confidence interval [CI]-0.25 to 0.08 and 0.09 SD higher scores (95% CI-0.07 to 0.24 for the Negative factor than English-speaking patients. After DIF correction, there was no change on the Positive factor and a non-significant increase of 0.04 SD on the Negative factor for French-speaking patients (difference = 0.13 SD, 95% CI-0.03 to 0.28.The English and French versions of the CES-D, despite minor DIF on several items, are substantively equivalent and can be used in studies that combine data from English- and French-speaking Canadian SSc patients.

  7. Factorial and Discriminant Validity of the Center for Epidemiological Studies Depression (CES-D) Scale.

    Science.gov (United States)

    Orme, John G.; And Others

    1986-01-01

    Examined the factorial and discriminant validity of the Center for Epidemiological Studies Depression (CES-D) scale for 116 parents participating in family support programs. Factorial validity was adequate, and results indicated a moderate correlation between the CES-D and self-esteem and state anxiety. However, a high correlation was obtained…

  8. Correlations between the BDI and CES-D in a Sample of Adolescent Mothers.

    Science.gov (United States)

    Wilcox, Holly; Field, Tiffany; Prodromidis, Margarita; Scafidi, Frank

    1998-01-01

    The adequacy of the Beck Depression Inventory (BDI) and Center for Epidemiological Studies-Depression (CES-D) as screening instruments for adolescent depression is examined. Both are correlated with the Diagnostic Interview Schedule for Children, a clinical measure. BDI correlates more highly with Major Depression subscale, CES-D to Dysthymia…

  9. Validation of the turkish version of the centre for epidemiologic studies depression scale (ces-d in patients with type 2 diabetes mellitus

    Directory of Open Access Journals (Sweden)

    Karşıdağ Kubilay

    2011-07-01

    Full Text Available Abstract Background Depression is a common co-morbid health problem in patients with diabetes that is underrecognised. Current international guidelines recommend screening for depression in patients with diabetes. Yet, few depression screening instruments have been validated for use in this particular group of patients. Aim of the present study was to investigate the psychometric properties of the Turkish version of the Centre for Epidemiologic Studies Depression Scale (CES-D in patients with type 2 diabetes. Methods A sample of 151 Turkish outpatients with type 2 diabetes completed the CES-D, the World Health Organization-Five Well-Being Index (WHO-5, and the Problem Areas in Diabetes scale (PAID. Explanatory factor analyses, various correlations and Cronbach's alpha were investigated to test the validity and reliability of the CES-D in Turkish diabetes outpatients. Results The original four-factor structure proposed by Radloff was not confirmed. Explanatory factor analyses revealed a two-factor structure representing two subscales: (1 depressed mood combined with somatic symptoms of depression and (2 positive affect. However, one item showed insufficient factor loadings. Cronbach's alpha of the total score was high (0.88, as were split-half coefficients (0.77-0.90. The correlation of the CES-D with the WHO-5 was the strongest (r = -0.70, and supported concurrent validity. Conclusion The CES-D appears to be a valid measure for the assessment of depression in Turkish diabetes patients. Future studies should investigate its sensitivity and specificity as well as test-retest reliability.

  10. 10项流调中心抑郁自评量表在中国中老人群中的信效度%Reliability and Validity of 10-item CES-D among Middle Aged and Older Adults in China

    Institute of Scientific and Technical Information of China (English)

    黄庆波; 王晓华; 陈功

    2015-01-01

    目的:分析10项CES-D问卷在中老年人群中的信效度,以及该量表在中年和老年群体中的信效度差异。方法:使用中国健康与养老追踪调查2011年数据,样本人群年龄45岁以上,共14931人;首先,运用区分度分析量表各条目对不同水平被试的区分能力;其次,分别对整个人群、老年人群和中年人群进行内部一致性检验,并在3个不同因子结构的假定下(单因子结构;二因子结构:积极因子和消极因子;三因子结构:积极因子、消极因子和躯体因子)分别对3个人群进行验证性因子分析,并利用多组验证性因子分析检验两因子结构模型在中年人群和老年人群中的恒等性。结果:区分度分析发现,在总分百分位数27%的最高组和最低组之间各项目得分具有显著的差异,在整个人群中的科隆巴赫系数为0.815,两因子结构对整个样本人群拟合较好;在中年人群和老年人群中,内部一致性系数分别为0.814、0.813;两因子结构模型在4个嵌套模型的对比检验中显示,△CFI、△RMSEA和△SRMR的差异均满足横等性最低临界值要求。结论:10项CES-D量表具有较高的信效度,两因子结构模型在中年人群和老年人群具有恒等性,量表具有良好的区分效度,能有效地测量中老年人群的抑郁水平。%Objective:To examine the reliability and validity of 10-item of Center for Epidemiologic Studies Depression Scale among older adults and test the scale invariance for middle aged and older a-dults.Methods:Data were derived from 2011 wave of China Health and Retirement Longitudinal Study and 14931 individuals with 45 or more years old.First,item discrimination analysis was used to test the discriminative ability among different people with various level;Second,among the entire sample,older a-dults and middle aged adults,internal consistency analysis was conducted to test

  11. Use of the Center for Epidemiologic Studies Depression (CES-D) Scale in Korea.

    Science.gov (United States)

    Cho, M J; Kim, K H

    1998-05-01

    We translated the Center for Epidemiologic Studies Depression (CES-D) Scale into Korean, paying careful attention to culturally different modes of expression of depressive feelings and thoughts. The final translated version (CES-D-K) was administered to 164 psychiatric patients and 464 normal subjects residing in the community. Reliability, validity, and the optimal cutoff point of this scale were estimated, including ROC analysis. The CES-D-K was reliable and valid for the Korean population. Two optimal cutoff points were suggested: 24/25, the point which best corresponded to the clinical diagnosis of depression, and 20/21, which most effectively detects and covers depressive symptoms during screening. The higher cutoff points than those in Western countries may be due to different ways of expression affect, especially the suppression of positive affect, in cultures based on Confucian ethics.

  12. Measurement equivalence of the CES-D 8 depression-scale among the ageing population in eleven European countries.

    Science.gov (United States)

    Missinne, Sarah; Vandeviver, Christophe; Van de Velde, Sarah; Bracke, Piet

    2014-07-01

    Depression is one of the most prevalent mental disorders in later life. However, despite considerable research attention, great confusion remains regarding the association between ageing and depression. There is doubt as to whether a depression scale performs identically for different age groups and countries. Although measurement equivalence is a crucial prerequisite for valid comparisons across age groups and countries, it has not been established for the eight-item version of the Centre for Epidemiological Studies Depression Scale (CES-D8). Using multi-group confirmatory factor analysis, we assess configural, metric, and scalar measurement equivalence across two age groups (50-64 years of age and 65 or older) in eleven European countries, employing data from the Survey of Health, Ageing, and Retirement (SHARE). Results indicate that the construct of depression is comparable across age and country groups, allowing the substantive interpretation of correlates and mean levels of depressive symptoms.

  13. Cross-language measurement equivalence of the Center for Epidemiologic Studies Depression (CES-D scale in systemic sclerosis: a comparison of Canadian and Dutch patients.

    Directory of Open Access Journals (Sweden)

    Linda Kwakkenbos

    Full Text Available OBJECTIVES: Increasingly, medical research involves patients who complete outcomes in different languages. This occurs in countries with more than one common language, such as Canada (French/English or the United States (Spanish/English, as well as in international multi-centre collaborations, which are utilized frequently in rare diseases such as systemic sclerosis (SSc. In order to pool or compare outcomes, instruments should be measurement equivalent (invariant across cultural or linguistic groups. This study provides an example of how to assess cross-language measurement equivalence by comparing the Center for Epidemiologic Studies Depression (CES-D scale between English-speaking Canadian and Dutch SSc patients. METHODS: The CES-D was completed by 922 English-speaking Canadian and 213 Dutch SSc patients. Confirmatory factor analysis (CFA was used to assess the factor structure in both samples. The Multiple-Indicator Multiple-Cause (MIMIC model was utilized to assess the amount of differential item functioning (DIF. RESULTS: A two-factor model (positive and negative affect showed excellent fit in both samples. Statistically significant, but small-magnitude, DIF was found for 3 of 20 items on the CES-D. The English-speaking Canadian sample endorsed more feeling-related symptoms, whereas the Dutch sample endorsed more somatic/retarded activity symptoms. The overall estimate in depression scores between English and Dutch was not influenced substantively by DIF. CONCLUSIONS: CES-D scores from English-speaking Canadian and Dutch SSc patients can be compared and pooled without concern that measurement differences may substantively influence results. The importance of assessing cross-language measurement equivalence in rheumatology studies prior to pooling outcomes obtained in different languages should be emphasized.

  14. Factorial validity and invariance of the center for epidemiologic studies depression (CES-D) scale in a sample of black and white adolescent girls.

    Science.gov (United States)

    Hales, Derek P; Dishman, Rod K; Motl, Robert W; Addy, Cheryl L; Pfeiffer, Karin A; Pate, Russell R

    2006-01-01

    Meaningful comparison of depression symptoms requires that the measurement instrument has equivalent measurement properties among racial and ethnic groups. We tested the factorial validity and invariance of the Center for Epidemiologic Studies-Depression (CES-D) Scale among Black (n=610) and White (n=452) adolescent girls in the 12th grade. The invariance analyses were conducted by using LISREL 8.70 with maximum likelihood estimation and Satorra-Bentler scaled chi-square statistics and standard errors. The hypothesized second-order model (first-order factors: depressed affect, somatic and retarded activity, interpersonal, and positive affect; second-order factor: depression) demonstrated good overall fit in both groups. Comparison of nested models supported the between-group invariance of the overall factor structure, first- and second-order factor loadings, first-order factor variances, and the second-order factor variance. Item uniquenesses were not invariant. Our results support the hypothesis that a meaningful comparison of composite CES-D scores can be made between Black and White girls in the 12th grade.

  15. [Confirmatory factor analysis of the short French version of the Center for Epidemiological Studies of Depression Scale (CES-D10) in adolescents].

    Science.gov (United States)

    Cartierre, N; Coulon, N; Demerval, R

    2011-09-01

    Screening depressivity among adolescents is a key public health priority. In order to measure the severity of depressive symptomatology, a four-dimensional 20 items scale called "Center for Epidemiological Studies-Depression Scale" (CES-D) was developed. A shorter 10-item version was developed and validated (Andresen et al.). For this brief version, several authors supported a two-factor structure - Negative and Positive affect - but the relationship between the two reversed-worded items of the Positive affect factor could be better accounted for by correlated errors. The aim of this study is triple: firstly to test a French version of the CES-D10 among adolescents; secondly to test the relevance of a one-dimensional structure by considering error correlation for Positive affect items; finally to examine the extent to which this structural model is invariant across gender. The sample was composed of 269 French middle school adolescents (139 girls and 130 boys, mean age: 13.8, SD=0.65). Confirmatory Factorial Analyses (CFA) using the LISREL 8.52 were conducted in order to assess the adjustment to the data of three factor models: a one-factor model, a two-factor model (Positive and Negative affect) and a one-factor model with specification of correlated errors between the two reverse-worded items. Then, multigroup analysis was conducted to test the scale invariance for girls and boys. Internal consistency of the CES-D10 was satisfying for the adolescent sample (α=0.75). The best fitting model is the one-factor model with correlated errors between the two items of the previous Positive affect factor (χ(2)/dl=2.50; GFI=0.939; CFI=0.894; RMSEA=0.076). This model presented a better statistical fit to the data than the one-factor model without error correlation: χ(2)(diff) (1)=22.14, pfactor model with correlated errors was analyzed across separate samples of girls and boys. The model explains the data somewhat better for boys than for girls. The model's overall χ(2

  16. Lifting—A nonreversible Markov chain Monte Carlo algorithm

    Science.gov (United States)

    Vucelja, Marija

    2016-12-01

    Markov chain Monte Carlo algorithms are invaluable tools for exploring stationary properties of physical systems, especially in situations where direct sampling is unfeasible. Common implementations of Monte Carlo algorithms employ reversible Markov chains. Reversible chains obey detailed balance and thus ensure that the system will eventually relax to equilibrium, though detailed balance is not necessary for convergence to equilibrium. We review nonreversible Markov chains, which violate detailed balance and yet still relax to a given target stationary distribution. In particular cases, nonreversible Markov chains are substantially better at sampling than the conventional reversible Markov chains with up to a square root improvement in the convergence time to the steady state. One kind of nonreversible Markov chain is constructed from the reversible ones by enlarging the state space and by modifying and adding extra transition rates to create non-reversible moves. Because of the augmentation of the state space, such chains are often referred to as lifted Markov Chains. We illustrate the use of lifted Markov chains for efficient sampling on several examples. The examples include sampling on a ring, sampling on a torus, the Ising model on a complete graph, and the one-dimensional Ising model. We also provide a pseudocode implementation, review related work, and discuss the applicability of such methods.

  17. Factor Structure and Psychometric Properties of the Center for Epidemiologic Studies Depression Scale (CES-D) in Older Populations with and without Cognitive Impairment

    Science.gov (United States)

    Ros, L.; Latorre, J. M.; Aguilar, M. J.; Serrano, J. P.; Navarro, B.; Ricarte, J. J.

    2011-01-01

    The CES-D is widely used for the assessment of depressive symptoms in the adult population. However, few studies have been performed to assess the utility of this scale in an older population with cognitive impairment. The factor structure of the Spanish version of the CES-D was examined in an observational, cross sectional study in 623 older…

  18. Validation of the Turkish version of the Centre for Epidemiologic Studies Depression Scale (CES-D) in patients with type 2 diabetes mellitus

    DEFF Research Database (Denmark)

    Lehmann, Vicky; Makine, Ceylan; Karşıdağ, Cagatay

    2011-01-01

    group of patients. Aim of the present study was to investigate the psychometric properties of the Turkish version of the Centre for Epidemiologic Studies Depression Scale (CES-D) in patients with type 2 diabetes. METHODS: A sample of 151 Turkish outpatients with type 2 diabetes completed the CES......-D, the World Health Organization-Five Well-Being Index (WHO-5), and the Problem Areas in Diabetes scale (PAID). Explanatory factor analyses, various correlations and Cronbach's alpha were investigated to test the validity and reliability of the CES-D in Turkish diabetes outpatients. RESULTS: The original four....... Future studies should investigate its sensitivity and specificity as well as test-retest reliability....

  19. The evaluation of the Center for Epidemiologic Studies Depression (CES-D) scale : Depressed and Positive Affect in cancer patients and healthy reference subjects

    NARCIS (Netherlands)

    Schroevers, MJ; Sanderman, R; van Sonderen, E; Ranchor, AV

    2000-01-01

    This study examined the reliability and validity of a two-factor structure of the Center for Epidemiologic Studies Depression (CES-D) scale. The study was conducted in a large group of cancer patients (n = 475) and a matched reference group (n = 255). Both groups filled in a questionnaire at two

  20. Association between the Center for Epidemiologic Studies Depression Scale (CES-D and mortality in a community sample: An artifact of the somatic complaints factor?

    Directory of Open Access Journals (Sweden)

    Jeremy W. Pettit

    2008-01-01

    Full Text Available La mayoría de las investigaciones sobre la asociación entre la depresión y la mortalidad no han examinado distintos grupos de síntomas depresivos. Este estudio ex post facto examinó que aspectos de la depresión explican su asociación con la mortalidad. La Escala de Depresión del Centro de Estudios Epidemiológicos (CES-D fue administrada a 3.867 residentes comunitarios. El riesgo de mortalidad como función del estado depresivo y de cada uno de los 4 factores de la CES-D fue estimado con el modelo de azar proporcional de Cox. Los participantes deprimidos (CES-D > 16 tuvieron un riesgo elevado de mortalidad (HR 1,23, 95% CI 1,03-1,49 después de la corrección de variables sociodemográficos. Quejas somáticas fue el único factor que predijo la mortalidad (HR 1,19, 95% CI 1,03-1,38. Después de excluir Quejas somáticas, la CES-D no predijo la mortalidad (HR 0,98, 95% CI 0,79-1,21. La asociación entre los síntomas depresivos de la CES-D y la mortalidad parece ser una función del factor Quejas somáticas. Es posible que la asociación entre los síntomas depresivos no somáticos y la mortalidad no sea tan robusta como indican los hallazgos anteriores.

  1. Avaliação da sintomatologia depressiva de mulheres no climatério com a escala de rastreamento populacional para depressão CES-D Depressive symptoms in climacteric women evaluated by the Center for Epidemiological Studies Depression Scale

    Directory of Open Access Journals (Sweden)

    Rita de Cássia Leite Fernandes

    2008-12-01

    Full Text Available INTRODUÇÃO: O objetivo deste estudo foi avaliar a sintomatologia depressiva em mulheres climatéricas com a escala de depressão CES-D (Center for Epidemiological Studies Depression Scale, do National Institute of Mental Health (EUA. MÉTODO: Estudo transversal com 151 mulheres entre 40 e 65 anos de idade, usuárias de serviço de ginecologia geral em unidade de atenção básica à saúde no Rio de Janeiro. Aplicou-se a escala CES-D e um questionário estruturado para a obtenção de dados sociodemográficos, clínicos e ginecológicos. O nível de corte > 15 pontos na CES-D foi considerado como indicativo de quadro depressivo. RESULTADOS: A média de pontuação da amostra foi de 9,2 pontos (desvio padrão = 9,0. Os itens mais pontuados da escala foram relativos à insônia, tristeza e desânimo. Não houve associação significativa entre os escores e o período climatérico, características sociodemográficas, clínicas ou ginecológicas, exceto para as mulheres com presença de sintomas psíquicos, histórico depressivo pregresso e uso atual de antidepressivos (p = 0,000. Entre as 32 mulheres (21% com pontuação > 15 na CES-D, 72% referiram episódio depressivo pregresso. Dentre as participantes sem histórico depressivo, as perimenopáusicas apresentaram escores > 15 com maior freqüência. CONCLUSÃO: Essa casuística, oriunda de serviço não-especializado em menopausa ou saúde mental, revelou baixas pontuações médias na escala de sintomas depressivos CES-D, e o item insônia foi o mais pontuado. O histórico de depressão foi fator de associação com a alta pontuação na escala, mas não a fase climatérica em que a mulher se encontrava. A maior freqüência de pontuação acima do nível de corte nas mulheres sem histórico depressivo pregresso que estavam na perimenopausa sugere a maior vulnerabilidade, nessa fase, a episódios depressivos novos.INTRODUCTION: The objective of this study was to evaluate depressive symptoms in

  2. The Factor Structure, Predictors, and Percentile Norms of the Center for Epidemiologic Studies Depression (CES-D Scale in the Dutch-speaking Adult Population of Belgium

    Directory of Open Access Journals (Sweden)

    Qian Wu

    2016-01-01

    Full Text Available The Center of Epidemiologic Studies Depression Scale (CES-D is a commonly used self-report scale to measure depressive symptoms in the general population. In the present study, the Dutch version of the CES-D was administered to a sample of 837 Dutch-speaking adults of Belgium to examine the factor structure of the scale. Using confirmatory factory analysis (CFA, four first-order models and two second-order models were tested, and the second-order factor model with three pairs of correlated error terms provided the best fit to the data. Second, five socio-demographic variables (age, gender, education level, relation status, and family history of depression were included as covariates to the second-order factor model to explore the associations between background characteristics and the latent factor depression using a multiple indicators and multiple causes (MIMIC approach. Age had a significantly negative effect on depression, but the effect was not substantial. Female gender, lower education level, being single or widowed, and having a family history of depression were found to be significant predictors of higher levels of depression symptomatology. Finally, percentile norms on the CES-D raw scores were provided for subgroups of gender by education level for the general Dutch-speaking adult population of Belgium.

  3. The center for epidemiologic studies depression scale: a review with a theoretical and empirical examination of item content and factor structure.

    Directory of Open Access Journals (Sweden)

    R Nicholas Carleton

    Full Text Available BACKGROUND: The Center for Epidemiologic Studies Depression Scale (CES-D; Radloff, 1977 is a commonly used freely available self-report measure of depressive symptoms. Despite its popularity, several recent investigations have called into question the robustness and suitability of the commonly used 4-factor 20-item CES-D model. The goal of the current study was to address these concerns by confirming the factorial validity of the CES-D. METHODS AND FINDINGS: Differential item functioning estimates were used to examine sex biases in item responses, and confirmatory factor analyses were used to assess prior CES-D factor structures and new models heeding current theoretical and empirical considerations. Data used for the analyses included undergraduate (n = 948; 74% women, community (n = 254; 71% women, rehabilitation (n = 522; 53% women, clinical (n =84; 77% women, and National Health and Nutrition Examination Survey (NHANES; n = 2814; 56% women samples. Differential item functioning identified an item as inflating CES-D scores in women. Comprehensive comparison of the several models supported a novel, psychometrically robust, and unbiased 3-factor 14-item solution, with factors (i.e., negative affect, anhedonia, and somatic symptoms that are more in line with current diagnostic criteria for depression. CONCLUSIONS: Researchers and practitioners may benefit from using the novel factor structure of the CES-D and from being cautious in interpreting results from the originally proposed scale. Comprehensive results, implications, and future research directions are discussed.

  4. Psychometric comparison of single-item, short, and comprehensive depression screening measures in Korean young adults.

    Science.gov (United States)

    Kim, Hee-Ju; Abraham, Ivo

    2016-04-01

    Integrating long depression-screening instruments into routine clinical practice and research studies is often impractical, necessitating short-item if not single-item measures with comparable psychometric properties. To examine whether single-item or short depression-screening measures are comparable to a comprehensive screening measure in reliability (i.e., internal consistency and test-retest reliability) and validity (i.e., convergent, concurrent, and predictive validity) in Korean young adults within a Classical Testing Theory framework. A total of 458 students from six nursing colleges in South Korea completed three depression measures: the 20-item Center for Epidemiologic Studies-Depression screening instrument (CES-D; comprehensive measure); the five-item Profile of Mood States-Brief depression subscale (POMS-B depression subscale; short measure); a single-item Likert measure; and a single-item numeric rating scale. Internal consistency reliability was tested by Cronbach's alpha and item-total correlations; test-retest reliability by intraclass correlation coefficient (ICC); convergent validity by correlation with the CES-D; concurrent validity by the correlation with perceived stress level and sleep quality; and predictive validity by receiver operating characteristic curve to predict the two groups with different depression levels. The POMS-B depression subscale was comparable to the comprehensive CES-D scale in internal consistency reliability (alpha=.85); test-retest reliability (ICC=.76); and convergent (r=.81 with CES-D), concurrent (r=.64 with perceived stress level, r=.34 with sleep quality), and predictive validity (area under the curve=.88). The two single-item options were not comparable to the comprehensive CES-D. The short POMS-B depression subscale shows an acceptable balance between practical clinical and research needs and psychometric quality. Copyright © 2015 Elsevier Ltd. All rights reserved.

  5. Metastability of Non-reversible, Mean-Field Potts Model with Three Spins

    Science.gov (United States)

    Landim, C.; Seo, I.

    2016-11-01

    We examine a non-reversible, mean-field Potts model with three spins on a set with N\\uparrow ∞ points. Without an external field, there are three critical temperatures and five different metastable regimes. The analysis can be extended by a perturbative argument to the case of small external fields, and it can be carried out in the case where the external field is in the direction or in the opposite direction to one of the values of the spins. Numerical computations permit to identify other phenomena which are not present in the previous situations.

  6. Comparison of reversible and nonreversible aqueous-soluble lipidized conjugates of salmon calcitonin.

    Science.gov (United States)

    Cheng, Weiqiang; Lim, Lee-Yong

    2008-01-01

    Reversible aqueous lipidization (REAL) at the interdisulfide bond has been shown to improve the deliverability of some peptide drugs. Recently, we developed a nonreversible aqueous lipidization method targeted at the interdisulfide bond of salmon calcitonin (sCT). The resultant derivative had comparable hypocalcemic activity to sCT after subcutaneous injection in rats, despite possessing significantly different biophysical properties. The purpose of this study was to conduct a comparative evaluation of the biophysical properties of the reversible aqueous-soluble lipidized sCT (REAL-sCT) and its corresponding nonreversible aqueous-soluble compound (Mal-sCT) with a view to correlate these properties to the bioactivities of the peptides. REAL-sCT and Mal-sCT were successfully synthesized, purified and identified. Both conjugates showed comparable retention times in a C-18 HPLC column, as well as robust helical structures and aggregation behavior in water, although REAL-sCT was shown by dynamic light scattering experiments to form larger aggregates than Mal-sCT in water. The larger particle size of REAL-sCT correlated with its stronger resistance to degradation by intestinal enzymes. Unlike Mal-sCT, REAL-sCT was rapidly converted to sCT in liver juice; however, the regenerated sCT appeared to degrade at a slower rate than unmodified sCT in the liver juice. Compared with sCT, REAL-sCT after subcutaneous injection as an aqueous solution at a dose of 0.15 mg/kg produced a prolonged hypocalcemic activity that lasted at least 24 h in the rat. Using a novel LC-MS/MS method that was developed for this study, we were able to show concomitant increases in REAL-sCT and sCT plasma concentrations with time, the latter prevailing at 10% the molar concentration of the former. In contrast, sCT was not present in the plasma following the subcutaneous injection of Mal-sCT, although a comparable hypocalcemic activity with shorter duration was observed. Oral administration of REAL

  7. Lesion Analysis of Cortical Regions Associated with the Comprehension of Nonreversible and Reversible Yes/No Questions

    Science.gov (United States)

    Race, David S.; Ochfeld, Elisa; Leigh, Richard; Hillis, Argye E.

    2012-01-01

    We investigated the association between yes/no sentence comprehension and dysfunction in anterior and posterior left-hemisphere cortical regions in acute stroke patients. More specifically, we manipulated whether questions were Nonreversible (e.g., Are limes sour?) or Reversible (e.g., Is a horse larger than a dog?) to investigate the regions…

  8. Mental health (GHQ12; CES-D and attitudes towards the value of work among inmates of a semi-open prison and the long-term unemployed in Luxembourg

    Directory of Open Access Journals (Sweden)

    Le Bihan Etienne

    2008-06-01

    Full Text Available Abstract Aim To analyse the relationships between mental health and employment commitment among prisoners and the long-term unemployed (LTU trying to return to work. Method Fifty-two of 62 male inmates of a semi-open prison (Givenich Penitentiary Centre, the only such unit in Luxembourg, and 69 LTU registered at the Luxembourg Employment Administration completed a questionnaire exploring: 1 mental health (measured by means of scales GHQ12 and CES-D; 2 employment commitment; 3 availability of a support network, self-esteem, empowerment; and 4 socio-demographic characteristics. Results Compared with LTU, inmates were younger, more had work experience (54.9% vs 26.1%, and more were educated to only a low level (71.1% vs 58.0%. The link between employment commitment and mental health in the LTU was the opposite of that seen among the prisoners: the more significant the perceived importance of employment, the worse the mental health (GHQ12 p = 0.003; CES-D p Conclusion The two groups clearly need professional support. Future research should further investigate the link between different forms of professional help and mental health. Randomized controlled trials could be carried out in both groups, with interventions to improve work commitment for prisoners and to help with getting a job for LTU. For those LTU who value employment but cannot find it, the best help may be psychological support.

  9. Changes in G+C content of a neutrally evolving gene under a non-reversible dynamics measured by computer simulations based on experimental evolution data

    Directory of Open Access Journals (Sweden)

    Adriana Brunstein

    2004-01-01

    Full Text Available To evaluate the effects of non-reversibility on compositional base changes and the distribution of branch lengths along a phylogeny, we extended, by means of computer simulations, our previous sequential PCR in vitro evolution experiment. In that study a 18S rRNA gene evolved neutrally for 280 generations and a homogeneous non-stationary model of base substitution based on a non-reversible dynamics was built from the in vitro evolution data to describe the observed pattern of nucleotide substitutions. Here, the process was extended to 840 generations without selection, using the model parameters calculated from the in vitro evolution experiment. We observed that under a non-reversible model the G+C content of the sequences significantly increases when compared to simulations with a reversible model. The values of mean and variance of the branch lengths are reduced under a non-reversible dynamics although they follow a Poisson distribution. We conclude that the major implication of non-reversibility is the overall decrease of branch lengths, although no transition from a stochastic to an ordered process is observed. According to our model the result of this neutral process will be the increase in the G+C content of the descendant sequences with an overall decrease in the frequency of substitutions.

  10. 高校男护生抑郁情况及影响因素的初步分析%An evaluation of status of depression using CES-D and analysis of its influencing factors among male nursing undergraduates

    Institute of Scientific and Technical Information of China (English)

    徐英; 祁娟; 张丕德

    2013-01-01

    目的 了解高校男护理本科生的抑郁情况,并与全国城市同龄人群常模进行比较并初步探讨可能的影响因素,为相关心理干预方法的提出提供依据.方法 采用两阶段抽样的方法抽取两所高校102名男护生作为研究对象,利用流调中心抑郁量表(CES-D)测量抑郁发生情况,同时利用自拟的调查问卷探讨可能的影响因素.结果 男护生的抑郁发生率为14.0%,CES-D得分(11.00±6.37)分,与全国城市同龄人群水平相当.单因素分析提示来自农村、人际关系较差、感觉就业前景较差、自己的兴趣、爱好、需求、想法等得不到父母亲的关注是抑郁发生的危险因素;迸一步进行多因素分析提示,感觉就业前景较差及自己的兴趣、爱好、需求、想法等得不到父母亲的关注是抑郁发生的独立危险因素.结论 针对男护生,需要从家庭和就业前景两方面着手进一步改善其心理健康状况,为护理事业的发展储备合格的力量.%Objective To understand the status of depression and compare with the same age group of Chinese norms of CES-D in urban areas and analyze its potential influencing factors among male nursing undergraduates.Methods In this cross-sectional study,the CES-D and self-made questionnaire were administrated to 102 male nursing undergraduates from two schools of nursing through two-stage random sampling.Results The prevalence of depression was 14.0%,and the score of nursing undergraduates was (11.00±6.37) points,which had no significant difference from the scores of the same age group of Chinese norms in urban areas.Univariate analysis showed that coming from countryside,worse interpersonal relationship,worse career prospect,getting no attention from parents had some significant association with depression.Multivariate analysis showed that worse career prospect and getting no attention from parents associated with lower CES-D scores.Condusions For male nursing

  11. Characteristics of reversible and nonreversible COPD and asthma and COPD overlap syndrome patients: an analysis of salbutamol Easyhaler data.

    Science.gov (United States)

    Müller, Veronika; Gálffy, Gabriella; Orosz, Márta; Kováts, Zsuzsanna; Odler, Balázs; Selroos, Olof; Tamási, Lilla

    2016-01-01

    The choice of inhaler device for bronchodilator reversibility is crucial since suboptimal inhalation technique may influence the result. On the other hand, bronchodilator response also varies from time to time and may depend on patient characteristics. In this study, patients with airway obstruction (forced expiratory volume in 1 second [FEV1]/forced vital capacity [FVC] ratio salbutamol dry powder inhaler (Buventol Easyhaler, Orion Pharma Ltd, Espoo, Finland). Demographic data and patients' perceptions of Easyhaler compared with β2-agonist pressurized metered dose inhalers (pMDIs) were analyzed. American Thoracic Society/European Respiratory Society guideline defined reversibility was found in 21 out of 63 COPD patients and in two out of 12 ACOS patients. Airway obstruction was more severe in COPD patients as compared with controls (mean FEV1 and FEV1% predicted both Psalbutamol was significantly lower in COPD patients compared with asthma controls (Psalbutamol dry powder inhaler. Nonreversible patients with COPD were characterized by higher weight and body mass index, and a higher FEV1/FVC ratio. Most patients preferred Easyhaler compared with a pMDI.

  12. Nonreversible Homoclinic Snaking

    CERN Document Server

    Knobloch, Jürgen; Vielitz, Martin

    2010-01-01

    Homoclinic snaking refers to the sinusoidal snaking continuation curve of homoclinic orbits near a heteroclinic cycle connecting an equilibrium E and a periodic orbit P. Along this curve the homoclinic orbit performs more and more windings about the periodic orbit. Typically this behaviour appears in reversible Hamiltonian systems. Here we discuss this phenomenon in systems without any particular structure. We give a rigorous analytical verification of homoclinic snaking under certain assumptions on the behaviour of the stable and unstable manifolds of E and P. We show how the snaking behaviour depends on the signs of the Floquet multipliers of P. Further we present a nonsnaking scenario. Finally we show numerically that these assumptions are fulfilled in a model equation.

  13. Screening Test Items for Differential Item Functioning

    Science.gov (United States)

    Longford, Nicholas T.

    2014-01-01

    A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…

  14. Pattern analysis of total item score and item response of the Kessler Screening Scale for Psychological Distress (K6) in a nationally representative sample of US adults.

    Science.gov (United States)

    Tomitaka, Shinichiro; Kawasaki, Yohei; Ide, Kazuki; Akutagawa, Maiko; Yamada, Hiroshi; Yutaka, Ono; Furukawa, Toshiaki A

    2017-01-01

    Several recent studies have shown that total scores on depressive symptom measures in a general population approximate an exponential pattern except for the lower end of the distribution. Furthermore, we confirmed that the exponential pattern is present for the individual item responses on the Center for Epidemiologic Studies Depression Scale (CES-D). To confirm the reproducibility of such findings, we investigated the total score distribution and item responses of the Kessler Screening Scale for Psychological Distress (K6) in a nationally representative study. Data were drawn from the National Survey of Midlife Development in the United States (MIDUS), which comprises four subsamples: (1) a national random digit dialing (RDD) sample, (2) oversamples from five metropolitan areas, (3) siblings of individuals from the RDD sample, and (4) a national RDD sample of twin pairs. K6 items are scored using a 5-point scale: "none of the time," "a little of the time," "some of the time," "most of the time," and "all of the time." The pattern of total score distribution and item responses were analyzed using graphical analysis and exponential regression model. The total score distributions of the four subsamples exhibited an exponential pattern with similar rate parameters. The item responses of the K6 approximated a linear pattern from "a little of the time" to "all of the time" on log-normal scales, while "none of the time" response was not related to this exponential pattern. The total score distribution and item responses of the K6 showed exponential patterns, consistent with other depressive symptom scales.

  15. 室温不可逆原电池技术制备BaMoO4多晶薄膜%Room Temperature Preparation of BaMoO4 Polycrystalline Thin Film by a Nonreversible Galvanic Cell Technique

    Institute of Scientific and Technical Information of China (English)

    徐成刚; 赖欣; 崔春华; 毕剑; 高道江

    2009-01-01

    Well-crystallized BaMoO4 polycrystalline films were prepared directly on molybdenum substrate by a nonreversible galvanic cell method without applied current in barium hydroxide aqueous solution at room temperature (25 °C). The microstructures of the prepared films were characterized by using XRD, SEM and Raman spectrometer, respectively; and the formation mechanism of the films under the electrochemical conditions was discussed. The XRD, SEM and Raman measurements results indicate that the as-prepared thin films are a single phase BaMoO4 with a scheelite-type structure and show uniform and homogeneous surfaces; the formation of BaMoO4 film includes anode dissolution-oxidation reaction and solution deposition reaction in the nonreversible galvanic cell. The driving force of the nonreversible galvanic cell reaction is the difference of the electrode potential between anode and cathode of the cell.%室温条件下于Ba(OH)2溶液中采用无外加电流的非可逆原电池技术在钼金属基片上制备了BaMoO4多晶薄膜.采用XRD、SEM和Raman技术表征了制备薄膜的微结构;同时探讨了BaMoO4薄膜的电化学形成机制.XRD,SEM和Raman分析结果表明制备的薄膜表面均匀、致密,是白钨矿结构的四方单相BaMoO4;非可逆原电池条件下BaMoO4薄膜的形成包括阳极氧化溶解反应和溶液沉积反应,原电池反应的驱动力是电池的正极与负极的电势差.

  16. Usefulness of a single-item measure of depression to predict mortality: the GAZEL prospective cohort study

    Science.gov (United States)

    Lefèvre, Thomas; Singh-Manoux, Archana; Stringhini, Silvia; Dugravot, Aline; Lemogne, Cédric; Consoli, Silla M.; Goldberg, Marcel; Zins, Marie

    2012-01-01

    Background: It remains unknown whether short measures of depression perform as well as long measures in predicting adverse outcomes such as mortality. The present study aims to examine the predictive value of a single-item measure of depression for mortality. Methods: A total of 14 185 participants of the GAZEL cohort completed the 20-item Center-for-Epidemiologic-Studies-Depression (CES-D) scale in 1996. One of these items (I felt depressed) was used as a single-item measure of depression. All-cause mortality data were available until 30 September 2009, a mean follow-up period of 12.7 years with a total of 650 deaths. Results: In Cox regression model adjusted for baseline socio-demographic characteristics, a one-unit increase in the single-item score (range 0–3) was associated with a 25% higher risk of all-cause mortality (95% CI: 13–37%, P < 0.001). Further adjustment for health-related behaviours and physical chronic diseases reduced this risk by 36% and 8%, respectively. After adjustment for all these variables, every one-unit increase in the single-item score predicted a 15% increased risk of death (95% CI: 5–27%, P < 0.01). There is also an evidence of a dose–reponse relationship between reponse scores on the single-item measure of depression and mortality. Conclusion: This study shows that a single-item measure of depression is associated with an increased risk of death. Given its simplicity and ease of administration, a very simple single-item measure of depression might be useful for identifying middle-aged adults at risk for elevated depressive symptoms in large epidemiological studies and clinical settings. PMID:21840893

  17. Writing better test items.

    Science.gov (United States)

    Aucoin, Julia W

    2005-01-01

    Professional development specialists have had little opportunity to learn how to write test items to meet the expectations of today's graduate nurse. Schools of nursing have moved away from knowledge-level test items and have had to develop more application and analysis items to prepare graduates for the National Council Licensure Examination (NCLEX). This same type of question can be used effectively to support a competence assessment system and document critical thinking skills.

  18. Evaluation of item candidates: the PROMIS qualitative item review.

    Science.gov (United States)

    DeWalt, Darren A; Rothrock, Nan; Yount, Susan; Stone, Arthur A

    2007-05-01

    One of the PROMIS (Patient-Reported Outcome Measurement Information System) network's primary goals is the development of a comprehensive item bank for patient-reported outcomes of chronic diseases. For its first set of item banks, PROMIS chose to focus on pain, fatigue, emotional distress, physical function, and social function. An essential step for the development of an item pool is the identification, evaluation, and revision of extant questionnaire items for the core item pool. In this work, we also describe the systematic process wherein items are classified for subsequent statistical processing by the PROMIS investigators. Six phases of item development are documented: identification of extant items, item classification and selection, item review and revision, focus group input on domain coverage, cognitive interviews with individual items, and final revision before field testing. Identification of items refers to the systematic search for existing items in currently available scales. Expert item review and revision was conducted by trained professionals who reviewed the wording of each item and revised as appropriate for conventions adopted by the PROMIS network. Focus groups were used to confirm domain definitions and to identify new areas of item development for future PROMIS item banks. Cognitive interviews were used to examine individual items. Items successfully screened through this process were sent to field testing and will be subjected to innovative scale construction procedures.

  19. The Role of Item Models in Automatic Item Generation

    Science.gov (United States)

    Gierl, Mark J.; Lai, Hollis

    2012-01-01

    Automatic item generation represents a relatively new but rapidly evolving research area where cognitive and psychometric theories are used to produce tests that include items generated using computer technology. Automatic item generation requires two steps. First, test development specialists create item models, which are comparable to templates…

  20. SHIPPING OF RADIOACTIVE ITEMS

    CERN Multimedia

    TIS/RP Group

    2001-01-01

    The TIS-RP group informs users that shipping of small radioactive items is normally guaranteed within 24 hours from the time the material is handed in at the TIS-RP service. This time is imposed by the necessary procedures (identification of the radionuclides, determination of dose rate and massive objects require a longer procedure and will therefore take longer.

  1. Item Banking with Embedded Standards

    Science.gov (United States)

    MacCann, Robert G.; Stanley, Gordon

    2009-01-01

    An item banking method that does not use Item Response Theory (IRT) is described. This method provides a comparable grading system across schools that would be suitable for low-stakes testing. It uses the Angoff standard-setting method to obtain item ratings that are stored with each item. An example of such a grading system is given, showing how…

  2. Item Banking with Embedded Standards

    Science.gov (United States)

    MacCann, Robert G.; Stanley, Gordon

    2009-01-01

    An item banking method that does not use Item Response Theory (IRT) is described. This method provides a comparable grading system across schools that would be suitable for low-stakes testing. It uses the Angoff standard-setting method to obtain item ratings that are stored with each item. An example of such a grading system is given, showing how…

  3. SHIPPING OF RADIOACTIVE ITEMS

    CERN Multimedia

    TIS/RP Group

    2001-01-01

    The TIS-RP group informs users that shipping of small radioactive items is normally guaranteed within 24 hours from the time the material is handed in at the TIS-RP service. This time is imposed by the necessary procedures (identification of the radionuclides, determination of dose rate, preparation of the package and related paperwork). Large and massive objects require a longer procedure and will therefore take longer.

  4. Faculty development on item writing substantially improves item quality.

    NARCIS (Netherlands)

    Naeem, N.; Vleuten, C.P.M. van der; Alfaris, E.A.

    2012-01-01

    The quality of items written for in-house examinations in medical schools remains a cause of concern. Several faculty development programs are aimed at improving faculty's item writing skills. The purpose of this study was to evaluate the effectiveness of a faculty development program in item develo

  5. IRT Item Parameter Scaling for Developing New Item Pools

    Science.gov (United States)

    Kang, Hyeon-Ah; Lu, Ying; Chang, Hua-Hua

    2017-01-01

    Increasing use of item pools in large-scale educational assessments calls for an appropriate scaling procedure to achieve a common metric among field-tested items. The present study examines scaling procedures for developing a new item pool under a spiraled block linking design. The three scaling procedures are considered: (a) concurrent…

  6. Item Overexposure in Computerized Classification Tests Using Sequential Item Selection

    Directory of Open Access Journals (Sweden)

    Alan Huebner

    2012-06-01

    Full Text Available Computerized classification tests (CCTs often use sequential item selection which administers items according to maximizing psychometric information at a cut point demarcating passing and failing scores. This paper illustrates why this method of item selection leads to the overexposure of a significant number of items, and the performances of three different methods for controlling maximum item exposure rates in CCTs are compared. Specifically, the Sympson-Hetter, restricted, and item eligibility methods are examined in two studies realistically simulating different types of CCTs and are evaluated based upon criteria including classification accuracy, the number of items exceeding the desired maximum exposure rate, and test overlap. The pros and cons of each method are discussed from a practical perspective.

  7. Bayesian item fit analysis for unidimensional item response theory models.

    Science.gov (United States)

    Sinharay, Sandip

    2006-11-01

    Assessing item fit for unidimensional item response theory models for dichotomous items has always been an issue of enormous interest, but there exists no unanimously agreed item fit diagnostic for these models, and hence there is room for further investigation of the area. This paper employs the posterior predictive model-checking method, a popular Bayesian model-checking tool, to examine item fit for the above-mentioned models. An item fit plot, comparing the observed and predicted proportion-correct scores of examinees with different raw scores, is suggested. This paper also suggests how to obtain posterior predictive p-values (which are natural Bayesian p-values) for the item fit statistics of Orlando and Thissen that summarize numerically the information in the above-mentioned item fit plots. A number of simulation studies and a real data application demonstrate the effectiveness of the suggested item fit diagnostics. The suggested techniques seem to have adequate power and reasonable Type I error rate, and psychometricians will find them promising.

  8. GRE Verbal Analogy Items: Examinee Reasoning on Items.

    Science.gov (United States)

    Duran, Richard P.; And Others

    Information about how Graduate Record Examination (GRE) examinees solve verbal analogy problems was obtained in this study through protocol analysis. High- and low-ability subjects who had recently taken the GRE General Test were asked to "think aloud" as they worked through eight analogy items. These items varied factorially on the…

  9. A Mixed Effects Randomized Item Response Model

    Science.gov (United States)

    Fox, J.-P.; Wyrick, Cheryl

    2008-01-01

    The randomized response technique ensures that individual item responses, denoted as true item responses, are randomized before observing them and so-called randomized item responses are observed. A relationship is specified between randomized item response data and true item response data. True item response data are modeled with a (non)linear…

  10. Computerized adaptive testing with item cloning

    NARCIS (Netherlands)

    Glas, Cornelis A.W.; van der Linden, Willem J.

    2003-01-01

    To increase the number of items available for adaptive testing and reduce the cost of item writing, the use of techniques of item cloning has been proposed. An important consequence of item cloning is possible variability between the item parameters. To deal with this variability, a multilevel item

  11. Item Veto: Dangerous Constitutional Tinkering.

    Science.gov (United States)

    Bellamy, Calvin

    1989-01-01

    In theory, the item veto would empower the President to remove wasteful and unnecessary projects from legislation. Yet, despite its history at the state level, the item veto is a loosely defined concept that may not work well at the federal level. Much more worrisome is the impact on the balance of power. (Author/CH)

  12. Continuous Online Item Calibration: Parameter Recovery and Item Utilization.

    Science.gov (United States)

    Ren, Hao; van der Linden, Wim J; Diao, Qi

    2017-06-01

    Parameter recovery and item utilization were investigated for different designs for online test item calibration. The design was adaptive in a double sense: it assumed both adaptive testing of examinees from an operational pool of previously calibrated items and adaptive assignment of field-test items to the examinees. Four criteria of optimality for the assignment of the field-test items were used, each of them based on the information in the posterior distributions of the examinee's ability parameter during adaptive testing as well as the sequentially updated posterior distributions of the field-test item parameters. In addition, different stopping rules based on target values for the posterior standard deviations of the field-test parameters and the size of the calibration sample were used. The impact of each of the criteria and stopping rules on the statistical efficiency of the estimates of the field-test parameters and on the time spent by the items in the calibration procedure was investigated. Recommendations as to the practical use of the designs are given.

  13. Assessing item fit for unidimensional item response theory models using residuals from estimated item response functions.

    Science.gov (United States)

    Haberman, Shelby J; Sinharay, Sandip; Chon, Kyong Hee

    2013-07-01

    Residual analysis (e.g. Hambleton & Swaminathan, Item response theory: principles and applications, Kluwer Academic, Boston, 1985; Hambleton, Swaminathan, & Rogers, Fundamentals of item response theory, Sage, Newbury Park, 1991) is a popular method to assess fit of item response theory (IRT) models. We suggest a form of residual analysis that may be applied to assess item fit for unidimensional IRT models. The residual analysis consists of a comparison of the maximum-likelihood estimate of the item characteristic curve with an alternative ratio estimate of the item characteristic curve. The large sample distribution of the residual is proved to be standardized normal when the IRT model fits the data. We compare the performance of our suggested residual to the standardized residual of Hambleton et al. (Fundamentals of item response theory, Sage, Newbury Park, 1991) in a detailed simulation study. We then calculate our suggested residuals using data from an operational test. The residuals appear to be useful in assessing the item fit for unidimensional IRT models.

  14. Faculty development on item writing substantially improves item quality.

    Science.gov (United States)

    Naeem, Naghma; van der Vleuten, Cees; Alfaris, Eiad Abdelmohsen

    2012-08-01

    The quality of items written for in-house examinations in medical schools remains a cause of concern. Several faculty development programs are aimed at improving faculty's item writing skills. The purpose of this study was to evaluate the effectiveness of a faculty development program in item development. An objective method was developed and used to assess improvement in faculty's competence to develop high quality test items. This was a quasi experimental study with a pretest-midtest-posttest design. A convenience sample of 51 faculty members participated. Structured checklists were used to assess the quality of test items at each phase of the study. Group scores were analyzed using repeated measures analysis of variance. The results showed a significant increase in participants' mean scores on Multiple Choice Questions, Short Answer Questions and Objective Structured Clinical Examination checklists from pretest to posttest (p development are generally lacking in quality. It also provides evidence of the value of faculty development in improving the quality of items generated by faculty.

  15. Counting Frequencies from Zotero Items

    Directory of Open Access Journals (Sweden)

    Spencer Roberts

    2013-04-01

    Full Text Available In Counting Frequencies you learned how to count the frequency of specific words in a list using python. In this lesson, we will expand on that topic by showing you how to get information from Zotero HTML items, save the content from those items, and count the frequencies of words. It may be beneficial to look over the previous lesson before we begin.

  16. Multilevel Modeling of Item Position Effects

    Science.gov (United States)

    Albano, Anthony D.

    2013-01-01

    In many testing programs it is assumed that the context or position in which an item is administered does not have a differential effect on examinee responses to the item. Violations of this assumption may bias item response theory estimates of item and person parameters. This study examines the potentially biasing effects of item position. A…

  17. Australian Item Bank Program: Social Science Item Bank.

    Science.gov (United States)

    Australian Council for Educational Research, Hawthorn.

    After vigorous review, editing, and trial testing, this item bank was compiled to help secondary school teachers construct objective tests in the social sciences. Anthropology, economics, ethnic and cultural studies, geography, history, legal studies, politics, and sociology are among the topics represented. The bank consists of multiple choice…

  18. Modelling sequentially scored item responses

    NARCIS (Netherlands)

    Akkermans, W.

    2000-01-01

    The sequential model can be used to describe the variable resulting from a sequential scoring process. In this paper two more item response models are investigated with respect to their suitability for sequential scoring: the partial credit model and the graded response model. The investigation is c

  19. Psychometric Consequences of Subpopulation Item Parameter Drift

    Science.gov (United States)

    Huggins-Manley, Anne Corinne

    2017-01-01

    This study defines subpopulation item parameter drift (SIPD) as a change in item parameters over time that is dependent on subpopulations of examinees, and hypothesizes that the presence of SIPD in anchor items is associated with bias and/or lack of invariance in three psychometric outcomes. Results show that SIPD in anchor items is associated…

  20. Psychometric Consequences of Subpopulation Item Parameter Drift

    Science.gov (United States)

    Huggins-Manley, Anne Corinne

    2017-01-01

    This study defines subpopulation item parameter drift (SIPD) as a change in item parameters over time that is dependent on subpopulations of examinees, and hypothesizes that the presence of SIPD in anchor items is associated with bias and/or lack of invariance in three psychometric outcomes. Results show that SIPD in anchor items is associated…

  1. Unidimensional Interpretations for Multidimensional Test Items

    Science.gov (United States)

    Kahraman, Nilufer

    2013-01-01

    This article considers potential problems that can arise in estimating a unidimensional item response theory (IRT) model when some test items are multidimensional (i.e., show a complex factorial structure). More specifically, this study examines (1) the consequences of model misfit on IRT item parameter estimates due to unintended minor item-level…

  2. Generalizability theory and item response theory

    NARCIS (Netherlands)

    Glas, Cornelis A.W.; Eggen, T.J.H.M.; Veldkamp, B.P.

    2012-01-01

    Item response theory is usually applied to items with a selected-response format, such as multiple choice items, whereas generalizability theory is usually applied to constructed-response tasks assessed by raters. However, in many situations, raters may use rating scales consisting of items with a

  3. Generalizability theory and item response theory

    NARCIS (Netherlands)

    Glas, C.A.W.; Eggen, T.J.H.M.; Veldkamp, B.P.

    2012-01-01

    Item response theory is usually applied to items with a selected-response format, such as multiple choice items, whereas generalizability theory is usually applied to constructed-response tasks assessed by raters. However, in many situations, raters may use rating scales consisting of items with a s

  4. 15 CFR 742.15 - Encryption items.

    Science.gov (United States)

    2010-01-01

    ... 15 Commerce and Foreign Trade 2 2010-01-01 2010-01-01 false Encryption items. 742.15 Section 742... BASED CONTROLS § 742.15 Encryption items. Encryption items can be used to maintain the secrecy of... export and reexport of encryption items. As the President indicated in Executive Order 13026 and in his...

  5. Item-Writing Guidelines for Physics

    Science.gov (United States)

    Regan, Tom

    2015-01-01

    A teacher learning how to write test questions (test items) will almost certainly encounter item-writing guidelines--lists of item-writing do's and don'ts. Item-writing guidelines usually are presented as applicable across all assessment settings. Table I shows some guidelines that I believe to be generally applicable and two will be briefly…

  6. Unidimensional Interpretations for Multidimensional Test Items

    Science.gov (United States)

    Kahraman, Nilufer

    2013-01-01

    This article considers potential problems that can arise in estimating a unidimensional item response theory (IRT) model when some test items are multidimensional (i.e., show a complex factorial structure). More specifically, this study examines (1) the consequences of model misfit on IRT item parameter estimates due to unintended minor item-level…

  7. The impact of item-writing flaws and item complexity on examination item difficulty and discrimination value.

    Science.gov (United States)

    Rush, Bonnie R; Rankin, David C; White, Brad J

    2016-09-29

    Failure to adhere to standard item-writing guidelines may render examination questions easier or more difficult than intended. Item complexity describes the cognitive skill level required to obtain a correct answer. Higher cognitive examination items promote critical thinking and are recommended to prepare students for clinical training. This study evaluated faculty-authored examinations to determine the impact of item-writing flaws and item complexity on the difficulty and discrimination value of examination items used to assess third year veterinary students. The impact of item-writing flaws and item complexity (cognitive level I-V) on examination item difficulty and discrimination value was evaluated on 1925 examination items prepared by clinical faculty for third year veterinary students. The mean (± SE) percent correct (83.3 % ± 17.5) was consistent with target values in professional education, and the mean discrimination index (0.18 ± 0.17) was slightly lower than recommended (0.20). More than one item-writing flaw was identified in 37.3 % of questions. The most common item-writing flaws were awkward stem structure, implausible distractors, longest response is correct, and responses are series of true-false statements. Higher cognitive skills (complexity level III-IV) were required to correctly answer 38.4 % of examination items. As item complexity increased, item difficulty and discrimination values increased. The probability of writing discriminating, difficult examination items decreased when implausible distractors and all of the above were used, and increased if the distractors were comprised of a series of true/false statements. Items with four distractors were not more difficult or discriminating than items with three distractors. Preparation of examination questions targeting higher cognitive levels will increase the likelihood of constructing discriminating items. Use of implausible distractors to complete a five-option multiple choice

  8. Teoria da Resposta ao Item Teoria de la respuesta al item Item response theory

    Directory of Open Access Journals (Sweden)

    Eutalia Aparecida Candido de Araujo

    2009-12-01

    Full Text Available A preocupação com medidas de traços psicológicos é antiga, sendo que muitos estudos e propostas de métodos foram desenvolvidos no sentido de alcançar este objetivo. Entre os trabalhos propostos, destaca-se a Teoria da Resposta ao Item (TRI que, a princípio, veio completar limitações da Teoria Clássica de Medidas, empregada em larga escala até hoje na medida de traços psicológicos. O ponto principal da TRI é que ela leva em consideração o item particularmente, sem relevar os escores totais; portanto, as conclusões não dependem apenas do teste ou questionário, mas de cada item que o compõe. Este artigo propõe-se a apresentar esta Teoria que revolucionou a teoria de medidas.La preocupación con las medidas de los rasgos psicológicos es antigua y muchos estudios y propuestas de métodos fueron desarrollados para lograr este objetivo. Entre estas propuestas de trabajo se incluye la Teoría de la Respuesta al Ítem (TRI que, en principio, vino a completar las limitaciones de la Teoría Clásica de los Tests, ampliamente utilizada hasta hoy en la medida de los rasgos psicológicos. El punto principal de la TRI es que se tiene en cuenta el punto concreto, sin relevar las puntuaciones totales; por lo tanto, los resultados no sólo dependen de la prueba o cuestionario, sino que de cada ítem que lo compone. En este artículo se propone presentar la Teoría que revolucionó la teoría de medidas.The concern with measures of psychological traits is old and many studies and proposals of methods were developed to achieve this goal. Among these proposed methods highlights the Item Response Theory (IRT that, in principle, came to complete limitations of the Classical Test Theory, which is widely used until nowadays in the measurement of psychological traits. The main point of IRT is that it takes into account the item in particular, not relieving the total scores; therefore, the findings do not only depend on the test or questionnaire

  9. Sur deux espèces D'anthessius (Copepoda) des Indes Orientales

    NARCIS (Netherlands)

    Stock, J.H.

    1964-01-01

    Les espèces du genre Anthessius Della Valle, 1880, sont associées de façon très préférentielle à deux classes de Mollusques: les Pélécypodes et les Gastéropodes. Notre connaissance taxonomique du genre progresse actuellement de manière très rapide. Il y a cinq ans seulement, on ne connaissait que 11

  10. Factorial Structure of the CES-D among American Indian Adolescents.

    Science.gov (United States)

    Dick, Rhonda Wiegman; And Others

    1994-01-01

    Examined factor structure of Center for Epidemiologic Studies-Depression Scale among American Indian adolescents attending boarding school. Found that Depressed and Somatic factors were highly correlated and should be collapsed into one factor. No differences in factor structure were found by gender. (Author/NB)

  11. Sharing the cost of redundant items

    DEFF Research Database (Denmark)

    Hougaard, Jens Leth; Moulin, Hervé

    2014-01-01

    are network connectivity problems when an existing (possibly inefficient) network must be maintained. We axiomatize a family cost ratios based on simple liability indices, one for each agent and for each item, measuring the relative worth of this item across agents, and generating cost allocation rules......We ask how to share the cost of finitely many public goods (items) among users with different needs: some smaller subsets of items are enough to serve the needs of each user, yet the cost of all items must be covered, even if this entails inefficiently paying for redundant items. Typical examples...... additive in costs....

  12. Unfair items detection in educational measurement

    CERN Document Server

    Bakman, Yefim

    2012-01-01

    Measurement professionals cannot come to an agreement on the definition of the term 'item fairness'. In this paper a continuous measure of item unfairness is proposed. The more the unfairness measure deviates from zero, the less fair the item is. If the measure exceeds the cutoff value, the item is identified as definitely unfair. The new approach can identify unfair items that would not be identified with conventional procedures. The results are in accord with experts' judgments on the item qualities. Since no assumptions about scores distributions and/or correlations are assumed, the method is applicable to any educational test. Its performance is illustrated through application to scores of a real test.

  13. A Bifactor Multidimensional Item Response Theory Model for Differential Item Functioning Analysis on Testlet-Based Items

    Science.gov (United States)

    Fukuhara, Hirotaka; Kamata, Akihito

    2011-01-01

    A differential item functioning (DIF) detection method for testlet-based data was proposed and evaluated in this study. The proposed DIF model is an extension of a bifactor multidimensional item response theory (MIRT) model for testlets. Unlike traditional item response theory (IRT) DIF models, the proposed model takes testlet effects into…

  14. Using automatic item generation to create multiple-choice test items.

    Science.gov (United States)

    Gierl, Mark J; Lai, Hollis; Turner, Simon R

    2012-08-01

    Many tests of medical knowledge, from the undergraduate level to the level of certification and licensure, contain multiple-choice items. Although these are efficient in measuring examinees' knowledge and skills across diverse content areas, multiple-choice items are time-consuming and expensive to create. Changes in student assessment brought about by new forms of computer-based testing have created the demand for large numbers of multiple-choice items. Our current approaches to item development cannot meet this demand. We present a methodology for developing multiple-choice items based on automatic item generation (AIG) concepts and procedures. We describe a three-stage approach to AIG and we illustrate this approach by generating multiple-choice items for a medical licensure test in the content area of surgery. To generate multiple-choice items, our method requires a three-stage process. Firstly, a cognitive model is created by content specialists. Secondly, item models are developed using the content from the cognitive model. Thirdly, items are generated from the item models using computer software. Using this methodology, we generated 1248 multiple-choice items from one item model. Automatic item generation is a process that involves using models to generate items using computer technology. With our method, content specialists identify and structure the content for the test items, and computer technology systematically combines the content to generate new test items. By combining these outcomes, items can be generated automatically. © Blackwell Publishing Ltd 2012.

  15. Item calibration in incomplete testing designs

    NARCIS (Netherlands)

    Eggen, Theo J.H.M.; Verhelst, Norman D.

    2011-01-01

    This study discusses the justifiability of item parameter estimation in incomplete testing designs in item response theory. Marginal maximum likelihood (MML) as well as conditional maximum likelihood (CML) procedures are considered in three commonly used incomplete designs: random incomplete, multis

  16. Processing Polarity Items: Contrastive Licensing Costs

    Science.gov (United States)

    Saddy, Douglas; Drenhaus, Heiner; Frisch, Stefan

    2004-01-01

    We describe an experiment that investigated the failure to license polarity items in German using event-related brain potentials (ERPs). The results reveal distinct processing reflexes associated with failure to license positive polarity items in comparison to failure to license negative polarity items. Failure to license both negative and…

  17. Towards an authoring system for item construction

    NARCIS (Netherlands)

    Rikers, Jos H.A.N.

    1988-01-01

    The process of writing test items is analyzed, and a blueprint is presented for an authoring system for test item writing to reduce invalidity and to structure the process of item writing. The developmental methodology is introduced, and the first steps in the process are reported. A historical revi

  18. Generalized Full-Information Item Bifactor Analysis

    Science.gov (United States)

    Cai, Li; Yang, Ji Seung; Hansen, Mark

    2011-01-01

    Full-information item bifactor analysis is an important statistical method in psychological and educational measurement. Current methods are limited to single-group analysis and inflexible in the types of item response models supported. We propose a flexible multiple-group item bifactor analysis framework that supports a variety of…

  19. Real and Artificial Differential Item Functioning

    Science.gov (United States)

    Andrich, David; Hagquist, Curt

    2012-01-01

    The literature in modern test theory on procedures for identifying items with differential item functioning (DIF) among two groups of persons includes the Mantel-Haenszel (MH) procedure. Generally, it is not recognized explicitly that if there is real DIF in some items which favor one group, then as an artifact of this procedure, artificial DIF…

  20. Item Response Methods for Educational Assessment.

    Science.gov (United States)

    Mislevy, Robert J.; Rieser, Mark R.

    Multiple matrix sampling (MMS) theory indicates how data may be gathered to most efficiently convey information about levels of attainment in a population, but standard analyses of these data require random sampling of items from a fixed pool of items. This assumption proscribes the retirement of flawed or obsolete items from the pool as well as…

  1. Matrix Sampling of Test Items. ERIC Digest.

    Science.gov (United States)

    Childs, Ruth A.; Jaciw, Andrew P.

    This Digest describes matrix sampling of test items as an approach to achieving broad coverage while minimizing testing time per student. Matrix sampling involves developing a complete set of items judged to cover the curriculum, then dividing the items into subsets and administering one subset to each student. Matrix sampling, by limiting the…

  2. The Multidimensionality of Verbal Analogy Items

    Science.gov (United States)

    Ullstadius, Eva; Carlstedt, Berit; Gustafsson, Jan-Eric

    2008-01-01

    The influence of general and verbal ability on each of 72 verbal analogy test items were investigated with new factor analytical techniques. The analogy items together with the Computerized Swedish Enlistment Battery (CAT-SEB) were given randomly to two samples of 18-year-old male conscripts (n = 8566 and n = 5289). Thirty-two of the 72 items had…

  3. Optimal item pool design for computerized adaptive tests with polytomous items using GPCM

    Directory of Open Access Journals (Sweden)

    Xuechun Zhou

    2014-09-01

    Full Text Available Computerized adaptive testing (CAT is a testing procedure with advantages in improving measurement precision and increasing test efficiency. An item pool with optimal characteristics is the foundation for a CAT program to achieve those desirable psychometric features. This study proposed a method to design an optimal item pool for tests with polytomous items using the generalized partial credit model (G-PCM. It extended a method for approximating optimality with polytomous items being described succinctly for the purpose of pool design. Optimal item pools were generated using CAT simulations with and without practical constraints of content balancing and item exposure control. The performances of the item pools were evaluated against an operational item pool. The results indicated that the item pools designed with stratification based on discrimination parameters performed well with an efficient use of the less discriminative items within the target accuracy levels. The implications for developing item pools are also discussed.

  4. Emergency Power For Critical Items

    Science.gov (United States)

    Young, William R.

    2009-07-01

    Natural disasters, such as hurricanes, floods, tornados, and tsunami, are becoming a greater problem as climate change impacts our environment. Disasters, whether natural or man made, destroy lives, homes, businesses and the natural environment. Such disasters can happen with little or no warning, leaving hundreds or even thousands of people without medical services, potable water, sanitation, communications and electrical services for up to several weeks. In our modern world, the need for electricity has become a necessity. Modern building codes and new disaster resistant building practices are reducing the damage to homes and businesses. Emergency gasoline and diesel generators are becoming common place for power outages. Generators need fuel, which may not be available after a disaster, but Photovoltaic (solar-electric) systems supply electricity without petroleum fuel as they are powered by the sun. Photovoltaic (PV) systems can provide electrical power for a home or business. PV systems can operate as utility interactive or stand-alone with battery backup. Determining your critical load items and sizing the photovoltaic system for those critical items, guarantees their operation in a disaster.

  5. Intact Transition Epitope Mapping (ITEM)

    Science.gov (United States)

    Yefremova, Yelena; Opuni, Kwabena F. M.; Danquah, Bright D.; Thiesen, Hans-Juergen; Glocker, Michael O.

    2017-08-01

    Intact transition epitope mapping (ITEM) enables rapid and accurate determination of protein antigen-derived epitopes by either epitope extraction or epitope excision. Upon formation of the antigen peptide-containing immune complex in solution, the entire mixture is electrosprayed to translate all constituents as protonated ions into the gas phase. There, ions from antibody-peptide complexes are separated from unbound peptide ions according to their masses, charges, and shapes either by ion mobility drift or by quadrupole ion filtering. Subsequently, immune complexes are dissociated by collision induced fragmentation and the ion signals of the "complex-released peptides," which in effect are the epitope peptides, are recorded in the time-of-flight analyzer of the mass spectrometer. Mixing of an antibody solution with a solution in which antigens or antigen-derived peptides are dissolved is, together with antigen proteolysis, the only required in-solution handling step. Simplicity of sample handling and speed of analysis together with very low sample consumption makes ITEM faster and easier to perform than other experimental epitope mapping methods.

  6. Expected linking error resulting from item parameter drift among the common Items on Rasch calibrated tests.

    Science.gov (United States)

    Miller, G Edward; Gesn, Paul Randall; Rotou, Jourania

    2005-01-01

    In state assessment programs that employ Rasch-based common item linking procedures, the linking constant is usually estimated with only those common items not identified as exhibiting item difficulty parameter drift. Since state assessments typically contain a fixed number of items, an item classified as exhibiting parameter drift during the linking process remains on the exam as a scorable item even if it is removed from the common item set. Under the assumption that item parameter drift has occurred for one or more of the common items, the expected effect of including or excluding the "affected" item(s) in the estimation of the linking constant is derived in this article. If the item parameter drift is due solely to factors not associated with a change in examinee achievement, no linking error will (be expected to) occur given that the linking constant is estimated only with the items not identified as "affected"; linking error will (be expected to) occur if the linking constant is estimated with all common items. However, if the item parameter drift is due solely to change in examinee achievement, the opposite is true: no linking error will (be expected to) occur if the linking constant is estimated with all common items; linking error will (be expected to) occur if the linking constant is estimated only with the items not identified as "affected".

  7. Approximation Preserving Reductions among Item Pricing Problems

    Science.gov (United States)

    Hamane, Ryoso; Itoh, Toshiya; Tomita, Kouhei

    When a store sells items to customers, the store wishes to determine the prices of the items to maximize its profit. Intuitively, if the store sells the items with low (resp. high) prices, the customers buy more (resp. less) items, which provides less profit to the store. So it would be hard for the store to decide the prices of items. Assume that the store has a set V of n items and there is a set E of m customers who wish to buy those items, and also assume that each item i ∈ V has the production cost di and each customer ej ∈ E has the valuation vj on the bundle ej ⊆ V of items. When the store sells an item i ∈ V at the price ri, the profit for the item i is pi = ri - di. The goal of the store is to decide the price of each item to maximize its total profit. We refer to this maximization problem as the item pricing problem. In most of the previous works, the item pricing problem was considered under the assumption that pi ≥ 0 for each i ∈ V, however, Balcan, et al. [In Proc. of WINE, LNCS 4858, 2007] introduced the notion of “loss-leader, ” and showed that the seller can get more total profit in the case that pi < 0 is allowed than in the case that pi < 0 is not allowed. In this paper, we derive approximation preserving reductions among several item pricing problems and show that all of them have algorithms with good approximation ratio.

  8. Losing Items in the Psychogeriatric Nursing Home

    Directory of Open Access Journals (Sweden)

    J. van Hoof PhD

    2016-09-01

    Full Text Available Introduction: Losing items is a time-consuming occurrence in nursing homes that is ill described. An explorative study was conducted to investigate which items got lost by nursing home residents, and how this affects the residents and family caregivers. Method: Semi-structured interviews and card sorting tasks were conducted with 12 residents with early-stage dementia and 12 family caregivers. Thematic analysis was applied to the outcomes of the sessions. Results: The participants stated that numerous personal items and assistive devices get lost in the nursing home environment, which had various emotional, practical, and financial implications. Significant amounts of time are spent on trying to find items, varying from 1 hr up to a couple of weeks. Numerous potential solutions were identified by the interviewees. Discussion: Losing items often goes together with limitations to the participation of residents. Many family caregivers are reluctant to replace lost items, as these items may get lost again.

  9. Instructional Topics in Educational Measurement (ITEMS) Module: Using Automated Processes to Generate Test Items

    Science.gov (United States)

    Gierl, Mark J.; Lai, Hollis

    2013-01-01

    Changes to the design and development of our educational assessments are resulting in the unprecedented demand for a large and continuous supply of content-specific test items. One way to address this growing demand is with automatic item generation (AIG). AIG is the process of using item models to generate test items with the aid of computer…

  10. New technologies for item monitoring

    Energy Technology Data Exchange (ETDEWEB)

    Abbott, J.A. [EG & G Energy Measurements, Albuquerque, NM (United States); Waddoups, I.G. [Sandia National Labs., Albuquerque, NM (United States)

    1993-12-01

    This report responds to the Department of Energy`s request that Sandia National Laboratories compare existing technologies against several advanced technologies as they apply to DOE needs to monitor the movement of material, weapons, or personnel for safety and security programs. The authors describe several material control systems, discuss their technologies, suggest possible applications, discuss assets and limitations, and project costs for each system. The following systems are described: WATCH system (Wireless Alarm Transmission of Container Handling); Tag system (an electrostatic proximity sensor); PANTRAK system (Personnel And Material Tracking); VRIS (Vault Remote Inventory System); VSIS (Vault Safety and Inventory System); AIMS (Authenticated Item Monitoring System); EIVS (Experimental Inventory Verification System); Metrox system (canister monitoring system); TCATS (Target Cueing And Tracking System); LGVSS (Light Grid Vault Surveillance System); CSS (Container Safeguards System); SAMMS (Security Alarm and Material Monitoring System); FOIDS (Fiber Optic Intelligence & Detection System); GRADS (Graded Radiation Detection System); and PINPAL (Physical Inventory Pallet).

  11. Reversed item bias: an integrative model.

    Science.gov (United States)

    Weijters, Bert; Baumgartner, Hans; Schillewaert, Niels

    2013-09-01

    In the recent methodological literature, various models have been proposed to account for the phenomenon that reversed items (defined as items for which respondents' scores have to be recoded in order to make the direction of keying consistent across all items) tend to lead to problematic responses. In this article we propose an integrative conceptualization of three important sources of reversed item method bias (acquiescence, careless responding, and confirmation bias) and specify a multisample confirmatory factor analysis model with 2 method factors to empirically test the hypothesized mechanisms, using explicit measures of acquiescence and carelessness and experimentally manipulated versions of a questionnaire that varies 3 item arrangements and the keying direction of the first item measuring the focal construct. We explain the mechanisms, review prior attempts to model reversed item bias, present our new model, and apply it to responses to a 4-item self-esteem scale (N = 306) and the 6-item Revised Life Orientation Test (N = 595). Based on the literature review and the empirical results, we formulate recommendations on how to use reversed items in questionnaires.

  12. Examining item difficulty and response time on perceptual ability test items.

    Science.gov (United States)

    Yang, Chien-Lin; O'Neill, Thomas R; Kramer, Gene A

    2002-01-01

    This study examined item calibration stability in relation to response time and the levels of item difficulty between different response time groups on a sample of 389 examinees responding to six different subtest items of the Perceptual Ability Test (PAT). The results indicated that no Differential Item Functioning (DIF) was found and a significant correlation coefficient of item difficulty was formed between slow and fast responders. Three distinct levels of difficulty emerged among the six subtests across groups. Slow responders spent significantly more time than fast responders on the four most difficult subtests. A positive significant relationship was found between item difficulty and response time across groups on the overall perceptual ability test items. Overall, this study found that: 1) the same underlying construct is being measured across groups, 2) the PAT scores were equally useful across groups, 3) different sources of item difficulty may exist among the six subtests, and 4) more difficult test items may require more time to answer.

  13. From concepts to lexical items.

    Science.gov (United States)

    Bierwisch, M; Schreuder, R

    1992-03-01

    In this paper we address the question how in language production conceptual structures are mapped onto lexical items. First we describe the lexical system in a fairly abstract way. Such a system consists of, among other things, a fixed set of basic lexical entries characterized by four groups of information: phonetic form, grammatical features, argument structure, and semantic form. A crucial assumption of the paper is that the meaning in a lexical entry has a complex internal structure composed of more primitive elements (decomposition). Some aspects of argument structure and semantic form and their interaction are discussed with respect to the issue of synonymy. We propose two different mappings involved in lexical access. One maps conceptual structures to semantic forms, and the other maps semantic forms to conceptual structures. Both mappings are context dependent and are many-to-many mappings. We present an elaboration of Levelt's (1989) model in which these processes interact with the grammatical encoder and the mental lexicon. Then we address the consequences of decomposition for processing models, especially the nature of the input of lexical access and the time course. Processing models that use the activation metaphor may have difficulties accounting for certain phenomena where a certain lemma triggers not one, but two or more word forms that have to be produced with other word forms in between.

  14. 41 CFR 101-30.701-2 - Item standardization code.

    Science.gov (United States)

    2010-07-01

    ... 41 Public Contracts and Property Management 2 2010-07-01 2010-07-01 true Item standardization code....7-Item Reduction Program § 101-30.701-2 Item standardization code. Item standardization code (ISC) means a code assigned an item in the supply system which identifies the item as authorized...

  15. Detection of Differential Item Functioning Using the Lasso Approach

    Science.gov (United States)

    Magis, David; Tuerlinckx, Francis; De Boeck, Paul

    2015-01-01

    This article proposes a novel approach to detect differential item functioning (DIF) among dichotomously scored items. Unlike standard DIF methods that perform an item-by-item analysis, we propose the "LR lasso DIF method": logistic regression (LR) model is formulated for all item responses. The model contains item-specific intercepts,…

  16. An Item Analysis and Validity Investigation of Bender Visual Motor Gestalt Test Score Items

    Science.gov (United States)

    Lambert, Nadine M.

    1971-01-01

    This investigation attempted to demonstrate the utility of standard item analysis procedures for selecting the most reliable and valid items for scoring Bender Visual Motor Gestalt Test test records. (Author)

  17. 76 FR 60474 - Commercial Item Handbook

    Science.gov (United States)

    2011-09-29

    ... Defense Acquisition Regulations System Commercial Item Handbook AGENCY: Defense Acquisition Regulations... Commercial Item Handbook. The purpose of the Handbook is to help acquisition personnel develop sound business... the Handbook. DATES: Comments should be submitted in writing to the address shown below on or before...

  18. Item nonresponse in questionnaire research with children

    NARCIS (Netherlands)

    Hox, J.J.; Borgers, N.

    2001-01-01

    This study investigates the effect of item and person characteristics on item nonresponse, for written questionnaires used with school children. Secondary analyses were done on questionnaire data collected in five distinct studies. To analyze the data, logistic multilevel analysis was used with the

  19. Restricted Interests and Teacher Presentation of Items

    Science.gov (United States)

    Stocco, Corey S.; Thompson, Rachel H.; Rodriguez, Nicole M.

    2011-01-01

    Restricted and repetitive behavior (RRB) is more pervasive, prevalent, frequent, and severe in individuals with autism spectrum disorders (ASDs) than in their typical peers. One subtype of RRB is restricted interests in items or activities, which is evident in the manner in which individuals engage with items (e.g., repetitious wheel spinning),…

  20. Item Calibration in Incomplete Testing Designs

    Science.gov (United States)

    Eggen, Theo J. H. M.; Verhelst, Norman D.

    2011-01-01

    This study discusses the justifiability of item parameter estimation in incomplete testing designs in item response theory. Marginal maximum likelihood (MML) as well as conditional maximum likelihood (CML) procedures are considered in three commonly used incomplete designs: random incomplete, multistage testing and targeted testing designs.…

  1. Grouping of Items in Mobile Web Questionnaires

    Science.gov (United States)

    Mavletova, Aigul; Couper, Mick P.

    2016-01-01

    There is some evidence that a scrolling design may reduce breakoffs in mobile web surveys compared to a paging design, but there is little empirical evidence to guide the choice of the optimal number of items per page. We investigate the effect of the number of items presented on a page on data quality in two types of questionnaires: with or…

  2. 38 CFR 3.1606 - Transportation items.

    Science.gov (United States)

    2010-07-01

    ... 38 Pensions, Bonuses, and Veterans' Relief 1 2010-07-01 2010-07-01 false Transportation items. 3... Burial Benefits § 3.1606 Transportation items. The transportation costs of those persons who come within... shipment. (6) Cost of transportation by common carrier including amounts paid as Federal taxes. (7) Cost...

  3. Interservice Availability of Multiservice Used Items.

    Science.gov (United States)

    1999-05-14

    t PIVT OF DISTRIBUTION STATEMENT A Approved for Public Releaseo r. Distribution Unlimited 19990817 034 INTERSERVICE AVAILABILTY OF MULTISERVICE USED...was initially cataloged or placed in the DoD supply system. As items matured, that is, as recurring item costs, usage rates, and technological

  4. The use of an item response theory-based disability item bank across diseases: accounting for differential item functioning.

    Science.gov (United States)

    Weisscher, Nadine; Glas, Cees A; Vermeulen, Marinus; De Haan, Rob J

    2010-05-01

    There is not a single universally accepted activity of daily living (ADL) instrument available to compare disability assessments across different patient groups. We developed a generic item bank of ADL items using item response theory, the Academic Medical Center Linear Disability Scale (ALDS). When comparing outcomes of the ALDS between patients groups, item characteristics of the ALDS should be comparable across groups. The aim of the study was to assess the differential item functioning (DIF) in a group of patients with various disorders to investigate the comparability across these groups. Cross-sectional, multicenter study including 1,283 in- and outpatients with a variety of disorders and disability levels. The sample was divided in two groups: (1) mainly neurological patients (n=497; vascular medicine, Parkinson's disease and neuromuscular disorders) and (2) patients from internal medicine (n=786; pulmonary diseases, chronic pain, rheumatoid arthritis, and geriatric patients). Eighteen of 72 ALDS items showed statistically significant DIF (P<0.01). However, the DIF could effectively be modeled by the introduction of disease-specific parameters. In the subgroups studied, DIF could be modeled in such a way that the ensemble of the items comprised a scale applicable in both groups.

  5. Item response theory - A first approach

    Science.gov (United States)

    Nunes, Sandra; Oliveira, Teresa; Oliveira, Amílcar

    2017-07-01

    The Item Response Theory (IRT) has become one of the most popular scoring frameworks for measurement data, frequently used in computerized adaptive testing, cognitively diagnostic assessment and test equating. According to Andrade et al. (2000), IRT can be defined as a set of mathematical models (Item Response Models - IRM) constructed to represent the probability of an individual giving the right answer to an item of a particular test. The number of Item Responsible Models available to measurement analysis has increased considerably in the last fifteen years due to increasing computer power and due to a demand for accuracy and more meaningful inferences grounded in complex data. The developments in modeling with Item Response Theory were related with developments in estimation theory, most remarkably Bayesian estimation with Markov chain Monte Carlo algorithms (Patz & Junker, 1999). The popularity of Item Response Theory has also implied numerous overviews in books and journals, and many connections between IRT and other statistical estimation procedures, such as factor analysis and structural equation modeling, have been made repeatedly (Van der Lindem & Hambleton, 1997). As stated before the Item Response Theory covers a variety of measurement models, ranging from basic one-dimensional models for dichotomously and polytomously scored items and their multidimensional analogues to models that incorporate information about cognitive sub-processes which influence the overall item response process. The aim of this work is to introduce the main concepts associated with one-dimensional models of Item Response Theory, to specify the logistic models with one, two and three parameters, to discuss some properties of these models and to present the main estimation procedures.

  6. AN ITEM RESPONSE MODEL WITH SINGLE PEAKED ITEM CHARACTERISTIC CURVES - THE PARELLA MODEL

    NARCIS (Netherlands)

    HOIJTINK, H; MOLENAAR, [No Value

    In this paper an item response model (the PARELLA model) designed specifically for the measurement of attitudes and preferences will be introduced. In contrast with the item response models currently used (e.g. the Rasch model and, the two and three parameter logistic model) the item characteristic

  7. Comparing Methods for Item Analysis: The Impact of Different Item-Selection Statistics on Test Difficulty

    Science.gov (United States)

    Jones, Andrew T.

    2011-01-01

    Practitioners often depend on item analysis to select items for exam forms and have a variety of options available to them. These include the point-biserial correlation, the agreement statistic, the B index, and the phi coefficient. Although research has demonstrated that these statistics can be useful for item selection, no research as of yet has…

  8. Developing and evaluating innovative items for the NCLEX: Part 2, item characteristics and cognitive processing.

    Science.gov (United States)

    Wendt, Anne; Harmes, J Christine

    2009-01-01

    This article is a continuation of the research on the development and evaluation of innovative item formats for the NCLEX examinations that was published in the March/April 2009 edition of Nurse Educator. The authors discuss the innovative item templates and evaluate the statistical characteristics and level of cognitive processing required to answer the examination items.

  9. Ramsay-Curve Item Response Theory for the Three-Parameter Logistic Item Response Model

    Science.gov (United States)

    Woods, Carol M.

    2008-01-01

    In Ramsay-curve item response theory (RC-IRT), the latent variable distribution is estimated simultaneously with the item parameters of a unidimensional item response model using marginal maximum likelihood estimation. This study evaluates RC-IRT for the three-parameter logistic (3PL) model with comparisons to the normal model and to the empirical…

  10. Evaluation of Northwest University, Kano Post-UTME Test Items Using Item Response Theory

    Science.gov (United States)

    Bichi, Ado Abdu; Hafiz, Hadiza; Bello, Samira Abdullahi

    2016-01-01

    High-stakes testing is used for the purposes of providing results that have important consequences. Validity is the cornerstone upon which all measurement systems are built. This study applied the Item Response Theory principles to analyse Northwest University Kano Post-UTME Economics test items. The developed fifty (50) economics test items was…

  11. Measuring student learning with item response theory

    Directory of Open Access Journals (Sweden)

    Young-Jin Lee

    2008-01-01

    Full Text Available We investigate short-term learning from hints and feedback in a Web-based physics tutoring system. Both the skill of students and the difficulty and discrimination of items were determined by applying item response theory (IRT to the first answers of students who are working on for-credit homework items in an introductory Newtonian physics course. We show that after tutoring a shifted logistic item response function with lower discrimination fits the students’ second responses to an item previously answered incorrectly. Student skill decreased by 1.0 standard deviation when students used no tutoring between their (incorrect first and second attempts, which we attribute to “item-wrong bias.” On average, using hints or feedback increased students’ skill by 0.8 standard deviation. A skill increase of 1.9 standard deviation was observed when hints were requested after viewing, but prior to attempting to answer, a particular item. The skill changes measured in this way will enable the use of IRT to assess students based on their second attempt in a tutoring environment.

  12. Item response theory for measurement validity.

    Science.gov (United States)

    Yang, Frances M; Kao, Solon T

    2014-06-01

    Item response theory (IRT) is an important method of assessing the validity of measurement scales that is underutilized in the field of psychiatry. IRT describes the relationship between a latent trait (e.g., the construct that the scale proposes to assess), the properties of the items in the scale, and respondents' answers to the individual items. This paper introduces the basic premise, assumptions, and methods of IRT. To help explain these concepts we generate a hypothetical scale using three items from a modified, binary (yes/no) response version of the Center for Epidemiological Studies-Depression scale that was administered to 19,399 respondents. We first conducted a factor analysis to confirm the unidimensionality of the three items and then proceeded with Mplus software to construct the 2-Parameter Logic (2-PL) IRT model of the data, a method which allows for estimates of both item discrimination and item difficulty. The utility of this information both for clinical purposes and for scale construction purposes is discussed.

  13. Bookmark locations and item response model selection in the presence of local item dependence.

    Science.gov (United States)

    Skaggs, Gary

    2007-01-01

    The bookmark standard setting procedure is a popular method for setting performance standards on state assessment programs. This study reanalyzed data from an application of the bookmark procedure to a passage-based test that used the Rasch model to create the item ordered booklet. Several problems were noted in this implementation of the bookmark procedure, including disagreement among the SMEs about the correct order of items in the bookmark booklet, performance level descriptions of the passing standard being based on passage difficulty as well as item difficulty, and the presence of local item dependence within reading passages. Bookmark item locations were recalculated for the IRT three-parameter model and the multidimensional bifactor model. The results showed that the order of item locations was very similar for all three models when items of high difficulty and low discrimination were excluded. However, the items whose positions were the most discrepant between models were not the items that the SMEs disagreed about the most in the original standard setting. The choice of latent trait model did not address problems of item order disagreement. Implications for the use of the bookmark method in the presence of local item dependence are discussed.

  14. Sequential item pricing for unlimited supply

    OpenAIRE

    2010-01-01

    We investigate the extent to which price updates can increase the revenue of a seller with little prior information on demand. We study prior-free revenue maximization for a seller with unlimited supply of n item types facing m myopic buyers present for k < log n days. For the static (k = 1) case, Balcan et al. [2] show that one random item price (the same on each item) yields revenue within a \\Theta(log m + log n) factor of optimum and this factor is tight. We define the hereditary maximizer...

  15. NHRIC (National Health Related Items Code)

    Data.gov (United States)

    U.S. Department of Health & Human Services — The National Health Related Items Code (NHRIC) is a system for identification and numbering of marketed device packages that is compatible with other numbering...

  16. NHRIC (National Health Related Items Code)

    Data.gov (United States)

    U.S. Department of Health & Human Services — The National Health Related Items Code (NHRIC) is a system for identification and numbering of marketed device packages that is compatible with other numbering...

  17. National Hospice Item Set (HIS) data

    Data.gov (United States)

    U.S. Department of Health & Human Services — This data set includes the national averages (mean) for quality measure scores of Medicare-certified hospice agencies calculated from the Hospice Item Set (HIS) for...

  18. The Analysis of The Multiple Choice Item

    Institute of Scientific and Technical Information of China (English)

    曹吴惠

    2009-01-01

    In this article, the author analyzes in detail the advantages, disadvantages and forming of the multiple choice item in examinations. On its basis, the author also exploree some aspects the teacher should pay attention to while setting an examination paper.

  19. Extending Item Response Theory to Online Homework

    CERN Document Server

    Kortemeyer, Gerd

    2014-01-01

    Item Response Theory becomes an increasingly important tool when analyzing ``Big Data'' gathered from online educational venues. However, the mechanism was originally developed in traditional exam settings, and several of its assumptions are infringed upon when deployed in the online realm. For a large enrollment physics course for scientists and engineers, the study compares outcomes from IRT analyses of exam and homework data, and then proceeds to investigate the effects of each confounding factor introduced in the online realm. It is found that IRT yields the correct trends for learner ability and meaningful item parameters, yet overall agreement with exam data is moderate. It is also found that learner ability and item discrimination is over wide ranges robust with respect to model assumptions and introduced noise, less so than item difficulty.

  20. Basic Stand Alone Carrier Line Items PUF

    Data.gov (United States)

    U.S. Department of Health & Human Services — This release contains the Basic Stand Alone (BSA) Carrier Line Items Public Use Files (PUF) with information from Medicare Carrier claims. The CMS BSA Carrier Line...

  1. Calorimetry of low mass Pu239 items

    Energy Technology Data Exchange (ETDEWEB)

    Cremers, Teresa L [Los Alamos National Laboratory; Sampson, Thomas E [Los Alamos National Laboratory

    2010-01-01

    Calorimetric assay has the reputation of providing the highest precision and accuracy of all nondestructive assay measurements. Unfortunately, non-destructive assay practitioners and measurement consumers often extend, inappropriately, the high precision and accuracy of calorimetric assay to very low mass items. One purpose of this document is to present more realistic expectations for the random uncertainties associated with calorimetric assay for weapons grade plutonium items with masses of 200 grams or less.

  2. New developments in reprocessing semicritical items.

    Science.gov (United States)

    Rutala, William A; Weber, David J

    2013-05-01

    Semicritical medical devices are defined as items that come into contact with mucous membranes or nonintact skin (eg, gastrointestinal endoscopes). Such medical devices require minimally high-level disinfection. Because many of these items are temperature sensitive, low-temperature chemical methods must be used rather than steam sterilization. Strict adherence to current guidelines is required because more outbreaks have been linked to inadequately cleaned or disinfected endoscopes undergoing high-level disinfection than any other medical device.

  3. A Review of Test Item Types

    Science.gov (United States)

    2008-03-06

    for integration with the CAT- ASVAB as it exists now. The CAT-ASVAB currently employs only dichotomously scored items using the 3 parameter logistic ...model ( 3PL ; Lord & Novick, 1968). IRT models appropriate for polytomously scored items (e.g., Muraki, 1997) are available, and mixing of models is not...problematic within the IRT framework per se. Nevertheless, the current CAT-ASVAB infrastructure is configured to work with the 3PL model only, and

  4. Investigating Item Exposure Control Methods in Computerized Adaptive Testing

    Science.gov (United States)

    Ozturk, Nagihan Boztunc; Dogan, Nuri

    2015-01-01

    This study aims to investigate the effects of item exposure control methods on measurement precision and on test security under various item selection methods and item pool characteristics. In this study, the Randomesque (with item group sizes of 5 and 10), Sympson-Hetter, and Fade-Away methods were used as item exposure control methods. Moreover,…

  5. 15 CFR 734.3 - Items subject to the EAR.

    Science.gov (United States)

    2010-01-01

    ... 15 Commerce and Foreign Trade 2 2010-01-01 2010-01-01 false Items subject to the EAR. 734.3... EXPORT ADMINISTRATION REGULATIONS § 734.3 Items subject to the EAR. (a) Except for items excluded in paragraph (b) of this section, the following items are subject to the EAR: (1) All items in the...

  6. An emotional functioning item bank of 24 items for computerized adaptive testing (CAT) was established

    DEFF Research Database (Denmark)

    Petersen, Morten Aa.; Gamper, Eva-Maria; Costantini, Anna

    2016-01-01

    OBJECTIVE: To improve measurement precision, the European Organisation for Research and Treatment of Cancer (EORTC) Quality of Life Group is developing an item bank for computerized adaptive testing (CAT) of emotional functioning (EF). The item bank will be within the conceptual framework...... of the widely used EORTC Quality of Life questionnaire (QLQ-C30). STUDY DESIGN AND SETTING: On the basis of literature search and evaluations by international samples of experts and cancer patients, 38 candidate items were developed. The psychometric properties of the items were evaluated in a large...... international sample of cancer patients. This included evaluations of dimensionality, item response theory (IRT) model fit, differential item functioning (DIF), and of measurement precision/statistical power. RESULTS: Responses were obtained from 1,023 cancer patients from four countries. The evaluations showed...

  7. Psychometric Changes on Item Difficulty Due to Item Review by Examinees

    Directory of Open Access Journals (Sweden)

    Elena C. Papanastasiou

    2015-01-01

    Full Text Available If good measurement depends in part on the estimation of accurate item characteristics, it is essential that test developers become aware of discrepancies that may exist on the item parameters before and after item review. The purpose of this study was to examine the answer changing patterns of students while taking paper-and-pencil multiple choice exams, and to examine how these changes affect the estimation of item difficulty parameters. The results of this study have shown that item review by examinees does produce some changes to the examinee ability estimates and to the item difficulty parameters. In addition, these effects are more pronounced in shorter tests than in longer tests. In turn, these small changes produce larger effects when estimating the changes in the information values of each student's test score.

  8. Alternate item types: continuing the quest for authentic testing.

    Science.gov (United States)

    Wendt, Anne; Kenny, Lorraine E

    2009-03-01

    Many test developers suggest that multiple-choice items can be used to evaluate critical thinking if the items are focused on measuring higher order thinking ability. The literature supports the use of alternate item types to assess additional competencies, such as higher level cognitive processing and critical thinking, as well as ways to allow examinees to demonstrate their competencies differently. This research study surveyed nurses after taking a test composed of alternate item types paired with multiple-choice items. The participants were asked to provide opinions regarding the items and the item formats. Demographic information was asked. In addition, information was collected as the participants responded to the items. The results of this study reveal that the participants thought that, in general, the items were more authentic and allowed them to demonstrate their competence better than multiple-choice items did. Further investigation into the optimal blend of alternate items and multiple-choice items is needed.

  9. Measuring response styles in Likert items.

    Science.gov (United States)

    Böckenholt, Ulf

    2017-03-01

    The recently proposed class of item response tree models provides a flexible framework for modeling multiple response processes. This feature is particularly attractive for understanding how response styles may affect answers to attitudinal questions. Facilitating the disassociation of response styles and attitudinal traits, item response tree models can provide powerful process tests of how different response formats may affect the measurement of substantive traits. In an empirical study, 3 response formats were used to measure the 2-dimensional Personal Need for Structure traits. Different item response tree models are proposed to capture the response styles for each of the response formats. These models show that the response formats give rise to similar trait measures but different response-style effects. (PsycINFO Database Record (c) 2017 APA, all rights reserved).

  10. Physics Items and Student's Performance at Enem

    CERN Document Server

    Gonçalves, Wanderley P

    2013-01-01

    The Brazilian National Assessment of Secondary Education (ENEM, Exame Nacional do Ensino M\\'edio) has changed in 2009: from a self-assessment of competences at the end of high school to an assessment that allows access to college and student financing. From a single general exam, now there are tests in four areas: Mathematics, Language, Natural Sciences and Social Sciences. A new Reference Matrix is build with components as cognitive domains, competencies, skills and knowledge objects; also, the methodological framework has changed, using now Item Response Theory to provide scores and allowing longitudinal comparison of results from different years, providing conditions for monitoring high school quality in Brazil. We present a study on the issues discussed in Natural Science Test of ENEM over the years 2009, 2010 and 2011. Qualitative variables are proposed to characterize the items, and data from students' responses in Physics items were analysed. The qualitative analysis reveals the characteristics of the ...

  11. Validation of Physics Standardized Test Items

    Science.gov (United States)

    Marshall, Jill

    2008-10-01

    The Texas Physics Assessment Team (TPAT) examined the Texas Assessment of Knowledge and Skills (TAKS) to determine whether it is a valid indicator of physics preparation for future course work and employment, and of the knowledge and skills needed to act as an informed citizen in a technological society. We categorized science items from the 2003 and 2004 10th and 11th grade TAKS by content area(s) covered, knowledge and skills required to select the correct answer, and overall quality. We also analyzed a 5000 student sample of item-level results from the 2004 11th grade exam using standard statistical methods employed by test developers (factor analysis and Item Response Theory). Triangulation of our results revealed strengths and weaknesses of the different methods of analysis. The TAKS was found to be only weakly indicative of physics preparation and we make recommendations for increasing the validity of standardized physics testing..

  12. Differential item functioning due to gender between depression and anxiety items among Chilean adolescents.

    Science.gov (United States)

    Bares, Cristina; Andrade, Fernando; Delva, Jorge; Grogan-Kaylor, Andrew; Kamata, Akihito

    2012-07-01

    Although much is known about the higher prevalence of anxiety and depressive disorders among adolescent females, less is known about the differential item endorsement due to gender in items of scales commonly used to measure anxiety and depression. We conducted a study to examine if adolescent males and females from Chile differed on how they endorsed the items of the Youth Self Report (YSR) anxious/depressed problem scale. We used data from a cross-sectional sample consisting of 925 participants (mean age = 14, SD 1.3, 49% females) of low to lower-middle socioeconomic status. A two-parameter logistic (2PL) IRT DIF model was fit. s revealed differential item functioning (DIF) by gender for six of the 13 items, with adolescent females being more likely to endorse a depression item while males were found more likely to endorse anxiety items. Findings suggest that items found in commonly used measures of anxiety and depression symptoms may not equally capture the true levels of these behavioural problems in adolescent males and females. Given the high levels of mental disorders in Chile and the surrounding countries, further attention should be focused on increasing the number of empirical studies examining potential gender differences in the assessment of mental health problems among Latin American populations to better aid our understanding of the phenomenology and determinants of these problems in the region.

  13. Developing Multidimensional Likert Scales Using Item Factor Analysis: The Case of Four-Point Items

    Science.gov (United States)

    Asún, Rodrigo A.; Rdz-Navarro, Karina; Alvarado, Jesús M.

    2016-01-01

    This study compares the performance of two approaches in analysing four-point Likert rating scales with a factorial model: the classical factor analysis (FA) and the item factor analysis (IFA). For FA, maximum likelihood and weighted least squares estimations using Pearson correlation matrices among items are compared. For IFA, diagonally weighted…

  14. Evaluating Item Discrimination Power of WHOQOL-BREF from an Item Response Model Perspectives

    Science.gov (United States)

    Lin, Ting Hsiang; Yao, Grace

    2009-01-01

    Quality of life (QOL) has become an important component of health. By using the methodology of psychometric theory, we examine the item properties of the WHOQOL-BRIEF. Samejima's graded response model with natural metrics of the logistic response function was fitted. The results showed items with negative natures were less discriminating. Items…

  15. Application of Unidimensional Item Response Models to Tests with Items Sensitive to Secondary Dimensions

    Science.gov (United States)

    Zhang, Bo

    2008-01-01

    In this research, the author addresses whether the application of unidimensional item response models provides valid interpretation of test results when administering items sensitive to multiple latent dimensions. Overall, the present study found that unidimensional models are quite robust to the violation of the unidimensionality assumption due…

  16. Are Inferential Reading Items More Susceptible to Cultural Bias than Literal Reading Items?

    Science.gov (United States)

    Banks, Kathleen

    2012-01-01

    The purpose of this article is to illustrate a seven-step process for determining whether inferential reading items were more susceptible to cultural bias than literal reading items. The seven-step process was demonstrated using multiple-choice data from the reading portion of a reading/language arts test for fifth and seventh grade Hispanic,…

  17. Modeling Local Item Dependence in Cloze and Reading Comprehension Test Items Using Testlet Response Theory

    Science.gov (United States)

    Baghaei, Purya; Ravand, Hamdollah

    2016-01-01

    In this study the magnitudes of local dependence generated by cloze test items and reading comprehension items were compared and their impact on parameter estimates and test precision was investigated. An advanced English as a foreign language reading comprehension test containing three reading passages and a cloze test was analyzed with a…

  18. Item Structure as a Determinant of Item Difficulty in Verbal Analogies

    Science.gov (United States)

    Ace, Merle C.; Dawis, Rene V.

    1973-01-01

    Because no previous study was found in which both blank position in the item stem and positional placement of the correct response were studied simultaneously, it was decided to investigate the influence of these two factors, alone and in combination, on the difficulty level of verbal analogy items. (Authors)

  19. Secondary Item Procurement Lead Time Study.

    Science.gov (United States)

    1984-03-01

    NONEhhhhhh % I - 15 III& 1-- -NAION&°I° oI Ai OF .° NAIU REUONF TEST CANA -~~ 7 .. SECONDARY ITEM PROCUREMENT LEAD TIME STUDY __ LOGISTICS SYSTEMS...ASSISTANT SECRETARY OF THE AIR FORCE (RD&L) DIRECTOR, DEFENSE LOGISTICS AGENCY SUBJECT: Secondary Item Procurement Lead Time Study A recent report by the...determination of procurement lead time. A plan for the study is enclosed. In order to achieve the objectives of the procurement lead time study as well as the

  20. MULTI-ITEM FAIR EXCHANGE SCHEME

    Institute of Scientific and Technical Information of China (English)

    Zheng Dong; Chen Kefei

    2002-01-01

    As more business is conduced over the Internet, the fair exchange problem assumes increasing importance. However, the problem of multi-party fair exchange has not been studied as widely as the more fundamental problem of 2-party fair exchange. Recently, Franklin and Tsudik proposed two protocols for n-party multi-item exchange on FC'98, SUCEX-1 and SUCES-2. This paper first gives an attack on the proposed protocol SUCEX-1, then presents two protocols for multi-item exchange, one is an improved protocol of SUCEX-1, another is the extension of protocol SUCEX-2.

  1. MULTI—ITEM FAIR EXCHANGE SCHEME

    Institute of Scientific and Technical Information of China (English)

    ZhengDong; ChenKefei

    2002-01-01

    As more ubsiness is conduced over the Internet,the fair exchange problem assumes increasing importance,However,the problem of multi-party fair exchange has not been studied as widely as the more fundamental problem of 2-party fair exchange,Recently,Franklin and Tsudik proposed two protocols for n-parth multi-item exchange on FC'98,SUCEX-1 and SUCES-2,This paper first gives an attack on the proposed protocol SUCEX-1,then presents two protocols for multi-item exchange,one is an improoved protocol of SUCEX-1,another is the extension of protocol SUCEX-2.

  2. Comparison on Computed Tomography using industrial items

    DEFF Research Database (Denmark)

    Angel, Jais Andreas Breusch; De Chiffre, Leonardo

    2014-01-01

    In a comparison involving 27 laboratories from 8 countries, measurements on two common industrial items, a polymer part and a metal part, were carried out using X-ray Computed Tomography. All items were measured using coordinate measuring machines before and after circulation, with reference...... measurement uncertainties in the range 1.5–5.5 μm, showing a good stability over the 6 months of the circulation. The comparison has shown that CT measurements on the industrial parts used lie in the range 6–53 μm, with maximum values up to 158 μm....

  3. Fluctuation formula for nonreversible dynamics in the thermostated Lorentz gas

    Science.gov (United States)

    Dolowschiák, M.; Kovács, Z.

    2002-12-01

    We investigate numerically the validity of the Gallavotti-Cohen fluctuation formula in the two- and three-dimensional periodic Lorentz gas subjected to constant electric and magnetic fields and thermostated by the Gaussian isokinetic thermostat. The magnetic field breaks the time reversal symmetry, and by choosing its orientation with respect to the lattice, one can have either a generalized reversing symmetry or no reversibility at all. Our results indicate that the scaling property described by the fluctuation formula may be approximately valid for large fluctuations even in the absence of reversibility.

  4. Principles and procedures of considering item sequence effects in the development of calibrated item pools: Conceptual analysis and empirical illustration

    Directory of Open Access Journals (Sweden)

    Safir Yousfi

    2012-12-01

    Full Text Available Item responses can be context-sensitive. Consequently, composing test forms flexibly from a calibrated item pool requires considering potential context effects. This paper focuses on context effects that are related to the item sequence. It is argued that sequence effects are not necessarily a violation of item response theory but that item response theory offers a powerful tool to analyze them. If sequence effects are substantial, test forms cannot be composed flexibly on the basis of a calibrated item pool, which precludes applications like computerized adaptive testing. In contrast, minor sequence effects do not thwart applications of calibrated item pools. Strategies to minimize the detrimental impact of sequence effects on item parameters are discussed and integrated into a nomenclature that addresses the major features of item calibration designs. An example of an item calibration design demonstrates how this nomenclature can guide the process of developing a calibrated item pool.

  5. Soviet Cybernetics: Recent News Items, Number Thirteen.

    Science.gov (United States)

    Holland, Wade B.

    An issue of "Soviet Cybernetics: Recent News Items" consists of English translations of the leading recent Soviet contributions to the study of cybernetics. Articles deal with cybernetics in the 21st Century; the Soviet State Committee on Science and Technology; economic reforms in Rudnev's ministry; an interview with Rudnev; Dnepr-2; Dnepr-2…

  6. 10 CFR 74.55 - Item monitoring.

    Science.gov (United States)

    2010-01-01

    ..., except for reactor components measuring at least one meter in length and weighing in excess of 30... 10 Energy 2 2010-01-01 2010-01-01 false Item monitoring. 74.55 Section 74.55 Energy NUCLEAR REGULATORY COMMISSION (CONTINUED) MATERIAL CONTROL AND ACCOUNTING OF SPECIAL NUCLEAR MATERIAL...

  7. The Fantastic Four of Mathematics Assessment Items

    Science.gov (United States)

    Greenlees, Jane

    2011-01-01

    In this article, the author makes reference to four comic book characters to make the point that together they are a formidable team, but on their own they are vulnerable. She examines the four components of mathematics assessment items and the need for implicit instruction within the classroom for student success. Just like the "Fantastic Four"…

  8. 47 CFR 65.820 - Included items.

    Science.gov (United States)

    2010-10-01

    ... 47 Telecommunication 3 2010-10-01 2010-10-01 false Included items. 65.820 Section 65.820 Telecommunication FEDERAL COMMUNICATIONS COMMISSION (CONTINUED) COMMON CARRIER SERVICES (CONTINUED) INTERSTATE RATE...) Cash working capital. The average amount of investor-supplied capital needed to provide funds for...

  9. Algorithmic test design using classical item parameters

    NARCIS (Netherlands)

    van der Linden, Willem J.; Adema, Jos J.

    1988-01-01

    Two optimalization models for the construction of tests with a maximal value of coefficient alpha are given. Both models have a linear form and can be solved by using a branch-and-bound algorithm. The first model assumes an item bank calibrated under the Rasch model and can be used, for instance, wh

  10. Estimating the Importance of Differential Item Functioning.

    Science.gov (United States)

    Rudas, Tamas; Zwick, Rebecca

    1997-01-01

    The mixture index of fit (T. Rudas et al, 1994) is used to estimate the fraction of a population for which differential item functioning (DIF) occurs, and this approach is compared to the Mantel Haenszel test of DIF. The proposed noniterative procedure provides information about data portions contributing to DIF. (SLD)

  11. Item Feature Effects in Evolution Assessment

    Science.gov (United States)

    Nehm, Ross H.; Ha, Minsu

    2011-01-01

    Despite concerted efforts by science educators to understand patterns of evolutionary reasoning in science students and teachers, the vast majority of evolution education studies have failed to carefully consider or control for item feature effects in knowledge measurement. Our study explores whether robust contextualization patterns emerge within…

  12. Bayesian item selection criteria for adaptive testing

    NARCIS (Netherlands)

    Linden, van der Wim J.

    1996-01-01

    R.J. Owen (1975) proposed an approximate empirical Bayes procedure for item selection in adaptive testing. The procedure replaces the true posterior by a normal approximation with closed-form expressions for its first two moments. This approximation was necessary to minimize the computational comple

  13. Bayesian item selection criteria for adaptive testing

    NARCIS (Netherlands)

    van der Linden, Willem J.

    1996-01-01

    R.J. Owen (1975) proposed an approximate empirical Bayes procedure for item selection in adaptive testing. The procedure replaces the true posterior by a normal approximation with closed-form expressions for its first two moments. This approximation was necessary to minimize the computational comple

  14. Estimating the Importance of Differential Item Functioning.

    Science.gov (United States)

    Rudas, Tamas; Zwick, Rebecca

    1997-01-01

    The mixture index of fit (T. Rudas et al, 1994) is used to estimate the fraction of a population for which differential item functioning (DIF) occurs, and this approach is compared to the Mantel Haenszel test of DIF. The proposed noniterative procedure provides information about data portions contributing to DIF. (SLD)

  15. Item Feature Effects in Evolution Assessment

    Science.gov (United States)

    Nehm, Ross H.; Ha, Minsu

    2011-01-01

    Despite concerted efforts by science educators to understand patterns of evolutionary reasoning in science students and teachers, the vast majority of evolution education studies have failed to carefully consider or control for item feature effects in knowledge measurement. Our study explores whether robust contextualization patterns emerge within…

  16. An Analytical Method of Identifying Biased Test Items.

    Science.gov (United States)

    Plake, Barbara S.; Hoover, H. D.

    1979-01-01

    A follow-up technique is needed to identify items contributing to items-by-groups interaction when using an ANOVA procedure to examine a test for biased items. The method described includes distribution theory for assessing level of significance and is sensitive to items at all difficulty levels. (Author/GSK)

  17. A Process for Reviewing and Evaluating Generated Test Items

    Science.gov (United States)

    Gierl, Mark J.; Lai, Hollis

    2016-01-01

    Testing organization needs large numbers of high-quality items due to the proliferation of alternative test administration methods and modern test designs. But the current demand for items far exceeds the supply. Test items, as they are currently written, evoke a process that is both time-consuming and expensive because each item is written,…

  18. The Licensing of Negative Sensitive Items in Jordanian Arabic

    Science.gov (United States)

    Alsarayreh, Atef

    2012-01-01

    This study investigates the licensing conditions on Negative Sensitive Items (NSIs) in Jordanian Arabic (JA). JA exhibits both types of NSIs that are discussed in the literature: Negative Polarity Items (NPIs) and Negative Concord Items (NCIs). Although these two sets of items seem to form a natural class in the sense that they show certain…

  19. An optimized item-based collaborative filtering recommendation algorithm based on item genre prediction

    Science.gov (United States)

    Zhang, De-Jia

    2009-07-01

    With the fast development of Internet, many systems have emerged in e-commerce applications to support the product recommendation. Collaborative filtering is one of the most promising techniques in recommender systems, providing personalized recommendations to users based on their previously expressed preferences in the form of ratings and those of other similar users. In practice, with the adding of user and item scales, user-item ratings are becoming extremely sparsity and recommender systems utilizing traditional collaborative filtering are facing serious challenges. To address the issue, this paper presents an approach to compute item genre similarity, through mapping each item with a corresponding descriptive genre, and computing similarity between genres as similarity, then make basic predictions according to those similarities to lower sparsity of the user-item ratings. After that, item-based collaborative filtering steps are taken to generate predictions. Compared with previous methods, the presented collaborative filtering employs the item genre similarity can alleviate the sparsity issue in the recommender systems, and can improve accuracy of recommendation.

  20. 17 CFR 260.7a-16 - Inclusion of items, differentiation between items and answers, omission of instructions.

    Science.gov (United States)

    2010-04-01

    ... 17 Commodity and Securities Exchanges 3 2010-04-01 2010-04-01 false Inclusion of items... INDENTURE ACT OF 1939 Formal Requirements § 260.7a-16 Inclusion of items, differentiation between items and... application, statement, or report shall contain all of the items of the form as well as the answers...

  1. Differential Item Functioning Analysis of the Science and Mathematics Items in the University Entrance Examinations in Turkey

    Science.gov (United States)

    Kalaycioglu, Dilara Bakan; Berberoglu, Giray

    2011-01-01

    This study is aimed to detect differential item functioning (DIF) items across gender groups, analyze item content for the possible sources of DIF, and eventually investigate the effect of DIF items on the criterion-related validity of the test scores in the quantitative section of the university entrance examination (UEE) in Turkey. The reason…

  2. Differential Item Functioning on the International Personality Item Poolâ s Neuroticism Scale

    OpenAIRE

    McBride, Nadine LeBarron

    2008-01-01

    As use of the public-domain International Personality Item Pool (IPIP) scales has grown significantly over the past decade (Goldberg, Johnson, Eber, Hogan, Ashton, Cloninger, & Gough, 2006) research on the psychometric properties of the items and scales have become increasingly important. This research study examines the IPIP scale constructed to measure the Five Factor Model (FFM) domain of Neuroticism (as measured by the NEO-PI-R) for occurrences of differential functioning a...

  3. Item Screening in Graphical Loglinear Rasch Models

    DEFF Research Database (Denmark)

    Kreiner, Svend; Christensen, Karl Bang

    2011-01-01

    the initial item analysis may disclose a great deal of spurious and misleading evidence of DIF and local dependence that has to disposed of during the modelling procedure. Like graphical models, graphical loglinear Rasch models possess Markov properties that are useful during the statistical analysis......In behavioural sciences, local dependence and DIF are common, and purification procedures that eliminate items with these weaknesses often result in short scales with poor reliability. Graphical loglinear Rasch models (Kreiner & Christensen, in Statistical Methods for Quality of Life Studies, ed....... by M. Mesbah, F.C. Cole & M.T. Lee, Kluwer Academic, pp. 187–203, 2002) where uniform DIF and uniform local dependence are permitted solve this dilemma by modelling the local dependence and DIF. Identifying loglinear Rasch models by a stepwise model search is often very time consuming, since...

  4. Teoria de la respuesta al item

    OpenAIRE

    2009-01-01

    A preocupação com medidas de traços psicológicos é antiga, sendo que muitos estudos e propostas de métodos foram desenvolvidos no sentido de alcançar este objetivo. Entre os trabalhos propostos, destaca-se a Teoria da Resposta ao Item (TRI) que, a princípio, veio completar limitações da Teoria Clássica de Medidas, empregada em larga escala até hoje na medida de traços psicológicos. O ponto principal da TRI é que ela leva em consideração o item particularmente, sem relevar os escores totais; p...

  5. Global positive polarity items and obligatory exhaustivity

    Directory of Open Access Journals (Sweden)

    Benjamin Spector

    2014-11-01

    Full Text Available I argue for a distinction between two types of positive polarity items (PPIs which has not been recognized so far. While for some PPIs, anti-licensing is a strictly local phenomenon, for other PPIs anti-licensing should be stated as a global condition. I aim to contribute to a principled explanation for the distribution of a significant subset of global PPIs, by relating it to specific semantic properties of the relevant items. More specifically, I argue that PPIs such as soit ... soit ..., quelques and almost trigger obligatory exhaustivity effects and scalar inferences, and that independently motivated constraints regarding the generation of such inferences can account for their distribution. The paper also briefly addresses the case of other global PPIs, e.g., at least, for which a similar account is not straightforwardly available. http://dx.doi.org/10.3765/sp.7.11 BibTeX info

  6. An item factor analysis and item response theory-based revision of the Everyday Discrimination Scale.

    Science.gov (United States)

    Stucky, Brian D; Gottfredson, Nisha C; Panter, A T; Daye, Charles E; Allen, Walter R; Wightman, Linda F

    2011-04-01

    The Everyday Discrimination Scale (EDS), a widely used measure of daily perceived discrimination, is purported to be unidimensional, to function well among African Americans, and to have adequate construct validity. Two separate studies and data sources were used to examine and cross-validate the psychometric properties of the EDS. In Study 1, an exploratory factor analysis was conducted on a sample of African American law students (N = 589), providing strong evidence of local dependence, or nuisance multidimensionality within the EDS. In Study 2, a separate nationally representative community sample (N = 3,527) was used to model the identified local dependence in an item factor analysis (i.e., bifactor model). Next, item response theory (IRT) calibrations were conducted to obtain item parameters. A five-item, revised-EDS was then tested for gender differential item functioning (in an IRT framework). Based on these analyses, a summed score to IRT-scaled score translation table is provided for the revised-EDS. Our results indicate that the revised-EDS is unidimensional, with minimal differential item functioning, and retains predictive validity consistent with the original scale.

  7. NBC Contamination Survivability, Large Item Exteriors

    Science.gov (United States)

    2007-11-02

    Perform these tasks (timed) in the standard garment. .(3) Perform these tasks (timed) in mission-oriented protective posture level 4 (MOPP4). (4...bring the chamber to the environmental conditions specified for the test. Condition the test item until it has equilibrated at 30±50 C. Temperature and...condition that all essential operations can be continued in the lowest protective posture consistent with the mission and threat, and without long-term

  8. Validity of Center for Epidemiologic Studies Depression (CES-D scale in a sample of Iraq and Afghanistan Veterans

    Directory of Open Access Journals (Sweden)

    Ana R Quiñones

    2016-04-01

    Full Text Available Objectives: Optimal depression screening necessitates measurement tools that are valid across varied populations and in the presence of comorbidities. Methods: This study assessed the test properties of two versions of the Center for Epidemiologic Studies Depression scale against psychiatric diagnoses established by the Mini International Neuropsychiatric Interview among a clinical sample of US Veterans deployed during Operations Enduring Freedom, Iraqi Freedom, and New Dawn. Participants (N = 359 recruited from two Department of Veterans Affairs hospitals completed a clinical interview, structured diagnostic interview, and self-reported measures. Results: Based on diagnostic interview and the Diagnostic and Statistical Manual of Mental Disorders 4th Edition criteria, 29.5% of the sample met diagnostic criteria for major depressive disorder and 26.5% met diagnostic criteria for post-traumatic stress disorder. Both Center for Epidemiologic Studies Depression-20 and Center for Epidemiologic Studies Depression-10 scales performed well and almost identically against the Mini International Neuropsychiatric Interview-major depressive disorder in identifying Veterans with major depressive disorder (Center for Epidemiologic Studies Depression-20 area under the Receiver Operating Characteristic curve 91%; Center for Epidemiologic Studies Depression-10 area under the ROC curve 90%. Overall, higher cut points for the Center for Epidemiologic Studies Depression scales performed better in correctly identifying true positives and true negatives for major depressive disorder (Center for Epidemiologic Studies Depression-20 cut point 18+ sensitivity 92% specificity 72%; Center for Epidemiologic Studies Depression-10 cut point 10+ sensitivity 92% specificity 69%. Conclusions: The specificity of the Center for Epidemiologic Studies Depression scales was poor among Veterans with co-occurring post-traumatic stress disorder (13% and 16%. Veterans with post-traumatic stress disorder who have a positive depression screen should have a more thorough assessment of mental health symptoms and comorbidities, rather than immediate diagnosis of and treatment for depression.

  9. CTTITEM: SAS macro and SPSS syntax for classical item analysis.

    Science.gov (United States)

    Lei, Pui-Wa; Wu, Qiong

    2007-08-01

    This article describes the functions of a SAS macro and an SPSS syntax that produce common statistics for conventional item analysis including Cronbach's alpha, item difficulty index (p-value or item mean), and item discrimination indices (D-index, point biserial and biserial correlations for dichotomous items and item-total correlation for polytomous items). These programs represent an improvement over the existing SAS and SPSS item analysis routines in terms of completeness and user-friendliness. To promote routine evaluations of item qualities in instrument development of any scale, the programs are available at no charge for interested users. The program codes along with a brief user's manual that contains instructions and examples are downloadable from suen.ed.psu.edu/-pwlei/plei.htm.

  10. Assessing the Psychometric Properties of Alternative Items for Certification.

    Science.gov (United States)

    Krogh, Mary Anne; Muckle, Timothy

    Alternative items were added as scored items to the National Certification Examination for Nurse Anesthetists (NCE) in 2010. A common concern related to the new items has been their measurement attributes. This study was undertaken to evaluate the psychometric impact of adding these items to the examination. Candidates had a significantly higher ability estimate in alternative items than in multiple choice questions and 6.7 percent of test candidates performed significantly differently in alternative item formats. The ability estimates of multiple choice questions correlated at r = .58. The alternative items took significantly longer time to answer than standard multiple choice questions and discriminated to a higher degree than MCQs. The alternative items exhibited unidimensionality to the same degree as MCQs and the BIC confirmed the Rasch model as acceptable for scoring. The new item types were found to have acceptable attributes for inclusion in the certification program.

  11. Pengendalian Persediaan Primary Items dalam Logistik Konstruksi

    Directory of Open Access Journals (Sweden)

    Lady Lisya

    2016-09-01

    Full Text Available Construction logistics are activities that consist of ordering, storage and transportation of materials of construction projects. Storage material is logistics activity that ensure the availability of materials in project site. Generally, material storage activities have been conducted at the project site. Logistics construction is aimed to support the project activities that the completion schedule has been set. Construction logistics issues is determining the schedule of ordering materials so that the project can be implemented on schedule. The purpose of research is to determine the optimum ordering period for the primary items on the main building structure construction and designing inventory control cards as a mechanism for monitoring procurement of materials. This research has been obtained optimal ordering period for the primary items of main building structure with elements of the work using Fixed Period Requirement method. Inventories were already meet the material requirement of each period. Material management has been conducted based grouping approach as many as 31 groups. In addition, this research has proposed the inventory control cards as an instrument for material procurement monitoring. The implications of inventory control cards are coordinate contracting parties with vendors to plan the replenishment  of materials to meet the work schedule. Further research can be developed with other aspects such as integrated material order system between contractors and vendors to consider the safety stock. In addition, the information system for planning material is an important consideration for construction projects with large scale so that the companies can plan primary items inventory and other materials in the projects completion more easily, quickly and accurately.

  12. A NEW ASSOCIATION RULE MINING BASED ON FREQUENT ITEM SET

    Directory of Open Access Journals (Sweden)

    Ms. Sanober Shaikh

    2011-09-01

    Full Text Available In this paper a new mining algorithm is defined based on frequent item set. Apriori Algorithm scans the database every time when it finds the frequent item set so it is very time consuming and at each step it generates candidate item set. So for large databases it takes lots of space to store candidate item set. The defined algorithm scans the database at the start only once and then makes the undirected item set graph. From this graph by considering minimum support it finds the frequent item set and by considering the minimum confidence it generates the association rule. If database and minimum support is changed, the new algorithm finds the new frequent items by scanning undirected item set graph. That is why it’s executing efficiency is improved distinctly compared to traditional algorithm.

  13. Small-Item Contact Test Method, FY11 Release

    Science.gov (United States)

    2012-07-01

    sections on an item. Often designated in a sampling plan and marked on an item to enable quick reference. • sessile drop : A liquid droplet that is...Full item contamination illustration 25 9. Localized contamination illustration 25 10. Gross sample collection technique for small items 32 Blank...decontaminant performance. contamination set: A specific contamination density, drop volume, and deposition pattern combination used for dose confirmation

  14. Negative affect impairs associative memory but not item memory.

    OpenAIRE

    Bisby, J. A.; Burgess, N.

    2014-01-01

    The formation of associations between items and their context has been proposed to rely on mechanisms distinct from those supporting memory for a single item. Although emotional experiences can profoundly affect memory, our understanding of how it interacts with different aspects of memory remains unclear. We performed three experiments to examine the effects of emotion on memory for items and their associations. By presenting neutral and negative items with background contexts, Experiment 1 ...

  15. Curriculum, Translation, and Differential Functioning of Measurement and Geometry Items

    Science.gov (United States)

    Emenogu, Barnabas C.; Childs, Ruth A.

    2005-01-01

    A test item exhibits differential item functioning (DIF) if students with the same ability find it differentially difficult. When the item is administered in French and English, differences in language difficulty and meaning are the most likely explanations. However, curriculum differences may also contribute to DIF. The responses of Ontario…

  16. 41 CFR 101-27.209-1 - GSA stock items.

    Science.gov (United States)

    2010-07-01

    ...-Management of Shelf-Life Materials § 101-27.209-1 GSA stock items. Shelf-life items that meet the criteria... 41 Public Contracts and Property Management 2 2010-07-01 2010-07-01 true GSA stock items. 101-27.209-1 Section 101-27.209-1 Public Contracts and Property Management Federal Property Management...

  17. The Identification of Radex Properties in Objective Test Items.

    Science.gov (United States)

    Seddon, G. M.; And Others

    1981-01-01

    In a Monte Carlo simulation, a methodology was developed to investigate the existence of radex properties among objective test items. In an experiment with items covering four categories of Bloom's cognitive domain taxonomy, the items did not have the factorial properties of a radex with four levels of complexity. (Author/BW)

  18. Effect of Multiple Testing Adjustment in Differential Item Functioning Detection

    Science.gov (United States)

    Kim, Jihye; Oshima, T. C.

    2013-01-01

    In a typical differential item functioning (DIF) analysis, a significance test is conducted for each item. As a test consists of multiple items, such multiple testing may increase the possibility of making a Type I error at least once. The goal of this study was to investigate how to control a Type I error rate and power using adjustment…

  19. Multistage Computerized Adaptive Testing with Uniform Item Exposure

    Science.gov (United States)

    Edwards, Michael C.; Flora, David B.; Thissen, David

    2012-01-01

    This article describes a computerized adaptive test (CAT) based on the uniform item exposure multi-form structure (uMFS). The uMFS is a specialization of the multi-form structure (MFS) idea described by Armstrong, Jones, Berliner, and Pashley (1998). In an MFS CAT, the examinee first responds to a small fixed block of items. The items comprising…

  20. Stochastic Approximation Methods for Latent Regression Item Response Models

    Science.gov (United States)

    von Davier, Matthias; Sinharay, Sandip

    2010-01-01

    This article presents an application of a stochastic approximation expectation maximization (EM) algorithm using a Metropolis-Hastings (MH) sampler to estimate the parameters of an item response latent regression model. Latent regression item response models are extensions of item response theory (IRT) to a latent variable model with covariates…

  1. A method for designing IRT-based item banks

    NARCIS (Netherlands)

    Boekkooi-Timminga, Ellen

    1990-01-01

    Since 1985 several procedures for computerized test construction using linear programing techniques have been described in the literature. To apply these procedures successfully, suitable item banks are needed. The problem of designing item banks based on item response theory (IRT) is addressed. A p

  2. Higher-Order Item Response Models for Hierarchical Latent Traits

    Science.gov (United States)

    Huang, Hung-Yu; Wang, Wen-Chung; Chen, Po-Hsi; Su, Chi-Ming

    2013-01-01

    Many latent traits in the human sciences have a hierarchical structure. This study aimed to develop a new class of higher order item response theory models for hierarchical latent traits that are flexible in accommodating both dichotomous and polytomous items, to estimate both item and person parameters jointly, to allow users to specify…

  3. The Applicability of Interactive Item Templates in Varied Knowledge Types

    Science.gov (United States)

    Koong, Chorng-Shiuh; Wu, Chi-Ying

    2011-01-01

    A well-edited assessment can enhance student's learning motives. Applicability of items, which includes item content and template, plays a crucial role in authoring a good assessment. Templates in discussion contain not only conventional true & false, multiple choice, completion item and short answer but also of those interactive ones. Methods…

  4. DIF Trees: Using Classification Trees to Detect Differential Item Functioning

    Science.gov (United States)

    Vaughn, Brandon K.; Wang, Qiu

    2010-01-01

    A nonparametric tree classification procedure is used to detect differential item functioning for items that are dichotomously scored. Classification trees are shown to be an alternative procedure to detect differential item functioning other than the use of traditional Mantel-Haenszel and logistic regression analysis. A nonparametric…

  5. Guide to good practices for the development of test items

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    1997-01-01

    While the methodology used in developing test items can vary significantly, to ensure quality examinations, test items should be developed systematically. Test design and development is discussed in the DOE Guide to Good Practices for Design, Development, and Implementation of Examinations. This guide is intended to be a supplement by providing more detailed guidance on the development of specific test items. This guide addresses the development of written examination test items primarily. However, many of the concepts also apply to oral examinations, both in the classroom and on the job. This guide is intended to be used as guidance for the classroom and laboratory instructor or curriculum developer responsible for the construction of individual test items. This document focuses on written test items, but includes information relative to open-reference (open book) examination test items, as well. These test items have been categorized as short-answer, multiple-choice, or essay. Each test item format is described, examples are provided, and a procedure for development is included. The appendices provide examples for writing test items, a test item development form, and examples of various test item formats.

  6. Is the New Item Priority Effect an Experimental Artifact?

    Science.gov (United States)

    Andre, Thomas

    This research was directed at determining whether the new item priority (NIP) effect in free recall was a result of an experimental artifact produced by the joint action of the serial position effect and the randomization of items over trials, or a consequence of a strategy of recalling newer items before older ones. In the experiment, subjects…

  7. The Feasibility of Single-Item Measures for Organizational Justice

    Science.gov (United States)

    Jordan, Jeremy S.; Turner, Brian A.

    2008-01-01

    Researchers in a number of disciplines have examined the utility of single-item measures for both affective and cognitive constructs. While these authors have indicated that, under certain circumstances, the use of single-item measures is appropriate, there remains concern regarding the reliability and validity of single-item measures. This study…

  8. The Applicability of Interactive Item Templates in Varied Knowledge Types

    Science.gov (United States)

    Koong, Chorng-Shiuh; Wu, Chi-Ying

    2011-01-01

    A well-edited assessment can enhance student's learning motives. Applicability of items, which includes item content and template, plays a crucial role in authoring a good assessment. Templates in discussion contain not only conventional true & false, multiple choice, completion item and short answer but also of those interactive ones. Methods…

  9. Gating Items: Definition, Significance, and Need for Further Study

    Science.gov (United States)

    Judd, Wallace

    2009-01-01

    Over the past twenty years in performance testing a specific item type with distinguishing characteristics has arisen time and time again. It's been invented independently by dozens of test development teams. And yet this item type is not recognized in the research literature. This article is an invitation to investigate the item type, evaluate…

  10. The Development of Practical Item Analysis Program for Indonesian Teachers

    Science.gov (United States)

    Muhson, Ali; Lestari, Barkah; Supriyanto; Baroroh, Kiromim

    2017-01-01

    Item analysis has essential roles in the learning assessment. The item analysis program is designed to measure student achievement and instructional effectiveness. This study was aimed to develop item-analysis program and verify its feasibility. This study uses a Research and Development (R & D) model. The procedure includes designing and…

  11. A Procedure for Linear Polychotomous Scoring of Test Items

    Science.gov (United States)

    1993-10-01

    associated with the response categories of test items . When tests are scored using these scoring weights, test reliability increases. The new procedure is...program POLY. The example demonstrates how polyweighting can be used to calibrate and score test items drawn from an item bank that is too large to

  12. Primary Science Assessment Item Setters' Misconceptions Concerning Biological Science Concepts

    Science.gov (United States)

    Boo, Hong Kwen

    2007-01-01

    Assessment is an integral and vital part of teaching and learning, providing feedback on progress through the assessment period to both learners and teachers. However, if test items are flawed because of misconceptions held by the question setter, then such test items are invalid as assessment tools. Moreover, such flawed items are also likely to…

  13. Classroom Test Writing: Effects of Item Format on Test Quality.

    Science.gov (United States)

    Torabi-Parizi, Rosa; Campbell, Noma Jo

    1982-01-01

    Investigates the effects of varying the placement of blanks and the number of options available in multiple-choice items on the reliability of fifth-grade students' scores. Results indicate that scores on three-choice item tests were not less reliable than scores on four-choice item tests. A similar finding was found regarding the placement of…

  14. Influence of Item Direction on Student Responses in Attitude Assessment.

    Science.gov (United States)

    Campbell, Noma Jo; Grissom, Stephen

    To investigate the effects of wording in attitude test items, a five-point Likert-type rating scale was administered to 173 undergraduate education majors. The test measured attitudes toward college and self, and contained 38 positively-worded items. Thirty-eight negatively-worded items were also written to parallel the positive statements.…

  15. Analysis of Differential Item Functioning in the NAEP History Assessment.

    Science.gov (United States)

    Zwick, Rebecca; Ercikan, Kadriye

    The Mantel-Haenszel approach for investigating differential item functioning (DIF) was applied to U.S. history items that were administered as part of the National Assessment of Educational Progress (NAEP). DIF analyses were based on the responses of 7,743 students in grade 11. On some items, Blacks, Hispanics, and females performed more poorly…

  16. 41 CFR 101-28.306-6 - Sensitive items.

    Science.gov (United States)

    2010-07-01

    ... for embarrassment of GSA and customer agencies, the level of customer complaints, and control as an accountable item of personal property. Each customer activity shall take all appropriate measures necessary to... 28.3-Customer Supply Centers § 101-28.306-6 Sensitive items. Many items stocked by the CSCs may...

  17. Social Desirability Scale Values of Locus of Control Items

    Science.gov (United States)

    Kestenbaum, Joel M.

    1976-01-01

    Subjects rated each item in Rotter's I-E Scale for its social desirability value. Social desirability scale values (SDSV) of paired items were compared with one another. Results indicate that paired items are not similar in their SDSV, thus enabling subjects to respond on the basis of social desirability. (Author/DEP)

  18. 17 CFR 229.1100 - (Item 1100) General.

    Science.gov (United States)

    2010-04-01

    ... AND CONSERVATION ACT OF 1975-REGULATION S-K Asset-Backed Securities (Regulation AB) § 229.1100 (Item... source of various disclosure items and requirements for “asset-backed securities” filings under the... this Regulation AB, including the definition of “asset-backed security,” are set forth in Item 1101. (b...

  19. Item validity of the Myers-Briggs Type Indicator.

    Science.gov (United States)

    Tzeng, O C; Outcalt, D; Boyer, S L; Ware, R; Landis, D

    1984-06-01

    The present study presents a brief summary of four extensive psychometric analyses of the Myers-Briggs Type Indicator (MBTI) items. Positive empirical evidence supports the MBTI item validity. However, several measurement issues on item construction were raised to caution the future users.

  20. Differential PIXE analysis of Mesoamerican jewelry items

    Science.gov (United States)

    Demortier, G.; Ruvalcaba-Sil, J. L.

    1996-09-01

    Gold jewelry items of Mesoamerican origin (from Peru, Colombia, Mexico, etc,…) are usually cast in Tumbaga: a man-made gold-copper-silver alloy containing a large proportion of copper. In order to give the objects a colour close to that of pure gold, ancient Mesoamerican goldsmiths experimented with a procedure to eliminate less noble metals (like copper and silver) from the surface. RBS may be used to identify a possible enrichment in gold in the most external layer of the items but due to the low capability of this technique to separate scattered particles on gold and silver and due to the low Rutherford cross section for α-particles on copper by comparison with those on gold, the determination of the exact depth depletion of copper cannot be easily reached. Differential PIXE is an appropriate method to achieve this goal. It takes the relative X-ray intensities of Cu and Au lines into account. By varying the incident proton energy, this ratio is modified in a completely different way if the sample is homogeneous or exhibits a layered or depth profile structure.

  1. The Academic Medical Center Linear Disability Score (ALDS) item bank: item response theory analysis in a mixed patient population.

    Science.gov (United States)

    Holman, Rebecca; Weisscher, Nadine; Glas, Cees A W; Dijkgraaf, Marcel G W; Vermeulen, Marinus; de Haan, Rob J; Lindeboom, Robert

    2005-12-29

    Currently, there is a lot of interest in the flexible framework offered by item banks for measuring patient relevant outcomes. However, there are few item banks, which have been developed to quantify functional status, as expressed by the ability to perform activities of daily life. This paper examines the measurement properties of the Academic Medical Center linear disability score item bank in a mixed population. This paper uses item response theory to analyse data on 115 of 170 items from a total of 1002 respondents. These were: 551 (55%) residents of supported housing, residential care or nursing homes; 235 (23%) patients with chronic pain; 127 (13%) inpatients on a neurology ward following a stroke; and 89 (9%) patients suffering from Parkinson's disease. Of the 170 items, 115 were judged to be clinically relevant. Of these 115 items, 77 were retained in the item bank following the item response theory analysis. Of the 38 items that were excluded from the item bank, 24 had either been presented to fewer than 200 respondents or had fewer than 10% or more than 90% of responses in the category 'can carry out'. A further 11 items had different measurement properties for younger and older or for male and female respondents. Finally, 3 items were excluded because the item response theory model did not fit the data. The Academic Medical Center linear disability score item bank has promising measurement characteristics for the mixed patient population described in this paper. Further studies will be needed to examine the measurement properties of the item bank in other populations.

  2. The Academic Medical Center Linear Disability Score (ALDS) item bank: item response theory analysis in a mixed patient population

    Science.gov (United States)

    Holman, Rebecca; Weisscher, Nadine; Glas, Cees AW; Dijkgraaf, Marcel GW; Vermeulen, Marinus; de Haan, Rob J; Lindeboom, Robert

    2005-01-01

    Background Currently, there is a lot of interest in the flexible framework offered by item banks for measuring patient relevant outcomes. However, there are few item banks, which have been developed to quantify functional status, as expressed by the ability to perform activities of daily life. This paper examines the measurement properties of the Academic Medical Center linear disability score item bank in a mixed population. Methods This paper uses item response theory to analyse data on 115 of 170 items from a total of 1002 respondents. These were: 551 (55%) residents of supported housing, residential care or nursing homes; 235 (23%) patients with chronic pain; 127 (13%) inpatients on a neurology ward following a stroke; and 89 (9%) patients suffering from Parkinson's disease. Results Of the 170 items, 115 were judged to be clinically relevant. Of these 115 items, 77 were retained in the item bank following the item response theory analysis. Of the 38 items that were excluded from the item bank, 24 had either been presented to fewer than 200 respondents or had fewer than 10% or more than 90% of responses in the category 'can carry out'. A further 11 items had different measurement properties for younger and older or for male and female respondents. Finally, 3 items were excluded because the item response theory model did not fit the data. Conclusion The Academic Medical Center linear disability score item bank has promising measurement characteristics for the mixed patient population described in this paper. Further studies will be needed to examine the measurement properties of the item bank in other populations. PMID:16381611

  3. The Academic Medical Center Linear Disability Score (ALDS item bank: item response theory analysis in a mixed patient population

    Directory of Open Access Journals (Sweden)

    Vermeulen Marinus

    2005-12-01

    Full Text Available Abstract Background Currently, there is a lot of interest in the flexible framework offered by item banks for measuring patient relevant outcomes. However, there are few item banks, which have been developed to quantify functional status, as expressed by the ability to perform activities of daily life. This paper examines the measurement properties of the Academic Medical Center linear disability score item bank in a mixed population. Methods This paper uses item response theory to analyse data on 115 of 170 items from a total of 1002 respondents. These were: 551 (55% residents of supported housing, residential care or nursing homes; 235 (23% patients with chronic pain; 127 (13% inpatients on a neurology ward following a stroke; and 89 (9% patients suffering from Parkinson's disease. Results Of the 170 items, 115 were judged to be clinically relevant. Of these 115 items, 77 were retained in the item bank following the item response theory analysis. Of the 38 items that were excluded from the item bank, 24 had either been presented to fewer than 200 respondents or had fewer than 10% or more than 90% of responses in the category 'can carry out'. A further 11 items had different measurement properties for younger and older or for male and female respondents. Finally, 3 items were excluded because the item response theory model did not fit the data. Conclusion The Academic Medical Center linear disability score item bank has promising measurement characteristics for the mixed patient population described in this paper. Further studies will be needed to examine the measurement properties of the item bank in other populations.

  4. The 12-item World Health Organization Disability Assessment Schedule II (WHO-DAS II: a nonparametric item response analysis

    Directory of Open Access Journals (Sweden)

    Fernandez Ana

    2010-05-01

    Full Text Available Abstract Background Previous studies have analyzed the psychometric properties of the World Health Organization Disability Assessment Schedule II (WHO-DAS II using classical omnibus measures of scale quality. These analyses are sample dependent and do not model item responses as a function of the underlying trait level. The main objective of this study was to examine the effectiveness of the WHO-DAS II items and their options in discriminating between changes in the underlying disability level by means of item response analyses. We also explored differential item functioning (DIF in men and women. Methods The participants were 3615 adult general practice patients from 17 regions of Spain, with a first diagnosed major depressive episode. The 12-item WHO-DAS II was administered by the general practitioners during the consultation. We used a non-parametric item response method (Kernel-Smoothing implemented with the TestGraf software to examine the effectiveness of each item (item characteristic curves and their options (option characteristic curves in discriminating between changes in the underliying disability level. We examined composite DIF to know whether women had a higher probability than men of endorsing each item. Results Item response analyses indicated that the twelve items forming the WHO-DAS II perform very well. All items were determined to provide good discrimination across varying standardized levels of the trait. The items also had option characteristic curves that showed good discrimination, given that each increasing option became more likely than the previous as a function of increasing trait level. No gender-related DIF was found on any of the items. Conclusions All WHO-DAS II items were very good at assessing overall disability. Our results supported the appropriateness of the weights assigned to response option categories and showed an absence of gender differences in item functioning.

  5. Confirmatory Factor Analysis of the M5-50: An Implementation of the International Personality Item Pool Item Set

    Science.gov (United States)

    Socha, Alan; Cooper, Christopher A.; McCord, David M.

    2010-01-01

    Goldberg's International Personality Item Pool (IPIP; Goldberg, 1999) provides researchers with public-domain, free-access personality measurement scales that are proxies of well-established published scales. One of the more commonly used IPIP sets employs 50 items to measure the 5 broad domains of the 5-factor model, with 10 items per factor. The…

  6. Multidimensional CAT Item Selection Methods for Domain Scores and Composite Scores with Item Exposure Control and Content Constraints

    Science.gov (United States)

    Yao, Lihua

    2014-01-01

    The intent of this research was to find an item selection procedure in the multidimensional computer adaptive testing (CAT) framework that yielded higher precision for both the domain and composite abilities, had a higher usage of the item pool, and controlled the exposure rate. Five multidimensional CAT item selection procedures (minimum angle;…

  7. Multidimensional CAT Item Selection Methods for Domain Scores and Composite Scores with Item Exposure Control and Content Constraints

    Science.gov (United States)

    Yao, Lihua

    2014-01-01

    The intent of this research was to find an item selection procedure in the multidimensional computer adaptive testing (CAT) framework that yielded higher precision for both the domain and composite abilities, had a higher usage of the item pool, and controlled the exposure rate. Five multidimensional CAT item selection procedures (minimum angle;…

  8. Do Images Influence Assessment in Anatomy? Exploring the Effect of Images on Item Difficulty and Item Discrimination

    Science.gov (United States)

    Vorstenbosch, Marc A. T. M.; Klaassen, Tim P. F. M.; Kooloos, Jan G. M.; Bolhuis, Sanneke M.; Laan, Roland F. J. M.

    2013-01-01

    Anatomists often use images in assessments and examinations. This study aims to investigate the influence of different types of images on item difficulty and item discrimination in written assessments. A total of 210 of 460 students volunteered for an extra assessment in a gross anatomy course. This assessment contained 39 test items grouped in…

  9. A Comparison of Anchor-Item Designs for the Concurrent Calibration of Large Banks of Likert-Type Items

    Science.gov (United States)

    Garcia-Perez, Miguel A.; Alcala-Quintana, Rocio; Garcia-Cueto, Eduardo

    2010-01-01

    Current interest in measuring quality of life is generating interest in the construction of computerized adaptive tests (CATs) with Likert-type items. Calibration of an item bank for use in CAT requires collecting responses to a large number of candidate items. However, the number is usually too large to administer to each subject in the…

  10. Line-item veto may pass this month

    Science.gov (United States)

    U.S. House and Senate Republicans have cleared the way for legislation to give the President the power of a line-item-veto in time for the 1998 budget season.The line-item veto, as proposed, would allow the President to remove from appropriations bills selected appropriations items, new entitlement spending, or narrowly targeted tax breaks. Once a line item is cut from the budget, Congress would have to pass a separate appropriations bill for that individual item; that new, separate bill would then be subject to the traditional Presidential veto.

  11. A generalized item response tree model for psychological assessments.

    Science.gov (United States)

    Jeon, Minjeong; De Boeck, Paul

    2016-09-01

    A new item response theory (IRT) model with a tree structure has been introduced for modeling item response processes with a tree structure. In this paper, we present a generalized item response tree model with a flexible parametric form, dimensionality, and choice of covariates. The utilities of the model are demonstrated with two applications in psychological assessments for investigating Likert scale item responses and for modeling omitted item responses. The proposed model is estimated with the freely available R package flirt (Jeon et al., 2014b).

  12. The Development of Practical Item Analysis Program for Indonesian Teachers

    Directory of Open Access Journals (Sweden)

    Ali Muhson

    2017-04-01

    Full Text Available Item analysis has essential roles in the learning assessment. The item analysis program is designed to measure student achievement and instructional effectiveness. This study was aimed to develop item-analysis program and verify its feasibility. This study uses a Research and Development (R & D model. The procedure includes designing and developing a product, validating, and testing the product. The data were collected through documentations, questionnaires, and interviews. This study successfully developed item analysis program, namely AnBuso. It is developed based on classical test theory (CTT. It was practical and applicable for Indonesian teachers to analyse test items

  13. Item Discrimination and Type I Error in the Detection of Differential Item Functioning

    Science.gov (United States)

    Li, Yanju; Brooks, Gordon P.; Johanson, George A.

    2012-01-01

    In 2009, DeMars stated that when impact exists there will be Type I error inflation, especially with larger sample sizes and larger discrimination parameters for items. One purpose of this study is to present the patterns of Type I error rates using Mantel-Haenszel (MH) and logistic regression (LR) procedures when the mean ability between the…

  14. Item Analysis and Differential Item Functioning of a Brief Conduct Problem Screen

    Science.gov (United States)

    Wu, Johnny; King, Kevin M.; Witkiewitz, Katie; Racz, Sarah Jensen; McMahon, Robert J.

    2012-01-01

    Research has shown that boys display higher levels of childhood conduct problems than girls, and Black children display higher levels than White children, but few studies have tested for scalar equivalence of conduct problems across gender and race. The authors conducted a 2-parameter item response theory (IRT) model to examine item…

  15. An NCME Instructional Module on Item-Fit Statistics for Item Response Theory Models

    Science.gov (United States)

    Ames, Allison J.; Penfield, Randall D.

    2015-01-01

    Drawing valid inferences from item response theory (IRT) models is contingent upon a good fit of the data to the model. Violations of model-data fit have numerous consequences, limiting the usefulness and applicability of the model. This instructional module provides an overview of methods used for evaluating the fit of IRT models. Upon completing…

  16. Detecting measurement disturbance effects: the graphical display of item characteristics.

    Science.gov (United States)

    Schumacker, Randall E

    2015-01-01

    Traditional identification of misfitting items in Rasch measurement models have interpreted the Infit and Outfit z standardized statistic. A more recent approach made possible by Winsteps is to specify "group = 0" in the control file and subsequently view the item characteristic curve for each item against the true probability curve. The graphical display reveals whether an item follows the true probability curve or deviates substantially, thus indicating measurement disturbance. Probability of item response and logit ability are easily copied into data vectors in R software then graphed. An example control file, output item data, and subsequent preparation of an overlay graph for misfit items are presented using Winsteps and R software. For comparison purposes the data are also analyzed using a multi-dimensional (MD) mapping procedure.

  17. Feed mechanism and method for feeding minute items

    Science.gov (United States)

    Stringer, Timothy Kent; Yerganian, Simon Scott

    2012-11-06

    A feeding mechanism and method for feeding minute items, such as capacitors, resistors, or solder preforms. The mechanism is adapted to receive a plurality of the randomly-positioned and randomly-oriented extremely small or minute items, and to isolate, orient, and position the items in a specific repeatable pickup location wherefrom they may be removed for use by, for example, a computer-controlled automated assembly machine. The mechanism comprises a sliding shelf adapted to receive and support the items; a wiper arm adapted to achieve a single even layer of the items; and a pushing arm adapted to push the items into the pickup location. The mechanism can be adapted for providing the items with a more exact orientation, and can also be adapted for use in a liquid environment.

  18. Item Response Theory Using Hierarchical Generalized Linear Models

    Directory of Open Access Journals (Sweden)

    Hamdollah Ravand

    2015-03-01

    Full Text Available Multilevel models (MLMs are flexible in that they can be employed to obtain item and person parameters, test for differential item functioning (DIF and capture both local item and person dependence. Papers on the MLM analysis of item response data have focused mostly on theoretical issues where applications have been add-ons to simulation studies with a methodological focus. Although the methodological direction was necessary as a first step to show how MLMs can be utilized and extended to model item response data, the emphasis needs to be shifted towards providing evidence on how applications of MLMs in educational testing can provide the benefits that have been promised. The present study uses foreign language reading comprehension data to illustrate application of hierarchical generalized models to estimate person and item parameters, differential item functioning (DIF, and local person dependence in a three-level model.

  19. Identifying Unbiased Items for Screening Preschoolers for Disruptive Behavior Problems.

    Science.gov (United States)

    Studts, Christina R; Polaha, Jodi; van Zyl, Michiel A

    2016-10-25

    OBJECTIVE : Efficient identification and referral to behavioral services are crucial in addressing early-onset disruptive behavior problems. Existing screening instruments for preschoolers are not ideal for pediatric primary care settings serving diverse populations. Eighteen candidate items for a new brief screening instrument were examined to identify those exhibiting measurement bias (i.e., differential item functioning, DIF) by child characteristics. METHOD : Parents/guardians of preschool-aged children (N = 900) from four primary care settings completed two full-length behavioral rating scales. Items measuring disruptive behavior problems were tested for DIF by child race, sex, and socioeconomic status using two approaches: item response theory-based likelihood ratio tests and ordinal logistic regression. RESULTS : Of 18 items, eight were identified with statistically significant DIF by at least one method. CONCLUSIONS : The bias observed in 8 of 18 items made them undesirable for screening diverse populations of children. These items were excluded from the new brief screening tool.

  20. The effects of relative food item size on optimal tooth cusp sharpness during brittle food item processing.

    Science.gov (United States)

    Berthaume, Michael A; Dumont, Elizabeth R; Godfrey, Laurie R; Grosse, Ian R

    2014-12-01

    Teeth are often assumed to be optimal for their function, which allows researchers to derive dietary signatures from tooth shape. Most tooth shape analyses normalize for tooth size, potentially masking the relationship between relative food item size and tooth shape. Here, we model how relative food item size may affect optimal tooth cusp radius of curvature (RoC) during the fracture of brittle food items using a parametric finite-element (FE) model of a four-cusped molar. Morphospaces were created for four different food item sizes by altering cusp RoCs to determine whether optimal tooth shape changed as food item size changed. The morphospaces were also used to investigate whether variation in efficiency metrics (i.e. stresses, energy and optimality) changed as food item size changed. We found that optimal tooth shape changed as food item size changed, but that all optimal morphologies were similar, with one dull cusp that promoted high stresses in the food item and three cusps that acted to stabilize the food item. There were also positive relationships between food item size and the coefficients of variation for stresses in food item and optimality, and negative relationships between food item size and the coefficients of variation for stresses in the enamel and strain energy absorbed by the food item. These results suggest that relative food item size may play a role in selecting for optimal tooth shape, and the magnitude of these selective forces may change depending on food item size and which efficiency metric is being selected.

  1. Dependability of technical items: Problems of standardization

    Science.gov (United States)

    Fedotova, G. A.; Voropai, N. I.; Kovalev, G. F.

    2016-12-01

    This paper is concerned with problems blown up in the development of a new version of the Interstate Standard GOST 27.002 "Industrial product dependability. Terms and definitions". This Standard covers a wide range of technical items and is used in numerous regulations, specifications, standard and technical documentation. A currently available State Standard GOST 27.002-89 was introduced in 1990. Its development involved a participation of scientists and experts from different technical areas, its draft was debated in different audiences and constantly refined, so it was a high quality document. However, after 25 years of its application it's become necessary to develop a new version of the Standard that would reflect the current understanding of industrial dependability, accounting for the changes taking place in Russia in the production, management and development of various technical systems and facilities. The development of a new version of the Standard makes it possible to generalize on a terminological level the knowledge and experience in the area of reliability of technical items, accumulated over a quarter of the century in different industries and reliability research schools, to account for domestic and foreign experience of standardization. Working on the new version of the Standard, we have faced a number of issues and problems on harmonization with the International Standard IEC 60500-192, caused first of all by different approaches to the use of terms and differences in the mentalities of experts from different countries. The paper focuses on the problems related to the chapter "Maintenance, restoration and repair", which caused difficulties for the developers to harmonize term definitions both with experts and the International Standard, which is mainly related to differences between the Russian concept and practice of maintenance and repair and foreign ones.

  2. The Prediction of TOEFL Reading Comprehension Item Difficulty for Expository Prose Passages for Three Item Types: Main Idea, Inference, and Supporting Idea Items.

    Science.gov (United States)

    Freedle, Roy; Kostin, Irene

    Prediction of the difficulty (equated delta) of a large sample (n=213) of reading comprehension items from the Test of English as a Foreign Language (TOEFL) was studied using main idea, inference, and supporting statement items. A related purpose was to examine whether text and text-related variables play a significant role in predicting item…

  3. Item Response Theory with Covariates (IRT-C): Assessing Item Recovery and Differential Item Functioning for the Three-Parameter Logistic Model

    Science.gov (United States)

    Tay, Louis; Huang, Qiming; Vermunt, Jeroen K.

    2016-01-01

    In large-scale testing, the use of multigroup approaches is limited for assessing differential item functioning (DIF) across multiple variables as DIF is examined for each variable separately. In contrast, the item response theory with covariate (IRT-C) procedure can be used to examine DIF across multiple variables (covariates) simultaneously. To…

  4. Editorial Changes and Item Performance: Implications for Calibration and Pretesting

    Directory of Open Access Journals (Sweden)

    Heather Stoffel

    2014-11-01

    Full Text Available Previous research on the impact of text and formatting changes on test-item performance has produced mixed results. This matter is important because it is generally acknowledged that any change to an item requires that it be recalibrated. The present study investigated the effects of seven classes of stylistic changes on item difficulty, discrimination, and response time for a subset of 65 items that make up a standardized test for physician licensure completed by 31,918 examinees in 2012. One of two versions of each item (original or revised was randomly assigned to examinees such that each examinee saw only two experimental items, with each item being administered to approximately 480 examinees. The stylistic changes had little or no effect on item difficulty or discrimination; however, one class of edits -' changing an item from an open lead-in (incomplete statement to a closed lead-in (direct question -' did result in slightly longer response times. Data for nonnative speakers of English were analyzed separately with nearly identical results. These findings have implications for the conventional practice of repretesting (or recalibrating items that have been subjected to minor editorial changes.

  5. Item analysis of in use multiple choice questions in pharmacology

    Science.gov (United States)

    Kaur, Mandeep; Singla, Shweta; Mahajan, Rajiv

    2016-01-01

    Background: Multiple choice questions (MCQs) are a common method of assessment of medical students. The quality of MCQs is determined by three parameters such as difficulty index (DIF I), discrimination index (DI), and distracter efficiency (DE). Objectives: The objective of this study is to assess the quality of MCQs currently in use in pharmacology and discard the MCQs which are not found useful. Materials and Methods: A class test of central nervous system unit was conducted in the Department of Pharmacology. This test comprised 50 MCQs/items and 150 distracters. A correct response to an item was awarded one mark with no negative marking for incorrect response. Each item was analyzed for three parameters such as DIF I, DI, and DE. Results: DIF of 38 (76%) items was in the acceptable range (P = 30–70%), 11 (22%) items were too easy (P > 70%), and 1 (2%) item was too difficult (P 0.35), of 12 (24%) items was good (d = 0.20–0.34), and of 7 (14%) items was poor (d < 0.20). A total of 50 items had 150 distracters. Among these, 27 (18%) were nonfunctional distracters (NFDs) and 123 (82%) were functional distracters. Items with one NFD were 11 and with two NFDs were 8. Based on these parameters, 6 items were discarded, 17 were revised, and 27 were kept for subsequent use. Conclusion: Item analysis is a valuable tool as it helps us to retain the valuable MCQs and discard the items which are not useful. It also helps in increasing our skills in test construction and identifies the specific areas of course content which need greater emphasis or clarity. PMID:27563581

  6. Efficient Algorithms for Segmentation of Item-Set Time Series

    Science.gov (United States)

    Chundi, Parvathi; Rosenkrantz, Daniel J.

    We propose a special type of time series, which we call an item-set time series, to facilitate the temporal analysis of software version histories, email logs, stock market data, etc. In an item-set time series, each observed data value is a set of discrete items. We formalize the concept of an item-set time series and present efficient algorithms for segmenting a given item-set time series. Segmentation of a time series partitions the time series into a sequence of segments where each segment is constructed by combining consecutive time points of the time series. Each segment is associated with an item set that is computed from the item sets of the time points in that segment, using a function which we call a measure function. We then define a concept called the segment difference, which measures the difference between the item set of a segment and the item sets of the time points in that segment. The segment difference values are required to construct an optimal segmentation of the time series. We describe novel and efficient algorithms to compute segment difference values for each of the measure functions described in the paper. We outline a dynamic programming based scheme to construct an optimal segmentation of the given item-set time series. We use the item-set time series segmentation techniques to analyze the temporal content of three different data sets—Enron email, stock market data, and a synthetic data set. The experimental results show that an optimal segmentation of item-set time series data captures much more temporal content than a segmentation constructed based on the number of time points in each segment, without examining the item set data at the time points, and can be used to analyze different types of temporal data.

  7. Impacts of Newly Acquired Items Within Business Combinations on the Items of the Financial Statements

    Directory of Open Access Journals (Sweden)

    Jana Gláserová

    2016-01-01

    Full Text Available This paper is focused on the operations with the company (business combinations. These are those operations that are associated with the formation or dissolution of companies or reorganization of their ownership structure. They are often referred as equity transactions. In the concept of Czech accounting legislation, these are the purchase, sale, investment (deposit of firms or their parts, and various forms of transformation of enterprises. There are analyzed the accounting practices of recording of these issues under the Czech accounting legislation and International Financial Reporting Standards. Consequently there are identified newly acquired assets and liabilities arising directly in connection with the business combinations. In the conclusion of this paper there are examined the effects of different reporting of newly acquired items in the context of business combinations according to Czech accounting legislation and in accordance with International Financial Reporting Standards on the significant items of balance sheet and profit and loss statement from the material and time point of view.

  8. Hiding Sensitive Association Rules without Altering the Support of Sensitive Item(s)

    CERN Document Server

    Jain, Dhyanendra

    2012-01-01

    Association rule mining is an important data-mining technique that finds interesting association among a large set of data items. Since it may disclose patterns and various kinds of sensitive knowledge that are difficult to find otherwise, it may pose a threat to the privacy of discovered confidential information. Such information is to be protected against unauthorized access. Many strategies had been proposed to hide the information. Some use distributed databases over several sites, data perturbation, clustering, and data distortion techniques. Hiding sensitive rules problem, and still not sufficiently investigated, is the requirement to balance the confidentiality of the disclosed data with the legitimate needs of the user. The proposed approach uses the data distortion technique where the position of the sensitive items is altered but its support is never changed. The size of the database remains the same. It uses the idea of representative rules to prune the rules first and then hides the sensitive rule...

  9. Binary classification of items of interest in a repeatable process

    Science.gov (United States)

    Abell, Jeffrey A.; Spicer, John Patrick; Wincek, Michael Anthony; Wang, Hui; Chakraborty, Debejyo

    2014-06-24

    A system includes host and learning machines in electrical communication with sensors positioned with respect to an item of interest, e.g., a weld, and memory. The host executes instructions from memory to predict a binary quality status of the item. The learning machine receives signals from the sensor(s), identifies candidate features, and extracts features from the candidates that are more predictive of the binary quality status relative to other candidate features. The learning machine maps the extracted features to a dimensional space that includes most of the items from a passing binary class and excludes all or most of the items from a failing binary class. The host also compares the received signals for a subsequent item of interest to the dimensional space to thereby predict, in real time, the binary quality status of the subsequent item of interest.

  10. The basics of item response theory using R

    CERN Document Server

    Baker, Frank B

    2017-01-01

    This graduate-level textbook is a tutorial for item response theory that covers both the basics of item response theory and the use of R for preparing graphical presentation in writings about the theory. Item response theory has become one of the most powerful tools used in test construction, yet one of the barriers to learning and applying it is the considerable amount of sophisticated computational effort required to illustrate even the simplest concepts. This text provides the reader access to the basic concepts of item response theory freed of the tedious underlying calculations. It is intended for those who possess limited knowledge of educational measurement and psychometrics. Rather than presenting the full scope of item response theory, this textbook is concise and practical and presents basic concepts without becoming enmeshed in underlying mathematical and computational complexities. Clearly written text and succinct R code allow anyone familiar with statistical concepts to explore and apply item re...

  11. Differential functioning of Bender Visual-Motor Gestalt Test items.

    Science.gov (United States)

    Sisto, Fermino Fernandes; Dos Santos, Acácia Aparecida Angeli; Noronha, Ana Paula Porto

    2010-02-01

    Differential Item Functioning (DIF) refers to items that do not function the same way for comparable members of different groups. The present study focuses on analyzing and classifying sex-related differential item functioning in the Bender Visual-Motor Gestalt Test. Subjects were 1,052 children attending public schools (513 boys, 539 girls, ages 6-10 years). The protocols were scored using the Bender Graduated Scoring System, which evaluates only the distortion criterion using the Rasch logistic response model. The scoring system fit the Rasch model, although two items were found to be biased by sex. When analyzing differential functioning of items for boys and girls separately, the number of differentially functioning items was equal.

  12. Promoting cold-start items in recommender systems

    CERN Document Server

    Liu, Jin-Hu; Zhang, Zi-Ke; Yang, Zimo; Liu, Chuang; Li, Wei-Min

    2014-01-01

    As one of major challenges, cold-start problem plagues nearly all recommender systems. In particular, new items will be overlooked, impeding the development of new products online. Given limited resources, how to utilize the knowledge of recommender systems and design efficient marketing strategy for new items is extremely important. In this paper, we convert this ticklish issue into a clear mathematical problem based on a bipartite network representation. Under the most widely used algorithm in real e-commerce recommender systems, so-called the item-based collaborative filtering, we show that to simply push new items to active users is not a good strategy. To our surprise, experiments on real recommender systems indicate that to connect new items with some less active users will statistically yield better performance, namely these new items will have more chance to appear in other users' recommendation lists. Further analysis suggests that the disassortative nature of recommender systems contributes to such ...

  13. The Body Appreciation Scale-2: item refinement and psychometric evaluation.

    Science.gov (United States)

    Tylka, Tracy L; Wood-Barcalow, Nichole L

    2015-01-01

    Considered a positive body image measure, the 13-item Body Appreciation Scale (BAS; Avalos, Tylka, & Wood-Barcalow, 2005) assesses individuals' acceptance of, favorable opinions toward, and respect for their bodies. While the BAS has accrued psychometric support, we improved it by rewording certain BAS items (to eliminate sex-specific versions and body dissatisfaction-based language) and developing additional items based on positive body image research. In three studies, we examined the reworded, newly developed, and retained items to determine their psychometric properties among college and online community (Amazon Mechanical Turk) samples of 820 women and 767 men. After exploratory factor analysis, we retained 10 items (five original BAS items). Confirmatory factor analysis upheld the BAS-2's unidimensionality and invariance across sex and sample type. Its internal consistency, test-retest reliability, and construct (convergent, incremental, and discriminant) validity were supported. The BAS-2 is a psychometrically sound positive body image measure applicable for research and clinical settings.

  14. A Reexamination of the Factor Structure of the Center for Epidemiologic Studies Depression Scale: Is a One-Factor Model Plausible?

    Science.gov (United States)

    Edwards, Michael C.; Cheavens, Jennifer S.; Heiy, Jane E.; Cukrowicz, Kelly C.

    2010-01-01

    The Center for Epidemiologic Studies Depression Scale (CES-D) is one of the most widely used measures of depressive symptoms in research today. The original psychometric work in support of the CES-D (Radloff, 1977) described a 4-factor model underlying the 20 items on the scale. Despite a long history of evidence supporting this structure,…

  15. OPTIMIZING HIGHWAY PROFILES FOR INDIVIDUAL COST ITEMS

    Directory of Open Access Journals (Sweden)

    Essam Dabbour

    2013-12-01

    Full Text Available According to the current practice, vertical alignment of a highway segment is usually selected by creating a profile showing the actual ground surface and selecting initial and final grades to minimize the overall cut and fill quantities. Those grades are connected together with a parabolic curve. However, in many highway construction or rehabilitation projects, the cost of cut may be substantially different from that of fill (e.g. in extremely hard soils where blasting is needed to cut the soil. In that case, an optimization process will be needed to minimize the overall cost of cut and fill rather than to minimize their quantities. This paper proposes a nonlinear optimization model to select optimum vertical curve parameters based on individual cost items of cut and fill. The parameters selected by the optimization model include the initial grade, the final grade, the station and elevation of the point of vertical curvature (PVC, and the station and elevation of the point of vertical tangency (PVT. The model is flexible to include any design constraints for particular design problems. Different application examples are provided using the Evolutionary Algorithm in Microsoft Excel’s Solver add-in. The application examples validated the model and demonstrated its advantage of minimizing the overall cost rather than minimizing the overall volume of cut and fill quantities.

  16. Percent Errors in the Estimation of Demand for Secondary Items.

    Science.gov (United States)

    1985-11-01

    percent errors, and the program change factor (PCF) to predict item demana during the procurement *’ leadtime (PROLT) ior the item. The PCF accounts for...type of demand it was. It may"-- have been demanded over two years ago or it may nave been a non-recurring demana . Since CC b only retains two years of...observed distributions could be compared with negative binomial distributions. For each item the computed ratio of actual demana to expected demand was

  17. Safety Evaluation for Packaging (onsite) T Plant Canyon Items

    Energy Technology Data Exchange (ETDEWEB)

    OBRIEN, J.H.

    2000-07-14

    This safety evaluation for packaging (SEP) evaluates and documents the ability to safely ship mostly unique inventories of miscellaneous T Plant canyon waste items (T-P Items) encountered during the canyon deck clean off campaign. In addition, this SEP addresses contaminated items and material that may be shipped in a strong tight package (STP). The shipments meet the criteria for onsite shipments as specified by Fluor Hanford in HNF-PRO-154, Responsibilities and Procedures for all Hazardous Material Shipments.

  18. Assessing the quality of multiple-choice test items.

    Science.gov (United States)

    Clifton, Sandra L; Schriner, Cheryl L

    2010-01-01

    With the focus of nursing education geared toward teaching students to think critically, faculty need to assure that test items require students to use a high level of cognitive processing. To evaluate their examinations, the authors assessed multiple-choice test items on final nursing examinations. The assessment included determining cognitive learning levels and frequency of items among 3 adult health courses, comparing difficulty values with cognitive learning levels, and examining discrimination values and the relationship to distracter performance.

  19. Three controversies over item disclosure in medical licensure examinations

    Directory of Open Access Journals (Sweden)

    Yoon Soo Park

    2015-09-01

    Full Text Available In response to views on public's right to know, there is growing attention to item disclosure – release of items, answer keys, and performance data to the public – in medical licensure examinations and their potential impact on the test's ability to measure competence and select qualified candidates. Recent debates on this issue have sparked legislative action internationally, including South Korea, with prior discussions among North American countries dating over three decades. The purpose of this study is to identify and analyze three issues associated with item disclosure in medical licensure examinations – 1 fairness and validity, 2 impact on passing levels, and 3 utility of item disclosure – by synthesizing existing literature in relation to standards in testing. Historically, the controversy over item disclosure has centered on fairness and validity. Proponents of item disclosure stress test takers’ right to know, while opponents argue from a validity perspective. Item disclosure may bias item characteristics, such as difficulty and discrimination, and has consequences on setting passing levels. To date, there has been limited research on the utility of item disclosure for large scale testing. These issues requires ongoing and careful consideration.

  20. Embodied Valuation: Directional Action is Associated with Item Values.

    Science.gov (United States)

    Green, Danielle J; Harris, Alison; Young, Aleena; Reed, Catherine L

    2017-08-31

    We have a lifetime of experience interacting with objects we value. Although many economic theories represent valuation as a purely cognitive process independent of the sensorimotor system, embodied cognitive theory suggests that our memories for items' value should be linked to actions we use to obtain them. Here we investigated whether the value of real items was associated with specific directional movements toward or away from the body. Participants priced a set of food items to determine their values; they then used directional actions to classify each item as high- or low-value. To determine if value is linked to specific action mappings, movements were referenced either with respect to the object (push toward high-value items; pull away from low-value items) or the self (pull high-value items toward self; push low-value items away). Participants who were assigned (Experiment 1) or chose (Experiment 2) to use an object-referenced action mapping were faster than those using a self-referenced mapping. A control experiment (Experiment 3) using left/right movements found no such difference when action mappings were not toward/away from the body. These results indicate that directional actions towards items are associated with the representation of their value, suggesting an embodied component to economic choice.

  1. [Perceptions on item disclosure for the Korean medical licensing examination].

    Science.gov (United States)

    Yang, Eunbae B

    2015-09-01

    This study analyzed the perceptions of medical students and faculty regarding disclosure of test items on the Korean medical licensing examination. I conducted a survey of medical students from medical colleges and professional medical schools nationwide. Responses were analyzed from 718 participants as well as 69 faculty members who participated in creating the medical licensing examination item sets. Data were analyzed using descriptive statistics and the chi-square test. It is important to maintain test quality and to keep the test items unavailable to the public. There are also concerns among students that disclosure of test items would prompt increasing difficulty of test items (48.3%). Further, few students found it desirable to disclose test items regardless of any considerations (28.5%). The professors, who had experience in designing the test items, also expressed their opposition to test item disclosure (60.9%). It is desirable not to disclose the test items of the Korean medical licensing examination to the public on the condition that students are provided with a sufficient amount of information regarding the examination. This is so that the exam can appropriately identify candidates with the required qualifications.

  2. Conditional recall and the frequency effect in the serial recall task: an examination of item-to-item associativity.

    Science.gov (United States)

    Miller, Leonie M; Roodenrys, Steven

    2012-11-01

    The frequency effect in short-term serial recall is influenced by the composition of lists. In pure lists, a robust advantage in the recall of high-frequency (HF) words is observed, yet in alternating mixed lists, HF and low-frequency (LF) words are recalled equally well. It has been argued that the preexisting associations between all list items determine a single, global level of supportive activation that assists item recall. Preexisting associations between items are assumed to be a function of language co-occurrence; HF-HF associations are high, LF-LF associations are low, and mixed associations are intermediate in activation strength. This account, however, is based on results when alternating lists with equal numbers of HF and LF words were used. It is possible that directional association between adjacent list items is responsible for the recall patterns reported. In the present experiment, the recall of three forms of mixed lists-those with equal numbers of HF and LF items and pure lists-was examined to test the extent to which item-to-item associations are present in serial recall. Furthermore, conditional probabilities were used to examine more closely the evidence for a contribution, since correct-in-position scoring may mask recall that is dependent on the recall of prior items. The results suggest that an item-to-item effect is clearly present for early but not late list items, and they implicate an additional factor, perhaps the availability of resources at output, in the recall of late list items.

  3. Scientific Items in Cyprus Turkish Tales

    Directory of Open Access Journals (Sweden)

    Sevilay Atmaca

    2013-05-01

    Full Text Available Folk literature, which is a production of daily life experiences transferred from mouth to mouth, encompasses many types such as epic, legend, folk poetry, mania, laments, songs, riddles, tales, folk story, anecdote, proverbs, idioms, rhymes, and etc. These products, which represent the oral tradition of the people, transfer numerous information regarding the life experiences, beliefs, and values of their communities to the next generation. Tales are more attractive since they depict extraordinary events and because of these characteristics they are very didactic. These are some reasons why tales have a very significant place in Turkish folk literature. People have documented tales successfully transferring from a nomadic life to a settled one. This study attempts to determine the scientific elements in Cyprus Turkish Tales, as sample in the scope of the Turkish Tales, through content analysis. Researchers in this study will examine the themes in the selected tales such as “Earth and Cosmos”, “Physical Event”, “Matter and Change”, and “Living Organisms and Life”. They will also provide a code-list according to the themes found in the tales. Researchers will do a content analysis in relation to the themes and code-listing activities in “Cyprus Turkish Tales” collected by Mustafa Gökçeoğlu, a well-known Cyprus Turkish folk literature writer. It is seen from the research that living organism and life (f: 25 and matter and change (f: 24 themes have very abundant coded. It is followed with an orderly physical events theme (f: 13 and earth and cosmos theme (f: 6. As a result, it is seen that in samples which are taken from Turkish Cypriot Tales, scientific items are widely used. It is believed that the results of the study will give different perspectives to Turkish Language, literature, science-technology, and classroom teacher providing them with interdisciplinary and rich materials.

  4. Measurement Assurance for End-Item Users

    Science.gov (United States)

    Mimbs, Scott M.

    2008-01-01

    The goal of a Quality Management System (QMS) as specified in ISO 9001 and AS9100 is to assure the end product meets specifications and customer requirements. Measuring devices, often called measuring and test equipments (MTE), provide the evidence of product conformity to the prescribed requirements. Therefore the processes which employ MTE can become a weak link to the overall QMS if proper attention is not given to development and execution of these processes. Traditionally, calibration of MTE is given more focus in industry standards and process control efforts than the equally important proper usage of the same equipment. It is a common complaint of calibration laboratory personnel that MTE users are only interested in "a sticker." If the QMS requires the MTE "to demonstrate conformity of the product," then the quality of the measurement process must be adequate for the task. This leads to an ad hoc definition; measurement assurance is a discipline that assures that all processes, activities, environments, standards, and procedures involved in making a measurement produce a result that can be rigorously evaluated for validity and accuracy. To evaluate that the existing measurement processes are providing an adequate level of quality to support the decisions based upon this measurement data, an understanding of measurement assurance basics is essential. This topic is complimentary to the calibration standard, ANSI/NCSL Z540.3-2006, which targets the calibration of MTE at the organizational level. This paper will discuss general measurement assurance when MTE is used to provide evidence of product conformity, therefore the target audience of this paper is end item users of MTE. A central focus of the paper will be the verification of tolerances and the associated risks, so calibration professionals may find the paper useful in communication with their customers, MTE users.

  5. Creation of New Items and Forms for the Project A Assembling Objects Test

    Science.gov (United States)

    1994-08-01

    correct; D, a measure of item discrimination - the rpbi , between the item score (correct or incorrect) and the total score on the 36 original items...D2 another measure of item discrimination - the rpbis between the item score and the total score on the 18 original items of the same type (marked

  6. Item Purification Does Not Always Improve DIF Detection: A Counterexample with Angoff's Delta Plot

    Science.gov (United States)

    Magis, David; Facon, Bruno

    2013-01-01

    Item purification is an iterative process that is often advocated as improving the identification of items affected by differential item functioning (DIF). With test-score-based DIF detection methods, item purification iteratively removes the items currently flagged as DIF from the test scores to get purified sets of items, unaffected by DIF. The…

  7. Conditional Versus Unconditional Procedures for Sample-Free Item Analysis

    Science.gov (United States)

    Wright, Benjamin D.; Douglas, Graham A.

    1977-01-01

    Two procedures for Rasch, sample-free item calibration are reviewed and compared for accuracy. The theoretically ideal "conditional" procedure is impractical for more than fifteen items. The more practical but biased "unconditional" procedure is discussed in detail. (Author/JKS)

  8. Item-Specific and Interitem Elaboration in Recall and Recognition

    Science.gov (United States)

    1984-07-01

    However, increasing task difficulty, resulting in a more istinctive encoding for items, plays a larger role in recognition. In a similar vein, Belleza ...and his collegues ( Belleza , Cheesman & Reddy 1977) have demonstrated that organizational strategies, such as the alphabet and story mnemonics, aid...specific semantic processing (Craik & Tulving, 1975). Presumably, some item-specific semantic processing occurs prior to or during organization ( Belleza

  9. 26 CFR 1.461-5 - Recurring item exception.

    Science.gov (United States)

    2010-04-01

    ... satisfied, generally accepted accounting principles are an important factor, but are not dispositive. (ii... of accounting may adopt the recurring item exception described in paragraph (b) of this section as method of accounting for one or more types of recurring items incurred by the taxpayer. In the case...

  10. 48 CFR 252.211-7003 - Item identification and valuation.

    Science.gov (United States)

    2010-10-01

    ... in this clause— Automatic identification device means a device, such as a reader or interrogator... 48 Federal Acquisition Regulations System 3 2010-10-01 2010-10-01 false Item identification and... of Provisions And Clauses 252.211-7003 Item identification and valuation. As prescribed in 211.274-6...

  11. Semiparametric Item Response Functions in the Context of Guessing

    Science.gov (United States)

    Falk, Carl F.; Cai, Li

    2016-01-01

    We present a logistic function of a monotonic polynomial with a lower asymptote, allowing additional flexibility beyond the three-parameter logistic model. We develop a maximum marginal likelihood-based approach to estimate the item parameters. The new item response model is demonstrated on math assessment data from a state, and a computationally…

  12. 17 CFR 229.10 - (Item 10) General.

    Science.gov (United States)

    2010-04-01

    ... Corporate governance. Item 503 Prospectus summary, risk factors, and ratio of earnings to fixed charges... term is defined in Item 1101 of Regulation AB (§ 229.1101)), reference may not be made to any document... this part, the term smaller reporting company means an issuer that is not an investment company,...

  13. A nonparametric approach to the analysis of dichotomous item responses

    NARCIS (Netherlands)

    Mokken, R.J.; Lewis, C.

    1982-01-01

    An item response theory is discussed which is based on purely ordinal assumptions about the probabilities that people respond positively to items. It is considered as a natural generalization of both Guttman scaling and classical test theory. A distinction is drawn between construction and evaluatio

  14. Do item-writing flaws reduce examinations psychometric quality?

    Science.gov (United States)

    Pais, João; Silva, Artur; Guimarães, Bruno; Povo, Ana; Coelho, Elisabete; Silva-Pereira, Fernanda; Lourinho, Isabel; Ferreira, Maria Amélia; Severo, Milton

    2016-08-11

    The psychometric characteristics of multiple-choice questions (MCQ) changed when taking into account their anatomical sites and the presence of item-writing flaws (IWF). The aim is to understand the impact of the anatomical sites and the presence of IWF in the psychometric qualities of the MCQ. 800 Clinical Anatomy MCQ from eight examinations were classified as standard or flawed items and according to one of the eight anatomical sites. An item was classified as flawed if it violated at least one of the principles of item writing. The difficulty and discrimination indices of each item were obtained. 55.8 % of the MCQ were flawed items. The anatomical site of the items explained 6.2 and 3.2 % of the difficulty and discrimination parameters and the IWF explained 2.8 and 0.8 %, respectively. The impact of the IWF was heterogeneous, the Writing the Stem and Writing the Choices categories had a negative impact (higher difficulty and lower discrimination) while the other categories did not have any impact. The anatomical site effect was higher than IWF effect in the psychometric characteristics of the examination. When constructing MCQ, the focus should be in the topic/area of the items and only after in the presence of IWF.

  15. Mathematical-programming approaches to test item pool design

    NARCIS (Netherlands)

    Veldkamp, Bernard P.; van der Linden, Willem J.; Ariel, A.

    2002-01-01

    This paper presents an approach to item pool design that has the potential to improve on the quality of current item pools in educational and psychological testing andhence to increase both measurement precision and validity. The approach consists of the application of mathematical programming

  16. Using response times for item selection in adaptive testing

    NARCIS (Netherlands)

    Linden, van der Wim J.

    2008-01-01

    Response times on items can be used to improve item selection in adaptive testing provided that a probabilistic model for their distribution is available. In this research, the author used a hierarchical modeling framework with separate first-level models for the responses and response times and a s

  17. Detecting Local Item Dependence in Polytomous Adaptive Data

    Science.gov (United States)

    Mislevy, Jessica L.; Rupp, Andre A.; Harring, Jeffrey R.

    2012-01-01

    A rapidly expanding arena for item response theory (IRT) is in attitudinal and health-outcomes survey applications, often with polytomous items. In particular, there is interest in computer adaptive testing (CAT). Meeting model assumptions is necessary to realize the benefits of IRT in this setting, however. Although initial investigations of…

  18. Distributed Item Review: Administrator User Guide. Technical Report #1603

    Science.gov (United States)

    Irvin, P. Shawn

    2016-01-01

    The Distributed Item Review (DIR) is a secure and flexible, web-based system designed to present test items to expert reviewers across a broad geographic area for evaluation of important dimensions of quality (e.g., alignment with standards, bias, sensitivity, and student accessibility). The DIR is comprised of essential features that allow system…

  19. Automated addition of Chelex solution to tubes containing trace items

    DEFF Research Database (Denmark)

    Stangegaard, Michael; Hansen, Thomas Møller; Hansen, Anders Johannes;

    2011-01-01

    Extraction of DNA from trace items for forensic genetic DNA typing using a manual Chelex based extraction protocol requires addition of Chelex solution to sample tubes containing trace items. Automated of addition of Chelex solution may be hampered by high viscosity of the solution and fast sedim...

  20. Lower-fat menu items in restaurants satisfy customers.

    Science.gov (United States)

    Fitzpatrick, M P; Chapman, G E; Barr, S I

    1997-05-01

    To evaluate a restaurant-based nutrition program by measuring customer satisfaction with lower-fat menu items and assessing patrons' reactions to the program. Questionnaires to assess satisfaction with menu items were administered to patrons in eight of the nine restaurants that volunteered to participate in the nutrition program. One patron from each participating restaurant was randomly selected for a semistructured interview about nutrition programming in restaurants. Persons dining in eight participating restaurants over a 1-week period (n = 686). Independent samples t tests were used to compare respondents' satisfaction with lower-fat and regular menu items. Two-way analysis of variance tests were completed using overall satisfaction as the dependent variable and menu-item classification (ie, lower fat or regular) and one of eight other menu item and respondent characteristics as independent variables. Qualitative methods were used to analyze interview transcripts. Of 1,127 menu items rated for satisfaction, 205 were lower fat, 878 were regular, and 44 were of unknown classification. Customers were significantly more satisfied with lower-fat than with regular menu items (P satisfaction did not vary by any of the other independent variables. Interview results indicate the importance of restaurant during as an indulgent experience. High satisfaction with lower-fat menu items suggests that customers will support restaurant providing such choices. Dietitians can use these findings to encourage restaurateurs to include lower-fat choices on their menus, and to assure clients that their expectations of being indulged are not incompatible with these choices.

  1. Effects of clinical practice focusing on level-3 OSCE items.

    Science.gov (United States)

    Sakurai, Hiroaki; Kanada, Yoshikiyo; Sugiura, Yoshito; Motoya, Ikuo; Wada, Yosuke; Yamada, Masayuki; Tomita, Masao; Tanabe, Shigeo; Koyama, Soichiro; Teranishi, Toshio; Sawa, Syunji; Okanishi, Tetsuo

    2015-05-01

    [Purpose] This study examined the effects of clinical training focusing on level-3 OSCE (analytical and therapeutic skills) items, and compared the achievement levels of physical (PT) and occupational (OT) therapist students. [Subjects] A total of 282 (165 PT and 117 OT) students enrolled at our university between 2008 and 2010 were studied. [Methods] OSCE scores were compared between before and after clinical training focusing on level-3 OSCE items, and between PT and OT students. [Results] Scores for 5 out of the 6 level-3a items were significantly higher after than before clinical training. Increases in scores of 2 or 3 level-3b and -3c items were also observed after clinical training. There were no marked differences between PT and OT students in scores for level-3a, -3b, and -3c items before clinical training. In contrast, after clinical training, OT students' scores for 3a and 3c items related to dressing were higher than those of PT students, and the latter's scores for 3b items related to transfer were higher than those of the former. [Conclusion] The results suggest level-3 OSCE items are effectively taught during clinical training.

  2. 48 CFR 211.105 - Items peculiar to one manufacturer.

    Science.gov (United States)

    2010-10-01

    ... 48 Federal Acquisition Regulations System 3 2010-10-01 2010-10-01 false Items peculiar to one manufacturer. 211.105 Section 211.105 Federal Acquisition Regulations System DEFENSE ACQUISITION REGULATIONS... Requirements Documents 211.105 Items peculiar to one manufacturer. Follow the publication requirements at...

  3. 48 CFR 811.105 - Items peculiar to one manufacturer.

    Science.gov (United States)

    2010-10-01

    ... 48 Federal Acquisition Regulations System 5 2010-10-01 2010-10-01 false Items peculiar to one manufacturer. 811.105 Section 811.105 Federal Acquisition Regulations System DEPARTMENT OF VETERANS AFFAIRS... 811.105 Items peculiar to one manufacturer. (a) Except as provided in paragraph (b) of this...

  4. Group differences in the heritability of items and test scores

    NARCIS (Netherlands)

    Wicherts, J.M.; Johnson, W.

    2009-01-01

    It is important to understand potential sources of group differences in the heritability of intelligence test scores. On the basis of a basic item response model we argue that heritabilities which are based on dichotomous item scores normally do not generalize from one sample to the next. If groups

  5. Individuality of Item Interpretation in Interchangeable ACL Scales

    Science.gov (United States)

    Fiske, Donald W.; Barack, Leonard I.

    1976-01-01

    The diversity among interpretations of single items in personality questionnaires has been noted previously. Using adjectives from the Adjective Check List (ACL), the study sought evidence bearing on these questions: Does such diversity make the responses to an item not comparable across subjects? If so, what are the implications for scores based…

  6. 32 CFR 643.51 - Additional items concerning leasing.

    Science.gov (United States)

    2010-07-01

    ... 32 National Defense 4 2010-07-01 2010-07-01 true Additional items concerning leasing. 643.51... PROPERTY REAL ESTATE Leases § 643.51 Additional items concerning leasing. In addition to the general and... leasing of Army real estate....

  7. Characterizing Sources of Uncertainty in Item Response Theory Scale Scores

    Science.gov (United States)

    Yang, Ji Seung; Hansen, Mark; Cai, Li

    2012-01-01

    Traditional estimators of item response theory scale scores ignore uncertainty carried over from the item calibration process, which can lead to incorrect estimates of the standard errors of measurement (SEMs). Here, the authors review a variety of approaches that have been applied to this problem and compare them on the basis of their statistical…

  8. 7 CFR 65.220 - Processed food item.

    Science.gov (United States)

    2010-01-01

    ... AGRICULTURAL MARKETING ACT OF 1946 AND THE EGG PRODUCTS INSPECTION ACT (CONTINUED) COUNTRY OF ORIGIN LABELING..., PEANUTS, AND GINSENG General Provisions Definitions § 65.220 Processed food item. Processed food item... further step in the preparation of the product for consumption, would not in itself result in a processed...

  9. Negative Affect Impairs Associative Memory but Not Item Memory

    Science.gov (United States)

    Bisby, James A.; Burgess, Neil

    2014-01-01

    The formation of associations between items and their context has been proposed to rely on mechanisms distinct from those supporting memory for a single item. Although emotional experiences can profoundly affect memory, our understanding of how it interacts with different aspects of memory remains unclear. We performed three experiments to examine…

  10. Dissociation between source and item memory in Parkinson's disease

    Institute of Scientific and Technical Information of China (English)

    Hu Panpan; Li Youhai; Ma Huijuan; Xi Chunhua; Chen Xianwen; Wang Kai

    2014-01-01

    Background Episodic memory includes information about item memory and source memory.Many researches support the hypothesis that these two memory systems are implemented by different brain structures.The aim of this study was to investigate the characteristics of item memory and source memory processing in patients with Parkinson's disease (PD),and to further verify the hypothesis of dual-process model of source and item memory.Methods We established a neuropsychological battery to measure the performance of item memory and source memory.Totally 35 PD individuals and 35 matched healthy controls (HC) were administrated with the battery.Item memory task consists of the learning and recognition of high-frequency national Chinese characters; source memory task consists of the learning and recognition of three modes (character,picture,and image) of objects.Results Compared with the controls,the idiopathic PD patients have been impaired source memory (PD vs.HC:0.65±0.06 vs.0.72±0.09,P=0.001),but not impaired in item memory (PD vs.HC:0.65±0.07 vs.0.67±0.08,P=0.240).Conclusions The present experiment provides evidence for dissociation between item and source memory in PD patients,thereby strengthening the claim that the item or source memory rely on different brain structures.PD patients show poor source memory,in which dopamine plays a critical role.

  11. Elu kui näitemäng / Helju Koger

    Index Scriptorium Estoniae

    Koger, Helju, 1943-

    2007-01-01

    VI kihelkonnapäevadest Juurus. Juuru Mihkli kirikus esines ansambel Resonabilis. Konverentsil räägiti Järlepa mõisast, Anu Allikvee pidas ettekande "August von Kotzebue elu nagu näitemäng" jm. Näitemängu "Pärmi Jaagu unenägu" nägi kohalike asjaarmastajate esituses

  12. Estimating the Reliability of a Test Containing Multiple Item Formats.

    Science.gov (United States)

    Qualls, Audrey L.

    1995-01-01

    Classically parallel, tau-equivalently parallel, and congenerically parallel models representing various degrees of part-test parallelism and their appropriateness for tests composed of multiple item formats are discussed. An appropriate reliability estimate for a test with multiple item formats is presented and illustrated. (SLD)

  13. The Use of Item Analysis for Improvement of Biochemical Teaching

    Science.gov (United States)

    Nagata, Ryoichi

    2004-01-01

    Item analysis was used to find out which biochemical explanations need to be improved in biochemical teaching, not which items are to be discarded, improved, or reusable in biochemical examinations. The analysis revealed the basic facts of which less able students had more misunderstanding than able students. Identifying these basic facts helps…

  14. Bayesian Item Selection in Constrained Adaptive Testing Using Shadow Tests

    Science.gov (United States)

    Veldkamp, Bernard P.

    2010-01-01

    Application of Bayesian item selection criteria in computerized adaptive testing might result in improvement of bias and MSE of the ability estimates. The question remains how to apply Bayesian item selection criteria in the context of constrained adaptive testing, where large numbers of specifications have to be taken into account in the item…

  15. Costs of Matrix Sampling of Test Items. ERIC Digest.

    Science.gov (United States)

    Childs, Ruth A.; Jaciw, Andrew P.

    Matrix sampling of test items, the division of a set of items into different versions of a test form, is used by several large-scale testing programs. This Digest discusses nine categories of costs associated with matrix sampling. These categories are: (1) development costs; (2) materials costs; (3) administration costs; (4) educational costs; (5)…

  16. 14 CFR Section 17 - Objective Classification-Extraordinary Items

    Science.gov (United States)

    2010-01-01

    ... 14 Aeronautics and Space 4 2010-01-01 2010-01-01 false Objective Classification-Extraordinary Items Section 17 Section 17 Aeronautics and Space OFFICE OF THE SECRETARY, DEPARTMENT OF TRANSPORTATION... AIR CARRIERS Profit and Loss Classification Section 17 Objective Classification—Extraordinary Items...

  17. Automated addition of Chelex solution to tubes containing trace items

    DEFF Research Database (Denmark)

    Stangegaard, Michael; Hansen, Thomas Møller; Hansen, Anders Johannes;

    2011-01-01

    Extraction of DNA from trace items for forensic genetic DNA typing using a manual Chelex based extraction protocol requires addition of Chelex solution to sample tubes containing trace items. Automated of addition of Chelex solution may be hampered by high viscosity of the solution and fast...

  18. 17 CFR 229.1002 - (Item 1002) Subject company information.

    Science.gov (United States)

    2010-04-01

    ... ENERGY POLICY AND CONSERVATION ACT OF 1975-REGULATION S-K Mergers and Acquisitions (Regulation M-A) § 229.1002 (Item 1002) Subject company information. (a) Name and address. State the name of the subject... 17 Commodity and Securities Exchanges 2 2010-04-01 2010-04-01 false (Item 1002) Subject...

  19. Effects of Aging and IQ on Item and Associative Memory

    Science.gov (United States)

    Ratcliff, Roger; Thapar, Anjali; McKoon, Gail

    2011-01-01

    The effects of aging and IQ on performance were examined in 4 memory tasks: item recognition, associative recognition, cued recall, and free recall. For item and associative recognition, accuracy and the response time (RT) distributions for correct and error responses were explained by Ratcliff's (1978) diffusion model at the level of individual…

  20. Multidimensional Linking for Tests with Mixed Item Types

    Science.gov (United States)

    Yao, Lihua; Boughton, Keith

    2009-01-01

    Numerous assessments contain a mixture of multiple choice (MC) and constructed response (CR) item types and many have been found to measure more than one trait. Thus, there is a need for multidimensional dichotomous and polytomous item response theory (IRT) modeling solutions, including multidimensional linking software. For example,…

  1. Optimal Bayesian Adaptive Design for Test-Item Calibration

    NARCIS (Netherlands)

    Linden, van der Wim J.; Ren, Hao

    2015-01-01

    An optimal adaptive design for test-item calibration based on Bayesian optimality criteria is presented. The design adapts the choice of field-test items to the examinees taking an operational adaptive test using both the information in the posterior distributions of their ability parameters and the

  2. Textiles and Design Library of Test Items. Volume I.

    Science.gov (United States)

    Smith, Jan, Ed.

    As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items of value from past tests are made available to teachers for the construction of unit tests, term examinations or as a basis for class discussion. Each collection is reviewed for content validity and reliability. The test…

  3. Refine test items for accurate measurement: six valuable tips.

    Science.gov (United States)

    Siroky, Karen; Di Leonardi, Bette Case

    2015-01-01

    Nursing Professional Development (NPD) specialists frequently design test items to assess competence, to measure learning outcomes, and to create active learning experiences. This article presents six valuable tips for improving test items and using test results to strengthen validity of measurement. NPD specialists can readily apply these tips and examples to measure knowledge with greater accuracy.

  4. A lognormal model for response times on test items

    NARCIS (Netherlands)

    van der Linden, Willem J.

    2006-01-01

    A lognormal model for the response times of a person on a set of test items is investigated. The model has a parameter structure analogous to the two-parameter logistic response models in item response theory, with a parameter for the speed of each person as well as parameters for the time intensity

  5. Studies on statistical models for polytomously scored test items

    NARCIS (Netherlands)

    Akkermans, Wies

    1998-01-01

    This dissertation, which is structured as a collection of self-contained papers, will be concerned mainly with di�erences between item response models. The purpose of item response theory (IRT) is estimation of a hypothesized latent variable, such as, for example, intelligence or ability in geograph

  6. 48 CFR 53.212 - Acquisition of commercial items.

    Science.gov (United States)

    2010-10-01

    ... (CONTINUED) CLAUSES AND FORMS FORMS Prescription of Forms 53.212 Acquisition of commercial items. SF 1449 (Rev. 3/2005), Solicitation/Contract/Order for Commercial Items. SF 1449 is prescribed for use in... 48 Federal Acquisition Regulations System 2 2010-10-01 2010-10-01 false Acquisition of...

  7. 17 CFR 229.407 - (Item 407) Corporate governance.

    Science.gov (United States)

    2010-04-01

    ... 17 Commodity and Securities Exchanges 2 2010-04-01 2010-04-01 false (Item 407) Corporate....407 (Item 407) Corporate governance. (a) Director independence. Identify each director and, when the... independent accountant's communications with the audit committee concerning independence, and has...

  8. 17 CFR 229.406 - (Item 406) Code of ethics.

    Science.gov (United States)

    2010-04-01

    ... 17 Commodity and Securities Exchanges 2 2010-04-01 2010-04-01 false (Item 406) Code of ethics. 229... 406) Code of ethics. (a) Disclose whether the registrant has adopted a code of ethics that applies to... code of ethics, explain why it has not done so. (b) For purposes of this Item 406, the term code...

  9. Comparison of Methods for Varying Item Presentation during Noncontingent Reinforcement

    Science.gov (United States)

    Rosales, Rocio; Worsdell, April; Trahan, Maranda

    2010-01-01

    The purpose of the present investigation was to evaluate the relative effectiveness of three item presentation methods during noncontingent reinforcement (NCR). Four individuals with developmental disabilities and problem behavior maintained by automatic reinforcement were recruited for the study. Single-item stimulus preference assessments were…

  10. Visual Items in Tests of Intelligence for Children.

    Science.gov (United States)

    Wyver, Shirley R.; Markham, Roslyn; Hlavacek, Sonia

    1999-01-01

    A study compared the performance of 15 children (ages 5-12) with visual impairments and 15 controls on the Comprehension and Similarities items of the Wechsler Intelligence Scale for Children-Revised. Results indicated the children with visual impairments were disadvantaged by comprehension-type items with high visual content. (CR)

  11. 17 CFR 229.1110 - (Item 1110) Originators.

    Science.gov (United States)

    2010-04-01

    ... AND CONSERVATION ACT OF 1975-REGULATION S-K Asset-Backed Securities (Regulation AB) § 229.1110 (Item... 17 Commodity and Securities Exchanges 2 2010-04-01 2010-04-01 false (Item 1110) Originators. 229.1110 Section 229.1110 Commodity and Securities Exchanges SECURITIES AND EXCHANGE COMMISSION STANDARD...

  12. 17 CFR 229.1120 - (Item 1120) Ratings.

    Science.gov (United States)

    2010-04-01

    ... AND CONSERVATION ACT OF 1975-REGULATION S-K Asset-Backed Securities (Regulation AB) § 229.1120 (Item... rating monitored while the asset-backed securities are outstanding. ... 17 Commodity and Securities Exchanges 2 2010-04-01 2010-04-01 false (Item 1120) Ratings. 229.1120...

  13. 17 CFR 229.1104 - (Item 1104) Sponsors.

    Science.gov (United States)

    2010-04-01

    ... AND CONSERVATION ACT OF 1975-REGULATION S-K Asset-Backed Securities (Regulation AB) § 229.1104 (Item... 17 Commodity and Securities Exchanges 2 2010-04-01 2010-04-01 false (Item 1104) Sponsors. 229.1104 Section 229.1104 Commodity and Securities Exchanges SECURITIES AND EXCHANGE COMMISSION STANDARD...

  14. 48 CFR 1852.246-73 - Human space flight item.

    Science.gov (United States)

    2010-10-01

    ... 48 Federal Acquisition Regulations System 6 2010-10-01 2010-10-01 true Human space flight item. 1852.246-73 Section 1852.246-73 Federal Acquisition Regulations System NATIONAL AERONAUTICS AND SPACE... 1852.246-73 Human space flight item. As prescribed in 1845.370(b), insert the following clause:...

  15. Model Penjadwalan Batch Multi Item dengan Dependent Processing Time

    Directory of Open Access Journals (Sweden)

    Sukoyo Sukoyo

    2010-01-01

    Full Text Available This paper investigates a development of single machine batch scheduling for multi items with dependent processing time. The batch scheduling problem is to determine simultaneously number of batch (N, which item and its size allocated for each batch, and processing sequences of resulting batches. We use total actual flow time as the objective of schedule performance. The multi item batch scheduling problem could be formulated into a biner-integer nonlinear programming model because the number of batch should be in integer value, the allocation of items to resulting batch need binary values, and also there are some non-linearity on objective function and constraint due to the dependent processing time. By applying relaxation on the decision variable of number of batch (N as parameter, a heuristic procedure could be applied to find solution of the single machine batch scheduling problem for multi items.

  16. An autoregressive growth model for longitudinal item analysis.

    Science.gov (United States)

    Jeon, Minjeong; Rabe-Hesketh, Sophia

    2016-09-01

    A first-order autoregressive growth model is proposed for longitudinal binary item analysis where responses to the same items are conditionally dependent across time given the latent traits. Specifically, the item response probability for a given item at a given time depends on the latent trait as well as the response to the same item at the previous time, or the lagged response. An initial conditions problem arises because there is no lagged response at the initial time period. We handle this problem by adapting solutions proposed for dynamic models in panel data econometrics. Asymptotic and finite sample power for the autoregressive parameters are investigated. The consequences of ignoring local dependence and the initial conditions problem are also examined for data simulated from a first-order autoregressive growth model. The proposed methods are applied to longitudinal data on Korean students' self-esteem.

  17. Modeling local item dependence with the hierarchical generalized linear model.

    Science.gov (United States)

    Jiao, Hong; Wang, Shudong; Kamata, Akihito

    2005-01-01

    Local item dependence (LID) can emerge when the test items are nested within common stimuli or item groups. This study proposes a three-level hierarchical generalized linear model (HGLM) to model LID when LID is due to such contextual effects. The proposed three-level HGLM was examined by analyzing simulated data sets and was compared with the Rasch-equivalent two-level HGLM that ignores such a nested structure of test items. The results demonstrated that the proposed model could capture LID and estimate its magnitude. Also, the two-level HGLM resulted in larger mean absolute differences between the true and the estimated item difficulties than those from the proposed three-level HGLM. Furthermore, it was demonstrated that the proposed three-level HGLM estimated the ability distribution variance unaffected by the LID magnitude, while the two-level HGLM with no LID consideration increasingly underestimated the ability variance as the LID magnitude increased.

  18. Quantitative Analysis of Complex Multiple-Choice Items in Science Technology and Society: Item Scaling

    Directory of Open Access Journals (Sweden)

    Ángel Vázquez Alonso

    2005-05-01

    Full Text Available The scarce attention to assessment and evaluation in science education research has been especially harmful for Science-Technology-Society (STS education, due to the dialectic, tentative, value-laden, and controversial nature of most STS topics. To overcome the methodological pitfalls of the STS assessment instruments used in the past, an empirically developed instrument (VOSTS, Views on Science-Technology-Society have been suggested. Some methodological proposals, namely the multiple response models and the computing of a global attitudinal index, were suggested to improve the item implementation. The final step of these methodological proposals requires the categorization of STS statements. This paper describes the process of categorization through a scaling procedure ruled by a panel of experts, acting as judges, according to the body of knowledge from history, epistemology, and sociology of science. The statement categorization allows for the sound foundation of STS items, which is useful in educational assessment and science education research, and may also increase teachers’ self-confidence in the development of the STS curriculum for science classrooms.

  19. Exploring the Relevance of Items in the Communicative Participation Item Bank (CPIB) for Individuals With Hearing Loss

    Science.gov (United States)

    Baylor, Carolyn R.; Birch, Kristen; Yorkston, Kathryn M.

    2017-01-01

    Purpose The Communicative Participation Item Bank (CPIB) was developed to evaluate participation restrictions in communication situations for individuals with speech and language disorders. This study evaluated the potential relevance of CPIB items for individuals with hearing loss. Method Cognitive interviews were conducted with 17 adults with a range of treated and untreated hearing loss, who responded to 46 items. Interviews were continued until saturation was reached and prevalent trends emerged. A focus group was also conducted with 3 experienced audiologists to seek their views on the CPIB. Analysis of data included qualitative and quantitative approaches. Results The majority of the items were applicable to individuals with hearing loss; however, 12 items were identified as potentially not relevant. This was largely attributed to the items' focus on speech production rather than hearing. The results from the focus group were in agreement for a majority of items. Conclusions The next step in validating the CPIB for individuals with hearing loss is a psychometric analysis on a large sample. Possible outcomes could be that the CPIB is considered valid in its entirety or the creation of a new questionnaire or a hearing loss–specific short form with a subset of items is necessary. PMID:28114665

  20. The academic medical center linear disability score (ALDS) item bank: item response theory analysis in a mixed patient population

    NARCIS (Netherlands)

    Holman, Rebecca; Weisscher, Nadine; Glas, Cornelis A.W.; Dijkgraaf, Marcel G.W.; Vermeulen, Martinus; de Haan, Rob J.; Lindeboom, Robert

    2005-01-01

    Background: Currently, there is a lot of interest in the flexible framework offered by item banks for measuring patient relevant outcomes. However, there are few item banks, which have been developed to quantify functional status, as expressed by the ability to perform activities of daily life. This

  1. The use of predicted values for item parameters in item response theory models: An application in intelligence tests

    NARCIS (Netherlands)

    Matteucci, M.; S. Mignani, Prof.; Veldkamp, Bernard P.

    2012-01-01

    In testing, item response theory models are widely used in order to estimate item parameters and individual abilities. However, even unidimensional models require a considerable sample size so that all parameters can be estimated precisely. The introduction of empirical prior information about candi

  2. 41 CFR 101-26.605 - Items other than petroleum products and electronic items available from the Defense Logistics...

    Science.gov (United States)

    2010-07-01

    ... petroleum products and electronic items available from the Defense Logistics Agency. 101-26.605 Section 101... available from the Defense Logistics Agency. Agencies required to use GSA supply sources should also use... Logistics Agency, the catalog will contain only those items in Federal supply classification classes which...

  3. Evaluation of the Magnitude of Differential Item Functioning in Polytomous Items. Program Statistics Research Technical Report No. 94-2.

    Science.gov (United States)

    Zwick, Rebecca; Thayer, Dorothy T.

    Several recent studies have investigated the application of statistical inference procedures to the analysis of differential item functioning (DIF) in test items that are scored on an ordinal scale. Mantel's extension of the Mantel-Haenszel test is a possible hypothesis-testing method for this purpose. The development of descriptive statistics for…

  4. Analysis of differential item functioning in the depression item bank from the Patient Reported Outcome Measurement Information System (PROMIS: An item response theory approach

    Directory of Open Access Journals (Sweden)

    JOSEPH P. EIMICKE

    2009-06-01

    Full Text Available The aims of this paper are to present findings related to differential item functioning (DIF in the Patient Reported Outcome Measurement Information System (PROMIS depression item bank, and to discuss potential threats to the validity of results from studies of DIF. The 32 depression items studied were modified from several widely used instruments. DIF analyses of gender, age and education were performed using a sample of 735 individuals recruited by a survey polling firm. DIF hypotheses were generated by asking content experts to indicate whether or not they expected DIF to be present, and the direction of the DIF with respect to the studied comparison groups. Primary analyses were conducted using the graded item response model (for polytomous, ordered response category data with likelihood ratio tests of DIF, accompanied by magnitude measures. Sensitivity analyses were performed using other item response models and approaches to DIF detection. Despite some caveats, the items that are recommended for exclusion or for separate calibration were "I felt like crying" and "I had trouble enjoying things that I used to enjoy." The item, "I felt I had no energy," was also flagged as evidencing DIF, and recommended for additional review. On the one hand, false DIF detection (Type 1 error was controlled to the extent possible by ensuring model fit and purification. On the other hand, power for DIF detection might have been compromised by several factors, including sparse data and small sample sizes. Nonetheless, practical and not just statistical significance should be considered. In this case the overall magnitude and impact of DIF was small for the groups studied, although impact was relatively large for some individuals.

  5. Analysis of differential item functioning in the depression item bank from the Patient Reported Outcome Measurement Information System (PROMIS): An item response theory approach

    Science.gov (United States)

    Teresi, Jeanne A.; Ocepek-Welikson, Katja; Kleinman, Marjorie; Eimicke, Joseph P.; Crane, Paul K.; Jones, Richard N.; Lai, Jin-shei; Choi, Seung W.; Hays, Ron D.; Reeve, Bryce B.; Reise, Steven P.; Pilkonis, Paul A.; Cella, David

    2009-01-01

    The aims of this paper are to present findings related to differential item functioning (DIF) in the Patient Reported Outcome Measurement Information System (PROMIS) depression item bank, and to discuss potential threats to the validity of results from studies of DIF. The 32 depression items studied were modified from several widely used instruments. DIF analyses of gender, age and education were performed using a sample of 735 individuals recruited by a survey polling firm. DIF hypotheses were generated by asking content experts to indicate whether or not they expected DIF to be present, and the direction of the DIF with respect to the studied comparison groups. Primary analyses were conducted using the graded item response model (for polytomous, ordered response category data) with likelihood ratio tests of DIF, accompanied by magnitude measures. Sensitivity analyses were performed using other item response models and approaches to DIF detection. Despite some caveats, the items that are recommended for exclusion or for separate calibration were “I felt like crying” and “I had trouble enjoying things that I used to enjoy.” The item, “I felt I had no energy,” was also flagged as evidencing DIF, and recommended for additional review. On the one hand, false DIF detection (Type 1 error) was controlled to the extent possible by ensuring model fit and purification. On the other hand, power for DIF detection might have been compromised by several factors, including sparse data and small sample sizes. Nonetheless, practical and not just statistical significance should be considered. In this case the overall magnitude and impact of DIF was small for the groups studied, although impact was relatively large for some individuals. PMID:20336180

  6. Item response theory analysis of the life orientation test-revised: age and gender differential item functioning analyses.

    Science.gov (United States)

    Steca, Patrizia; Monzani, Dario; Greco, Andrea; Chiesi, Francesca; Primi, Caterina

    2015-06-01

    This study is aimed at testing the measurement properties of the Life Orientation Test-Revised (LOT-R) for the assessment of dispositional optimism by employing item response theory (IRT) analyses. The LOT-R was administered to a large sample of 2,862 Italian adults. First, confirmatory factor analyses demonstrated the theoretical conceptualization of the construct measured by the LOT-R as a single bipolar dimension. Subsequently, IRT analyses for polytomous, ordered response category data were applied to investigate the items' properties. The equivalence of the items across gender and age was assessed by analyzing differential item functioning. Discrimination and severity parameters indicated that all items were able to distinguish people with different levels of optimism and adequately covered the spectrum of the latent trait. Additionally, the LOT-R appears to be gender invariant and, with minor exceptions, age invariant. Results provided evidence that the LOT-R is a reliable and valid measure of dispositional optimism.

  7. AVOID BECOMING A VICTIM OF COUNTERFEIT ITEMS

    Energy Technology Data Exchange (ETDEWEB)

    WARRINER RD

    2011-07-13

    In today's globalized economy, we cannot live without imported products. Most people do not realize how thin the safety net of regulation and inspection really is. Less than three percent of imported products receive any form of government inspection prior to sale. Avoid flea markets, street vendors and deep discount stores. The sellers of counterfeit wares know where to market their products. They look for individuals who are hungry for a brand name item but do not want to pay a brand name price for it. The internet provides anonymity to the sellers of counterfeit products. Unlike Europe, U.S. law does not hold internet-marketing organizations, responsible for the quality of the products sold on their websites. These organizations will remove an individual vendor when a sufficient number of complaints are lodged, but they will not take responsibility for the counterfeit products you may have purchased. EBay has a number of counterfeit product guides to help you avoid being a victim of the sellers of these products. Ten percent of all medications taken worldwide are counterfeit. If you do buy medications on-line, be sure that the National Association of Boards of Pharmacy Verified Internet Pharmacy Practice Sites (VIPPS) recommends the pharmacy you choose to use. Inspect all medication purchases and report any change in color, shape, imprinting or odor to your pharmacist. If you take generic medications these attributes may change from one manufacturer to another. Your pharmacist should inform you of any changes when you refill your prescription. If they do not, get clarification prior to taking the medication. Please note that the Federal Drug Administration (FDA) does not regulate supplements. The FDA only steps in when a specific supplement proves to cause physical harm or contains a regulated ingredient. Due to counterfeiting, Underwriters Laboratories (UL) changed their label design three times since 1996. The new gold label should be attached to the cord

  8. Mediate gamma radiation effects on some packaged food items

    Science.gov (United States)

    Inamura, Patricia Y.; Uehara, Vanessa B.; Teixeira, Christian A. H. M.; del Mastro, Nelida L.

    2012-08-01

    For most of prepackaged foods a 10 kGy radiation dose is considered the maximum dose needed; however, the commercially available and practically accepted packaging materials must be suitable for such application. This work describes the application of ionizing radiation on several packaged food items, using 5 dehydrated food items, 5 ready-to-eat meals and 5 ready-to-eat food items irradiated in a 60Co gamma source with a 3 kGy dose. The quality evaluation of the irradiated samples was performed 2 and 8 months after irradiation. Microbiological analysis (bacteria, fungus and yeast load) was performed. The sensory characteristics were established for appearance, aroma, texture and flavor attributes were also established. From these data, the acceptability of all irradiated items was obtained. All ready-to-eat food items assayed like manioc flour, some pâtés and blocks of raw brown sugar and most of ready-to-eat meals like sausages and chicken with legumes were considered acceptable for microbial and sensory characteristics. On the other hand, the dehydrated food items chosen for this study, such as dehydrated bacon potatoes or pea soups were not accepted by the sensory analysis. A careful dose choice and special irradiation conditions must be used in order to achieve sensory acceptability needed for the commercialization of specific irradiated food items.

  9. Item Banking Enables Stand-Alone Measurement of Driving Ability.

    Science.gov (United States)

    Khadka, Jyoti; Fenwick, Eva K; Lamoureux, Ecosse L; Pesudovs, Konrad

    2016-12-01

    To explore whether large item sets, as used in item banking, enable important latent traits, such as driving, to form stand-alone measures. The 88-item activity limitation (AL) domain of the glaucoma module of the Eye-tem Bank was interviewer-administered to patients with glaucoma. Rasch analysis was used to calibrate all items in AL domain on the same interval-level scale and test its psychometric properties. Based on Rasch dimensionality metrics, the AL scale was separated into subscales. These subscales underwent separate Rasch analyses to test whether they could form stand-alone measures. Independence of these measures was tested with Bland and Altman (B&A) Limit of Agreement (LOA). The AL scale was completed by 293 patients (median age, 71 years). It demonstrated excellent precision (3.12). However, Rasch analysis dimensionality metrics indicated that the domain arguably had other dimensions which were driving, luminance, and reading. Once separated, the remaining AL items, driving and luminance subscales, were unidimensional and had excellent precision of 4.25, 2.94, and 2.22, respectively. The reading subscale showed poor precision (1.66), so it was not examined further. The luminance subscale demonstrated excellent agreement (mean bias, 0.2 logit; 95% LOA, -2.2 to 3.3 logit); however, the driving subscale demonstrated poor agreement (mean bias, 1.1 logit; 95% LOA, -4.8 to 7.0 logit) with the AL scale. These findings indicate that driving items in the AL domain of the glaucoma module were perceived and responded to differently from the other AL items, but the reading and luminance items were not. Therefore, item banking enables stand-alone measurement of driving ability in glaucoma.

  10. How did Danish students solve the PISA CBAS items?

    DEFF Research Database (Denmark)

    Sørensen, Helene; Andersen, Annemarie Møller

    2009-01-01

    is not gender-neutral to the same extent as the paper-and-pencil test. Instead of that the items appear to be more difficult for girls than boys in all three countries which went through CBAS. Within the Danish context we have classified the different items and compared the results with patterns in girls......’ and boys’ answers. Twelve items were chosen for focus group interviews with two groups of students – three girls and three boys. The analysis shows that the students need other competencies than in the paper-and-pencil test and another problem solving strategy. In the Danish context this may be one...

  11. Reprocessing semicritical items: Current issues and new technologies.

    Science.gov (United States)

    Rutala, William A; Weber, David J

    2016-05-02

    Semicritical medical devices are defined as items that come into contact with mucous membranes or nonintact skin (eg, gastrointestinal endoscopes, endocavitary probes). Such medical devices minimally require high-level disinfection. Because many of these items are temperature sensitive, low-temperature chemical methods are usually used rather than steam sterilization. Strict adherence to current guidelines is required because more outbreaks have been linked to inadequately cleaned or disinfected endoscopes and other semicritical items undergoing high-level disinfection than any other reusable medical device.

  12. Identifying predictors of physics item difficulty: A linear regression approach

    Directory of Open Access Journals (Sweden)

    Vanes Mesic

    2011-06-01

    Full Text Available Large-scale assessments of student achievement in physics are often approached with an intention to discriminate students based on the attained level of their physics competencies. Therefore, for purposes of test design, it is important that items display an acceptable discriminatory behavior. To that end, it is recommended to avoid extraordinary difficult and very easy items. Knowing the factors that influence physics item difficulty makes it possible to model the item difficulty even before the first pilot study is conducted. Thus, by identifying predictors of physics item difficulty, we can improve the test-design process. Furthermore, we get additional qualitative feedback regarding the basic aspects of student cognitive achievement in physics that are directly responsible for the obtained, quantitative test results. In this study, we conducted a secondary analysis of data that came from two large-scale assessments of student physics achievement at the end of compulsory education in Bosnia and Herzegovina. Foremost, we explored the concept of “physics competence” and performed a content analysis of 123 physics items that were included within the above-mentioned assessments. Thereafter, an item database was created. Items were described by variables which reflect some basic cognitive aspects of physics competence. For each of the assessments, Rasch item difficulties were calculated in separate analyses. In order to make the item difficulties from different assessments comparable, a virtual test equating procedure had to be implemented. Finally, a regression model of physics item difficulty was created. It has been shown that 61.2% of item difficulty variance can be explained by factors which reflect the automaticity, complexity, and modality of the knowledge structure that is relevant for generating the most probable correct solution, as well as by the divergence of required thinking and interference effects between intuitive and formal

  13. Quantum Partial Search of a Database with Several Target Items

    CERN Document Server

    Choi, B S; Choi, Byung-Soo; Korepin, Vladimir E

    2006-01-01

    We consider a database separated into blocks. Blocks containing target items are called target blocks. Blocks without target items are called non-target blocks. We consider a case, when each target block has the same number of target items. We present a fast quantum algorithm, which finds one of the target blocks. Our algorithm is based on Grover-Radhakrishnan algorithm of partial search. We minimize the number of queries to the oracle. We prove that for large size of the blocks the proposed algorithm shows universality and also shows how to use the algorithm for finite blocks.

  14. Identifying predictors of physics item difficulty: A linear regression approach

    Science.gov (United States)

    Mesic, Vanes; Muratovic, Hasnija

    2011-06-01

    Large-scale assessments of student achievement in physics are often approached with an intention to discriminate students based on the attained level of their physics competencies. Therefore, for purposes of test design, it is important that items display an acceptable discriminatory behavior. To that end, it is recommended to avoid extraordinary difficult and very easy items. Knowing the factors that influence physics item difficulty makes it possible to model the item difficulty even before the first pilot study is conducted. Thus, by identifying predictors of physics item difficulty, we can improve the test-design process. Furthermore, we get additional qualitative feedback regarding the basic aspects of student cognitive achievement in physics that are directly responsible for the obtained, quantitative test results. In this study, we conducted a secondary analysis of data that came from two large-scale assessments of student physics achievement at the end of compulsory education in Bosnia and Herzegovina. Foremost, we explored the concept of “physics competence” and performed a content analysis of 123 physics items that were included within the above-mentioned assessments. Thereafter, an item database was created. Items were described by variables which reflect some basic cognitive aspects of physics competence. For each of the assessments, Rasch item difficulties were calculated in separate analyses. In order to make the item difficulties from different assessments comparable, a virtual test equating procedure had to be implemented. Finally, a regression model of physics item difficulty was created. It has been shown that 61.2% of item difficulty variance can be explained by factors which reflect the automaticity, complexity, and modality of the knowledge structure that is relevant for generating the most probable correct solution, as well as by the divergence of required thinking and interference effects between intuitive and formal physics knowledge

  15. P2-19: The Effect of item Repetition on Item-Context Association Depends on the Prior Exposure of Items

    Directory of Open Access Journals (Sweden)

    Hongmi Lee

    2012-10-01

    Full Text Available Previous studies have reported conflicting findings on whether item repetition has beneficial or detrimental effects on source memory. To reconcile such contradictions, we investigated whether the degree of pre-exposure of items can be a potential modulating factor. The experimental procedures spanned two consecutive days. On Day 1, participants were exposed to a set of unfamiliar faces. On Day 2, the same faces presented on the previous day were used again in half of the participants, whereas novel faces were used for the other half. Day 2 procedures consisted of three successive phases: item repetition, source association, and source memory test. In the item repetition phase, half of the face stimuli were repeatedly presented while participants were making male/female judgments. During the source association phase, both the repeated and the unrepeated faces appeared in one of the four locations on the screen. Finally, participants were tested on the location in which a given face was presented during the previous phase and reported the confidence of their memory. Source memory accuracy was measured as the percentage of correct non-guess trials. As results, we found a significant interaction between prior exposure and repetition. Repetition impaired source memory when the items had been pre-exposed on Day 1, while it led to greater accuracy in novel ones. These results show that pre-experimental exposure can modulate the effects of repetition on associative binding between an item and its contextual information, suggesting that pre-existing representation and novelty signal interact to form new episodic memory.

  16. Application of multidimensional item response theory models to longitudinal data

    NARCIS (Netherlands)

    Marvelde, te Janneke M.; Glas, Cees A.W.; Van Landeghem, Georges; Van Damme, Jan

    2006-01-01

    The application of multidimensional item response theory (IRT) models to longitudinal educational surveys where students are repeatedly measured is discussed and exemplified. A marginal maximum likelihood (MML) method to estimate the parameters of a multidimensional generalized partial credit model

  17. Inconsistent Student Responses in TIMSS Questionnaire Items on Mathematics Lessons

    Directory of Open Access Journals (Sweden)

    Selda Yıldırım

    2009-12-01

    Full Text Available This study investigated consistency among Turkish students’ responses to TIMSS 2007 questionnaire items on frequency of certain activities in mathematics classrooms. In Turkey, 4476 students from 143 schools participated in the study. Analyses have revealed the existence of inconsistencies in student responses as indicated by high proportion of within-class variance components. That is, students in same class specified fluctuating frequencies to certain classroom activities, showing that some factors had an affect on perception of individuals. Further analyses showed that students at different levels of mathematics achievement reported differently on frequency of classroom activities, and precise items were answered more consistently compared to items containing vague terms. Using factor scores instead of individual item responses contributed consistency of responses within classes but only to a small extent. Based on the findings, this study also provided implications for questionnaire design.

  18. An Item Factor Analysis of the Mooney Problem Check List

    Science.gov (United States)

    Stewart, David W.; Deiker, Thomas

    1976-01-01

    Explores the factor structure of the Mooney Problem Check List (MPCL) at the junior and senior high school level by undertaking a large obverse factor analysis of item responses in three adolescent criterion groups. (Author/DEP)

  19. MODERATING ABILITY OF ITEM RESPONSE THEORY THROUGH PRIOR GUESSING PARAMETER

    Directory of Open Access Journals (Sweden)

    Siow Hoo Leong

    2013-01-01

    Full Text Available A psycho-technology approach to discouraging guessing in multiple-choice formatted item can be done through reducing the a priori guessing probability of an item. This study proposes a psychometrics framework of Item Response Theory (IRT to model the effect of having various priori guessing probabilities across different items. A prior guessing parameter is proposed to serves as a moderator of the ability parameter in the two parameter logistic IRT. The results show that the proposed prior guessing parameter successfully moderates the ability parameters of the subjects with different degrees of guessing. However, the prior guessing parameter is insensitive when the performance pattern is mixed within the testlet but similar across testlet with different priori guessing probabilities.

  20. Etendatakse 10-aastase kirjamehe näitemäng

    Index Scriptorium Estoniae

    2003-01-01

    Tartu katoliku kooli 3. klassi poisi Mario Raitari näitemäng "Kristoph Silvester von Tenderi lugu", mis on esimene lugu sarjast "Mario Raitari kroonika". Näidendit etendab Linnupuu Lastepereteatri trupp

  1. Item response theory modeling with nonignorable missing data

    NARCIS (Netherlands)

    Pimentel, Jonald L.

    2005-01-01

    This thesis discusses methods to detect nonignorable missing data and methods to adjust for the bias caused by nonignorable missing data, both by introducing a model for the missing data indicator using item response theory (IRT) models.

  2. Etendatakse 10-aastase kirjamehe näitemäng

    Index Scriptorium Estoniae

    2003-01-01

    Tartu katoliku kooli 3. klassi poisi Mario Raitari näitemäng "Kristoph Silvester von Tenderi lugu", mis on esimene lugu sarjast "Mario Raitari kroonika". Näidendit etendab Linnupuu Lastepereteatri trupp

  3. Solving Verbal Analogies: Some Cognitive Components of Intelligence Test Items

    Science.gov (United States)

    Whitely, Susan E.

    1976-01-01

    The results indicate that although relational concepts influence the cognitive aptitudes which are reflected in analogy item performance, success in solving analogies does not depend on individual differences in some major aspects of processing relationships. (Author/DEP)

  4. Separation index and fit items of creative thinking skills assessment

    Directory of Open Access Journals (Sweden)

    Andi Ulfa Tenri Pada

    2016-06-01

    Full Text Available This article discusses the evaluation results of the separation index and fit item of creative thinking skills assessment that supports the conation aspect of prospective biology teachers in Aceh. This assessment consists of 37 items of divergent tasks, which is the application of human physiology courses that support the conation aspects. The participants were selected from the Biology Education Program, Faculty of Teacher Training and Education, Syiah Kuala University. The data were analyzed using the Quest software including the separation index and fit item. The results indicate that the creative thinking skills assessment instrument that supports the conation aspect of prospective biology teachers has a good separation index and all the items fit PCM-1PL.

  5. Basic Stand Alone Medicare DME Line Items PUF

    Data.gov (United States)

    U.S. Department of Health & Human Services — This release contains the Basic Stand Alone (BSA) Durable Medical Equipment (DME) Line Items Public Use Files (PUF) with information from Medicare DME claims. The...

  6. Separation index and fit items of creative thinking skills assessment

    Directory of Open Access Journals (Sweden)

    Andi Ulfa Tenri Pada

    2016-06-01

    Full Text Available This article discusses the evaluation results of the separation index and fit item of creative thinking skills assessment that supports the conation aspect of prospective biology teachers in Aceh. This assessment consists of 37 items of divergent tasks, which is the application of human physiology courses that support the conation aspects. The participants were selected from the Biology Education Program, Faculty of Teacher Training and Education, Syiah Kuala University. The data were analyzed using the Quest software including the separation index and fit item. The results indicate that the creative thinking skills assessment instrument that supports the conation aspect of prospective biology teachers has a good separation index and all the items fit PCM-1PL.

  7. Development of six PROMIS pediatrics proxy-report item banks

    Directory of Open Access Journals (Sweden)

    Irwin Debra E

    2012-02-01

    Full Text Available Abstract Background Pediatric self-report should be considered the standard for measuring patient reported outcomes (PRO among children. However, circumstances exist when the child is too young, cognitively impaired, or too ill to complete a PRO instrument and a proxy-report is needed. This paper describes the development process including the proxy cognitive interviews and large-field-test survey methods and sample characteristics employed to produce item parameters for the Patient Reported Outcomes Measurement Information System (PROMIS pediatric proxy-report item banks. Methods The PROMIS pediatric self-report items were converted into proxy-report items before undergoing cognitive interviews. These items covered six domains (physical function, emotional distress, social peer relationships, fatigue, pain interference, and asthma impact. Caregivers (n = 25 of children ages of 5 and 17 years provided qualitative feedback on proxy-report items to assess any major issues with these items. From May 2008 to March 2009, the large-scale survey enrolled children ages 8-17 years to complete the self-report version and caregivers to complete the proxy-report version of the survey (n = 1548 dyads. Caregivers of children ages 5 to 7 years completed the proxy report survey (n = 432. In addition, caregivers completed other proxy instruments, PedsQL™ 4.0 Generic Core Scales Parent Proxy-Report version, PedsQL™ Asthma Module Parent Proxy-Report version, and KIDSCREEN Parent-Proxy-52. Results Item content was well understood by proxies and did not require item revisions but some proxies clearly noted that determining an answer on behalf of their child was difficult for some items. Dyads and caregivers of children ages 5-17 years old were enrolled in the large-scale testing. The majority were female (85%, married (70%, Caucasian (64% and had at least a high school education (94%. Approximately 50% had children with a chronic health condition, primarily

  8. Comparative Racial Analysis of Enlisted Advancement Exams: Item Differentiation

    Science.gov (United States)

    1977-01-01

    New York: Holt, Rinehart, & Winston, 1963. Henryssen, S. Gathering, analyzing, and using data on test items. In R. L. Thorndike (Ed.), Educational ...validation of test items. Journal of Educational Psychology , 1939, 30, 674-680. (See Appendix) Lawshe, C. H., Jr. A nomograph for estimating the...N. Planning the objective test. In R. L. Thorndike (Ed.), Educational measurement (2nd ed.). Washington, D.C.: American Council on Education , 1971

  9. Magnetometer Response of Commonly Found Munitions Items and Munitions Surrogates

    Science.gov (United States)

    2012-01-12

    Predicted minimum magnetometer anomaly strength for a variety of munitions and surrogate items at a burial depth corresponding to 11x their respective...Response Live Site Demonstrations. The authors would like to thank Craig Murray of Parsons and Stephen Billings of Sky Research for their...variety of munitions and surrogate items at a burial depth corresponding to 11x their respective diameter. The sensor is assumed to be deployed as part

  10. Metadata Embeddings for User and Item Cold-start Recommendations

    OpenAIRE

    Kula, Maciej

    2015-01-01

    I present a hybrid matrix factorisation model representing users and items as linear combinations of their content features' latent factors. The model outperforms both collaborative and content-based models in cold-start or sparse interaction data scenarios (using both user and item metadata), and performs at least as well as a pure collaborative matrix factorisation model where interaction data is abundant. Additionally, feature embeddings produced by the model encode semantic information in...

  11. Differential Weighting of Items to Improve University Admission Test Validity

    OpenAIRE

    Eduardo Backhoff Escudero; Felipe Tirado Segura; Norma Larrazolo Reyna

    2001-01-01

    This paper gives an evaluation of different ways to increase university admission test criterion-related validity, by differentially weighting test items. We compared four methods of weighting multiple-choice items of the Basic Skills and Knowledge Examination (EXHCOBA): (1) punishing incorrect responses by a constant factor, (2) weighting incorrect responses, considering the levels of error, (3) weighting correct responses, considering the item’s difficulty, based on the Classic Measur...

  12. A Case Study on an Item Writing Process: Use of Test Specifications, Nature of Group Dynamics, and Individual Item Writers' Characteristics

    Science.gov (United States)

    Kim, Jiyoung; Chi, Youngshin; Huensch, Amanda; Jun, Heesung; Li, Hongli; Roullion, Vanessa

    2010-01-01

    This article discusses a case study on an item writing process that reflects on our practical experience in an item development project. The purpose of the article is to share our lessons from the experience aiming to demystify item writing process. The study investigated three issues that naturally emerged during the project: how item writers use…

  13. MMPI-2 Item Endorsements in Dissociative Identity Disorder vs. Simulators.

    Science.gov (United States)

    Brand, Bethany L; Chasson, Gregory S; Palermo, Cori A; Donato, Frank M; Rhodes, Kyle P; Voorhees, Emily F

    2016-03-01

    Elevated scores on some MMPI-2 (Minnesota Multiphasic Inventory-2) validity scales are common among patients with dissociative identity disorder (DID), which raises questions about the validity of their responses. Such patients show elevated scores on atypical answers (F), F-psychopathology (Fp), atypical answers in the second half of the test (FB), schizophrenia (Sc), and depression (D) scales, with Fp showing the greatest utility in distinguishing them from coached and uncoached DID simulators. In the current study, we investigated the items on the MMPI-2 F, Fp, FB, Sc, and D scales that were most and least commonly endorsed by participants with DID in our 2014 study and compared these responses with those of coached and uncoached DID simulators. The comparisons revealed that patients with DID most frequently endorsed items related to dissociation, trauma, depression, fearfulness, conflict within family, and self-destructiveness. The coached group more successfully imitated item endorsements of the DID group than did the uncoached group. However, both simulating groups, especially the uncoached group, frequently endorsed items that were uncommonly endorsed by the DID group. The uncoached group endorsed items consistent with popular media portrayals of people with DID being violent, delusional, and unlawful. These results suggest that item endorsement patterns can provide useful information to clinicians making determinations about whether an individual is presenting with DID or feigning. © 2016 American Academy of Psychiatry and the Law.

  14. Promoting cold-start items in recommender systems.

    Science.gov (United States)

    Liu, Jin-Hu; Zhou, Tao; Zhang, Zi-Ke; Yang, Zimo; Liu, Chuang; Li, Wei-Min

    2014-01-01

    As one of the major challenges, cold-start problem plagues nearly all recommender systems. In particular, new items will be overlooked, impeding the development of new products online. Given limited resources, how to utilize the knowledge of recommender systems and design efficient marketing strategy for new items is extremely important. In this paper, we convert this ticklish issue into a clear mathematical problem based on a bipartite network representation. Under the most widely used algorithm in real e-commerce recommender systems, the so-called item-based collaborative filtering, we show that to simply push new items to active users is not a good strategy. Interestingly, experiments on real recommender systems indicate that to connect new items with some less active users will statistically yield better performance, namely, these new items will have more chance to appear in other users' recommendation lists. Further analysis suggests that the disassortative nature of recommender systems contributes to such observation. In a word, getting in-depth understanding on recommender systems could pave the way for the owners to popularize their cold-start products with low costs.

  15. Optimal Bayesian Adaptive Design for Test-Item Calibration.

    Science.gov (United States)

    van der Linden, Wim J; Ren, Hao

    2015-06-01

    An optimal adaptive design for test-item calibration based on Bayesian optimality criteria is presented. The design adapts the choice of field-test items to the examinees taking an operational adaptive test using both the information in the posterior distributions of their ability parameters and the current posterior distributions of the field-test parameters. Different criteria of optimality based on the two types of posterior distributions are possible. The design can be implemented using an MCMC scheme with alternating stages of sampling from the posterior distributions of the test takers' ability parameters and the parameters of the field-test items while reusing samples from earlier posterior distributions of the other parameters. Results from a simulation study demonstrated the feasibility of the proposed MCMC implementation for operational item calibration. A comparison of performances for different optimality criteria showed faster calibration of substantial numbers of items for the criterion of D-optimality relative to A-optimality, a special case of c-optimality, and random assignment of items to the test takers.

  16. Promoting Cold-Start Items in Recommender Systems

    Science.gov (United States)

    Liu, Jin-Hu; Zhou, Tao; Zhang, Zi-Ke; Yang, Zimo; Liu, Chuang; Li, Wei-Min

    2014-01-01

    As one of the major challenges, cold-start problem plagues nearly all recommender systems. In particular, new items will be overlooked, impeding the development of new products online. Given limited resources, how to utilize the knowledge of recommender systems and design efficient marketing strategy for new items is extremely important. In this paper, we convert this ticklish issue into a clear mathematical problem based on a bipartite network representation. Under the most widely used algorithm in real e-commerce recommender systems, the so-called item-based collaborative filtering, we show that to simply push new items to active users is not a good strategy. Interestingly, experiments on real recommender systems indicate that to connect new items with some less active users will statistically yield better performance, namely, these new items will have more chance to appear in other users' recommendation lists. Further analysis suggests that the disassortative nature of recommender systems contributes to such observation. In a word, getting in-depth understanding on recommender systems could pave the way for the owners to popularize their cold-start products with low costs. PMID:25479013

  17. Item Pool Design for an Operational Variable-Length Computerized Adaptive Test

    Science.gov (United States)

    He, Wei; Reckase, Mark D.

    2014-01-01

    For computerized adaptive tests (CATs) to work well, they must have an item pool with sufficient numbers of good quality items. Many researchers have pointed out that, in developing item pools for CATs, not only is the item pool size important but also the distribution of item parameters and practical considerations such as content distribution…

  18. Unidimensional IRT Item Parameter Estimates across Equivalent Test Forms with Confounding Specifications within Dimensions

    Science.gov (United States)

    Matlock, Ki Lynn; Turner, Ronna

    2016-01-01

    When constructing multiple test forms, the number of items and the total test difficulty are often equivalent. Not all test developers match the number of items and/or average item difficulty within subcontent areas. In this simulation study, six test forms were constructed having an equal number of items and average item difficulty overall.…

  19. Using a Linear Regression Method to Detect Outliers in IRT Common Item Equating

    Science.gov (United States)

    He, Yong; Cui, Zhongmin; Fang, Yu; Chen, Hanwei

    2013-01-01

    Common test items play an important role in equating alternate test forms under the common item nonequivalent groups design. When the item response theory (IRT) method is applied in equating, inconsistent item parameter estimates among common items can lead to large bias in equated scores. It is prudent to evaluate inconsistency in parameter…

  20. Using a Linear Regression Method to Detect Outliers in IRT Common Item Equating

    Science.gov (United States)

    He, Yong; Cui, Zhongmin; Fang, Yu; Chen, Hanwei

    2013-01-01

    Common test items play an important role in equating alternate test forms under the common item nonequivalent groups design. When the item response theory (IRT) method is applied in equating, inconsistent item parameter estimates among common items can lead to large bias in equated scores. It is prudent to evaluate inconsistency in parameter…

  1. A Stepwise Test Characteristic Curve Method to Detect Item Parameter Drift

    Science.gov (United States)

    Guo, Rui; Zheng, Yi; Chang, Hua-Hua

    2015-01-01

    An important assumption of item response theory is item parameter invariance. Sometimes, however, item parameters are not invariant across different test administrations due to factors other than sampling error; this phenomenon is termed item parameter drift. Several methods have been developed to detect drifted items. However, most of the…

  2. Few items in the thyroid-related quality of life instrument ThyPRO exhibited differential item functioning

    DEFF Research Database (Denmark)

    Watt, Torquil; Grønvold, Mogens; Hegedüs, Laszlo;

    2014-01-01

    To evaluate the extent of differential item functioning (DIF) within the thyroid-specific quality of life patient-reported outcome measure, ThyPRO, according to sex, age, education and thyroid diagnosis.......To evaluate the extent of differential item functioning (DIF) within the thyroid-specific quality of life patient-reported outcome measure, ThyPRO, according to sex, age, education and thyroid diagnosis....

  3. PENGEMBANGAN TES BERPIKIR KRITIS DENGAN PENDEKATAN ITEM RESPONSE THEORY

    Directory of Open Access Journals (Sweden)

    Fajrianthi Fajrianthi

    2016-06-01

    Full Text Available Penelitian ini bertujuan untuk menghasilkan sebuah alat ukur (tes berpikir kritis yang valid dan reliabel untuk digunakan, baik dalam lingkup pendidikan maupun kerja di Indonesia. Tahapan penelitian dilakukan berdasarkan tahap pengembangan tes menurut Hambleton dan Jones (1993. Kisi-kisi dan pembuatan butir didasarkan pada konsep dalam tes Watson-Glaser Critical Thinking Appraisal (WGCTA. Pada WGCTA, berpikir kritis terdiri dari lima dimensi yaitu Inference, Recognition Assumption, Deduction, Interpretation dan Evaluation of arguments. Uji coba tes dilakukan pada 1.453 peserta tes seleksi karyawan di Surabaya, Gresik, Tuban, Bojonegoro, Rembang. Data dikotomi dianalisis dengan menggunakan model IRT dengan dua parameter yaitu daya beda dan tingkat kesulitan butir. Analisis dilakukan dengan menggunakan program statistik Mplus versi 6.11 Sebelum melakukan analisis dengan IRT, dilakukan pengujian asumsi yaitu uji unidimensionalitas, independensi lokal dan Item Characteristic Curve (ICC. Hasil analisis terhadap 68 butir menghasilkan 15 butir dengan daya beda yang cukup baik dan tingkat kesulitan butir yang berkisar antara –4 sampai dengan 2.448. Sedikitnya jumlah butir yang berkualitas baik disebabkan oleh kelemahan dalam menentukan subject matter experts di bidang berpikir kritis dan pemilihan metode skoring. Kata kunci: Pengembangan tes, berpikir kritis, item response theory   DEVELOPING CRITICAL THINKING TEST UTILISING ITEM RESPONSE THEORY Abstract The present study was aimed to develop a valid and reliable instrument in assesing critical thinking which can be implemented both in educational and work settings in Indonesia. Following the Hambleton and Jones’s (1993 procedures on test development, the study developed the instrument by employing the concept of critical thinking from Watson-Glaser Critical Thinking Appraisal (WGCTA. The study included five dimensions of critical thinking as adopted from the WGCTA: Inference, Recognition

  4. Determining differential item functioning and its effect on the test scores of selected pib indexes, using item response theory techniques

    Directory of Open Access Journals (Sweden)

    Pieter Schaap

    2001-02-01

    Full Text Available The objective of this article is to present the results of an investigation into the item and test characteristics of two tests of the Potential Index Batteries (PIB in terms of differential item functioning (DIP and the effect thereof on test scores of different race groups. The English Vocabulary (Index 12 and Spelling Tests (Index 22 of the PIB were analysed for white, black and coloured South Africans. Item response theory (IRT methods were used to identify items which function differentially for white, black and coloured race groups. Opsomming Die doel van hierdie artikel is om die resultate van n ondersoek na die item- en toetseienskappe van twee PIB (Potential Index Batteries toetse in terme van itemsydigheid en die invloed wat dit op die toetstellings van rassegroepe het, weer te gee. Die Potential Index Batteries (PIB se Engelse Woordeskat (Index 12 en Spellingtoetse (Index 22 is ten opsigte van blanke, swart en gekleurde Suid-Afrikaners ontleed. Itemresponsteorie (IRT is gebruik om items te identifiseer wat as sydig (DIP vir die onderskeie rassegroepe beskou kan word.

  5. Suspect/Counterfeit Items Information Guide for Subcontractors/Suppliers

    Energy Technology Data Exchange (ETDEWEB)

    Tessmar, Nancy D. [Los Alamos National Laboratory; Salazar, Michael J. [Los Alamos National Laboratory

    2012-09-18

    Counterfeiting of industrial and commercial grade items is an international problem that places worker safety, program objectives, expensive equipment, and security at risk. In order to prevent the introduction of Suspect/Counterfeit Items (S/CI), this information sheet is being made available as a guide to assist in the implementation of S/CI awareness and controls, in conjunction with subcontractor's/supplier's quality assurance programs. When it comes to counterfeit goods, including industrial materials, items, and equipment, no market is immune. Some manufactures have been known to misrepresent their products and intentionally use inferior materials and processes to manufacture substandard items, whose properties can significantly cart from established standards and specifications. These substandard items termed by the Department of Energy (DOE) as S/CI, pose immediate and potential threats to the safety of DOE and contractor workers, the public, and the environment. Failure of certain systems and processes caused by an S/CI could also have national security implications at Los Alamos National Laboratory (LANL). Nuclear Safety Rules (federal Laws), DOE Orders, and other regulations set forth requirements for DOE contractors to implement effective controls to assure that items and services meet specified requirements. This includes techniques to implement and thereby minimizing the potential threat of entry of S/CI to LANL. As a qualified supplier of goods or services to the LANL, your company will be required to establish and maintain effective controls to prevent the introduction of S/CI to LANL. This will require that your company warrant that all items (including their subassemblies, components, and parts) sold to LANL are genuine (i.e. not counterfeit), new, and unused, and conform to the requirements of the LANL purchase orders/contracts unless otherwise approved in writing to the Los Alamos National Security (LANS) contract administrator

  6. Significant effects due to rephrasing the Maslach Burnout Inventory's personal accomplishment items.

    NARCIS (Netherlands)

    A.M. Bouman; J. Hoogstraten; H. te Brake

    2002-01-01

    Subjects, 292 Dutch psychology students answering negatively phrased Personal Accomplishment items, reported a more personally competent feeling than subjects answering positively phrased personal accomplishment items. The way of phrasing personal accomplishment items significantly affects the answe

  7. Significant effects due to rephrasing the Maslach Burnout Inventory's personal accomplishment items.

    Science.gov (United States)

    Bouman, Anne Marthe; Te Brake, Hans; Hoogstraten, Johan

    2002-12-01

    Subjects, 292 Dutch psychology students answering negatively phrased Personal Accomplishment items, reported a more personally competent feeling than subjects answering positively phrased personal accomplishment items. The way of phrasing personal accomplishment items significantly affects the answers given.

  8. On revenue maximization for selling multiple independently distributed items.

    Science.gov (United States)

    Li, Xinye; Yao, Andrew Chi-Chih

    2013-07-09

    Consider the revenue-maximizing problem in which a single seller wants to sell k different items to a single buyer, who has independently distributed values for the items with additive valuation. The k = 1 case was completely resolved by Myerson's classical work in 1981, whereas for larger k the problem has been the subject of much research efforts ever since. Recently, Hart and Nisan analyzed two simple mechanisms: selling the items separately, or selling them as a single bundle. They showed that selling separately guarantees at least a c/log2 k fraction of the optimal revenue; and for identically distributed items, bundling yields at least a c/log k fraction of the optimal revenue. In this paper, we prove that selling separately guarantees at least c/log k fraction of the optimal revenue, whereas for identically distributed items, bundling yields at least a constant fraction of the optimal revenue. These bounds are tight (up to a constant factor), settling the open questions raised by Hart and Nisan. The results are valid for arbitrary probability distributions without restrictions. Our results also have implications on other interesting issues, such as monotonicity and randomization of selling mechanisms.

  9. INTERDEPENDENCE OF COST AND QUALITY OF ITEMS OF REAL ESTATE

    Directory of Open Access Journals (Sweden)

    Lukmanova Inessa Galeevna

    2012-10-01

    Calculation of reduction of the overall cost of the real estate item that has improved quality indicators in comparison with the overall cost of the real estate item of satisfactory quality, taken as a benchmark, is made. The nature of interrelation between the quality of building works and maintenance expenses is provided. The overall cost of the item increases alongside with the increase of its quality, therefore the pre-set quality indicator should be defined by taking account of the market conditions, rates charged for building works and payable by buyers, and the amount of building works that sell at a higher price. The indicator of the overall cost of the item of real estate, if forthcoming operational expenses are taken into account, i.e. calculated for the course of the overall life cycle of the item, is essential if the investor is going to maintain the building. Investors often act as sellers of completed buildings; therefore, the product price set at the time when it is offered for sale is of particular importance.

  10. Verbal fluency: Effect of time on item generation

    Directory of Open Access Journals (Sweden)

    Mayra Jacuviske Venegas

    Full Text Available Abstract The distribution of item generation/time in the performance of elderly on verbal fluency (VF remains unknown. Objective: To analyze the number of items, their distribution and impact of the first quartile on the final test result. Methods: 31 individuals performed the tests (average age=74 years; schooling=8.16 years. Results: The number of items produced in the first quartile differed from the other quartiles for both semantic and phonologic VF where 40% of items were produced in the first quartile. No effect of age was found and schooling influenced performance on the first and second quartiles of semantic VF and on the 1st, 2nd and 3rd quartiles of phonemic VF. Discussion: This study contributes with the finding that asymptotic levels are attained prior to the 30 seconds observed in other studies, being reached at the 15-second mark. Furthermore, schooling was found to be associated to the number of items produced in both the first and 2nd quartiles for semantic VF, and in 1st, 2nd and 3rd quartiles for phonemic fluency. Conclusion: The schooling effect was noted both in semantic and executive aspects of VF. The brief form of the VF test may represent a promising tool for clinical evaluation.

  11. ‘Forget me (not?’ – Remembering forget-items versus un-cued items in directed forgetting

    Directory of Open Access Journals (Sweden)

    Bastian eZwissler

    2015-11-01

    Full Text Available Humans need to be able to selectively control their memories. Here, we investigate the underlying processes in item-method directed forgetting and compare the classic active memory cues in this paradigm with a passive instruction. Typically, individual items are presented and each is followed by either a forget- or remember-instruction. On a surprise test of all items, memory is then worse for to-be-forgotten items (TBF compared to to-be-remembered items (TBR. This is thought to result from selective rehearsal of TBR, or from active inhibition of TBF, or from both. However, evidence suggests that if a forget instruction initiates active processing, paradoxical effects may also arise. To investigate the underlying mechanisms, four experiments were conducted where un-cued items (UI were introduced and recognition performance was compared between TBR, TBF and UI stimuli. Accuracy was encouraged via a performance-dependent monetary bonus. Across all experiments, including perceptually fully matched variants, memory accuracy for TBF was reduced compared to TBR, but better than for UI. Moreover, participants used a more conservative response criterion when responding to TBF stimuli. Thus, ironically, the F cue results in active processing, but this does not have inhibitory effects that would impair recognition memory beyond a un-cued baseline condition. This casts doubts on inhibitory accounts of item-method directed forgetting and is also difficult to reconcile with pure selective rehearsal of TBR. While the F-cue does induce active processing, this does not result in particularly successful forgetting. The pattern seems most consistent with the notion of ironic processing.

  12. The effects of violating standard item writing principles on tests and students: the consequences of using flawed test items on achievement examinations in medical education.

    Science.gov (United States)

    Downing, Steven M

    2005-01-01

    The purpose of this research was to study the effects of violations of standard multiple-choice item writing principles on test characteristics, student scores, and pass-fail outcomes. Four basic science examinations, administered to year-one and year-two medical students, were randomly selected for study. Test items were classified as either standard or flawed by three independent raters, blinded to all item performance data. Flawed test questions violated one or more standard principles of effective item writing. Thirty-six to sixty-five percent of the items on the four tests were flawed. Flawed items were 0-15 percentage points more difficult than standard items measuring the same construct. Over all four examinations, 646 (53%) students passed the standard items while 575 (47%) passed the flawed items. The median passing rate difference between flawed and standard items was 3.5 percentage points, but ranged from -1 to 35 percentage points. Item flaws had little effect on test score reliability or other psychometric quality indices. Results showed that flawed multiple-choice test items, which violate well established and evidence-based principles of effective item writing, disadvantage some medical students. Item flaws introduce the systematic error of construct-irrelevant variance to assessments, thereby reducing the validity evidence for examinations and penalizing some examinees.

  13. Item banking to improve, shorten and computerize self-reported fatigue: an illustration of steps to create a core item bank from the FACIT-Fatigue Scale.

    Science.gov (United States)

    Lai, Jin-shei; Cella, David; Chang, Chih-Hung; Bode, Rita K; Heinemann, Allen W

    2003-08-01

    Fatigue is a common symptom among cancer patients and the general population. Due to its subjective nature, fatigue has been difficult to effectively and efficiently assess. Modern computerized adaptive testing (CAT) can enable precise assessment of fatigue using a small number of items from a fatigue item bank. CAT enables brief assessment by selecting questions from an item bank that provide the maximum amount of information given a person's previous responses. This article illustrates steps to prepare such an item bank, using 13 items from the Functional Assessment of Chronic Illness Therapy Fatigue Subscale (FACIT-F) as the basis. Samples included 1022 cancer patients and 1010 people from the general population. An Item Response Theory (IRT)-based rating scale model, a polytomous extension of the Rasch dichotomous model was utilized. Nine items demonstrating acceptable psychometric properties were selected and positioned on the fatigue continuum. The fatigue levels measured by these nine items along with their response categories covered 66.8% of the general population and 82.6% of the cancer patients. Although the operational CAT algorithms to handle polytomously scored items are still in progress, we illustrated how CAT may work by using nine core items to measure level of fatigue. Using this illustration, a fatigue measure comparable to its full-length 13-item scale administration was obtained using four items. The resulting item bank can serve as a core to which will be added a psychometrically sound and operational item bank covering the entire fatigue continuum.

  14. Differential item functioning analysis by applying multiple comparison procedures.

    Science.gov (United States)

    Eusebi, Paolo; Kreiner, Svend

    2015-01-01

    Analysis within a Rasch measurement framework aims at development of valid and objective test score. One requirement of both validity and objectivity is that items do not show evidence of differential item functioning (DIF). A number of procedures exist for the assessment of DIF including those based on analysis of contingency tables by Mantel-Haenszel tests and partial gamma coefficients. The aim of this paper is to illustrate Multiple Comparison Procedures (MCP) for analysis of DIF relative to a variable defining a very large number of groups, with an unclear ordering with respect to the DIF effect. We propose a single step procedure controlling the false discovery rate for DIF detection. The procedure applies for both dichotomous and polytomous items. In addition to providing evidence against a hypothesis of no DIF, the procedure also provides information on subset of groups that are homogeneous with respect to the DIF effect. A stepwise MCP procedure for this purpose is also introduced.

  15. Analyzing Force Concept Inventory with Item Response Theory

    CERN Document Server

    Wang, Jing

    2010-01-01

    Item Response Theory (IRT) is a popular assessment method used in education measurement, which builds on an assumption of a probability framework connecting students' innate ability and their actual performances on test items. The model transforms students' raw test scores through a nonlinear regression process into a scaled proficiency rating, which can be used to compare results obtained with different test questions. IRT also provides a theoretical approach to address ceiling effect and guessing. We applied IRT to analyze the Force Concept Inventory (FCI). The data was collected from 2802 students taking intro level mechanics courses at The Ohio State University. The data was analyzed with a 3-parameter item response model for multiple choice questions. We describe the procedures of the analysis and discuss the results and the interpretations. The analysis outcomes are compiled to provide a detailed IRT measurement metric of the FCI, which can be easily referenced and used by teachers and researchers for a...

  16. An EPQ model with imperfect items using interval grey numbers

    Directory of Open Access Journals (Sweden)

    Erdal Aydemir

    2015-01-01

    Full Text Available The classic economic production quantity (EPQ model has been widely used to determine the optimal production quantity. However, the analysis for finding an EPQ model has many weaknesses which lead many researchers and practitioners to make extensions in several aspects on the original EPQ model. The basic assumption of EPQ model is that 100% of manufactured products are non-defective that is not valid for many production processes generally. The purpose of this paper is to develop an EPQ model with grey demand rate and cost values with maximum backorder level allowed with the good quality items in units under an imperfect production process. The imperfect items are considered to be low quality items which are sold to a particular purchaser at a lower price and, the others are reworked and scrapped. A mathematical model is developed and then an industrial example is presented on the wooden chipboard production process for illustration of the proposed model.

  17. Line-item Budgeting and Film-Production

    DEFF Research Database (Denmark)

    Friis, Ivar; Hansen, Allan

    2015-01-01

    Purpose – This paper aims to explore the role of line-item budgeting in film production in an effort to illustrate the positive effects that budgetary constraints can have on creativity. Design/methodology/approach – Using Elster’s (2000) constraint theory as a basis for the research, this paper...... options from hampering the creative process. Originality/value – The paper contributes to management control research in two ways. By addressing calls to provide more insight into the positive effects management control constraints might have on creativity, this study explores somewhat ignored aspects...... of line-item budgeting, adding greater insight into the interrelations between creativity and control. By exploring the ways in which line-item budgeting might take on the role of pre-commitment advice and devices in the creative process, this paper further exposes the links between accounting constraints...

  18. Matrix Sampling of Items in Large-Scale Assessments

    Directory of Open Access Journals (Sweden)

    Ruth A. Childs

    2003-07-01

    Full Text Available Matrix sampling of items -' that is, division of a set of items into different versions of a test form..-' is used by several large-scale testing programs. Like other test designs, matrixed designs have..both advantages and disadvantages. For example, testing time per student is less than if each..student received all the items, but the comparability of student scores may decrease. Also,..curriculum coverage is maintained, but reporting of scores becomes more complex. In this paper,..matrixed designs are compared with more traditional designs in nine categories of costs:..development costs, materials costs, administration costs, educational costs, scoring costs,..reliability costs, comparability costs, validity costs, and reporting costs. In choosing among test..designs, a testing program should examine the costs in light of its mandate(s, the content of the..tests, and the financial resources available, among other considerations.

  19. Differential Weighting of Items to Improve University Admission Test Validity

    Directory of Open Access Journals (Sweden)

    Eduardo Backhoff Escudero

    2001-05-01

    Full Text Available This paper gives an evaluation of different ways to increase university admission test criterion-related validity, by differentially weighting test items. We compared four methods of weighting multiple-choice items of the Basic Skills and Knowledge Examination (EXHCOBA: (1 punishing incorrect responses by a constant factor, (2 weighting incorrect responses, considering the levels of error, (3 weighting correct responses, considering the item’s difficulty, based on the Classic Measurement Theory, and (4 weighting correct responses, considering the item’s difficulty, based on the Item Response Theory. Results show that none of these methods increased the instrument’s predictive validity, although they did improve its concurrent validity. It was concluded that it is appropriate to score the test by simply adding up correct responses.

  20. Psychometric analysis of the Ten-Item Perceived Stress Scale.

    Science.gov (United States)

    Taylor, John M

    2015-03-01

    Although the 10-item Perceived Stress Scale (PSS-10) is a popular measure, a review of the literature reveals 3 significant gaps: (a) There is some debate as to whether a 1- or a 2-factor model best describes the relationships among the PSS-10 items, (b) little information is available on the performance of the items on the scale, and (c) it is unclear whether PSS-10 scores are subject to gender bias. These gaps were addressed in this study using a sample of 1,236 adults from the National Survey of Midlife Development in the United States II. Based on self-identification, participants were 56.31% female, 77% White, 17.31% Black and/or African American, and the average age was 54.48 years (SD = 11.69). Findings from an ordinal confirmatory factor analysis suggested the relationships among the items are best described by an oblique 2-factor model. Item analysis using the graded response model provided no evidence of item misfit and indicated both subscales have a wide estimation range. Although t tests revealed a significant difference between the means of males and females on the Perceived Helplessness Subscale (t = 4.001, df = 1234, p < .001), measurement invariance tests suggest that PSS-10 scores may not be substantially affected by gender bias. Overall, the findings suggest that inferences made using PSS-10 scores are valid. However, this study calls into question inferences where the multidimensionality of the PSS-10 is ignored. 2015 APA, all rights reserved

  1. An item response theory analysis of the narcissistic personality inventory.

    Science.gov (United States)

    Ackerman, Robert A; Donnellan, M Brent; Robins, Richard W

    2012-01-01

    This research uses item response theory methods to evaluate the Narcissistic Personality Inventory (NPI; Raskin & Terry, 1988). Analyses using the 2-parameter logistic model were conducted on the total score and the Corry, Merritt, Mrug, and Pamp (2008) and Ackerman et al. (2011) subscales for the NPI. In addition to offering precise information about the psychometric properties of the NPI item pool, these analyses generated insights that can be used to develop new measures of the personality constructs embedded within this frequently used inventory.

  2. Face validity of the single work ability item

    DEFF Research Database (Denmark)

    Gupta, Nidhi; Jensen, Bjørn Søvsø; Søgaard, Karen

    2014-01-01

    PURPOSE: The purpose of this study was to investigate the face validity of the self-reported single item work ability with objectively measured heart rate reserve (%HRR) among blue-collar workers. METHODS: We utilized data from 127 blue-collar workers (Female = 53; Male = 74) aged 18-65 years from...... the cross-sectional "New method for Objective Measurements of physical Activity in Daily living (NOMAD)" study. The workers reported their single item work ability and completed an aerobic capacity cycling test and objective measurements of heart rate reserve monitored with Actiheart for 3-4 days...

  3. On random age and remaining lifetime for populations of items

    DEFF Research Database (Denmark)

    Finkelstein, M.; Vaupel, J.

    2015-01-01

    We consider items that are incepted into operation having already a random (initial) age and define the corresponding remaining lifetime. We show that these lifetimes are identically distributed when the age distribution is equal to the equilibrium distribution of the renewal theory. Then we...... develop the population studies approach to the problem and generalize the setting in terms of stationary and stable populations of items. We obtain new stochastic comparisons for the corresponding population ages and remaining lifetimes that can be useful in applications. Copyright (c) 2014 John Wiley...

  4. Logarithmic inventory model with shortage for deteriorating items

    Directory of Open Access Journals (Sweden)

    Khedlekar Uttam Kumar

    2013-01-01

    Full Text Available In this paper, we have modeled a business which starts with shortage of deteriorating items. After a duration, managers have freedom to order the stock of assurance of committed customers. There are many products that follow logarithmic demand pattern, so in this paper we incorporate it with the shortage of items at the beginning. A new model is developed to obtain the optimal solution for such type of market situation and have obtained some valuable results. Numerical examples and simulation study is appended along with managerial insights.

  5. 17 CFR 229.402 - (Item 402) Executive compensation.

    Science.gov (United States)

    2010-04-01

    ... achievement of the registrant's long-term goals, management's exposure to downside equity performance risk... this Item: (i) The term stock means instruments such as common stock, restricted stock, restricted... instruments that do not have option-like features, and the term option means instruments such as stock...

  6. 17 CFR 229.902 - (Item 902) Individual partnership supplements.

    Science.gov (United States)

    2010-04-01

    ... partnership supplements. 229.902 Section 229.902 Commodity and Securities Exchanges SECURITIES AND EXCHANGE... 1934 AND ENERGY POLICY AND CONSERVATION ACT OF 1975-REGULATION S-K Roll-Up Transactions § 229.902 (Item 902) Individual partnership supplements. (a) If two or more entities are proposed to be included...

  7. 75 FR 63695 - Designation of Biobased Items for Federal Procurement

    Science.gov (United States)

    2010-10-18

    ... ethanol led to additional rain forest destruction. The commenter further stated that because more corn was... Agriculture (USDA) is amending the Guidelines for Designating Biobased Products for Federal Procurement, to add eight sections to designate items within which biobased products will be afforded...

  8. Disentangling Sources of Differential Item Functioning in Multilanguage Assessments.

    Science.gov (United States)

    Ercikan, Kadriye

    2002-01-01

    Disentangled sources of differential item functioning (DIF) in a multilanguage assessment for which multiple factors were expected to be causing DIF. Data for the Third International Mathematics and Science study for four countries and two languages (3,000 to 11,000 cases in each comparison group) reveal amounts and sources of DIF. (SLD)

  9. Using Mixed Methods to Interpret Differential Item Functioning

    Science.gov (United States)

    Benítez, Isabel; Padilla, José-Luis; Hidalgo Montesinos, María Dolores; Sireci, Stephen G.

    2016-01-01

    Analysis of differential item functioning (DIF) is often used to determine if cross-lingual assessments are equivalent across languages. However, evidence on the causes of cross-lingual DIF is still evasive. Expert appraisal is a qualitative method useful for obtaining detailed information about problematic elements in the different linguistic…

  10. An item response curves analysis of the Force Concept Inventory

    Science.gov (United States)

    Morris, Gary A.; Harshman, Nathan; Branum-Martin, Lee; Mazur, Eric; Mzoughi, Taha; Baker, Stephen D.

    2012-09-01

    Several years ago, we introduced the idea of item response curves (IRC), a simplistic form of item response theory (IRT), to the physics education research community as a way to examine item performance on diagnostic instruments such as the Force Concept Inventory (FCI). We noted that a full-blown analysis using IRT would be a next logical step, which several authors have since taken. In this paper, we show that our simple approach not only yields similar conclusions in the analysis of the performance of items on the FCI to the more sophisticated and complex IRT analyses but also permits additional insights by characterizing both the correct and incorrect answer choices. Our IRC approach can be applied to a variety of multiple-choice assessments but, as applied to a carefully designed instrument such as the FCI, allows us to probe student understanding as a function of ability level through an examination of each answer choice. We imagine that physics teachers could use IRC analysis to identify prominent misconceptions and tailor their instruction to combat those misconceptions, fulfilling the FCI authors' original intentions for its use. Furthermore, the IRC analysis can assist test designers to improve their assessments by identifying nonfunctioning distractors that can be replaced with distractors attractive to students at various ability levels.

  11. Sales Models For Many Items Using Attribute Data

    NARCIS (Netherlands)

    J.E.M. van Nierop; D. Fok (Dennis); Ph.H.B.F. Franses (Philip Hans)

    2002-01-01

    textabstractSales models are mainly used to analyze markets with a fairly small number of items, obtained after aggregating to the brand level. In practice one may require analyses at a more disaggregate level. For example, brand managers may be interested in a comparison across product attributes.

  12. 17 CFR 229.101 - (Item 101) Description of business.

    Science.gov (United States)

    2010-04-01

    ... this Item, the registrant should take into account both quantitative and qualitative factors such as... the information in quantitative terms may not appear significant to the registrant's business taken as... business. 229.101 Section 229.101 Commodity and Securities Exchanges SECURITIES AND EXCHANGE COMMISSION...

  13. Full-Information Item Bifactor Analysis of Graded Response Data

    Science.gov (United States)

    Gibbons, Robert D.; Bock, R. Darrell; Hedeker, Donald; Weiss, David J.; Segawa, Eisuke; Bhaumik, Dulal K.; Kupfer, David J.; Frank, Ellen; Grochocinski, Victoria J.; Stover, Angela

    2007-01-01

    A plausible factorial structure for many types of psychological and educational tests exhibits a general factor and one or more group or method factors. This structure can be represented by a bifactor model. The bifactor structure results from the constraint that each item has a nonzero loading on the primary dimension and, at most, one of the…

  14. An Alternative Three-Parameter Logistic Item Response Model.

    Science.gov (United States)

    Pashley, Peter J.

    Birnbaum's three-parameter logistic function has become a common basis for item response theory modeling, especially within situations where significant guessing behavior is evident. This model is formed through a linear transformation of the two-parameter logistic function in order to facilitate a lower asymptote. This paper discusses an…

  15. Exploring Crossing Differential Item Functioning by Gender in Mathematics Assessment

    Science.gov (United States)

    Ong, Yoke Mooi; Williams, Julian; Lamprianou, Iasonas

    2015-01-01

    The purpose of this article is to explore crossing differential item functioning (DIF) in a test drawn from a national examination of mathematics for 11-year-old pupils in England. An empirical dataset was analyzed to explore DIF by gender in a mathematics assessment. A two-step process involving the logistic regression (LR) procedure for…

  16. The 10-item Remembered Relationship with Parents (RRP10) scale

    DEFF Research Database (Denmark)

    Denollet, Johan; Smolderen, Kim G E; van den Broek, Krista C

    2007-01-01

    Dysfunctional parenting styles are associated with poor mental and physical health. The 10-item Remembered Relationship with Parents (RRP(10)) scale retrospectively assesses Alienation (dysfunctional communication and intimacy) and Control (overprotection by parents), with an emphasis...... on deficiencies in empathic parenting. We examined the 2-factor structure of the RRP(10) and its relationship with adult depression....

  17. 17 CFR 229.301 - (Item 301) Selected financial data.

    Science.gov (United States)

    2010-04-01

    ... the accounting principles used in its primary financial statements but in such case shall present this... accounting principles and Regulation S-X made pursuant to Rule 4-01 of Regulation S-X (§ 210.4-01 of this... 17 Commodity and Securities Exchanges 2 2010-04-01 2010-04-01 false (Item 301) Selected...

  18. Relationships between the Thurstone and Rasch Approaches to Item Scaling.

    Science.gov (United States)

    Andrich, David

    1978-01-01

    When the logistic function is substituted for the normal, Thurstone's Case V specialization of the law of comparative judgment for paired comparison responses gives an identical equation for the estimation of item scale values, as does the Rasch formulation for direct responses. Comparisons are made. (Author/CTM)

  19. WIPDash: Work Item and People Dashboard for Software Development Teams

    NARCIS (Netherlands)

    Jakobsen, M. R.; Fernandez, R.; Czerwinski, M.; Inkpen, K.; Kulyk, Olga Anatoliyivna; Robertson, G.G.

    2009-01-01

    We present WIPDash, a visualization for software development teams designed to increase group awareness of work items and code base activity. WIPDash was iteratively designed by working with two development teams, using interviews, observations, and focus groups, as well as sketches of the

  20. Chicano Spanish: Cross Hispanic Language Attitudes toward Specific Lexical Items.

    Science.gov (United States)

    Giron, Robert LeRoy

    A study was undertaken to gather attitudes of Spanish-speakers toward specific types of Chicano Spanish dialect lexical items. Reactions were randomly taken from 11 Latin American students who attended Southern Illinois University at Carbondale during the 1975 spring semester; 20 Mexican residents of Ciudad Juarez, Mexico, who attended English as…

  1. Teaching Physical Geography with Toys, Household Items, and Food

    Science.gov (United States)

    Carnahan, Laura; Pankratz, Mary Jo; Alberts, Heike

    2014-01-01

    While many college physical geography instructors already use a wide variety of creative teaching approaches in their classes, others have not yet been exposed to teaching with toys, household items, or food. The goal in this article is to present some ideas for teaching college-level physical geography (weather/climate and geomorphology) for…

  2. IRT-Estimated Reliability for Tests Containing Mixed Item Formats

    Science.gov (United States)

    Shu, Lianghua; Schwarz, Richard D.

    2014-01-01

    As a global measure of precision, item response theory (IRT) estimated reliability is derived for four coefficients (Cronbach's a, Feldt-Raju, stratified a, and marginal reliability). Models with different underlying assumptions concerning test-part similarity are discussed. A detailed computational example is presented for the targeted…

  3. Test Item Linguistic Complexity and Assessments for Deaf Students

    Science.gov (United States)

    Cawthon, Stephanie

    2011-01-01

    Linguistic complexity of test items is one test format element that has been studied in the context of struggling readers and their participation in paper-and-pencil tests. The present article presents findings from an exploratory study on the potential relationship between linguistic complexity and test performance for deaf readers. A total of 64…

  4. Openness/Intellect in a 50-Item IPIP Instrument

    Science.gov (United States)

    Ingram, Paul B.; Boan-Lenzo, Candace; Vuyk, M. Alexandra

    2013-01-01

    The M5-50 is a five-factor theory instrument based on the International Personality Item Pool (IPIP) that has had difficulties with the five-factor model fitting well. The openness domain's factor structure has a history of concerns that might relate to the connected yet distinguishable facets of openness/intellect. This study explored the…

  5. An Investigation of Item Fit Statistics for Mixed IRT Models

    Science.gov (United States)

    Chon, Kyong Hee

    2009-01-01

    The purpose of this study was to investigate procedures for assessing model fit of IRT models for mixed format data. In this study, various IRT model combinations were fitted to data containing both dichotomous and polytomous item responses, and the suitability of the chosen model mixtures was evaluated based on a number of model fit procedures.…

  6. Item difficulty scaling for WAIS-III picture arrangement.

    Science.gov (United States)

    Costello, Raymond M; Connolly, Sean G

    2005-06-01

    Only one study regarding the sequencing of items of the WAIS-III Picture Arrangement subtest was located in a search of published literature. That study of 50 alcohol abusers failed to demonstrate that the items are sequenced in the perfect order of difficulty as suggested by the test publisher. The current study was accomplished to replicate or refute the prior study and to extend findings into related matters. Two laboratories provided four archival samples of 100 cases. Only five items appear properly placed, with one (OPENS) especially misplaced. A new sequence is recommended so that clinicians can administer the test more efficiently and examine errors from a process approach to evaluation. Difficult items were not passed as often as expected by Hispanic respondents. This finding was considered an artifact related to archival convenience sampling and may not be representative as a general finding regarding Hispanic performance until experimental sampling techniques or proper statistical controls can be applied. Statistically controlling for IQ, through analysis of covariance, eliminated ethnicity effects on total score for the PA subtest.

  7. Multilevel Higher-Order Item Response Theory Models

    Science.gov (United States)

    Huang, Hung-Yu; Wang, Wen-Chung

    2014-01-01

    In the social sciences, latent traits often have a hierarchical structure, and data can be sampled from multiple levels. Both hierarchical latent traits and multilevel data can occur simultaneously. In this study, we developed a general class of item response theory models to accommodate both hierarchical latent traits and multilevel data. The…

  8. Sales Models For Many Items Using Attribute Data

    NARCIS (Netherlands)

    J.E.M. van Nierop; D. Fok (Dennis); Ph.H.B.F. Franses (Philip Hans)

    2002-01-01

    textabstractSales models are mainly used to analyze markets with a fairly small number of items, obtained after aggregating to the brand level. In practice one may require analyses at a more disaggregate level. For example, brand managers may be interested in a comparison across product attributes.

  9. A short tutorial on item response theory in rheumatology

    NARCIS (Netherlands)

    Siemons, L.; Krishnan, E.

    2014-01-01

    OBJECTIVES: The aim is to familiarise physicians and researchers with the most important concepts of item response theory (IRT) and with its usefulness for improving test administration and data collection in health care. Special attention is given to the versatility of its use within the rheumatic

  10. No evidence for an item limit in change detection.

    Directory of Open Access Journals (Sweden)

    Shaiyan Keshvari

    Full Text Available Change detection is a classic paradigm that has been used for decades to argue that working memory can hold no more than a fixed number of items ("item-limit models". Recent findings force us to consider the alternative view that working memory is limited by the precision in stimulus encoding, with mean precision decreasing with increasing set size ("continuous-resource models". Most previous studies that used the change detection paradigm have ignored effects of limited encoding precision by using highly discriminable stimuli and only large changes. We conducted two change detection experiments (orientation and color in which change magnitudes were drawn from a wide range, including small changes. In a rigorous comparison of five models, we found no evidence of an item limit. Instead, human change detection performance was best explained by a continuous-resource model in which encoding precision is variable across items and trials even at a given set size. This model accounts for comparison errors in a principled, probabilistic manner. Our findings sharply challenge the theoretical basis for most neural studies of working memory capacity.

  11. Modular Open-Source Software for Item Factor Analysis

    Science.gov (United States)

    Pritikin, Joshua N.; Hunter, Micheal D.; Boker, Steven M.

    2015-01-01

    This article introduces an item factor analysis (IFA) module for "OpenMx," a free, open-source, and modular statistical modeling package that runs within the R programming environment on GNU/Linux, Mac OS X, and Microsoft Windows. The IFA module offers a novel model specification language that is well suited to programmatic generation…

  12. A Scalable Set of ESL Reading Comprehension Items.

    Science.gov (United States)

    Perkins, Kyle

    Guttman implicational scaling techniques were used to identify a unidimensional set of English as a Second Language reading comprehension items. Data were analyzed from 202 students who sat for an institutional administration of the Test of English as a Foreign Language (TOEFL). The examinees who contributed to the scalable set had significantly…

  13. Modular Open-Source Software for Item Factor Analysis

    Science.gov (United States)

    Pritikin, Joshua N.; Hunter, Micheal D.; Boker, Steven M.

    2015-01-01

    This article introduces an item factor analysis (IFA) module for "OpenMx," a free, open-source, and modular statistical modeling package that runs within the R programming environment on GNU/Linux, Mac OS X, and Microsoft Windows. The IFA module offers a novel model specification language that is well suited to programmatic generation…

  14. Interpretation of differential item functioning analyses using external review

    DEFF Research Database (Denmark)

    Scott, Neil W; Fayers, Peter M; Aaronson, Neil K

    2010-01-01

    considered. Relatively few examples of blinded item reviews were identified, and these were mostly from educational studies. A case study using blinded bilingual reviewers alongside translation DIF analyses of a health-related quality of life instrument is described. Future researchers should consider...

  15. 17 CFR 229.1010 - (Item 1010) Financial statements.

    Science.gov (United States)

    2010-04-01

    ... share data, statements of cash flows, and comprehensive income required to be included in the company's... statements. 229.1010 Section 229.1010 Commodity and Securities Exchanges SECURITIES AND EXCHANGE COMMISSION....1010 (Item 1010) Financial statements. (a) Financial information. Furnish the following financial...

  16. Dichotomous decisions based on dichotomously scored items: a case study

    NARCIS (Netherlands)

    Mellenbergh, G.J.; Koppelaar, H.; Linden, van der W.J.

    1977-01-01

    In a course in elementary statistics for psychology students using criterion-referenced achievement tests, the total test score, based on dichotomously scored items, was used for classifying students into those who passed and those who failed. The score on a test is considered as depending on a late

  17. Ancient Item Spoilage Ritual Used in Nomadic Burial Rite

    Directory of Open Access Journals (Sweden)

    Beisenov Arman Z.

    2017-07-01

    Full Text Available The article considers the findings of items in ancient burials which were intentionally spoiled prior to deposition in graves. This tradition was widely spread both in terms of chronology and geography, and therefore cannot be attributed to any individual cultures or regions. The authors present new information on the ritual obtained during an investigation of Borsyk burial mound of the Middle Sarmatian period located in West Kazakhstan. The central grave of barrow 6 contained a heavily damaged bronze cauldron. The grave was looted in antiquity. Individual scattered bones of a human skeleton and minor gold foil adornments from the ceremonial dress of a nobleman were discovered in the grave. The authors suggest that the cauldron was intentionally deformed by the participants of an ancient mortuary and memorial ritual. According to the principal hypothesis concerning the essence of this ritual, spoilage of the items was related to the idea of assign the items with “different” and “transcendent” properties, which resulted from the necessity of burying the owner. Cauldrons played an important role in the life of steppe leaders. The authors assume a sacral nature of the use of cauldrons in the culture of steppe peoples associated with feasts, battles, and sacred hunting. Perhaps, there was a tradition of burying cauldrons together with their owners after spoiling the items in view of the concept of the other world and the role of a heroic leader therein.

  18. A Standardization Approach to Adjusting Pretest Item Statistics.

    Science.gov (United States)

    Chang, Shun-Wen; Hanson, Bradley A.; Harris, Deborah J.

    This study presents and evaluates a method of standardization that may be used by test practitioners to standardize classical item statistics when sample sizes are small. The effectiveness of this standardization approach was compared through simulation with the one-parameter logistic (1PL) and three parameter logistic (3PL) models based on the…

  19. Detecting Differential Item Functioning Using Logistic Regression Procedures.

    Science.gov (United States)

    Swaminathan, Hariharan; Rogers, H. Jane

    1990-01-01

    A logistic regression model for characterizing differential item functioning (DIF) between two groups is presented. A distinction is drawn between uniform and nonuniform DIF in terms of model parameters. A statistic for testing the hypotheses of no DIF is developed, and simulation studies compare it with the Mantel-Haenszel procedure. (Author/TJH)

  20. A Framework for Dimensionality Assessment for Multidimensional Item Response Models

    Science.gov (United States)

    Svetina, Dubravka; Levy, Roy

    2014-01-01

    A framework is introduced for considering dimensionality assessment procedures for multidimensional item response models. The framework characterizes procedures in terms of their confirmatory or exploratory approach, parametric or nonparametric assumptions, and applicability to dichotomous, polytomous, and missing data. Popular and emerging…

  1. Food items and general condition of Hyperopisus bebe occidentalis ...

    African Journals Online (AJOL)

    GREG

    Key words: Food items, Hyperopisus bebe occidentalis, Warri River, condition factor. INTRODUCTION ... as supplement in aquaculture (Malami et al., 2004). Sufficient ... length; hence, the formula was logarithmically transformed for the purpose of ..... River, Nigeria part 11: Seasonal trend in physiochemical limnology. Trop.

  2. 17 CFR 229.507 - (Item 507) Selling security holders.

    Science.gov (United States)

    2010-04-01

    ... holders. 229.507 Section 229.507 Commodity and Securities Exchanges SECURITIES AND EXCHANGE COMMISSION... Provisions § 229.507 (Item 507) Selling security holders. If any of the securities to be registered are to be offered for the account of security holders, name each such security holder, indicate the nature of...

  3. A Method for Severely Constrained Item Selection in Adaptive Testing.

    Science.gov (United States)

    Stocking, Martha L.; Swanson, Len

    1993-01-01

    A method is presented for incorporating a large number of constraints on adaptive item selection in the construction of computerized adaptive tests. The method, which emulates practices of expert test specialists, is illustrated for verbal and quantitative measures. Its foundation is application of a weighted deviations model and algorithm. (SLD)

  4. Professor-Student Rapport Scale: Six Items Predict Student Outcomes

    Science.gov (United States)

    Wilson, Janie H.; Ryan, Rebecca G.

    2013-01-01

    Rapport between students and teachers leads to numerous positive student outcomes, including attitudes toward the teacher and course, student motivation, and perceived learning. The recent development of a Professor-Student Rapport scale offers assessment of this construct. However, a Cronbach's [alpha] of 0.96 indicated item redundancy, and the…

  5. 17 CFR 229.1000 - (Item 1000) Definitions.

    Science.gov (United States)

    2010-04-01

    ... AND CONSERVATION ACT OF 1975-REGULATION S-K Mergers and Acquisitions (Regulation M-A) § 229.1000 (Item... as in § 240.13e-3(a)(3) of this chapter; (f) Subject company means the company or entity...

  6. Some new item selection criteria for adaptive testing

    NARCIS (Netherlands)

    Veerkamp, Wim J.J.; Berger, Martijn P.F.

    1994-01-01

    In this study some alternative item selection criteria for adaptive testing are proposed. These criteria take into account the uncertainty of the ability estimates. A general weighted information criterion is suggested of which the usual maximum information criterion and the suggested alternative cr

  7. A stochastic inventory model with stock dependent demand items

    Directory of Open Access Journals (Sweden)

    Lakdere Benkherouf

    2001-01-01

    Full Text Available In this paper, we propose a new continuous time stochastic inventory model for stock dependent demand items. We then formulate the problem of finding the optimal replenishment schedule that minimizes the total expected discounted costs over an infinite horizon as a Quasi-Variational Inequality (QVI problem. The QVI is shown to have a unique solution under some conditions.

  8. Selecting Test Item Types To Evaluate Library Skills.

    Science.gov (United States)

    Fagan, Jody Condit

    2002-01-01

    This article outlines the advantages and disadvantages of various question types in tests for library classes, including selected-response, constructed-response and alternative-response test items. It examines a test case in which students in a for-credit library course were given a take home quiz with search story problems. Sample "search story"…

  9. Test Item Linguistic Complexity and Assessments for Deaf Students

    Science.gov (United States)

    Cawthon, Stephanie

    2011-01-01

    Linguistic complexity of test items is one test format element that has been studied in the context of struggling readers and their participation in paper-and-pencil tests. The present article presents findings from an exploratory study on the potential relationship between linguistic complexity and test performance for deaf readers. A total of 64…

  10. Differential Item Functioning on Two Tests of EFL Proficiency.

    Science.gov (United States)

    Ryan, Katherine E.; Bachman, Lyle F.

    1992-01-01

    The extent to which items from the Test of English as a Foreign Language and the First Certificate in English function differently for test-takers of equal ability from different native language and curricular backgrounds was investigated. Results suggest a need for methods like logistic regression to examine nonuniform differential item…

  11. Modeling nonignorable missing data processes in item calibration

    NARCIS (Netherlands)

    Glas, Cees A.W.; Pimentel, Jonald L.

    2006-01-01

    In this report, it is shown that the problem of nonignorable missing data in the calibration phase for computerized adaptive testing can be handled by introducing an item response theory (IRT) model for the missing data indicator. In the first simulation study, it is shown that treating data with no

  12. Design Patterns for Digital Item Types in Higher Education

    Science.gov (United States)

    Draaijer, S.; Hartog, R. J. M.

    2007-01-01

    A set of design patterns for digital item types has been developed in response to challenges identified in various projects by teachers in higher education. The goal of the projects in question was to design and develop formative and summative tests, and to develop interactive learning material in the form of quizzes. The subject domains involved…

  13. Item Dependency in an Objective Structured Clinical Examination

    Science.gov (United States)

    Iramaneerat, Cherdsak; Myford, Carol M.; Yudkowsky, Rachel

    2006-01-01

    An Objective Structured Clinical Examination (OSCE) is an assessment approach employed in medical education, in which residents rotate through multiple stations of standardized clinical tasks to evaluate their clinical competence. Because items used to evaluate residents' performance in each OSCE station are linked to the same task and are rated…

  14. 26 CFR 301.6245-1T - Subchapter S items.

    Science.gov (United States)

    2010-04-01

    ... corporate property is a capital asset, section 1231 property, or inventory; whether an item is currently... research activities under section 41; and whether the corporation qualified for the credit for clinical...); (ii) The amount of money distributed to a shareholder; (iii) The fair market value of property...

  15. A Speeded Item Response Model with Gradual Process Change

    Science.gov (United States)

    Goegebeur, Yuri; De Boeck, Paul; Wollack, James A.; Cohen, Allan S.

    2008-01-01

    An item response theory model for dealing with test speededness is proposed. The model consists of two random processes, a problem solving process and a random guessing process, with the random guessing gradually taking over from the problem solving process. The involved change point and change rate are considered random parameters in order to…

  16. 41 CFR 101-26.301-1 - Similar items.

    Science.gov (United States)

    2010-07-01

    ... 41 Public Contracts and Property Management 2 2010-07-01 2010-07-01 true Similar items. 101-26.301-1 Section 101-26.301-1 Public Contracts and Property Management Federal Property Management Regulations System FEDERAL PROPERTY MANAGEMENT REGULATIONS SUPPLY AND PROCUREMENT 26-PROCUREMENT SOURCES...

  17. Computing replenishment cycle policy parameters for a perishable item

    NARCIS (Netherlands)

    Rossi, R.; Tarim, S.A.; Hnich, B.; Prestwich, S.

    2010-01-01

    In many industrial environments there is a significant class of problems for which the perishable nature of the inventory cannot be ignored in developing replenishment order plans. Food is the most salient example of a perishable inventory item. In this work, we consider the periodic-review,

  18. An Inventory Model for Slow Moving Items Subject to Obsolescence

    NARCIS (Netherlands)

    C. Pinçe (Çerağ); R. Dekker (Rommert)

    2009-01-01

    textabstractIn this paper, we consider a continuous review inventory system of a slow moving item for which the demand rate drops to a lower level at a pre-determined time. Inventory system is controlled according to one-for-one replenishment policy with fixed lead time. Adaptation to the lower

  19. Testing Linear Models for Ability Parameters in Item Response Models

    NARCIS (Netherlands)

    Glas, Cees A.W.; Hendrawan, Irene

    2005-01-01

    Methods for testing hypotheses concerning the regression parameters in linear models for the latent person parameters in item response models are presented. Three tests are outlined: A likelihood ratio test, a Lagrange multiplier test and a Wald test. The tests are derived in a marginal maximum like

  20. DIF Analysis for Pretest Items in Computer-Adaptive Testing.

    Science.gov (United States)

    Zwick, Rebecca; And Others

    A simulation study of methods of assessing differential item functioning (DIF) in computer-adaptive tests (CATs) was conducted by Zwick, Thayer, and Wingersky (in press, 1993). Results showed that modified versions of the Mantel-Haenszel and standardization methods work well with CAT data. DIF methods were also investigated for nonadaptive…

  1. A Comparison of Item Fit Statistics for Mixed IRT Models

    Science.gov (United States)

    Chon, Kyong Hee; Lee, Won-Chan; Dunbar, Stephen B.

    2010-01-01

    In this study we examined procedures for assessing model-data fit of item response theory (IRT) models for mixed format data. The model fit indices used in this study include PARSCALE's G[superscript 2], Orlando and Thissen's S-X[superscript 2] and S-G[superscript 2], and Stone's chi[superscript 2*] and G[superscript 2*]. To investigate the…

  2. Differential Item Functioning on the Graduate Management Admission Test.

    Science.gov (United States)

    O'Neill, Kathleen A.; And Others

    The purpose of this study was to identify differentially functioning items on operational administrations of the Graduate Management Admission Test (GMAT) through the use of the Mantel-Haenszel statistic. Retrospective analyses of data collected over 3 years are reported for black/white and female/male comparisons for the Verbal and Quantitative…

  3. 48 CFR 52.244-6 - Subcontracts for Commercial Items.

    Science.gov (United States)

    2010-10-01

    ... clauses in subcontracts for commercial items: (i) 52.203-13, Contractor Code of Business Ethics and... Recovery Act. (iii) 52.219-8, Utilization of Small Business Concerns (MAY 2004) (15 U.S.C. 637(d)(2) and (3... subcontracts to small business concerns) exceeds $650,000 ($1.5 million for construction of any public...

  4. Using SAS PROC MCMC for Item Response Theory Models

    Science.gov (United States)

    Ames, Allison J.; Samonte, Kelli

    2015-01-01

    Interest in using Bayesian methods for estimating item response theory models has grown at a remarkable rate in recent years. This attentiveness to Bayesian estimation has also inspired a growth in available software such as WinBUGS, R packages, BMIRT, MPLUS, and SAS PROC MCMC. This article intends to provide an accessible overview of Bayesian…

  5. Closed loop two-echelon repairable item systems

    NARCIS (Netherlands)

    Spanjers, L.; van Ommeren, Jan C.W.; Zijm, Willem H.M.; Liberopoulos, G.; Papadopoulos, C.T.; Tan, B.; MacGregor Smith, J.; Gershwin, S.B.

    2006-01-01

    In this paper we consider closed loop two-echelon epairable item systems with repair facilities both at a number of local service centers (called bases) and at a central location (the depot). The goal of the system is to maintain a number of production facilities (one at each base) in optimal

  6. Closed loop two-echelon repairable item systems

    NARCIS (Netherlands)

    Spanjers, L.; van Ommeren, Jan C.W.; Zijm, Willem H.M.

    In this paper we consider closed loop two-echelon repairable item systems with repair facilities both at a number of local service centers (called bases) and at a central location (the depot). The goal of the system is to maintain a number of production facilities (one at each base) in optimal

  7. The force within: Recommendations via gravitational attraction between items

    NARCIS (Netherlands)

    V. Kumar (Vikas); S. Bakhshi (Saeideh); L. Kennedy (Lyndon); D.A. Shamma (David)

    2017-01-01

    textabstractRecommendation systems rely on various definitions of similarities. These definitions while having numerous design factors in different domains help identify and recommend relevant content. For example, similarity between users, or items, are measured based on, but not limited to,

  8. Morphological Contributions to Adolescent Word Reading: An Item Response Approach

    Science.gov (United States)

    Goodwin, Amanda P.; Gilbert, Jennifer K.; Cho, Sun-Joo

    2013-01-01

    The current study uses a crossed random-effects item response model to simultaneously examine both reader and word characteristics and interactions between them that predict the reading of 39 morphologically complex words for 221 middle school students. Results suggest that a reader's ability to read a root word (e.g., "isolate") predicts that…

  9. Exposure Control Using Adaptive Multi-Stage Item Bundles.

    Science.gov (United States)

    Luecht, Richard M.

    This paper presents a multistage adaptive testing test development paradigm that promises to handle content balancing and other test development needs, psychometric reliability concerns, and item exposure. The bundled multistage adaptive testing (BMAT) framework is a modification of the computer-adaptive sequential testing framework introduced by…

  10. 17 CFR 229.1106 - (Item 1106) Depositors.

    Science.gov (United States)

    2010-04-01

    ... AND CONSERVATION ACT OF 1975-REGULATION S-K Asset-Backed Securities (Regulation AB) § 229.1106 (Item... continuing duties of the depositor after issuance of the asset-backed securities being registered regarding the asset-backed securities or the pool assets. ...

  11. 17 CFR 229.1117 - (Item 1117) Legal proceedings.

    Science.gov (United States)

    2010-04-01

    ... ENERGY POLICY AND CONSERVATION ACT OF 1975-REGULATION S-K Asset-Backed Securities (Regulation AB) § 229... 17 Commodity and Securities Exchanges 2 2010-04-01 2010-04-01 false (Item 1117) Legal proceedings. 229.1117 Section 229.1117 Commodity and Securities Exchanges SECURITIES AND EXCHANGE COMMISSION...

  12. 17 CFR 229.1108 - (Item 1108) Servicers.

    Science.gov (United States)

    2010-04-01

    ... AND CONSERVATION ACT OF 1975-REGULATION S-K Asset-Backed Securities (Regulation AB) § 229.1108 (Item... to holders of the asset-backed securities, performing work-outs or foreclosures, or other aspect of the servicing of the pool assets or the asset-backed securities upon which the performance of the pool...

  13. 17 CFR 229.1116 - (Item 1116) Tax matters.

    Science.gov (United States)

    2010-04-01

    ... AND CONSERVATION ACT OF 1975-REGULATION S-K Asset-Backed Securities (Regulation AB) § 229.1116 (Item... asset-backed securities transaction under federal income tax laws. (b) The material federal income tax consequences of purchasing, owning and selling the asset-backed securities. If any of the material federal...

  14. 17 CFR 229.1109 - (Item 1109) Trustees.

    Science.gov (United States)

    2010-04-01

    ... AND CONSERVATION ACT OF 1975-REGULATION S-K Asset-Backed Securities (Regulation AB) § 229.1109 (Item... experience serving as a trustee for asset-backed securities transactions involving similar pool assets, if applicable. (c) Describe the trustee's duties and responsibilities regarding the asset-backed securities...

  15. 17 CFR 229.1123 - (Item 1123) Servicer compliance statement.

    Science.gov (United States)

    2010-04-01

    ... 1934 AND ENERGY POLICY AND CONSERVATION ACT OF 1975-REGULATION S-K Asset-Backed Securities (Regulation... 17 Commodity and Securities Exchanges 2 2010-04-01 2010-04-01 false (Item 1123) Servicer compliance statement. 229.1123 Section 229.1123 Commodity and Securities Exchanges SECURITIES AND EXCHANGE...

  16. 17 CFR 229.1101 - (Item 1101) Definitions.

    Science.gov (United States)

    2010-04-01

    ... AND CONSERVATION ACT OF 1975-REGULATION S-K Asset-Backed Securities (Regulation AB) § 229.1101 (Item... information regarding the asset-backed securities being offered and the structure and basic parameters of the...) Factual information regarding the pool assets underlying the asset-backed securities, including...

  17. Analysis of Individual "Test Of Astronomy STandards" (TOAST) Item Responses

    Science.gov (United States)

    Slater, Stephanie J.; Schleigh, Sharon Price; Stork, Debra J.

    2015-01-01

    The development of valid and reliable strategies to efficiently determine the knowledge landscape of introductory astronomy college students is an effort of great interest to the astronomy education community. This study examines individual item response rates from a widely used conceptual understanding survey, the Test Of Astronomy Standards…

  18. DEMAND CATEGORISATION, FORECASTING, AND INVENTORY CONTROL FOR INTERMITTENT DEMAND ITEMS

    Directory of Open Access Journals (Sweden)

    E. Babiloni

    2012-01-01

    Full Text Available

    ENGLISH ABSTRACT: It is commonly assumed that intermittent demand appears randomly, with many periods without demand; but that when it does appear, it tends to be higher than unit size. Basic and well-known forecasting techniques and stock policies perform very poorly with intermittent demand, making new approaches necessary. To select the appropriate inventory management policy, it is important to understand the demand pattern for the items, especially when demand is intermittent. The use of a forecasting method designed for an intermittent demand pattern, such as Croston’s method, is required instead of a simpler and more common approach such as exponential smoothing. The starting point is to establish taxonomic rules to select efficiently the most appropriate forecasting and stock control policy to cope with thousands of items found in real environments. This paper contributes to the state of the art in: (i categorisation of the demand pattern; (ii methods to forecast intermittent demand; and (iii stock control methods for items with intermittent demand patterns. The paper first presents a structured literature review to introduce managers to the theoretical research about how to deal with intermittent demand items in both forecasting and stock control methods, and then it points out some research gaps for future development for the three topics.

    AFRIKAANSE OPSOMMING: Daar word algemeen aanvaar dat intermitterende vraag op toevalswyse voorkom, met verskeie periodes waar daar geen vraag is nie. Wanneer die vraag dan wel materialiseer, oorskry dit dikwels die eenheidsgrootte. Die bekende vooruitskattingstegnieke en voorraadbeleidstellings het min sukses waar intermitterende vraag voorkom, sodat nuwe benaderings nodig is om die problem aan te spreek. Om ‘n geskikte voorraadbestuur-beleid te selekteer, is dit noodsaaklik om die vraagpatroon van die items te verstaan, juis in gevalle van intermitterende patrone. Die gebruik van

  19. 77 FR 23500 - Notice of Intent To Repatriate Cultural Items: Milwaukee Public Museum, Milwaukee, WI

    Science.gov (United States)

    2012-04-19

    ... notice. History and Description of the Cultural Items The five cultural items are a water drum, a... National Park Service Notice of Intent To Repatriate Cultural Items: Milwaukee Public Museum, Milwaukee, WI... consultation with the appropriate Indian tribe, has determined that the cultural items meet the definition...

  20. Reliability and norms for the 10-item self-motivation inventory: The TIGER Study

    Science.gov (United States)

    The Self-Motivation Inventory (SMI) has been shown to be a predictor of exercise dropout. The original SMI of 40 items has been shortened to 10 items and the psychometric qualities of the 10-item SMI are not known. To estimate the reliability of a 10-item SMI and develop norms for an ethnically dive...

  1. Evaluating the Psychometric Characteristics of Generated Multiple-Choice Test Items

    Science.gov (United States)

    Gierl, Mark J.; Lai, Hollis; Pugh, Debra; Touchie, Claire; Boulais, André-Philippe; De Champlain, André

    2016-01-01

    Item development is a time- and resource-intensive process. Automatic item generation integrates cognitive modeling with computer technology to systematically generate test items. To date, however, items generated using cognitive modeling procedures have received limited use in operational testing situations. As a result, the psychometric…

  2. Evaluating the Psychometric Characteristics of Generated Multiple-Choice Test Items

    Science.gov (United States)

    Gierl, Mark J.; Lai, Hollis; Pugh, Debra; Touchie, Claire; Boulais, André-Philippe; De Champlain, André

    2016-01-01

    Item development is a time- and resource-intensive process. Automatic item generation integrates cognitive modeling with computer technology to systematically generate test items. To date, however, items generated using cognitive modeling procedures have received limited use in operational testing situations. As a result, the psychometric…

  3. Weighted Maximum-a-Posteriori Estimation in Tests Composed of Dichotomous and Polytomous Items

    Science.gov (United States)

    Sun, Shan-Shan; Tao, Jian; Chang, Hua-Hua; Shi, Ning-Zhong

    2012-01-01

    For mixed-type tests composed of dichotomous and polytomous items, polytomous items often yield more information than dichotomous items. To reflect the difference between the two types of items and to improve the precision of ability estimation, an adaptive weighted maximum-a-posteriori (WMAP) estimation is proposed. To evaluate the performance of…

  4. The Rasch Model and Missing Data, with an Emphasis on Tailoring Test Items.

    Science.gov (United States)

    de Gruijter, Dato N. M.

    Many applications of educational testing have a missing data aspect (MDA). This MDA is perhaps most pronounced in item banking, where each examinee responds to a different subtest of items from a large item pool and where both person and item parameter estimates are needed. The Rasch model is emphasized, and its non-parametric counterpart (the…

  5. A Comparison of Mantel-Haenszel Differential Item Functioning Parameters. LSAC Research Report Series.

    Science.gov (United States)

    Schnipke, Deborah L.; Roussos, Louis A.; Pashley, Peter J.

    Differential item functioning (DIF) analyses are conducted to investigate how items function in various subgroups. The Mantel-Haenszel (MH) DIF statistic is used at the Law School Admission Council and other testing companies. When item functioning can be well-described in terms of a one- or two-parameter logistic item response theory (IRT) model…

  6. A Framework for Examining the Utility of Technology-Enhanced Items

    Science.gov (United States)

    Russell, Michael

    2016-01-01

    Interest in and use of technology-enhanced items has increased over the past decade. Given the additional time required to administer many technology-enhanced items and the increased expense required to develop them, it is important for testing programs to consider the utility of technology-enhanced items. The Technology-Enhanced Item Utility…

  7. Polytomous latent scales for the investigation of the ordering of items

    NARCIS (Netherlands)

    Ligtvoet, R.; van der Ark, L.A.; Bergsma, W. P.; Sijtsma, K.

    2011-01-01

    We propose three latent scales within the framework of nonparametric item response theory for polytomously scored items. Latent scales are models that imply an invariant item ordering, meaning that the order of the items is the same for each measurement value on the latent scale. This ordering prope

  8. A Model-Free Diagnostic for Single-Peakedness of Item Responses Using Ordered Conditional Means

    Science.gov (United States)

    Polak, Marike; De Rooij, Mark; Heiser, Willem J.

    2012-01-01

    In this article we propose a model-free diagnostic for single-peakedness (unimodality) of item responses. Presuming a unidimensional unfolding scale and a given item ordering, we approximate item response functions of all items based on ordered conditional means (OCM). The proposed OCM methodology is based on Thurstone & Chave's (1929) "criterion…

  9. Detection of person misfit in computerized adaptive tests with polytomous items

    NARCIS (Netherlands)

    Krimpen-Stoop, van Edith M.L.A.; Meijer, Rob R.

    2002-01-01

    Item scores that do not fit an assumed item response theory model may cause the latent trait value to be inaccurately estimated. For a computerized adaptive test (CAT) using dichotomous items, several person-fit statistics for detecting mis.tting item score patterns have been proposed. Both for pape

  10. The Determination of Hierarchies among TOEFL Vocabulary and Reading Comprehension Items.

    Science.gov (United States)

    Perkins, Kyle; And Others

    A study was undertaken to identify the prerequisite relations (or hierarchies among the items) existing in the item responses of a sample of 86 foreign students who took the Test of English as a Foreign Language (TOEFL) vocabulary and reading comprehension test, Form 3JTF1. The form contains 30 vocabulary items and 30 reading comprehension items.…

  11. Designing a Virtual Item Bank Based on the Techniques of Image Processing

    Science.gov (United States)

    Liao, Wen-Wei; Ho, Rong-Guey

    2011-01-01

    One of the major weaknesses of the item exposure rates of figural items in Intelligence Quotient (IQ) tests lies in its inaccuracies. In this study, a new approach is proposed and a useful test tool known as the Virtual Item Bank (VIB) is introduced. The VIB combine Automatic Item Generation theory and image processing theory with the concepts of…

  12. Representation of Item Position in Immediate Serial Recall: Evidence from Intrusion Errors

    Science.gov (United States)

    Fischer-Baum, Simon; McCloskey, Michael

    2015-01-01

    In immediate serial recall, participants are asked to recall novel sequences of items in the correct order. Theories of the representations and processes required for this task differ in how order information is maintained; some have argued that order is represented through item-to-item associations, while others have argued that each item is…

  13. IRT Item Parameters and the Reliability and Validity of Pretest, Posttest, and Gain Scores

    Science.gov (United States)

    May, Kim; Jackson, Tameika S.

    2005-01-01

    The effect of different combinations of item response theory (IRT) item parameters (item difficulty, item discrimination, and the guessing probability) on the reliability and construct validity (correlation with the latent trait being measured) of pretest, posttest, and gain scores was analytically examined using the 3-parameter logistic (3PL)…

  14. An Effect Size Measure for Raju's Differential Functioning for Items and Tests

    Science.gov (United States)

    Wright, Keith D.; Oshima, T. C.

    2015-01-01

    This study established an effect size measure for differential functioning for items and tests' noncompensatory differential item functioning (NCDIF). The Mantel-Haenszel parameter served as the benchmark for developing NCDIF's effect size measure for reporting moderate and large differential item functioning in test items. The effect size of…

  15. A Model-Based Method for Content Validation of Automatically Generated Test Items

    Science.gov (United States)

    Zhang, Xinxin; Gierl, Mark

    2016-01-01

    The purpose of this study is to describe a methodology to recover the item model used to generate multiple-choice test items with a novel graph theory approach. Beginning with the generated test items and working backward to recover the original item model provides a model-based method for validating the content used to automatically generate test…

  16. Testing Three-Item Versions for Seven of Young's Maladaptive Schema

    Science.gov (United States)

    Blau, Gary; DiMino, John; Sheridan, Natalie; Pred, Robert S.; Beverly, Clyde; Chessler, Marcy

    2015-01-01

    The Young Schema Questionnaire (YSQ) in either long-form (205- item) or short-form (75-item or 90-item) versions has demonstrated its clinical usefulness for assessing early maladaptive schemas. However, even a 75 or 90-item "short form", particularly when combined with other measures, can represent a lengthy…

  17. Weighted Maximum-a-Posteriori Estimation in Tests Composed of Dichotomous and Polytomous Items

    Science.gov (United States)

    Sun, Shan-Shan; Tao, Jian; Chang, Hua-Hua; Shi, Ning-Zhong

    2012-01-01

    For mixed-type tests composed of dichotomous and polytomous items, polytomous items often yield more information than dichotomous items. To reflect the difference between the two types of items and to improve the precision of ability estimation, an adaptive weighted maximum-a-posteriori (WMAP) estimation is proposed. To evaluate the performance of…

  18. NUNI (New User and New Item) Problem for SRSs Using Content Aware Multimedia-Based Approach

    DEFF Research Database (Denmark)

    Chaudhary, Pankaj; Deshmukh, Aaradhana A.; Mihovska, Albena Dimitrova

    2015-01-01

    Recommendation systems suggest items and users of interest based on preferences of items or users and item or user attributes. In social media-based services of dynamic content (such as news, blog, video, movies, books, etc.), recommender systems face the problem of discovering new items, new use...

  19. A Simulation Study of Methods for Assessing Differential Item Functioning in Computer-Adaptive Tests.

    Science.gov (United States)

    Zwick, Rebecca; And Others

    Simulated data were used to investigate the performance of modified versions of the Mantel-Haenszel and standardization methods of differential item functioning (DIF) analysis in computer-adaptive tests (CATs). Each "examinee" received 25 items out of a 75-item pool. A three-parameter logistic item response model was assumed, and…

  20. Method of data mining including determining multidimensional coordinates of each item using a predetermined scalar similarity value for each item pair

    Science.gov (United States)

    Meyers, Charles E.; Davidson, George S.; Johnson, David K.; Hendrickson, Bruce A.; Wylie, Brian N.

    1999-01-01

    A method of data mining represents related items in a multidimensional space. Distance between items in the multidimensional space corresponds to the extent of relationship between the items. The user can select portions of the space to perceive. The user also can interact with and control the communication of the space, focusing attention on aspects of the space of most interest. The multidimensional spatial representation allows more ready comprehension of the structure of the relationships among the items.

  1. Neural measures reveal a fixed item limit in subitizing.

    Science.gov (United States)

    Ester, Edward F; Drew, Trafton; Klee, Daniel; Vogel, Edward K; Awh, Edward

    2012-05-23

    For centuries, it has been known that humans can rapidly and accurately enumerate small sets of items, a process referred to as subitizing. However, there is still active debate regarding the mechanisms that mediate this ability. For example, some have argued that subitizing reflects the operation of a fixed-capacity individuation mechanism that enables concurrent access to a small number of items. However, others have argued that subitizing reflects the operation of a continuous numerical estimation mechanism whose precision varies with numerosity in a manner consistent with Weber's law. Critically, quantitative models based on either of these predictions can provide a reasonable description of subitizing performance, making it difficult to discriminate between these alternatives solely on the basis of subjects' behavioral performance. Here, we attempted to discriminate between fixed-capacity and continuous estimation models of subitizing using neural measures. In two experiments, we recorded EEGs while subjects performed a demanding subitizing task and examined set-size-dependent changes in a neurophysiological marker of visual selection (the N2pc event-related potential component) evoked by an array of to-be-enumerated items. In both experiments, N2pc amplitudes increased monotonically within the subitizing range before reaching an asymptotic limit at approximately three items. Moreover, inter-participant differences in the location of this asymptote were strongly predictive of behavioral estimates of subitizing span derived from a fixed-capacity model. Thus, neural activity linked with subitizing ability shows evidence of an early and discrete limit in the number of items that can be concurrently apprehended, supporting a fixed-capacity model of this process.

  2. Is Bloom's Taxonomy reflected in the response pattern to MCQ items?

    Science.gov (United States)

    Huxham, G J; Naeraa, N

    1980-01-01

    The purpose of this study was to find out whether taxonomic classification of MCQ items reflected differences in student behaviour. The data from one of this University's official open-book exams, in which students answer sixty MCQ items distributed over twelve content-areas of physiology were examined. The responses from all 153 candidates were then subjected to factor analysis. Analysis of individual item scores was unrewarding. Analysis of scores for item-groups based on taxonomy and content resulted in the identification of three factors, which carried predominant loadings from recall or look-up items, interpretation items, and problem-solving items, respectively.

  3. Validity and measurement precision of the PROMIS physical function item bank and a content validity-driven 20-item short form in rheumatoid arthritis compared with traditional measures

    NARCIS (Netherlands)

    Voshaar, M.A.; Klooster, P.M. ten; Glas, C.A.; Vonkeman, H.E.; Taal, E.; Krishnan, E.; Bernelot Moens, H.J.; Boers, M.; Terwee, C.B.; Riel, P.L.C.M. van; Laar, M.A. van der

    2015-01-01

    OBJECTIVE: To evaluate the content validity and measurement properties of the Patient-Reported Outcome Measurement Information System (PROMIS) physical function item bank and a 20-item short form in patients with RA in comparison with the HAQ disability index (HAQ-DI) and 36-item Short Form Health S

  4. Work ability as prognostic risk marker of disability pension : Single-item work ability score versus multi-item work ability index

    NARCIS (Netherlands)

    Roelen, C.A.M.; Rhenen, van W.; Groothoff, J.W.; Klink, van der J.J.L.; Twisk, W.R.; Heymans, M.W.

    2014-01-01

    Work ability predicts future disability pension (DP). A single-item work ability score (WAS) is emerging as a measure for work ability. This study compared single-item WAS with the multi-item work ability index (WAI) in its ability to identify workers at risk of DP.

  5. The Psychological Effect of Errors in Standardized Language Test Items on EFL Students' Responses to the Following Item

    Science.gov (United States)

    Khaksefidi, Saman

    2017-01-01

    This study investigates the psychological effect of a wrong question with wrong items on answering to the next question in a test of structure. Forty students selected through stratified random sampling are given 15 questions of a standardized test namely a TOEFL structure test in which questions number 7 and number 11 are wrong and their answers…

  6. A Comparison of Item Parameter Standard Error Estimation Procedures for Unidimensional and Multidimensional Item Response Theory Modeling

    Science.gov (United States)

    Paek, Insu; Cai, Li

    2014-01-01

    The present study was motivated by the recognition that standard errors (SEs) of item response theory (IRT) model parameters are often of immediate interest to practitioners and that there is currently a lack of comparative research on different SE (or error variance-covariance matrix) estimation procedures. The present study investigated item…

  7. International Semiotics: Item Difficulty and the Complexity of Science Item Illustrations in the PISA-2009 International Test Comparison

    Science.gov (United States)

    Solano-Flores, Guillermo; Wang, Chao; Shade, Chelsey

    2016-01-01

    We examined multimodality (the representation of information in multiple semiotic modes) in the context of international test comparisons. Using Program of International Student Assessment (PISA)-2009 data, we examined the correlation of the difficulty of science items and the complexity of their illustrations. We observed statistically…

  8. Estimating Non-Normal Latent Trait Distributions within Item Response Theory Using True and Estimated Item Parameters

    Science.gov (United States)

    Sass, D. A.; Schmitt, T. A.; Walker, C. M.

    2008-01-01

    Item response theory (IRT) procedures have been used extensively to study normal latent trait distributions and have been shown to perform well; however, less is known concerning the performance of IRT with non-normal latent trait distributions. This study investigated the degree of latent trait estimation error under normal and non-normal…

  9. A Comparison of Item Parameter Standard Error Estimation Procedures for Unidimensional and Multidimensional Item Response Theory Modeling

    Science.gov (United States)

    Paek, Insu; Cai, Li

    2014-01-01

    The present study was motivated by the recognition that standard errors (SEs) of item response theory (IRT) model parameters are often of immediate interest to practitioners and that there is currently a lack of comparative research on different SE (or error variance-covariance matrix) estimation procedures. The present study investigated item…

  10. The Relationship between Item Context Characteristics and Student Performance: The Case of the 2006 and 2009 PISA Science Items

    Science.gov (United States)

    Ruiz-Primo, Maria Araceli; Li, Min

    2015-01-01

    Background: A long-standing premise in test design is that contextualizing test items makes them concrete, less demanding, and more conducive to determining whether students can apply or transfer their knowledge. Purpose: We assert that despite decades of study and experience, much remains to be learned about how to construct effective and fair…

  11. Estimating Non-Normal Latent Trait Distributions within Item Response Theory Using True and Estimated Item Parameters

    Science.gov (United States)

    Sass, D. A.; Schmitt, T. A.; Walker, C. M.

    2008-01-01

    Item response theory (IRT) procedures have been used extensively to study normal latent trait distributions and have been shown to perform well; however, less is known concerning the performance of IRT with non-normal latent trait distributions. This study investigated the degree of latent trait estimation error under normal and non-normal…

  12. Net and Global Differential Item Functioning in PISA Polytomously Scored Science Items: Application of the Differential Step Functioning Framework

    Science.gov (United States)

    Akour, Mutasem; Sabah, Saed; Hammouri, Hind

    2015-01-01

    The purpose of this study was to apply two types of Differential Item Functioning (DIF), net and global DIF, as well as the framework of Differential Step Functioning (DSF) to real testing data to investigate measurement invariance related to test language. Data from the Program for International Student Assessment (PISA)-2006 polytomously scored…

  13. Interim Report of Field Test of Expedient Pavement Repairs (Test Items 1-15).

    Science.gov (United States)

    1980-03-01

    OF FIELD TEST OFEXPEDIENT PAVEMENT REPAIRS ( TEST ITEMS 1-15). RAYMONDS.4OLLINGS lWGINE E :AH1 DIVISION .I MARŜ 6 // il) JNTEIM REPUT TiJUL1077...Pavement Repairs ( Test Items 1-15) 6 PERFORMING ORG. REPORT NUMBER 7. AUTHOR(&) S. CONTRACT OR GRANT NUMBER(&) Raymond S. Rollings 9. PERFORMING...21 Item 14 Test Results .................... 33 22 Density Results, Item 15....................34 23 Summary of Test Items ......................37 24

  14. Relevance of Item Analysis in Standardizing an Achievement Test in Teaching of Physical Science in B.Ed Syllabus

    Science.gov (United States)

    Marie, S. Maria Josephine Arokia; Edannur, Sreekala

    2015-01-01

    This paper focused on the analysis of test items constructed in the paper of teaching Physical Science for B.Ed. class. It involved the analysis of difficulty level and discrimination power of each test item. Item analysis allows selecting or omitting items from the test, but more importantly item analysis is a tool to help the item writer improve…

  15. The Effects of Goal Relevance and Perceptual Features on Emotional Items and Associative Memory

    Directory of Open Access Journals (Sweden)

    Wei B. Mao

    2017-07-01

    Full Text Available Showing an emotional item in a neutral background scene often leads to enhanced memory for the emotional item and impaired associative memory for background details. Meanwhile, both top–down goal relevance and bottom–up perceptual features played important roles in memory binding. We conducted two experiments and aimed to further examine the effects of goal relevance and perceptual features on emotional items and associative memory. By manipulating goal relevance (asking participants to categorize only each item image as living or non-living or to categorize each whole composite picture consisted of item image and background scene as natural scene or manufactured scene and perceptual features (controlling visual contrast and visual familiarity in two experiments, we found that both high goal relevance and salient perceptual features (high salience of items vs. high familiarity of items could promote emotional item memory, but they had different effects on associative memory for emotional items and neutral backgrounds. Specifically, high goal relevance and high perceptual-salience of items could jointly impair the associative memory for emotional items and neutral backgrounds, while the effect of item familiarity on associative memory for emotional items would be modulated by goal relevance. High familiarity of items could increase associative memory for negative items and neutral backgrounds only in the low goal relevance condition. These findings suggest the effect of emotion on associative memory is not only related to attentional capture elicited by emotion, but also can be affected by goal relevance and perceptual features of stimulus.

  16. Measuring somatic symptoms with the CES-D to assess depression in cancer patients after treatment : Comparison among patients with oral/oropharyngeal, gynecological, colorectal, and breast cancer

    NARCIS (Netherlands)

    van Wilgen, C.P.; Dijkstra, P.U.; Stewart, R.E.; Ranchor, A.V.; Roodenburg, J.L.N.

    2006-01-01

    There is a high prevalence of depression after cancer treatment. In the literature, several authors have raised questions about assessing somatic symptoms to explore depression after cancer treatment. These somatic sequelae are a consequence of cancer treatment and should cause higher depression

  17. Cross-language measurement equivalence of the Center for Epidemiologic Studies Depression (CES-D) scale in systemic sclerosis: a comparison of Canadian and Dutch patients

    NARCIS (Netherlands)

    Kwakkenbos, L.; Arthurs, E.; Hoogen, F.H.J. van den; Hudson, M.; Lankveld, W.G. van; Baron, M.; Ende, C.H.M. van den; Thombs, B.D.

    2013-01-01

    OBJECTIVES: Increasingly, medical research involves patients who complete outcomes in different languages. This occurs in countries with more than one common language, such as Canada (French/English) or the United States (Spanish/English), as well as in international multi-centre collaborations, whi

  18. Measuring somatic symptoms with the CES-D to assess depression in cancer patients after treatment : Comparison among patients with oral/oropharyngeal, gynecological, colorectal, and breast cancer

    NARCIS (Netherlands)

    van Wilgen, C.P.; Dijkstra, P.U.; Stewart, R.E.; Ranchor, A.V.; Roodenburg, J.L.N.

    2006-01-01

    There is a high prevalence of depression after cancer treatment. In the literature, several authors have raised questions about assessing somatic symptoms to explore depression after cancer treatment. These somatic sequelae are a consequence of cancer treatment and should cause higher depression rat

  19. Method effects: the problem with negatively versus positively keyed items.

    Science.gov (United States)

    Lindwall, Magnus; Barkoukis, Vassilis; Grano, Caterina; Lucidi, Fabio; Raudsepp, Lennart; Liukkonen, Jarmo; Thøgersen-Ntoumani, Cecilie

    2012-01-01

    Using confirmatory factor analyses, we examined method effects on Rosenberg's Self-Esteem Scale (RSES; Rosenberg, 1965) in a sample of older European adults. Nine hundred forty nine community-dwelling adults 60 years of age or older from 5 European countries completed the RSES as well as measures of depression and life satisfaction. The 2 models that had an acceptable fit with the data included method effects. The method effects were associated with both positively and negatively worded items. Method effects models were invariant across gender and age, but not across countries. Both depression and life satisfaction predicted method effects. Individuals with higher depression scores and lower life satisfaction scores were more likely to endorse negatively phrased items.

  20. An Efficient Algorithm for Mining Maximal Frequent Item Sets

    Directory of Open Access Journals (Sweden)

    A. M.J.M.Z. Rahman

    2008-01-01

    Full Text Available Problem Statement: In today's life, the mining of frequent patterns is a basic problem in data mining applications. The algorithms which are used to generate these frequent patterns must perform efficiently. The objective was to propose an effective algorithm which generates frequent patterns in less time. Approach: We proposed an algorithm which was based on hashing technique and combines a vertical tidset representation of the database with effective pruning mechanisms. It removes all the non-maximal frequent item-sets to get exact set of MFI directly. It worked efficiently when the number of item-sets and tid-sets is more. Results: The performance of our algorithm had been compared with recently developed MAFIA algorithm and the results show how our algorithm gives better performance. Conclusions: Hence, the proposed algorithm performs effectively and generates frequent patterns faster.

  1. Full Information Item Factor Analysis of the FCI

    Science.gov (United States)

    Hagedorn, Eric

    2010-02-01

    Traditional factor analytical methods, principal factors or principal components analysis, are inappropriate techniques for analyzing dichotomously scored responses to standardized tests or concept inventories because they lead to artifactual factors often referred to as ``difficulty factors.'' Full information item factor analysis (Bock, Gibbons and Muraki, 1988) based on Thurstone's multiple factor model and calculated using marginal maximum likelihood estimation, is an appropriate technique for such analyses. Force Concept Inventory (Hestenes, Wells and Swackhamer, 1992) data from 1582 university students completing an introductory physics course, was analyzed using the full information item factor analysis software TESTFACT v. 4. Analyzing the statistical significance of successive factors added to the model, using chi-squared statistics, led to a six factor model interpretable in terms of the conceptual dimensions of the FCI. )

  2. Functionally unidimensional item response models for multivariate binary data

    DEFF Research Database (Denmark)

    Ip, Edward; Molenberghs, Geert; Chen, Shyh-Huei;

    2013-01-01

    The problem of fitting unidimensional item response models to potentially multidimensional data has been extensively studied. The focus of this article is on response data that have a strong dimension but also contain minor nuisance dimensions. Fitting a unidimensional model to such multidimensio......The problem of fitting unidimensional item response models to potentially multidimensional data has been extensively studied. The focus of this article is on response data that have a strong dimension but also contain minor nuisance dimensions. Fitting a unidimensional model...... to such multidimensional data is believed to result in ability estimates that represent a combination of the major and minor dimensions. We conjecture that the underlying dimension for the fitted unidimensional model, which we call the functional dimension, represents a nonlinear projection. In this article we investigate...... tool. An example regarding a construct of desire for physical competency is used to illustrate the functional unidimensional approach....

  3. Optimal pricing and marketing planning for deteriorating items

    Science.gov (United States)

    Moosavi Tabatabaei, Seyed Reza; Sadjadi, Seyed Jafar; Makui, Ahmad

    2017-01-01

    Optimal pricing and marketing planning plays an essential role in production decisions on deteriorating items. This paper presents a mathematical model for a three-level supply chain, which includes one producer, one distributor and one retailer. The proposed study considers the production of a deteriorating item where demand is influenced by price, marketing expenditure, quality of product and after-sales service expenditures. The proposed model is formulated as a geometric programming with 5 degrees of difficulty and the problem is solved using the recent advances in optimization techniques. The study is supported by several numerical examples and sensitivity analysis is performed to analyze the effects of the changes in different parameters on the optimal solution. The preliminary results indicate that with the change in parameters influencing on demand, inventory holding, inventory deteriorating and set-up costs change and also significantly affect total revenue. PMID:28306750

  4. Marginal Maximum Likelihood Estimation of Item Response Models in R

    Directory of Open Access Journals (Sweden)

    Matthew S. Johnson

    2007-02-01

    Full Text Available Item response theory (IRT models are a class of statistical models used by researchers to describe the response behaviors of individuals to a set of categorically scored items. The most common IRT models can be classified as generalized linear fixed- and/or mixed-effect models. Although IRT models appear most often in the psychological testing literature, researchers in other fields have successfully utilized IRT-like models in a wide variety of applications. This paper discusses the three major methods of estimation in IRT and develops R functions utilizing the built-in capabilities of the R environment to find the marginal maximum likelihood estimates of the generalized partial credit model. The currently available R packages ltm is also discussed.

  5. Finding Recently Frequent Items over Online Data Streams

    Institute of Scientific and Technical Information of China (English)

    YIN Zhi-wu; HUANG Shang-teng

    2006-01-01

    In this paper, a new algorithm HCOUNT + is proposed to find frequent items over data stream based on the HCOUNT algorithm. The new algorithm adopts aided measures to improve the precision of HCOUNT greatly. In addition,HCOUNT + is introduced to time critical applications and a novel sliding windows-based algorithm SL-HCOUNT + is proposed to mine the most frequent items occurring recently.This algorithm uses limited memory (nB · (1 +α) · e/ε·In(-M/lnρ)(α<1) counters), requires constant processing time per packet (only (1+α) · ln(-M/lnρ(α<1)) counters are updated), makes only one pass over the streaming data,and is shown to work well in the experimental results.

  6. Item Nonresponse in Face-to-Face Interviews with Children

    Directory of Open Access Journals (Sweden)

    Haunberger Sigrid

    2014-09-01

    Full Text Available This study examined item nonresponse and its respondent and interviewer correlates by means of a population-based, panel survey of children aged 8 to 11 who were surveyed using standardised, face-to-face interviews. Using multilevel, logistic analyses with cross-level interactions, this article aims to examine which effects of item nonresponse are subject to children as respondents or to the interviewers and the interview setting. Depending on the type of question, we found different effects for respondent and interviewer variables, as well as interaction effects between child age/interviewer age as well as child gender/interviewer gender. However, interviewer variance is for the most part not significant.

  7. SERENITY in e-Business and Smart Item Scenarios

    Science.gov (United States)

    Benameur, Azzedine; Khoury, Paul El; Seguran, Magali; Sinha, Smriti Kumar

    SERENITY Artefacts, like Class, Patterns, Implementations and Executable Components for Security & Dependability (S&D) in addition to Serenity Runtime Framework (SRF) are discussed in previous chapters. How to integrate these artefacts with applications in Serenity approach is discussed here with two scenarios. The e-Business scenario is a standard loan origination process in a bank. The Smart Item scenario is an Ambient intelligence case study where we take advantage of Smart Items to provide an electronic healthcare infrastructure for remote healthcare assistance. In both cases, we detail how the prototype implementations of the scenarios select proper executable components through Serenity Runtime Framework and then demonstrate how these executable components of the S&D Patterns are deployed.

  8. Optimal pricing and marketing planning for deteriorating items.

    Science.gov (United States)

    Moosavi Tabatabaei, Seyed Reza; Sadjadi, Seyed Jafar; Makui, Ahmad

    2017-01-01

    Optimal pricing and marketing planning plays an essential role in production decisions on deteriorating items. This paper presents a mathematical model for a three-level supply chain, which includes one producer, one distributor and one retailer. The proposed study considers the production of a deteriorating item where demand is influenced by price, marketing expenditure, quality of product and after-sales service expenditures. The proposed model is formulated as a geometric programming with 5 degrees of difficulty and the problem is solved using the recent advances in optimization techniques. The study is supported by several numerical examples and sensitivity analysis is performed to analyze the effects of the changes in different parameters on the optimal solution. The preliminary results indicate that with the change in parameters influencing on demand, inventory holding, inventory deteriorating and set-up costs change and also significantly affect total revenue.

  9. On finding similar items in a stream of transactions

    DEFF Research Database (Denmark)

    Campagna, Andrea; Pagh, Rasmus

    2010-01-01

    While there has been a lot of work on finding frequent itemsets in transaction data streams, none of these solve the problem of finding similar pairs according to standard similarity measures. This paper is a first attempt at dealing with this, arguably more important, problem. We start out......(\\min\\{mb,n^k,(mb/\\varphi)^k\\})$ bits, where $mb$ is the number of items in the stream so far, $n$ is the number of distinct items and $\\varphi$ is a support threshold. To achieve any non-trivial space upper bound we must thus abandon a worst-case assumption on the data stream. We work under the model that the transactions come...... in random order, and show that surprisingly, not only is small-space similarity mining possible for the most common similarity measures, but the mining accuracy {\\em improves\\/} with the length of the stream for any fixed support threshold....

  10. Indirect associations between multiple items and a mining algorithm

    Institute of Scientific and Technical Information of China (English)

    Ni Min; Xu Xiaofei; Deng Shengchun

    2005-01-01

    Indirect association is a high level relationship between items and frequent itemsets in data. Current research approaches on indirect association mining are limited to indirect association between itempairs, which will discovertoo many rules from dataset. A formal definition of indirect association between multiple items is presented, along with an algorithm, SET-NIA,for mining this kind of indirect associations based on anti-monotonicity of indirect associations and frequent itempair support matrix. While the found rules contain same information as compared to the rules found by indirect association between itempairs mining algorithms, this notion brings space-saving in storage ofthe rules as well as superiority for human to understand and apply the rules. Experiments conducted on two real-word datasets show that SET-NIA can effectively find fewer rules than existing algorithms which mine indirect association between itempairs, the experimental results also prove that SET-NIA has better performance than existing algorithms.

  11. Pricing policy for declining demand using item preservation technology.

    Science.gov (United States)

    Khedlekar, Uttam Kumar; Shukla, Diwakar; Namdeo, Anubhav

    2016-01-01

    We have designed an inventory model for seasonal products in which deterioration can be controlled by item preservation technology investment. Demand for the product is considered price sensitive and decreases linearly. This study has shown that the profit is a concave function of optimal selling price, replenishment time and preservation cost parameter. We simultaneously determined the optimal selling price of the product, the replenishment cycle and the cost of item preservation technology. Additionally, this study has shown that there exists an optimal selling price and optimal preservation investment to maximize the profit for every business set-up. Finally, the model is illustrated by numerical examples and sensitive analysis of the optimal solution with respect to major parameters.

  12. EOQ Model for Time-Deteriorating Items Using Penalty cost

    Directory of Open Access Journals (Sweden)

    Meenakshi Srivastava

    2009-01-01

    Full Text Available In inventory, the utility of the deteriorating items decreases with time. The degree ofdeterioration of product utility can be treated as penalty cost in the inventory replenishmentsystem. In this paper, we present EOQ model for those perishable products, which do notdeteriorate for some period of time and after that time they continuously deteriorate with time andloose their importance. This loss can be incurred as penalty cost to the wholesaler / retailer. Theprime focus of our paper is to develop the EOQ model for time-deteriorating items using penaltycost with finite and infinite production rate. For simplicity, linear and exponential penalty costfunctions have been considered as a measurement of the utility of the product. The theoreticalexpressions are obtained for optimum inventory level and cycle time. All the theoreticaldevelopments are numerically justified.

  13. Adult Attachment Ratings (AAR): an item response theory analysis.

    Science.gov (United States)

    Pilkonis, Paul A; Kim, Yookyung; Yu, Lan; Morse, Jennifer Q

    2014-01-01

    The Adult Attachment Ratings (AAR) include 3 scales for anxious, ambivalent attachment (excessive dependency, interpersonal ambivalence, and compulsive care-giving), 3 for avoidant attachment (rigid self-control, defensive separation, and emotional detachment), and 1 for secure attachment. The scales include items (ranging from 6-16 in their original form) scored by raters using a 3-point format (0 = absent, 1 = present, and 2 = strongly present) and summed to produce a total score. Item response theory (IRT) analyses were conducted with data from 414 participants recruited from psychiatric outpatient, medical, and community settings to identify the most informative items from each scale. The IRT results allowed us to shorten the scales to 5-item versions that are more precise and easier to rate because of their brevity. In general, the effective range of measurement for the scales was 0 to +2 SDs for each of the attachment constructs; that is, from average to high levels of attachment problems. Evidence for convergent and discriminant validity of the scales was investigated by comparing them with the Experiences of Close Relationships-Revised (ECR-R) scale and the Kobak Attachment Q-sort. The best consensus among self-reports on the ECR-R, informant ratings on the ECR-R, and expert judgments on the Q-sort and the AAR emerged for anxious, ambivalent attachment. Given the good psychometric characteristics of the scale for secure attachment, however, this measure alone might provide a simple alternative to more elaborate procedures for some measurement purposes. Conversion tables are provided for the 7 scales to facilitate transformation from raw scores to IRT-calibrated (theta) scores.

  14. Small-Item Vapor Test Method, FY11 Release

    Science.gov (United States)

    2012-07-01

    exposed. sessile drop : A liquid droplet that is firmly attached to a surface. If the droplet significantly spreads across the surface, it is better...following information regarding the observations. o Written description of applied drops as they appeared for each sample (e.g., sessile , spread). o... techniques for this method. The airflow and air volume are key variables required to assess risk. Residual agent: Because full-item extraction

  15. Vending Machines: A Narrative Review of Factors Influencing Items Purchased.

    Science.gov (United States)

    Hua, Sophia V; Ickovics, Jeannette R

    2016-10-01

    Vending machines are a ubiquitous part of our food environments. Unfortunately, items found in vending machines tend to be processed foods and beverages high in salt, sugar, and/or fat. The purpose of this review is to describe intervention and case studies designed to promote healthier vending purchases by consumers and identify which manipulations are most effective. All studies analyzed were intervention or case studies that manipulated vending machines and analyzed sales or revenue data. This literature review is limited to studies conducted in the United States within the past 2 decades (ie, 1994 to 2015), regardless of study population or setting. Ten articles met these criteria based on a search conducted using PubMed. Study manipulations included price changes, increase in healthier items, changes to the advertisements wrapped around vending machines, and promotional signs such as a stoplight system to indicate healthfulness of items and to remind consumers to make healthy choices. Overall, seven studies had manipulations that resulted in statistically significant positive changes in purchasing behavior. Two studies used manipulations that did not influence consumer behavior, and one study was equivocal. Although there was no intervention pattern that ensured changes in purchasing, price reductions were most effective overall. Revenue from vending sales did not change substantially regardless of intervention, which will be important to foster initiation and sustainability of healthier vending. Future research should identify price changes that would balance healthier choices and revenue as well as better marketing to promote purchase of healthier items. Copyright © 2016 Academy of Nutrition and Dietetics. Published by Elsevier Inc. All rights reserved.

  16. Deferred Item and Vehicle Routing within Integrated Networks

    OpenAIRE

    2002-01-01

    This paper studies the possible integration of long-haul operations by transportation mode and service level (defined by guaranteed delivery time) for package delivery carriers. Specifically, we consider the allocation of deferred items to excess capacity on alternative modes in ways that allow all transportation modes to be utilized better. Model formulation and solution techniques are discussed. The solution techniques presented produce efficient solutions for large-scale problem instances....

  17. Development of multiple choice items for reading comprehension assessment

    OpenAIRE

    Lance, Burrows

    2010-01-01

    Creating reading comprehension assessment instruments remains a problem for many EFL educators and researchers, alike. This paper will first provide an overview of literature pertaining to the various facets that make up the concept of reading comprehension. In addition, the cognitive processing model framework proposed by Embretson and Wetzel, will be discussed in detail. From the basis of this model, an explanation of how to create reading comprehension multiple-choice items of varying diff...

  18. Designing multiple-choice test items at higher cognitive levels.

    Science.gov (United States)

    Su, Whei Ming; Osisek, Paul J; Montgomery, Cynthia; Pellar, Suzanne

    2009-01-01

    In the midst of a nursing faculty shortage, many academic institutions hire clinicians who are not formally prepared for an academic role. These novice faculty face an immediate need to develop teaching skills. One area in particular is test construction. To address this need, the authors describe how faculty from one course designed multiple-choice test items at higher cognitive levels and simultaneously achieved congruence with critical-thinking learning objectives defined by the course.

  19. Development and validation of 26-item dysfunctional attitude scale.

    Science.gov (United States)

    Ebrahimi, Amrollah; Samouei, Rahele; Mousavii, Sayyed Ghafour; Bornamanesh, Ali Reza

    2013-06-01

    Dysfunctional Attitude Scale is one of the most common instruments used to assess cognitive vulnerability. This study aimed to develop and validate a short form of Dysfunctional Attitude Scale appropriate for an Iranian clinical population. Participants were 160 psychiatric patients from medical centers affiliated with Isfahan Medical University, as well as 160 non-patients. Research instruments were clinical interviews based on the Diagnostic and Statistical Manual-IV-TR, Dysfunctional Attitude Scale and General Heath Questionnaire (GHQ-28). Data was analyzed using multicorrelation calculations and factor analysis. Based on the results of factor analysis and item-total correlation, 14 items were judged candidates for omission. Analysis of the 26-item Dysfunctional Attitude Scale (DAS-26) revealed a Cronbach's alpha of 0.92. Evidence for the concurrent criterion validity was obtained through calculating the correlation between the Dysfunctional Attitude Scale and psychiatric diagnosis (r = 0.55), GHQ -28 (r = 0.56) and somatization, anxiety, social dysfunction, and depression subscales (0.45,0.53,0.48, and 0.57, respectively). Factor analysis deemed a four-factor structure the best. The factors were labeled as success-perfectionism, need for approval, need for satisfying others, and vulnerability-performance evaluation. The results showed that the Iranian version of the Dysfunctional Attitude Scale (DAS-26) bears satisfactory psychometric properties suggesting that this cognitive instrument is appropriate for use in an Iranian cultural context. Copyright © 2012 Wiley Publishing Asia Pty Ltd.

  20. Introduction to bifactor polytomous item response theory analysis.

    Science.gov (United States)

    Toland, Michael D; Sulis, Isabella; Giambona, Francesca; Porcu, Mariano; Campbell, Jonathan M

    2017-02-01

    A bifactor item response theory model can be used to aid in the interpretation of the dimensionality of a multifaceted questionnaire that assumes continuous latent variables underlying the propensity to respond to items. This model can be used to describe the locations of people on a general continuous latent variable as well as on continuous orthogonal specific traits that characterize responses to groups of items. The bifactor graded response (bifac-GR) model is presented in contrast to a correlated traits (or multidimensional GR model) and unidimensional GR model. Bifac-GR model specification, assumptions, estimation, and interpretation are demonstrated with a reanalysis of data (Campbell, 2008) on the Shared Activities Questionnaire. We also show the importance of marginalizing the slopes for interpretation purposes and we extend the concept to the interpretation of the information function. To go along with the illustrative example analyses, we have made available supplementary files that include command file (syntax) examples and outputs from flexMIRT, IRTPRO, R, Mplus, and STATA. Supplementary data to this article can be found online at http://dx.doi.org/10.1016/j.jsp.2016.11.001. Data needed to reproduce analyses in this article are available as supplemental materials (online only) in the Appendix of this article. Copyright © 2016 Society for the Study of School Psychology. Published by Elsevier Ltd. All rights reserved.