WorldWideScience

Sample records for item bias studies

  1. Item bias detection in the Hospital Anxiety and Depression Scale using structural equation modeling: comparison with other item bias detection methods

    NARCIS (Netherlands)

    Verdam, M.G.E.; Oort, F.J.; Sprangers, M.A.G.

    Purpose Comparison of patient-reported outcomes may be invalidated by the occurrence of item bias, also known as differential item functioning. We show two ways of using structural equation modeling (SEM) to detect item bias: (1) multigroup SEM, which enables the detection of both uniform and

  2. Latent Trait Theory Applications to Test Item Bias Methodology. Research Memorandum No. 1.

    Science.gov (United States)

    Osterlind, Steven J.; Martois, John S.

    This study discusses latent trait theory applications to test item bias methodology. A real data set is used in describing the rationale and application of the Rasch probabilistic model item calibrations across various ethnic group populations. A high school graduation proficiency test covering reading comprehension, writing mechanics, and…

  3. Assessing cross-cultural item bias in questionnaires: Acculturation and the Measurement of Social Support and Family Cohesion for Adolescents

    OpenAIRE

    Hemert, Dianne A. van; Baerveldt, Chris; Vermande, Marjolijn

    2001-01-01

    Amethod is presented for evaluating the presence and size of cross-cultural item biases. The examined items concern parental support and family cohesion in a Likert-type questionnaire for adolescents in The Netherlands. Each evaluated item has two versions, a collectivist and an individualistic one, that measure the same theoretical construct. The standardized difference between the score means of the item versions, called the ?e score, gives an indication of the cultural bias of the item. As...

  4. Item bias in self-reported functional ability among 75-year-old men and women in three Nordic localities

    DEFF Research Database (Denmark)

    Avlund, K; Era, P; Davidsen, M

    1996-01-01

    to geographical locality and gender. Information about self-reported functional ability was gathered from surveys on 75-year-old men and women in Glostrup (Denmark), Göteborg (Sweden) and Jyväskylä (Finland). The data were collected by structured home interviews about mobility and Physical activities of daily......The purpose of this article is to analyse item bias in a measure of self-reported functional ability among 75-year-old people in three Nordic localities. The present item bias analysis examines whether the construction of a functional ability index from several variables results in bias in relation...... living (PADL) in relation to tiredness, reduced speed and dependency and combined into three tiredness-scales, three reduced speed-scales and two dependency-scales. The analysis revealed item bias regarding geographical locality in seven out of eight of the functional ability scales, but nearly no bias...

  5. Transcranial direct current stimulation over the parietal cortex alters bias in item and source memory tasks.

    Science.gov (United States)

    Pergolizzi, Denise; Chua, Elizabeth F

    2016-10-01

    Neuroimaging data have shown that activity in the lateral posterior parietal cortex (PPC) correlates with item recognition and source recollection, but there is considerable debate about its specific contributions. Performance on both item and source memory tasks were compared between participants who were given bilateral transcranial direct current stimulation (tDCS) over the parietal cortex to those given prefrontal or sham tDCS. The parietal tDCS group, but not the prefrontal group, showed decreased false recognition, and less bias in item and source discrimination tasks compared to sham stimulation. These results are consistent with a causal role of the PPC in item and source memory retrieval, likely based on attentional and decision-making biases. Copyright © 2016 Elsevier Inc. All rights reserved.

  6. Measurement and control of bias in patient reported outcomes using multidimensional item response theory.

    Science.gov (United States)

    Dowling, N Maritza; Bolt, Daniel M; Deng, Sien; Li, Chenxi

    2016-05-26

    Patient-reported outcome (PRO) measures play a key role in the advancement of patient-centered care research. The accuracy of inferences, relevance of predictions, and the true nature of the associations made with PRO data depend on the validity of these measures. Errors inherent to self-report measures can seriously bias the estimation of constructs assessed by the scale. A well-documented disadvantage of self-report measures is their sensitivity to response style (RS) effects such as the respondent's tendency to select the extremes of a rating scale. Although the biasing effect of extreme responding on constructs measured by self-reported tools has been widely acknowledged and studied across disciplines, little attention has been given to the development and systematic application of methodologies to assess and control for this effect in PRO measures. We review the methodological approaches that have been proposed to study extreme RS effects (ERS). We applied a multidimensional item response theory model to simultaneously estimate and correct for the impact of ERS on trait estimation in a PRO instrument. Model estimates were used to study the biasing effects of ERS on sum scores for individuals with the same amount of the targeted trait but different levels of ERS. We evaluated the effect of joint estimation of multiple scales and ERS on trait estimates and demonstrated the biasing effects of ERS on these trait estimates when used as explanatory variables. A four-dimensional model accounting for ERS bias provided a better fit to the response data. Increasing levels of ERS showed bias in total scores as a function of trait estimates. The effect of ERS was greater when the pattern of extreme responding was the same across multiple scales modeled jointly. The estimated item category intercepts provided evidence of content independent category selection. Uncorrected trait estimates used as explanatory variables in prediction models showed downward bias. A

  7. Analysis of Item-Level Bias in the Bayley-III Language Subscales: The Validity and Utility of Standardized Language Assessment in a Multilingual Setting.

    Science.gov (United States)

    Goh, Shaun K Y; Tham, Elaine K H; Magiati, Iliana; Sim, Litwee; Sanmugam, Shamini; Qiu, Anqi; Daniel, Mary L; Broekman, Birit F P; Rifkin-Graboi, Anne

    2017-09-18

    The purpose of this study was to improve standardized language assessments among bilingual toddlers by investigating and removing the effects of bias due to unfamiliarity with cultural norms or a distributed language system. The Expressive and Receptive Bayley-III language scales were adapted for use in a multilingual country (Singapore). Differential item functioning (DIF) was applied to data from 459 two-year-olds without atypical language development. This involved investigating if the probability of success on each item varied according to language exposure while holding latent language ability, gender, and socioeconomic status constant. Associations with language, behavioral, and emotional problems were also examined. Five of 16 items showed DIF, 1 of which may be attributed to cultural bias and another to a distributed language system. The remaining 3 items favored toddlers with higher bilingual exposure. Removal of DIF items reduced associations between language scales and emotional and language problems, but improved the validity of the expressive scale from poor to good. Our findings indicate the importance of considering cultural and distributed language bias in standardized language assessments. We discuss possible mechanisms influencing performance on items favoring bilingual exposure, including the potential role of inhibitory processing.

  8. Assessing cross-cultural item bias in questionnaires : Acculturation and the Measurement of Social Support and Family Cohesion for Adolescents

    NARCIS (Netherlands)

    Hemert, Dianne A. van; Baerveldt, Chris; Vermande, Marjolijn

    2001-01-01

    Amethod is presented for evaluating the presence and size of cross-cultural item biases. The examined items concern parental support and family cohesion in a Likert-type questionnaire for adolescents in The Netherlands. Each evaluated item has two versions, a collectivist and an individualistic one,

  9. A unified factor-analytic approach to the detection of item and test bias: Illustration with the effect of providing calculators to students with dyscalculia

    Directory of Open Access Journals (Sweden)

    Lee, M. K.

    2016-01-01

    Full Text Available An absence of measurement bias against distinct groups is a prerequisite for the use of a given psychological instrument in scientific research or high-stakes assessment. Factor analysis is the framework explicitly adopted for the identification of such bias when the instrument consists of a multi-test battery, whereas item response theory is employed when the focus narrows to a single test composed of discrete items. Item response theory can be treated as a mild nonlinearization of the standard factor model, and thus the essential unity of bias detection at the two levels merits greater recognition. Here we illustrate the benefits of a unified approach with a real-data example, which comes from a statewide test of mathematics achievement where examinees diagnosed with dyscalculia were accommodated with calculators. We found that items that can be solved by explicit arithmetical computation became easier for the accommodated examinees, but the quantitative magnitude of this differential item functioning (measurement bias was small.

  10. The construct equivalence and item bias of the pib/SpEEx conceptualisation-ability test for members of five language groups in South Africa

    Directory of Open Access Journals (Sweden)

    Pieter Schaap

    2008-11-01

    Full Text Available This study’s objective was to determine whether the Potential Index Batteries/Situation Specific Evaluation Expert (PIB/SpEEx conceptualisation (100 ability test displays construct equivalence and item bias for members of five selected language groups in South Africa. The sample consisted of a non-probability convenience sample (N = 6 261 of members of five language groups (speakers of Afrikaans, English, North Sotho, Setswana and isiZulu working in the medical and beverage industries or studying at higher-educational institutions. Exploratory factor analysis with target rotations confrmed the PIB/SpEEx 100’s construct equivalence for the respondents from these five language groups. No evidence of either uniform or non-uniform item bias of practical signifcance was found for the sample.

  11. Psychometric Consequences of Subpopulation Item Parameter Drift

    Science.gov (United States)

    Huggins-Manley, Anne Corinne

    2017-01-01

    This study defines subpopulation item parameter drift (SIPD) as a change in item parameters over time that is dependent on subpopulations of examinees, and hypothesizes that the presence of SIPD in anchor items is associated with bias and/or lack of invariance in three psychometric outcomes. Results show that SIPD in anchor items is associated…

  12. Verification of Differential Item Functioning (DIF) Status of West ...

    African Journals Online (AJOL)

    This study investigated test item bias and Differential Item Functioning (DIF) of West African ... items in chemistry function differentially with respect to gender and location. In Aba education zone of Abia, 50 secondary schools were purposively ...

  13. Final Sampling Bias in Haptic Judgments: How Final Touch Affects Decision-Making.

    Science.gov (United States)

    Mitsuda, Takashi; Yoshioka, Yuichi

    2018-01-01

    When people make a choice between multiple items, they usually evaluate each item one after the other repeatedly. The effect of the order and number of evaluating items on one's choices is essential to understanding the decision-making process. Previous studies have shown that when people choose a favorable item from two items, they tend to choose the item that they evaluated last. This tendency has been observed regardless of sensory modalities. This study investigated the origin of this bias by using three experiments involving two-alternative forced-choice tasks using handkerchiefs. First, the bias appeared in a smoothness discrimination task, which indicates that the bias was not based on judgments of preference. Second, the handkerchief that was touched more often tended to be chosen more frequently in the preference task, but not in the smoothness discrimination task, indicating that a mere exposure effect enhanced the bias. Third, in the condition where the number of touches did not differ between handkerchiefs, the bias appeared when people touched a handkerchief they wanted to touch last, but not when people touched the handkerchief that was predetermined. This finding suggests a direct coupling between final voluntary touching and judgment.

  14. Cross-National Prevalence of Traditional Bullying, Traditional Victimization, Cyberbullying and Cyber-Victimization: Comparing Single-Item and Multiple-Item Approaches of Measurement

    Science.gov (United States)

    Yanagida, Takuya; Gradinger, Petra; Strohmeier, Dagmar; Solomontos-Kountouri, Olga; Trip, Simona; Bora, Carmen

    2016-01-01

    Many large-scale cross-national studies rely on a single-item measurement when comparing prevalence rates of traditional bullying, traditional victimization, cyberbullying, and cyber-victimization between countries. However, the reliability and validity of single-item measurement approaches are highly problematic and might be biased. Data from…

  15. Estimating non-response bias in family studies: application to mental health and lifestyle.

    NARCIS (Netherlands)

    Vink, J.M.; Willemsen, A.H.M.; Stubbe, J.H.; Middeldorp, C.M.; Ligthart, L.; Baas, K.D.; Dirkzwager, H.; de Geus, E.J.C.; Boomsma, D.I.

    2004-01-01

    Non-response to mailed surveys reduces the effective sample size and may introduce bias. Non-response has been studied by (1) comparison to available data in population based registers, (2) directly contacting non-respondents by telephone or single-item reply cards, and (3) longitudinal repetition

  16. A randomised trial and economic evaluation of the effect of response mode on response rate, response bias, and item non-response in a survey of doctors

    Directory of Open Access Journals (Sweden)

    Witt Julia

    2011-09-01

    Full Text Available Abstract Background Surveys of doctors are an important data collection method in health services research. Ways to improve response rates, minimise survey response bias and item non-response, within a given budget, have not previously been addressed in the same study. The aim of this paper is to compare the effects and costs of three different modes of survey administration in a national survey of doctors. Methods A stratified random sample of 4.9% (2,702/54,160 of doctors undertaking clinical practice was drawn from a national directory of all doctors in Australia. Stratification was by four doctor types: general practitioners, specialists, specialists-in-training, and hospital non-specialists, and by six rural/remote categories. A three-arm parallel trial design with equal randomisation across arms was used. Doctors were randomly allocated to: online questionnaire (902; simultaneous mixed mode (a paper questionnaire and login details sent together (900; or, sequential mixed mode (online followed by a paper questionnaire with the reminder (900. Analysis was by intention to treat, as within each primary mode, doctors could choose either paper or online. Primary outcome measures were response rate, survey response bias, item non-response, and cost. Results The online mode had a response rate 12.95%, followed by the simultaneous mixed mode with 19.7%, and the sequential mixed mode with 20.7%. After adjusting for observed differences between the groups, the online mode had a 7 percentage point lower response rate compared to the simultaneous mixed mode, and a 7.7 percentage point lower response rate compared to sequential mixed mode. The difference in response rate between the sequential and simultaneous modes was not statistically significant. Both mixed modes showed evidence of response bias, whilst the characteristics of online respondents were similar to the population. However, the online mode had a higher rate of item non-response compared

  17. Memory bias for negative emotional words in recognition memory is driven by effects of category membership.

    Science.gov (United States)

    White, Corey N; Kapucu, Aycan; Bruno, Davide; Rotello, Caren M; Ratcliff, Roger

    2014-01-01

    Recognition memory studies often find that emotional items are more likely than neutral items to be labelled as studied. Previous work suggests this bias is driven by increased memory strength/familiarity for emotional items. We explored strength and bias interpretations of this effect with the conjecture that emotional stimuli might seem more familiar because they share features with studied items from the same category. Categorical effects were manipulated in a recognition task by presenting lists with a small, medium or large proportion of emotional words. The liberal memory bias for emotional words was only observed when a medium or large proportion of categorised words were presented in the lists. Similar, though weaker, effects were observed with categorised words that were not emotional (animal names). These results suggest that liberal memory bias for emotional items may be largely driven by effects of category membership.

  18. Modeling Temporal Bias of Uplift Events in Recommender Systems

    KAUST Repository

    Altaf, Basmah

    2013-05-08

    Today, commercial industry spends huge amount of resources in advertisement campaigns, new marketing strategies, and promotional deals to introduce their product to public and attract a large number of customers. These massive investments by a company are worthwhile because marketing tactics greatly influence the consumer behavior. Alternatively, these advertising campaigns have a discernible impact on recommendation systems which tend to promote popular items by ranking them at the top, resulting in biased and unfair decision making and loss of customers’ trust. The biasing impact of popularity of items on recommendations, however, is not fixed, and varies with time. Therefore, it is important to build a bias-aware recommendation system that can rank or predict items based on their true merit at given time frame. This thesis proposes a framework that can model the temporal bias of individual items defined by their characteristic contents, and provides a simple process for bias correction. Bias correction is done either by cleaning the bias from historical training data that is used for building predictive model, or by ignoring the estimated bias from the predictions of a standard predictor. Evaluated on two real world datasets, NetFlix and MovieLens, our framework is shown to be able to estimate and remove the bias as a result of adopted marketing techniques from the predicted popularity of items at a given time.

  19. Investigation of the Performance of Multidimensional Equating Procedures for Common-Item Nonequivalent Groups Design

    Directory of Open Access Journals (Sweden)

    Burcu ATAR

    2017-12-01

    Full Text Available In this study, the performance of the multidimensional extentions of Stocking-Lord, mean/mean, and mean/sigma equating procedures under common-item nonequivalent groups design was investigated. The performance of those three equating procedures was examined under the combination of various conditions including sample size, ability distribution, correlation between two dimensions, and percentage of anchor items in the test. Item parameter recovery was evaluated calculating RMSE (root man squared error and BIAS values. It was found that Stocking-Lord procedure provided the smaller RMSE and BIAS values for both item discrimination and item difficulty parameter estimates across most conditions.

  20. A Comparison of Multidimensional Item Selection Methods in Simple and Complex Test Designs

    Directory of Open Access Journals (Sweden)

    Eren Halil ÖZBERK

    2017-03-01

    Full Text Available In contrast with the previous studies, this study employed various test designs (simple and complex which allow the evaluation of the overall ability score estimations across multiple real test conditions. In this study, four factors were manipulated, namely the test design, number of items per dimension, correlation between dimensions and item selection methods. Using the generated item and ability parameters, dichotomous item responses were generated in by using M3PL compensatory multidimensional IRT model with specified correlations. MCAT composite ability score accuracy was evaluated using absolute bias (ABSBIAS, correlation and the root mean square error (RMSE between true and estimated ability scores. The results suggest that the multidimensional test structure, number of item per dimension and correlation between dimensions had significant effect on item selection methods for the overall score estimations. For simple structure test design it was found that V1 item selection has the lowest absolute bias estimations for both long and short tests while estimating overall scores. As the model gets complex KL item selection method performed better than other two item selection method.

  1. Behavioral decoding of working memory items inside and outside the focus of attention.

    Science.gov (United States)

    Mallett, Remington; Lewis-Peacock, Jarrod A

    2018-03-31

    How we attend to our thoughts affects how we attend to our environment. Holding information in working memory can automatically bias visual attention toward matching information. By observing attentional biases on reaction times to visual search during a memory delay, it is possible to reconstruct the source of that bias using machine learning techniques and thereby behaviorally decode the content of working memory. Can this be done when more than one item is held in working memory? There is some evidence that multiple items can simultaneously bias attention, but the effects have been inconsistent. One explanation may be that items are stored in different states depending on the current task demands. Recent models propose functionally distinct states of representation for items inside versus outside the focus of attention. Here, we use behavioral decoding to evaluate whether multiple memory items-including temporarily irrelevant items outside the focus of attention-exert biases on visual attention. Only the single item in the focus of attention was decodable. The other item showed a brief attentional bias that dissipated until it returned to the focus of attention. These results support the idea of dynamic, flexible states of working memory across time and priority. © 2018 New York Academy of Sciences.

  2. Heuristic Processes in Ratings of Leader Behavior: Assessing Item-Induced Availability Biases.

    Science.gov (United States)

    Binning, John F.; Fernandez, Guadalupe

    Since observers' memory-based ratings of organizational phenomena provide data in research and decision-making contexts, bias in observers' judgments must be examined. A study was conducted to explore the extent to which leader behavior ratings are more generally biased by the availability heuristic. The availability heuristic is operative when a…

  3. Patterns of source monitoring bias in incarcerated youths with and without conduct problems.

    Science.gov (United States)

    Morosan, Larisa; Badoud, Deborah; Salaminios, George; Eliez, Stephan; Van der Linden, Martial; Heller, Patrick; Debbané, Martin

    2018-01-01

    Antisocial individuals present behaviours that violate the social norms and the rights of others. In the present study, we examine whether biases in monitoring the self-generated cognitive material might be linked to antisocial manifestations during adolescence. We further examine the association with psychopathic traits and conduct problems (CPs). Sixty-five incarcerated adolescents (IAs; M age = 15.85, SD = 1.30) and 88 community adolescents (CAs; M age = 15.78, SD = 1.60) participated in our study. In the IA group, 28 adolescents presented CPs (M age = 16.06, SD = 1.41) and 19 did not meet the diagnostic criteria for CPs (M age = 15.97, SD = 1.20). Source monitoring was assessed through a speech-monitoring task, using items requiring different levels of cognitive effort; recognition and source-monitoring bias scores (internalising and externalising biases) were calculated. Between-group comparisons indicate greater overall biases and different patterns of biases in the source monitoring. IA participants manifest a greater externalising bias, whereas CA participants present a greater internalising bias. In addition, IA with CPs present different patterns of item recognition. These results indicate that the two groups of adolescents present different types of source-monitoring bias for self-generated speech. In addition, the IAs with CPs present impairments in item recognition. Future studies may examine the developmental implications of self-monitoring biases in the perseverance of antisocial behaviours from adolescence to adulthood.

  4. Readdressing gender bias in the Coopersmith Self-Esteem Inventory-short form.

    Science.gov (United States)

    Chapman, Paula L; Mullis, Ann K

    2002-12-01

    The short form of the Coopersmith Self-Esteem Inventory (SEI) was evaluated for gender bias. The authors replicated a study by L. Francis and D. James (1998) and administered the SEI to 361 middle and high school students (146 boys, 2l5 girls). They found that gender bias existed in 6 of the 25 items on the SEI, with 5 of those items favoring boys. Because recent literature indicates that male and female adolescents experience problems in different areas of their lives, the authors suggest that researchers consider such differences when selecting items for a standardized measure.

  5. Effect of standardized training on the reliability of the Cochrane risk of bias assessment tool: a prospective study.

    Science.gov (United States)

    da Costa, Bruno R; Beckett, Brooke; Diaz, Alison; Resta, Nina M; Johnston, Bradley C; Egger, Matthias; Jüni, Peter; Armijo-Olivo, Susan

    2017-03-03

    The Cochrane risk of bias tool is commonly criticized for having a low reliability. We aimed to investigate whether training of raters, with objective and standardized instructions on how to assess risk of bias, can improve the reliability of the Cochrane risk of bias tool. In this pilot study, four raters inexperienced in risk of bias assessment were randomly allocated to minimal or intensive standardized training for risk of bias assessment of randomized trials of physical therapy treatments for patients with knee osteoarthritis pain. Two raters were experienced risk of bias assessors who served as reference. The primary outcome of our study was between-group reliability, defined as the agreement of the risk of bias assessments of inexperienced raters with the reference assessments of experienced raters. Consensus-based assessments were used for this purpose. The secondary outcome was within-group reliability, defined as the agreement of assessments within pairs of inexperienced raters. We calculated the chance-corrected weighted Kappa to quantify agreement within and between groups of raters for each of the domains of the risk of bias tool. A total of 56 trials were included in our analysis. The Kappa for the agreement of inexperienced raters with reference across items of the risk of bias tool ranged from 0.10 to 0.81 for the minimal training group and from 0.41 to 0.90 for the standardized training group. The Kappa values for the agreement within pairs of inexperienced raters across the items of the risk of bias tool ranged from 0 to 0.38 for the minimal training group and from 0.93 to 1 for the standardized training group. Between-group differences in Kappa for the agreement of inexperienced raters with reference always favored the standardized training group and was most pronounced for incomplete outcome data (difference in Kappa 0.52, p training on risk of bias assessment may significantly improve the reliability of the Cochrane risk of bias tool.

  6. Do people with and without medical conditions respond similarly to the short health anxiety inventory? An assessment of differential item functioning using item response theory.

    Science.gov (United States)

    LeBouthillier, Daniel M; Thibodeau, Michel A; Alberts, Nicole M; Hadjistavropoulos, Heather D; Asmundson, Gordon J G

    2015-04-01

    Individuals with medical conditions are likely to have elevated health anxiety; however, research has not demonstrated how medical status impacts response patterns on health anxiety measures. Measurement bias can undermine the validity of a questionnaire by overestimating or underestimating scores in groups of individuals. We investigated whether the Short Health Anxiety Inventory (SHAI), a widely-used measure of health anxiety, exhibits medical condition-based bias on item and subscale levels, and whether the SHAI subscales adequately assess the health anxiety continuum. Data were from 963 individuals with diabetes, breast cancer, or multiple sclerosis, and 372 healthy individuals. Mantel-Haenszel tests and item characteristic curves were used to classify the severity of item-level differential item functioning in all three medical groups compared to the healthy group. Test characteristic curves were used to assess scale-level differential item functioning and whether the SHAI subscales adequately assess the health anxiety continuum. Nine out of 14 items exhibited differential item functioning. Two items exhibited differential item functioning in all medical groups compared to the healthy group. In both Thought Intrusion and Fear of Illness subscales, differential item functioning was associated with mildly deflated scores in medical groups with very high levels of the latent traits. Fear of Illness items poorly discriminated between individuals with low and very low levels of the latent trait. While individuals with medical conditions may respond differentially to some items, clinicians and researchers can confidently use the SHAI with a variety of medical populations without concern of significant bias. Copyright © 2015 Elsevier Inc. All rights reserved.

  7. Geriatric Anxiety Scale: item response theory analysis, differential item functioning, and creation of a ten-item short form (GAS-10).

    Science.gov (United States)

    Mueller, Anne E; Segal, Daniel L; Gavett, Brandon; Marty, Meghan A; Yochim, Brian; June, Andrea; Coolidge, Frederick L

    2015-07-01

    The Geriatric Anxiety Scale (GAS; Segal et al. (Segal, D. L., June, A., Payne, M., Coolidge, F. L. and Yochim, B. (2010). Journal of Anxiety Disorders, 24, 709-714. doi:10.1016/j.janxdis.2010.05.002) is a self-report measure of anxiety that was designed to address unique issues associated with anxiety assessment in older adults. This study is the first to use item response theory (IRT) to examine the psychometric properties of a measure of anxiety in older adults. A large sample of older adults (n = 581; mean age = 72.32 years, SD = 7.64 years, range = 60 to 96 years; 64% women; 88% European American) completed the GAS. IRT properties were examined. The presence of differential item functioning (DIF) or measurement bias by age and sex was assessed, and a ten-item short form of the GAS (called the GAS-10) was created. All GAS items had discrimination parameters of 1.07 or greater. Items from the somatic subscale tended to have lower discrimination parameters than items on the cognitive or affective subscales. Two items were flagged for DIF, but the impact of the DIF was negligible. Women scored significantly higher than men on the GAS and its subscales. Participants in the young-old group (60 to 79 years old) scored significantly higher on the cognitive subscale than participants in the old-old group (80 years old and older). Results from the IRT analyses indicated that the GAS and GAS-10 have strong psychometric properties among older adults. We conclude by discussing implications and future research directions.

  8. A signal detection-item response theory model for evaluating neuropsychological measures.

    Science.gov (United States)

    Thomas, Michael L; Brown, Gregory G; Gur, Ruben C; Moore, Tyler M; Patt, Virginie M; Risbrough, Victoria B; Baker, Dewleen G

    2018-02-05

    Models from signal detection theory are commonly used to score neuropsychological test data, especially tests of recognition memory. Here we show that certain item response theory models can be formulated as signal detection theory models, thus linking two complementary but distinct methodologies. We then use the approach to evaluate the validity (construct representation) of commonly used research measures, demonstrate the impact of conditional error on neuropsychological outcomes, and evaluate measurement bias. Signal detection-item response theory (SD-IRT) models were fitted to recognition memory data for words, faces, and objects. The sample consisted of U.S. Infantry Marines and Navy Corpsmen participating in the Marine Resiliency Study. Data comprised item responses to the Penn Face Memory Test (PFMT; N = 1,338), Penn Word Memory Test (PWMT; N = 1,331), and Visual Object Learning Test (VOLT; N = 1,249), and self-report of past head injury with loss of consciousness. SD-IRT models adequately fitted recognition memory item data across all modalities. Error varied systematically with ability estimates, and distributions of residuals from the regression of memory discrimination onto self-report of past head injury were positively skewed towards regions of larger measurement error. Analyses of differential item functioning revealed little evidence of systematic bias by level of education. SD-IRT models benefit from the measurement rigor of item response theory-which permits the modeling of item difficulty and examinee ability-and from signal detection theory-which provides an interpretive framework encompassing the experimentally validated constructs of memory discrimination and response bias. We used this approach to validate the construct representation of commonly used research measures and to demonstrate how nonoptimized item parameters can lead to erroneous conclusions when interpreting neuropsychological test data. Future work might include the

  9. Preference Bias of Head Orientation in Choosing between Two Non-durables

    Directory of Open Access Journals (Sweden)

    Hiroyuki eFunaya

    2015-06-01

    Full Text Available The goal of this study is to investigate how customers’ gaze, head and body orientations reflect their choices. Although the relationship between human choice and gaze behavior has been well studied, other behaviors such as head and body are unknown. We conducted a two-alternatives-forced-choice task to examine (1 whether preference bias, i.e. a positional bias in gaze, head and body toward the item that was later chosen, exists in choice, (2 when preference bias is observed and when prediction of the resulting choice becomes possible (3 whether human choice is affected when the body orientations are manipulated. We used real non-durable products (cheap snacks and clothing on a shopping shelf. The results showed that there was a significant preference bias in head orientation at the beginning one second when the subjects stood straight toward the shelf, and that the head orientation was more biased toward the selected item than the gaze and the center of pressure at the ending one second. Manipulating body orientation did not affect the result of choice. The preference bias detected by observing the head orientation would be useful in marketing science for predicting customers’ choice.

  10. Preference bias of head orientation in choosing between two non-durables.

    Science.gov (United States)

    Funaya, Hiroyuki; Shibata, Tomohiro

    2015-01-01

    The goal of this study is to investigate how customers' gaze, head and body orientations reflect their choices. Although the relationship between human choice and gaze behavior has been well-studied, other behaviors such as head and body are unknown. We conducted a two-alternatives-forced-choice task to examine (1) whether preference bias, i.e., a positional bias in gaze, head and body toward the item that was later chosen, exists in choice, (2) when preference bias is observed and when prediction of the resulting choice becomes possible (3) whether human choice is affected when the body orientations are manipulated. We used real non-durable products (cheap snacks and clothing) on a shopping shelf. The results showed that there was a significant preference bias in head orientation at the beginning 1 s when the subjects stood straight toward the shelf, and that the head orientation was more biased toward the selected item than the gaze and the center of pressure at the ending 1 s. Manipulating body orientation did not affect the result of choice. The preference bias detected by observing the head orientation would be useful in marketing science for predicting customers' choice.

  11. Detecting rater bias using a person-fit statistic: a Monte Carlo simulation study.

    Science.gov (United States)

    Aubin, André-Sébastien; St-Onge, Christina; Renaud, Jean-Sébastien

    2018-04-01

    With the Standards voicing concern for the appropriateness of response processes, we need to explore strategies that would allow us to identify inappropriate rater response processes. Although certain statistics can be used to help detect rater bias, their use is complicated by either a lack of data about their actual power to detect rater bias or the difficulty related to their application in the context of health professions education. This exploratory study aimed to establish the worthiness of pursuing the use of l z to detect rater bias. We conducted a Monte Carlo simulation study to investigate the power of a specific detection statistic, that is: the standardized likelihood l z person-fit statistics (PFS). Our primary outcome was the detection rate of biased raters, namely: raters whom we manipulated into being either stringent (giving lower scores) or lenient (giving higher scores), using the l z statistic while controlling for the number of biased raters in a sample (6 levels) and the rate of bias per rater (6 levels). Overall, stringent raters (M = 0.84, SD = 0.23) were easier to detect than lenient raters (M = 0.31, SD = 0.28). More biased raters were easier to detect then less biased raters (60% bias: 62, SD = 0.37; 10% bias: 43, SD = 0.36). The PFS l z seems to offer an interesting potential to identify biased raters. We observed detection rates as high as 90% for stringent raters, for whom we manipulated more than half their checklist. Although we observed very interesting results, we cannot generalize these results to the use of PFS with estimated item/station parameters or real data. Such studies should be conducted to assess the feasibility of using PFS to identify rater bias.

  12. Gender Differences in Figural Matrices: The Moderating Role of Item Design Features

    Science.gov (United States)

    Arendasy, Martin E.; Sommer, Markus

    2012-01-01

    There is a heated debate on whether observed gender differences in some figural matrices in adults can be attributed to gender differences in inductive reasoning/G[subscript f] or differential item functioning and/or test bias. Based on previous studies we hypothesized that three specific item design features moderate the effect size of the gender…

  13. Dwalingen in de methodologie. II. Bias door vragenlijsten

    DEFF Research Database (Denmark)

    Pouwer, F; Van Der Ploeg, Henk M; Bramsen, I

    1998-01-01

    Some characteristics of self-report questionnaires can result in bias in responding. When a test item or a questionnaire is biased, the observed scores form an imprecise measurement of reality as a consequence of systematic errors of measurement. Causes of such bias are: unclear instructions, vague...

  14. Are Teacher Course Evaluations Biased against Faculty That Teach Quantitative Methods Courses?

    Science.gov (United States)

    Royal, Kenneth D.; Stockdale, Myrah R.

    2015-01-01

    The present study investigated graduate students' responses to teacher/course evaluations (TCE) to determine if students' responses were inherently biased against faculty who teach quantitative methods courses. Item response theory (IRT) and Differential Item Functioning (DIF) techniques were utilized for data analysis. Results indicate students…

  15. Some Cochrane risk of bias items are not important in osteoarthritis trials

    DEFF Research Database (Denmark)

    Bolvig, Julie; Juhl, Carsten B; Boutron, Isabelle

    2018-01-01

    of the risk of bias tool (RoB), trial size, single vs multi-site, and source of funding. Effect sizes were calculated as standardized mean differences (SMDs). Meta-regression was performed to identify "relevant study-level covariates" that decreases the between-study variance (τˆ2). RESULTS: Twenty reviews...

  16. A method for additive bias correction in cross-cultural surveys

    DEFF Research Database (Denmark)

    Scholderer, Joachim; Grunert, Klaus G.; Brunsø, Karen

    2001-01-01

    additive bias from cross-cultural data. The procedure involves four steps: (1) embed a potentially biased item in a factor-analytic measurement model, (2) test for the existence of additive bias between populations, (3) use the factor-analytic model to estimate the magnitude of the bias, and (4) replace......Measurement bias in cross-cultural surveys can seriously threaten the validity of hypothesis tests. Direct comparisons of means depend on the assumption that differences in observed variables reflect differences in the underlying constructs, and not an additive bias that may be caused by cultural...... differences in the understanding of item wording or response category labels. However, experience suggests that additive bias can be found more often than not. Based on the concept of partial measurement invariance (Byrne, Shavelson and Muthén, 1989), the present paper develops a procedure for eliminating...

  17. A scale purification procedure for evaluation of differential item functioning

    NARCIS (Netherlands)

    Khalid, Muhammad Naveed; Glas, Cornelis A.W.

    2014-01-01

    Item bias or differential item functioning (DIF) has an important impact on the fairness of psychological and educational testing. In this paper, DIF is seen as a lack of fit to an item response (IRT) model. Inferences about the presence and importance of DIF require a process of so-called test

  18. Investigating Assessment Bias for Constructed Response Explanation Tasks: Implications for Evaluating Performance Expectations for Scientific Practice

    Science.gov (United States)

    Federer, Meghan Rector

    frequently incorporate multivalent concepts into explanations of change, resulting in explanatory practices that were scientifically non-normative. However, use of follow-up question approaches was found to resolve this source of bias and thereby increase the validity of inferences about student understanding. The second study focused on issues of item and instrument structure, specifically item feature effects and item position effects, which have been shown to influence measures of student performance across assessment tasks. Results indicated that, along the instrument item sequence, items with similar surface features produced greater sequencing effects than sequences of items with dissimilar surface features. This bias could be addressed by use of a counterbalanced design (i.e., Latin Square) at the population level of analysis. Explanation scores were also highly correlated with student verbosity, despite verbosity being an intrinsically trivial aspect of explanation quality. Attempting to standardize student response length was one proposed solution to the verbosity bias. The third study explored gender differences in students' performance on constructed-response explanation tasks using impact (i.e., mean raw scores) and differential item function (i.e., item difficulties) patterns. While prior research in science education has suggested that females tend to perform better on constructed-response items, the results of this study revealed no overall differences in gender achievement. However, evaluation of specific item features patterns suggested that female respondents have a slight advantage on unfamiliar explanation tasks. That is, male students tended to incorporate fewer scientifically normative concepts (i.e., key concepts) than females for unfamiliar taxa. Conversely, females tended to incorporate more scientifically non-normative ideas (i.e., naive ideas) than males for familiar taxa. Together these results indicate that gender achievement differences for this

  19. Sources of Response Bias in Older Ethnic Minorities: A Case of Korean American Elderly

    Science.gov (United States)

    Kim, Miyong T.; Ko, Jisook; Yoon, Hyunwoo; Kim, Kim B.; Jang, Yuri

    2015-01-01

    The present study was undertaken to investigate potential sources of response bias in empirical research involving older ethnic minorities and to identify prudent strategies to reduce those biases, using Korean American elderly (KAE) as an example. Data were obtained from three independent studies of KAE (N=1,297; age ≥60) in three states (Florida, New York, and Maryland) from 2000 to 2008. Two common measures, Pearlin’s Mastery Scale and the CES-D scale, were selected for a series of psychometric tests based on classical measurement theory. Survey items were analyzed in depth, using psychometric properties generated from both exploratory factor analysis and confirmatory factor analysis as well as correlational analysis. Two types of potential sources of bias were identified as the most significant contributors to increases in error variances for these psychological instruments. Error variances were most prominent when (1) items were not presented in a manner that was culturally or contextually congruent with respect to the target population and/or (2) the response anchors for items were mixed (e.g., positive vs. negative). The systemic patterns and magnitudes of the biases were also cross-validated for the three studies. The results demonstrate sources and impacts of measurement biases in studies of older ethnic minorities. The identified response biases highlight the need for re-evaluation of current measurement practices, which are based on traditional recommendations that response anchors should be mixed or that the original wording of instruments should be rigidly followed. Specifically, systematic guidelines for accommodating cultural and contextual backgrounds into instrument design are warranted. PMID:26049971

  20. Using a Linear Regression Method to Detect Outliers in IRT Common Item Equating

    Science.gov (United States)

    He, Yong; Cui, Zhongmin; Fang, Yu; Chen, Hanwei

    2013-01-01

    Common test items play an important role in equating alternate test forms under the common item nonequivalent groups design. When the item response theory (IRT) method is applied in equating, inconsistent item parameter estimates among common items can lead to large bias in equated scores. It is prudent to evaluate inconsistency in parameter…

  1. A Diffusion Model Analysis of Decision Biases Affecting Delayed Recognition of Emotional Stimuli

    Science.gov (United States)

    Bowen, Holly J.; Spaniol, Julia; Patel, Ronak; Voss, Andreas

    2016-01-01

    Previous empirical work suggests that emotion can influence accuracy and cognitive biases underlying recognition memory, depending on the experimental conditions. The current study examines the effects of arousal and valence on delayed recognition memory using the diffusion model, which allows the separation of two decision biases thought to underlie memory: response bias and memory bias. Memory bias has not been given much attention in the literature but can provide insight into the retrieval dynamics of emotion modulated memory. Participants viewed emotional pictorial stimuli; half were given a recognition test 1-day later and the other half 7-days later. Analyses revealed that emotional valence generally evokes liberal responding, whereas high arousal evokes liberal responding only at a short retention interval. The memory bias analyses indicated that participants experienced greater familiarity with high-arousal compared to low-arousal items and this pattern became more pronounced as study-test lag increased; positive items evoke greater familiarity compared to negative and this pattern remained stable across retention interval. The findings provide insight into the separate contributions of valence and arousal to the cognitive mechanisms underlying delayed emotion modulated memory. PMID:26784108

  2. A procedure for eliminating additive bias from cross-cultural survey data

    DEFF Research Database (Denmark)

    Scholderer, Joachim; Grunert, Klaus G.; Brunsø, Karen

    2005-01-01

    additive bias from cross-cultural data. The procedure involves four steps: (1) embed a potentially biased item in a factor-analytic measurement model, (2) test for the existence of additive bias between populations, (3) use the factor-analytic model to estimate the magnitude of the bias, and (4) replace......Measurement bias in cross-cultural surveys can seriously threaten the validity of hypothesis tests. Direct comparisons of means depend on the assumption that differences in observed variables reflect differences in the underlying constructs, and not an additive bias that may be caused by cultural...... differences in the understanding of item wording or response category labels. However, experience suggests that additive bias can be found more often than not. Based on the concept of partial measurement invariance (Byrne, Shavelson and Muthén 1989), the present paper develops a procedure for eliminating...

  3. Weight bias internalization across weight categories among school-aged children. Validation of the Weight Bias Internalization Scale for Children.

    Science.gov (United States)

    Zuba, Anna; Warschburger, Petra

    2018-06-01

    Anti-fat bias is widespread and is linked to the internalization of weight bias and psychosocial problems. The purpose of this study was to examine the internalization of weight bias among children across weight categories and to evaluate the psychometric properties of the Weight Bias Internalization Scale for Children (WBIS-C). Data were collected from 1484 primary school children and their parents. WBIS-C demonstrated good internal consistency (α = .86) after exclusion of Item 1. The unitary factor structure was supported using exploratory and confirmatory factor analyses (factorial validity). Girls and overweight children reported higher WBIS-C scores in comparison to boys and non-overweight peers (known-groups validity). Convergent validity was shown by significant correlations with psychosocial problems. Internalization of weight bias explained additional variance in different indicators of psychosocial well-being. The results suggest that the WBIS-C is a psychometrically sound and informative tool to assess weight bias internalization among children. Copyright © 2018 Elsevier Ltd. All rights reserved.

  4. Bias Correction for the Maximum Likelihood Estimate of Ability. Research Report. ETS RR-05-15

    Science.gov (United States)

    Zhang, Jinming

    2005-01-01

    Lord's bias function and the weighted likelihood estimation method are effective in reducing the bias of the maximum likelihood estimate of an examinee's ability under the assumption that the true item parameters are known. This paper presents simulation studies to determine the effectiveness of these two methods in reducing the bias when the item…

  5. Three controversies over item disclosure in medical licensure examinations

    Directory of Open Access Journals (Sweden)

    Yoon Soo Park

    2015-09-01

    Full Text Available In response to views on public's right to know, there is growing attention to item disclosure – release of items, answer keys, and performance data to the public – in medical licensure examinations and their potential impact on the test's ability to measure competence and select qualified candidates. Recent debates on this issue have sparked legislative action internationally, including South Korea, with prior discussions among North American countries dating over three decades. The purpose of this study is to identify and analyze three issues associated with item disclosure in medical licensure examinations – 1 fairness and validity, 2 impact on passing levels, and 3 utility of item disclosure – by synthesizing existing literature in relation to standards in testing. Historically, the controversy over item disclosure has centered on fairness and validity. Proponents of item disclosure stress test takers’ right to know, while opponents argue from a validity perspective. Item disclosure may bias item characteristics, such as difficulty and discrimination, and has consequences on setting passing levels. To date, there has been limited research on the utility of item disclosure for large scale testing. These issues requires ongoing and careful consideration.

  6. Intentional forgetting reduces color-naming interference: evidence from item-method directed forgetting.

    Science.gov (United States)

    Lee, Yuh-Shiow; Lee, Huang-Mou; Fawcett, Jonathan M

    2013-01-01

    In an item-method-directed forgetting task, Chinese words were presented individually, each followed by an instruction to remember or forget. Colored probe items were presented following each memory instruction requiring a speeded color-naming response. Half of the probe items were novel and unrelated to the preceding study item, whereas the remaining half of the probe items were a repetition of the preceding study item. Repeated probe items were either identical to the preceding study item (E1, E2), a phonetic reproduction of the preceding study item (E3), or perceptually matched to the preceding study item (E4). Color-naming interference was calculated by subtracting color-naming reaction times made in response to a string of meaningless symbols from that of the novel and repeated conditions. Across all experiments, participants recalled more to-be-remembered (TBR) than to-be-forgotten (TBF) study words. More importantly, Experiments 1 and 2 found that color-naming interference was reduced for repeated TBF words relative to repeated TBR words. Experiments 3 and 4 further found that this effect occurred at the perceptual rather than semantic level. These findings suggest that participants may bias processing resources away from the perceptual representation of to-be-forgotten information.

  7. Development and Validation of a Response Bias Scale (RBS) for the MMPI-2

    Science.gov (United States)

    Gervais, Roger O.; Ben-Porath, Yossef S.; Wygant, Dustin B.; Green, Paul

    2007-01-01

    This study describes the development of a Minnesota Multiphasic Personality Inventory (MMPI-2) scale designed to detect negative response bias in forensic neuropsychological or disability assessment settings. The Response Bias Scale (RBS) consists of 28 MMPI-2 items that discriminated between persons who passed or failed the Word Memory Test…

  8. Mood-congruent free recall bias in anxious individuals is not a consequence of response bias.

    Science.gov (United States)

    Russo, Riccardo; Whittuck, Dora; Roberson, Debi; Dutton, Kevin; Georgiou, George; Fox, Elaine

    2006-05-01

    The status of mood-congruent free recall bias in anxious individuals was evaluated following incidental encoding of target words. Individuals with high and low levels of trait anxiety completed a modified Stroop task, which revealed an attentional bias for threat-related stimuli in anxious individuals. This group was significantly slower in naming the colour in which threat-related words were displayed compared to neutral words. In a subsequent free recall test for the words used in the modified Stroop task, anxious individuals recalled more threat-related words compared to low-anxious people. This difference was significant even when controlling for the false recall of items that had not been presented during study. These results support the view put forward by Russo, Fox, Bellinger, and Nguyen-Van-Tam (2001) that mood-congruent free recall bias in anxious individuals can be observed if the target material is encoded at a relatively shallow level. Moreover, contrary to Dowens and Calvo (2003), the current results show that the memory advantage for threat-related information in anxious individuals is not a consequence of response bias.

  9. Gender-based Differential Item Functioning in the Application of the Theory of Planned Behavior for the Study of Entrepreneurial Intentions.

    Science.gov (United States)

    Zampetakis, Leonidas A; Bakatsaki, Maria; Litos, Charalambos; Kafetsios, Konstantinos G; Moustakis, Vassilis

    2017-01-01

    Over the past years the percentage of female entrepreneurs has increased, yet it is still far below of that for males. Although various attempts have been made to explain differences in mens' and women's entrepreneurial attitudes and intentions, the extent to which those differences are due to self-report biases has not been yet considered. The present study utilized Differential Item Functioning (DIF) to compare men and women's reporting on entrepreneurial intentions. DIF occurs in situations where members of different groups show differing probabilities of endorsing an item despite possessing the same level of the ability that the item is intended to measure. Drawing on the theory of planned behavior (TPB), the present study investigated whether constructs such as entrepreneurial attitudes, perceived behavioral control, subjective norms and intention would show gender differences and whether these gender differences could be explained by DIF. Using DIF methods on a dataset of 1800 Greek participants (50.4% female) indicated that differences at the item-level are almost non-existent. Moreover, the differential test functioning (DTF) analysis, which allows assessing the overall impact of DIF effects with all items being taken into account simultaneously, suggested that the effect of DIF across all the items for each scale was negligible. Future research should consider that measurement invariance can be assumed when using TPB constructs for the study of entrepreneurial motivation independent of gender.

  10. Use of differential item functioning (DIF analysis for bias analysis in test construction

    Directory of Open Access Journals (Sweden)

    Marié De Beer

    2004-10-01

    Opsomming Waar differensiële itemfunksioneringsprosedures (DIF-prosedures vir itemontleding gebaseer op itemresponsteorie (IRT tydens toetskonstruksie gebruik word, is dit moontlik om itemkarakteristiekekrommes vir dieselfde item vir verskillende subgroepe voor te stel. Hierdie krommes dui aan hoe elke item vir die verskillende subgroepe op verskillende vermoënsvlakke te funksioneer. DIF word aangetoon deur die area tussen die krommes. DIF is in die konstruksie van die 'Learning Potential Computerised Adaptive test (LPCAT' gebruik om die items te identifiseer wat sydigheid ten opsigte van geslag, kultuur, taal of opleidingspeil geopenbaar het. Items wat ’n voorafbepaalde vlak van DIF oorskry het, is uit die finale itembank weggelaat, ongeag die subgroep wat bevoordeel of benadeel is. Die proses en resultate van die DIF-ontleding word bespreek.

  11. Standard Errors for National Trends in International Large-Scale Assessments in the Case of Cross-National Differential Item Functioning

    Science.gov (United States)

    Sachse, Karoline A.; Haag, Nicole

    2017-01-01

    Standard errors computed according to the operational practices of international large-scale assessment studies such as the Programme for International Student Assessment's (PISA) or the Trends in International Mathematics and Science Study (TIMSS) may be biased when cross-national differential item functioning (DIF) and item parameter drift are…

  12. Statistical Bias in Maximum Likelihood Estimators of Item Parameters.

    Science.gov (United States)

    1982-04-01

    34 a> E r’r~e r ,C Ie I# ne,..,.rVi rnd Id.,flfv b1 - bindk numb.r) I; ,t-i i-cd I ’ tiie bias in the maximum likelihood ,st i- i;, ’ t iIeiIrs in...NTC, IL 60088 Psychometric Laboratory University of North Carolina I ERIC Facility-Acquisitions Davie Hall 013A 4833 Rugby Avenue Chapel Hill, NC

  13. Robust Scale Transformation Methods in IRT True Score Equating under Common-Item Nonequivalent Groups Design

    Science.gov (United States)

    He, Yong

    2013-01-01

    Common test items play an important role in equating multiple test forms under the common-item nonequivalent groups design. Inconsistent item parameter estimates among common items can lead to large bias in equated scores for IRT true score equating. Current methods extensively focus on detection and elimination of outlying common items, which…

  14. Identifying Sources of Bias in the WISC-R.

    Science.gov (United States)

    Vance, Booney; Sabatino, David

    1991-01-01

    The issues of construct validity, predictive validity, and item content bias on the Wechsler Intelligence Scale for Children-Revised (WISC-R) are examined. The review concludes that most objective data have not supported the issue of bias of the WISC-R when used with children of different ethnic backgrounds. (JDD)

  15. Cognitive Reflection, Decision Biases, and Response Times.

    Science.gov (United States)

    Alós-Ferrer, Carlos; Garagnani, Michele; Hügelschäfer, Sabine

    2016-01-01

    We present novel evidence on response times and personality traits in standard questions from the decision-making literature where responses are relatively slow (medians around half a minute or above). To this end, we measured response times in a number of incentivized, framed items (decisions from description) including the Cognitive Reflection Test, two additional questions following the same logic, and a number of classic questions used to study decision biases in probability judgments (base-rate neglect, the conjunction fallacy, and the ratio bias). All questions create a conflict between an intuitive process and more deliberative thinking. For each item, we then created a non-conflict version by either making the intuitive impulse correct (resulting in an alignment question), shutting it down (creating a neutral question), or making it dominant (creating a heuristic question). For CRT questions, the differences in response times are as predicted by dual-process theories, with alignment and heuristic variants leading to faster responses and neutral questions to slower responses than the original, conflict questions. For decision biases (where responses are slower), evidence is mixed. To explore the possible influence of personality factors on both choices and response times, we used standard personality scales including the Rational-Experiential Inventory and the Big Five, and used them as controls in regression analysis.

  16. Does item overlap render measured relationships between pain and challenging behaviour trivial? Results from a multicentre cross-sectional study in 13 German nursing homes.

    Science.gov (United States)

    Kutschar, Patrick; Bauer, Zsuzsa; Gnass, Irmela; Osterbrink, Jürgen

    2017-07-01

    Several studies suggest that pain is a trigger for challenging behaviour in older adults with cognitive impairment. However, such measured relationships might be confounded due to item overlap as instruments share similar or identical items. The purpose of this study was to examine whether the frequently observed association between pain and challenging behaviour might be traced back to item overlap. This multicentre cross-sectional study was conducted in 13 nursing homes and examined pain (measure: Pain Assessment in Advanced Dementia Scale) and challenging behaviour (measure: Cohen-Mansfield Agitation Inventory) in 150 residents with severe cognitive impairment. The extent of item overlap was determined by juxtaposition of both measures' original items. As expected, comparison between these instruments revealed an extensive item overlap. The statistical relationship between the two phenomena can be traced back mainly to the contribution of the overlapping items, which renders the frequently stated relationship between pain and challenging behaviour trivial. The status quo of measuring such associations must be contested: constructs' discrimination and instruments' discrimination have to be discussed critically as item overlap may lead to biased conclusions and assumptions in research as well as to inadequate care measures in nursing practice. © 2017 John Wiley & Sons Ltd.

  17. Relation between education and dementia: the role of test bias revisited

    NARCIS (Netherlands)

    Schmand, B.; Lindeboom, J.; Hooijer, C.; Jonker, C.

    1995-01-01

    Several authors have suggested that dementia screening tests may be biased against low levels of education, whereas others find that a low level of education is a genuine risk factor for dementia. The present paper attempts to reconcile these conflicting views by examining item bias and test bias

  18. Gender-based Differential Item Functioning in the Application of the Theory of Planned Behavior for the Study of Entrepreneurial Intentions

    Science.gov (United States)

    Zampetakis, Leonidas A.; Bakatsaki, Maria; Litos, Charalambos; Kafetsios, Konstantinos G.; Moustakis, Vassilis

    2017-01-01

    Over the past years the percentage of female entrepreneurs has increased, yet it is still far below of that for males. Although various attempts have been made to explain differences in mens’ and women’s entrepreneurial attitudes and intentions, the extent to which those differences are due to self-report biases has not been yet considered. The present study utilized Differential Item Functioning (DIF) to compare men and women’s reporting on entrepreneurial intentions. DIF occurs in situations where members of different groups show differing probabilities of endorsing an item despite possessing the same level of the ability that the item is intended to measure. Drawing on the theory of planned behavior (TPB), the present study investigated whether constructs such as entrepreneurial attitudes, perceived behavioral control, subjective norms and intention would show gender differences and whether these gender differences could be explained by DIF. Using DIF methods on a dataset of 1800 Greek participants (50.4% female) indicated that differences at the item-level are almost non-existent. Moreover, the differential test functioning (DTF) analysis, which allows assessing the overall impact of DIF effects with all items being taken into account simultaneously, suggested that the effect of DIF across all the items for each scale was negligible. Future research should consider that measurement invariance can be assumed when using TPB constructs for the study of entrepreneurial motivation independent of gender. PMID:28386244

  19. Cognitive Reflection, Decision Biases, and Response Times

    Directory of Open Access Journals (Sweden)

    Carlos Alos-Ferrer

    2016-09-01

    Full Text Available We present novel evidence on decision times and personality traits in standard questions from the decision-making literature where responses are relatively slow (medians around half a minute or above. To this end, we measured decision times in a number of incentivized, framed items (decisions from description including the Cognitive Reflection Test, two additional questions following the same logic, and a number of classic questions used to study decision biases in probability judgments (base-rate neglect, the conjunction fallacy, and the ratio bias. All questions create a conflict between an intuitive process and more deliberative thinking. For each item, we then created a non-conflict version by either making the intuitive impulse correct (resulting in an alignment question, shutting it down (creating a neutral question, or making it dominant (creating a heuristic question. For CRT questions, the differences in decision times are as predicted by dual-process theories, with alignment and heuristic variants leading to faster responses and neutral questions to slower responses than the original, conflict questions. For decision biases (where responses are slower, evidence is mixed. To explore the possible influence of personality factors on both choices and decision times, we used standard personality scales including the Rational-Experiential Inventory and the Big Five, and used the mas controls in regression analysis.

  20. Assessment of Differential Item Functioning in the Experiences of Discrimination Index

    Science.gov (United States)

    Cunningham, Timothy J.; Berkman, Lisa F.; Gortmaker, Steven L.; Kiefe, Catarina I.; Jacobs, David R.; Seeman, Teresa E.; Kawachi, Ichiro

    2011-01-01

    The psychometric properties of instruments used to measure self-reported experiences of discrimination in epidemiologic studies are rarely assessed, especially regarding construct validity. The authors used 2000–2001 data from the Coronary Artery Risk Development in Young Adults (CARDIA) Study to examine differential item functioning (DIF) in 2 versions of the Experiences of Discrimination (EOD) Index, an index measuring self-reported experiences of racial/ethnic and gender discrimination. DIF may confound interpretation of subgroup differences. Large DIF was observed for 2 of 7 racial/ethnic discrimination items: White participants reported more racial/ethnic discrimination for the “at school” item, and black participants reported more racial/ethnic discrimination for the “getting housing” item. The large DIF by race/ethnicity in the index for racial/ethnic discrimination probably reflects item impact and is the result of valid group differences between blacks and whites regarding their respective experiences of discrimination. The authors also observed large DIF by race/ethnicity for 3 of 7 gender discrimination items. This is more likely to have been due to item bias. Users of the EOD Index must consider the advantages and disadvantages of DIF adjustment (omitting items, constructing separate measures, and retaining items). The EOD Index has substantial usefulness as an instrument that can assess self-reported experiences of discrimination. PMID:22038104

  1. Developing and testing items for the South African Personality Inventory (SAPI

    Directory of Open Access Journals (Sweden)

    Carin Hill

    2013-11-01

    Research purpose: This article reports on the process of identifying items for, and provides a quantitative evaluation of, the South African Personality Inventory (SAPI items. Motivation for the study: The study intended to develop an indigenous and psychometrically sound personality instrument that adheres to the requirements of South African legislation and excludes cultural bias. Research design, approach and method: The authors used a cross-sectional design. They measured the nine SAPI clusters identified in the qualitative stage of the SAPI project in 11 separate quantitative studies. Convenience sampling yielded 6735 participants. Statistical analysis focused on the construct validity and reliability of items. The authors eliminated items that showed poor performance, based on common psychometric criteria, and selected the best performing items to form part of the final version of the SAPI. Main findings: The authors developed 2573 items from the nine SAPI clusters. Of these, 2268 items were valid and reliable representations of the SAPI facets. Practical/managerial implications: The authors developed a large item pool. It measures personality in South Africa. Researchers can refine it for the SAPI. Furthermore, the project illustrates an approach that researchers can use in projects that aim to develop culturally-informed psychological measures. Contribution/value-add: Personality assessment is important for recruiting, selecting and developing employees. This study contributes to the current knowledge about the early processes researchers follow when they develop a personality instrument that measures personality fairly in different cultural groups, as the SAPI does.

  2. A study on investors’ personality characteristics and behavioral biases: Conservatism bias and availability bias in the Tehran Stock Exchange

    Directory of Open Access Journals (Sweden)

    Mahmoud Moradi

    2013-04-01

    Full Text Available Most economic and finance theories are based on the assumption that during economic decision making, people would act totally rational and consider all available information. Nevertheless, behavioral finance focuses on studying of the role of psychological factors on economic participants’ behavior. The study shows that in real-world environment, people are influenced by emotional and cognitive errors and may make irrational financial decisions. In many cases, the participants of financial markets are not aware of their talents for error in decision making, so they are dissatisfied with their investments by considering some behavioral biases decisions. These decisions may often yield undesirable outcomes, which could influence economy, significantly. This paper presents a survey on the relationship between personality dimensions with behavioral biases and availability bias among investment managers in the Tehran Stock Exchange using SPSS software, descriptive and inferential statistics. The necessary data are collected through questionnaire and they are analyzed using some statistical tests. The preliminary results indicate that there is a relationship between personality dimensions and behavioral biases like conservatism bias and availability bias among the investors in the Tehran Stock Exchange.

  3. Schema bias in source monitoring varies with encoding conditions: support for a probability-matching account.

    Science.gov (United States)

    Kuhlmann, Beatrice G; Vaterrodt, Bianca; Bayen, Ute J

    2012-09-01

    Two experiments examined reliance on schematic knowledge in source monitoring. Based on a probability-matching account of source guessing, a schema bias will only emerge if participants do not have a representation of the source-item contingency in the study list, or if the perceived contingency is consistent with schematic expectations. Thus, the account predicts that encoding conditions that affect contingency detection also affect schema bias. In Experiment 1, the schema bias commonly found when schematic information about the sources is not provided before encoding was diminished by an intentional source-memory instruction. In Experiment 2, the depth of processing of schema-consistent and schema-inconsistent source-item pairings was manipulated. Participants consequently overestimated the occurrence of the pairing type they processed in a deep manner, and their source guessing reflected this biased contingency perception. Results support the probability-matching account of source guessing. PsycINFO Database Record (c) 2012 APA, all rights reserved.

  4. Bias and equivalence of the Strengths Use and Deficit Correction Questionnaire

    Directory of Open Access Journals (Sweden)

    Crizelle Els

    2016-11-01

    Full Text Available Orientation: For optimal outcomes, it is suggested that employees receive support from their organisation to use their strengths and improve their deficits. Employees also engage in proactive behaviour to use their strengths and improve their deficits. Following this conversation, the Strengths Use and Deficit Correction Questionnaire (SUDCO was developed. However, the cultural suitability of the SUDCO has not been confirmed. Research purpose: The purpose of this study was to examine the bias and structural equivalence of the SUDCO. Motivation for the study: In a diverse cultural context such as South Africa, it is important to establish that a similar score on a psychological test has the same psychological meaning across ethnic groups. Research design, approach and method: A cross-sectional survey design was followed to collect data among a convenience sample of 858 employees from various occupational sectors in South Africa. Main findings: Confirmatory multigroup analysis was used to test for item and construct bias. None of the items were biased, neither uniform nor non-uniform. The most restrictive model accounted for similarities in weights, intercepts and means; only residuals were different. Practical/managerial implications: The results suggest that the SUDCO is suitable for use among the major ethnic groups included in this study. These results increase the probability that future studies with the SUDCO among other ethnic groups will be unbiased and equivalent. Contribution: This study contributed to existing literature because no previous research has assessed the bias and equivalence of the SUDCO among ethnic groups in South Africa.

  5. A New Online Calibration Method Based on Lord's Bias-Correction.

    Science.gov (United States)

    He, Yinhong; Chen, Ping; Li, Yong; Zhang, Shumei

    2017-09-01

    Online calibration technique has been widely employed to calibrate new items due to its advantages. Method A is the simplest online calibration method and has attracted many attentions from researchers recently. However, a key assumption of Method A is that it treats person-parameter estimates θ ^ s (obtained by maximum likelihood estimation [MLE]) as their true values θ s , thus the deviation of the estimated θ ^ s from their true values might yield inaccurate item calibration when the deviation is nonignorable. To improve the performance of Method A, a new method, MLE-LBCI-Method A, is proposed. This new method combines a modified Lord's bias-correction method (named as maximum likelihood estimation-Lord's bias-correction with iteration [MLE-LBCI]) with the original Method A in an effort to correct the deviation of θ ^ s which may adversely affect the item calibration precision. Two simulation studies were carried out to explore the performance of both MLE-LBCI and MLE-LBCI-Method A under several scenarios. Simulation results showed that MLE-LBCI could make a significant improvement over the ML ability estimates, and MLE-LBCI-Method A did outperform Method A in almost all experimental conditions.

  6. Item Response Theory Applied to Factors Affecting the Patient Journey Towards Hearing Rehabilitation

    Science.gov (United States)

    Chenault, Michelene; Berger, Martijn; Kremer, Bernd; Anteunis, Lucien

    2016-01-01

    To develop a tool for use in hearing screening and to evaluate the patient journey towards hearing rehabilitation, responses to the hearing aid rehabilitation questionnaire scales aid stigma, pressure, and aid unwanted addressing respectively hearing aid stigma, experienced pressure from others; perceived hearing aid benefit were evaluated with item response theory. The sample was comprised of 212 persons aged 55 years or more; 63 were hearing aid users, 64 with and 85 persons without hearing impairment according to guidelines for hearing aid reimbursement in the Netherlands. Bias was investigated relative to hearing aid use and hearing impairment within the differential test functioning framework. Items compromising model fit or demonstrating differential item functioning were dropped. The aid stigma scale was reduced from 6 to 4, the pressure scale from 7 to 4, and the aid unwanted scale from 5 to 4 items. This procedure resulted in bias-free scales ready for screening purposes and application to further understand the help-seeking process of the hearing impaired. PMID:28028428

  7. The cultural fairness of the 12-item General Health Questionnaire among diverse adolescents.

    Science.gov (United States)

    Bowe, Anica

    2017-01-01

    The 12-item general health questionnaire (GHQ-12) was used in the Longitudinal Study of Young People in England (LSYPE; N = 15,770) to collect measures on adolescent mental health. Given the debate in current literature regarding the dimensionality of the GHQ-12, this study examined the cultural sensitivity of the instrument at the item level for each of the 7 major ethnic groups within the database. This study used a hybrid approach of ordinal logistic regression and item response theory (IRT) to examine the presence of differential item functioning (DIF) on the questionnaire. Results demonstrated that uniform, nonuniform, and overall DIF were present on items between White and Asian adolescents (7 items), White and Black Caribbean adolescents (1 item), and White and Black African adolescents (7 items), however all McFadden's pseudo R² effect size estimates indicated that the DIF was negligible. Overall, there were cumulative small scale level effects for the Mixed/Biracial, Asian, and Black African groups, but in each case the bias was only marginal. Findings demonstrate that the GHQ-12 can be considered culturally sensitive for adolescents from diverse ethnic groups in England, but follow-up studies are necessary. Implications for future education and health policies as well as the use of IR-based approaches for psychological instruments are discussed. (PsycINFO Database Record (c) 2016 APA, all rights reserved).

  8. Item-focussed Trees for the Identification of Items in Differential Item Functioning.

    Science.gov (United States)

    Tutz, Gerhard; Berger, Moritz

    2016-09-01

    A novel method for the identification of differential item functioning (DIF) by means of recursive partitioning techniques is proposed. We assume an extension of the Rasch model that allows for DIF being induced by an arbitrary number of covariates for each item. Recursive partitioning on the item level results in one tree for each item and leads to simultaneous selection of items and variables that induce DIF. For each item, it is possible to detect groups of subjects with different item difficulties, defined by combinations of characteristics that are not pre-specified. The way a DIF item is determined by covariates is visualized in a small tree and therefore easily accessible. An algorithm is proposed that is based on permutation tests. Various simulation studies, including the comparison with traditional approaches to identify items with DIF, show the applicability and the competitive performance of the method. Two applications illustrate the usefulness and the advantages of the new method.

  9. Weight Bias: A Systematic Review of Characteristics and Psychometric Properties of Self-Report Questionnaires.

    Science.gov (United States)

    Lacroix, Emilie; Alberga, Angela; Russell-Mathew, Shelly; McLaren, Lindsay; von Ranson, Kristin

    2017-01-01

    People living with overweight and obesity often experience weight-based stigmatization. Investigations of the prevalence and correlates of weight bias and evaluation of weight bias reduction interventions depend upon psychometrically-sound measurement. Our paper is the first to comprehensively evaluate the psychometric properties, use of people-first language within items, and suitability for use with various populations of available self-report measures of weight bias. We searched five electronic databases to identify English-language self-report questionnaires of weight bias. We rated each questionnaire's psychometric properties based on initial validation reports and subsequent use, and examined item language. Our systematic review identified 40 original self-report questionnaires. Most questionnaires were brief, demonstrated adequate internal consistency, and tapped key cognitive and affective dimensions of weight bias such as stereotypes and blaming. Current psychometric evidence is incomplete for many questionnaires, particularly with regard to the properties of test-retest reliability, sensitivity to change as well as discriminant and structural validity. Most questionnaires were developed prior to debate surrounding terminology preferences, and do not employ people-first language in the items administered to participants. We provide information and recommendations for clinicians and researchers in selecting psychometrically sound measures of weight bias for various purposes and populations, and discuss future directions to improve measurement of this construct. © 2017 The Author(s) Published by S. Karger GmbH, Freiburg.

  10. Differential Item Functioning (DIF) among Spanish-Speaking English Language Learners (ELLs) in State Science Tests

    Science.gov (United States)

    Ilich, Maria O.

    Psychometricians and test developers evaluate standardized tests for potential bias against groups of test-takers by using differential item functioning (DIF). English language learners (ELLs) are a diverse group of students whose native language is not English. While they are still learning the English language, they must take their standardized tests for their school subjects, including science, in English. In this study, linguistic complexity was examined as a possible source of DIF that may result in test scores that confound science knowledge with a lack of English proficiency among ELLs. Two years of fifth-grade state science tests were analyzed for evidence of DIF using two DIF methods, Simultaneous Item Bias Test (SIBTest) and logistic regression. The tests presented a unique challenge in that the test items were grouped together into testlets---groups of items referring to a scientific scenario to measure knowledge of different science content or skills. Very large samples of 10, 256 students in 2006 and 13,571 students in 2007 were examined. Half of each sample was composed of Spanish-speaking ELLs; the balance was comprised of native English speakers. The two DIF methods were in agreement about the items that favored non-ELLs and the items that favored ELLs. Logistic regression effect sizes were all negligible, while SIBTest flagged items with low to high DIF. A decrease in socioeconomic status and Spanish-speaking ELL diversity may have led to inconsistent SIBTest effect sizes for items used in both testing years. The DIF results for the testlets suggested that ELLs lacked sufficient opportunity to learn science content. The DIF results further suggest that those constructed response test items requiring the student to draw a conclusion about a scientific investigation or to plan a new investigation tended to favor ELLs.

  11. The Battle over Studies of Faculty Bias

    Science.gov (United States)

    Gravois, John

    2007-01-01

    The American Federation of Teachers (AFT) recently commissioned a study to review the research that finds liberal bias run amok in academe. Believing that the AFT is not a dispassionate observer of this debate, this article provides "The Chronicle of Higher Education's" survey of the genre. The studies reviewed include: (1) "Political Bias in the…

  12. Placebo effect studies are susceptible to response bias and to other types of biases

    DEFF Research Database (Denmark)

    Hróbjartsson, Asbjørn; Kaptchuk, Ted J; Miller, Franklin G

    2011-01-01

    Investigations of the effect of placebo are often challenging to conduct and interpret. The history of placebo shows that assessment of its clinical significance has a real potential to be biased. We analyze and discuss typical types of bias in studies on placebo....

  13. Psychometric properties of the Triarchic Psychopathy Measure: An item response theory approach.

    Science.gov (United States)

    Shou, Yiyun; Sellbom, Martin; Xu, Jing

    2018-05-01

    There is cumulative evidence for the cross-cultural validity of the Triarchic Psychopathy Measure (TriPM; Patrick, 2010) among non-Western populations. Recent studies using correlational and regression analyses show promising construct validity of the TriPM in Chinese samples. However, little is known about the efficiency of items in TriPM in assessing the proposed latent traits. The current study evaluated the psychometric properties of the Chinese TriPM at the item level using item response theory analyses. It also examined the measurement invariance of the TriPM between the Chinese and the U.S. student samples by applying differential item functioning analyses under the item response theory framework. The results supported the unidimensional nature of the Disinhibition and Meanness scales. Both scales had a greater level of precision in the respective underlying constructs at the positive ends. The two scales, however, had several items that were weakly associated with their respective latent traits in the Chinese student sample. Boldness, on the other hand, was found to be multidimensional, and reflected a more normally distributed range of variation. The examination of measurement bias via differential item functioning analyses revealed that a number of items of the TriPM were not equivalent across the Chinese and the U.S. Some modification and adaptation of items might be considered for improving the precision of the TriPM for Chinese participants. (PsycINFO Database Record (c) 2018 APA, all rights reserved).

  14. Spatial attention can bias search in visual short-term memory

    Directory of Open Access Journals (Sweden)

    Anna C Nobre

    2008-03-01

    Full Text Available Whereas top-down attentional control is known to bias perceptual functions at many levels of stimulus analysis, its possible influence over memory-related functions remains uncharted. Our experiment combined behavioral measures and event-related potentials (ERPs to test the ability of spatial orienting to bias functions associated with visual short-term memory (VSTM, and to shed light on the neural mechanisms involved. In particular, we investigated whether orienting attention to a spatial location within an array maintained in VSTM could facilitate the search for a specific remembered item. Participants viewed arrays of one, two or four differently colored items, followed by an informative spatial (100% valid or uninformative neutral retro-cue (1500–2500 ms after the array, and later by a probe stimulus (500–1000 ms after the retro-cue. The task was to decide whether the probe stimulus had been present in the array. Behavioral results showed that spatial retro-cues improved both accuracy and response times for making decisions about the presence of the probe item in VSTM, and significantly attenuated performance decrements caused by increasing VSTM load. We also identified a novel ERP component (N3RS specifically associated with searching for an item within VSTM. Paralleling the behavioral results, the amplitude and duration of the N3RS systematically increased with VSTM load in neutral retro-cue trials. When spatial retro-cues were provided, this “retro-search” component was absent. Our findings clearly show that the infl uence of top-down attentional biases extends to mnemonic functions, and, specifically, that searching for items within VSTM can be under flexible voluntary control.

  15. A Case Study on an Item Writing Process: Use of Test Specifications, Nature of Group Dynamics, and Individual Item Writers' Characteristics

    Science.gov (United States)

    Kim, Jiyoung; Chi, Youngshin; Huensch, Amanda; Jun, Heesung; Li, Hongli; Roullion, Vanessa

    2010-01-01

    This article discusses a case study on an item writing process that reflects on our practical experience in an item development project. The purpose of the article is to share our lessons from the experience aiming to demystify item writing process. The study investigated three issues that naturally emerged during the project: how item writers use…

  16. Item response theory applied to factors affecting the patient journey towards hearing rehabilitation

    Directory of Open Access Journals (Sweden)

    Michelene Chenault

    2016-11-01

    Full Text Available To develop a tool for use in hearing screening and to evaluate the patient journey towards hearing rehabilitation, responses to the hearing aid rehabilitation questionnaire scales aid stigma, pressure, and aid unwanted addressing respectively hearing aid stigma, experienced pressure from others; perceived hearing aid benefit were evaluated with item response theory. The sample was comprised of 212 persons aged 55 years or more; 63 were hearing aid users, 64 with and 85 persons without hearing impairment according to guidelines for hearing aid reimbursement in the Netherlands. Bias was investigated relative to hearing aid use and hearing impairment within the differential test functioning framework. Items compromising model fit or demonstrating differential item functioning were dropped. The aid stigma scale was reduced from 6 to 4, the pressure scale from 7 to 4, and the aid unwanted scale from 5 to 4 items. This procedure resulted in bias-free scales ready for screening purposes and application to further understand the help-seeking process of the hearing impaired.

  17. A note on exponential dispersion models which are invariant under length-biased sampling

    NARCIS (Netherlands)

    Bar-Lev, S.K.; van der Duyn Schouten, F.A.

    2003-01-01

    Length-biased sampling situations may occur in clinical trials, reliability, queueing models, survival analysis and population studies where a proper sampling frame is absent.In such situations items are sampled at rate proportional to their length so that larger values of the quantity being

  18. Abstract analysis method facilitates filtering low-methodological quality and high-bias risk systematic reviews on psoriasis interventions.

    Science.gov (United States)

    Gómez-García, Francisco; Ruano, Juan; Aguilar-Luque, Macarena; Alcalde-Mellado, Patricia; Gay-Mimbrera, Jesús; Hernández-Romero, José Luis; Sanz-Cabanillas, Juan Luis; Maestre-López, Beatriz; González-Padilla, Marcelino; Carmona-Fernández, Pedro J; García-Nieto, Antonio Vélez; Isla-Tejera, Beatriz

    2017-12-29

    Article summaries' information and structure may influence researchers/clinicians' decisions to conduct deeper full-text analyses. Specifically, abstracts of systematic reviews (SRs) and meta-analyses (MA) should provide structured summaries for quick assessment. This study explored a method for determining the methodological quality and bias risk of full-text reviews using abstract information alone. Systematic literature searches for SRs and/or MA about psoriasis were undertaken on MEDLINE, EMBASE, and Cochrane database. For each review, quality, abstract-reporting completeness, full-text methodological quality, and bias risk were evaluated using Preferred Reporting Items for Systematic Reviews and Meta-analyses for abstracts (PRISMA-A), Assessing the Methodological Quality of Systematic Reviews (AMSTAR), and ROBIS tools, respectively. Article-, author-, and journal-derived metadata were systematically extracted from eligible studies using a piloted template, and explanatory variables concerning abstract-reporting quality were assessed using univariate and multivariate-regression models. Two classification models concerning SRs' methodological quality and bias risk were developed based on per-item and total PRISMA-A scores and decision-tree algorithms. This work was supported, in part, by project ICI1400136 (JR). No funding was received from any pharmaceutical company. This study analysed 139 SRs on psoriasis interventions. On average, they featured 56.7% of PRISMA-A items. The mean total PRISMA-A score was significantly higher for high-methodological-quality SRs than for moderate- and low-methodological-quality reviews. SRs with low-bias risk showed higher total PRISMA-A values than reviews with high-bias risk. In the final model, only 'authors per review > 6' (OR: 1.098; 95%CI: 1.012-1.194), 'academic source of funding' (OR: 3.630; 95%CI: 1.788-7.542), and 'PRISMA-endorsed journal' (OR: 4.370; 95%CI: 1.785-10.98) predicted PRISMA-A variability. Reviews with a

  19. A working memory bias for alcohol-related stimuli depends on drinking score.

    Science.gov (United States)

    Kessler, Klaus; Pajak, Katarzyna Malgorzata; Harkin, Ben; Jones, Barry

    2013-03-01

    We tested 44 participants with respect to their working memory (WM) performance on alcohol-related versus neutral visual stimuli. Previously an alcohol attentional bias (AAB) had been reported using these stimuli, where the attention of frequent drinkers was automatically drawn toward alcohol-related items (e.g., beer bottle). The present study set out to provide evidence for an alcohol memory bias (AMB) that would persist over longer time-scales than the AAB. The WM task we used required memorizing 4 stimuli in their correct locations and a visual interference task was administered during a 4-sec delay interval. A subsequent probe required participants to indicate whether a stimulus was shown in the correct or incorrect location. For each participant we calculated a drinking score based on 3 items derived from the Alcohol Use Questionnaire, and we observed that higher scorers better remembered alcohol-related images compared with lower scorers, particularly when these were presented in their correct locations upon recall. This provides first evidence for an AMB. It is important to highlight that this effect persisted over a 4-sec delay period including a visual interference task that erased iconic memories and diverted attention away from the encoded items, thus the AMB cannot be reduced to the previously reported AAB. Our finding calls for further investigation of alcohol-related cognitive biases in WM, and we propose a preliminary model that may guide future research. (PsycINFO Database Record (c) 2013 APA, all rights reserved).

  20. Affective bias in visual working memory is associated with capacity.

    Science.gov (United States)

    Xie, Weizhen; Li, Huanhuan; Ying, Xiangyu; Zhu, Shiyou; Fu, Rong; Zou, Yingmin; Cui, Yanyan

    2017-11-01

    How does the affective nature of task stimuli modulate working memory (WM)? This study investigates whether WM maintains emotional information in a biased manner to meet the motivational principle of approaching positivity and avoiding negativity by retaining more approach-related positive content over avoidance-related negative content. This bias may exist regardless of individual differences in WM functionality, as indexed by WM capacity (overall bias hypothesis). Alternatively, this bias may be contingent on WM capacity (capacity-based hypothesis), in which a better WM system may be more likely to reveal an adaptive bias. In two experiments, participants performed change localisation tasks with emotional and non-emotional stimuli to estimate the number of items that they could retain for each of those stimuli. Although participants did not seem to remember one type of emotional content (e.g. happy faces) better than the other type of emotional content (e.g. sad faces), there was a significant correlation between WM capacity and affective bias. Specifically, participants with higher WM capacity for non-emotional stimuli (colours or line-drawing symbols) tended to maintain more happy faces over sad faces. These findings demonstrated the presence of a "built-in" affective bias in WM as a function of its systematic limitations, favouring the capacity-based hypothesis.

  1. Translation Fidelity of Psychological Scales: An Item Response Theory Analysis of an Individualism-Collectivism Scale.

    Science.gov (United States)

    Bontempo, Robert

    1993-01-01

    Describes a method for assessing the quality of translations based on item response theory (IRT). Results from the IRT technique with French and Chinese versions of a scale measuring individualism-collectivism for samples of 250 U.S., 357 French, and 290 Chinese undergraduates show how several biased items are detected. (SLD)

  2. Referral bias in ALS epidemiological studies.

    Science.gov (United States)

    Logroscino, Giancarlo; Marin, Benoit; Piccininni, Marco; Arcuti, Simona; Chiò, Adriano; Hardiman, Orla; Rooney, James; Zoccolella, Stefano; Couratier, Philippe; Preux, Pierre-Marie; Beghi, Ettore

    2018-01-01

    Despite concerns about the representativeness of patients from ALS tertiary centers as compared to the ALS general population, the extent of referral bias in clinical studies remains largely unknown. Using data from EURALS consortium we aimed to assess nature, extent and impact of referral bias. Four European ALS population-based registries located in Ireland, Piedmont, Puglia, Italy, and Limousin, France, covering 50 million person-years, participated. Demographic and clinic characteristics of ALS patients diagnosed in tertiary referral centers were contrasted with the whole ALS populations enrolled in registries in the same geographical areas. Patients referred to ALS centers were younger (with difference ranging from 1.1 years to 2.4 years), less likely to present a bulbar onset, with a higher proportion of familial antecedents and a longer survival (ranging from 11% to 15%) when compared to the entire ALS population in the same geographic area. A trend for referral bias is present in cohorts drawn from ALS referral centers. The magnitude of the possible referral bias in a particular tertiary center can be estimated through a comparison with ALS patients drawn from registry in the same geographic area. Studies based on clinical cohorts should be cautiously interpreted. The presence of a registry in the same area may improve the complete ascertainment in the referral center.

  3. Presence of bias in radiographer plain film reading performance studies

    International Nuclear Information System (INIS)

    Brealey, S.; Scally, A.J.; Thomas, N.B.

    2002-01-01

    Purpose To raise awareness of the frequency of bias that can affect the quality of radiographer plain film reading performance studies. Methods Studies that assessed radiographer(s) plain film reading performance were located by searching electronic databases and grey literature, hand-searching journals, personal communication and scanning reference lists. Thirty studies were judged eligible from all data sources. Results A one-way analysis of variance (ANOVA) demonstrates no statistically significant difference (P=0.25) in the mean proportion of biases present from diagnostic accuracy (0.37), performance (0.42) and outcome (0.44) study designs. Pearson's correlation coefficient showed no statistically significant linear association between the proportion of biases present for the three different study designs and the year that the study was performed. The frequency of biases in film and observer selection and application of the reference standard was quite low. In contrast, many biases were present concerning independence of film reporting and comparison of reports for concordance. Conclusions The findings indicate variation in the presence of bias in radiographer plain film reading performance studies. The careful consideration of bias is an essential component of study quality and hence the validity of the evidence-base used to underpin radiographic reporting policy

  4. Bias-Correction in Vector Autoregressive Models: A Simulation Study

    Directory of Open Access Journals (Sweden)

    Tom Engsted

    2014-03-01

    Full Text Available We analyze the properties of various methods for bias-correcting parameter estimates in both stationary and non-stationary vector autoregressive models. First, we show that two analytical bias formulas from the existing literature are in fact identical. Next, based on a detailed simulation study, we show that when the model is stationary this simple bias formula compares very favorably to bootstrap bias-correction, both in terms of bias and mean squared error. In non-stationary models, the analytical bias formula performs noticeably worse than bootstrapping. Both methods yield a notable improvement over ordinary least squares. We pay special attention to the risk of pushing an otherwise stationary model into the non-stationary region of the parameter space when correcting for bias. Finally, we consider a recently proposed reduced-bias weighted least squares estimator, and we find that it compares very favorably in non-stationary models.

  5. Weight bias internalization in treatment-seeking overweight adults: Psychometric validation and associations with self-esteem, body image, and mood symptoms.

    Science.gov (United States)

    Durso, Laura E; Latner, Janet D; Ciao, Anna C

    2016-04-01

    Internalized weight bias has been previously associated with impairments in eating behaviors, body image, and psychological functioning. The present study explored the psychological correlates and psychometric properties of the Weight Bias Internalization Scale (WBIS) among overweight adults enrolled in a behavioral weight loss program. Questionnaires assessing internalized weight bias, anti-fat attitudes, self-esteem, body image concern, and mood symptoms were administered to 90 obese or overweight men and women between the ages of 21 and 73. Reliability statistics suggested revisions to the WBIS. The resulting 9-item scale was shown to be positively associated with body image concern, depressive symptoms, and stress, and negatively associated with self-esteem. Multiple linear regression models demonstrated that WBIS scores were significant and independent predictors of body image concern, self-esteem, and depressive symptoms. These results support the use of the revised 9-item WBIS in treatment-seeking samples as a reliable and valid measure of internalized weight bias. Copyright © 2016. Published by Elsevier Ltd.

  6. Bias-correction in vector autoregressive models: A simulation study

    DEFF Research Database (Denmark)

    Engsted, Tom; Pedersen, Thomas Quistgaard

    We analyze and compare the properties of various methods for bias-correcting parameter estimates in vector autoregressions. First, we show that two analytical bias formulas from the existing literature are in fact identical. Next, based on a detailed simulation study, we show that this simple...... and easy-to-use analytical bias formula compares very favorably to the more standard but also more computer intensive bootstrap bias-correction method, both in terms of bias and mean squared error. Both methods yield a notable improvement over both OLS and a recently proposed WLS estimator. We also...... of pushing an otherwise stationary model into the non-stationary region of the parameter space during the process of correcting for bias....

  7. Reduction of bias in neutron multiplicity assay using a weighted point model

    Energy Technology Data Exchange (ETDEWEB)

    Geist, W. H. (William H.); Krick, M. S. (Merlyn S.); Mayo, D. R. (Douglas R.)

    2004-01-01

    Accurate assay of most common plutonium samples was the development goal for the nondestructive assay technique of neutron multiplicity counting. Over the past 20 years the technique has been proven for relatively pure oxides and small metal items. Unfortunately, the technique results in large biases when assaying large metal items. Limiting assumptions, such as unifoh multiplication, in the point model used to derive the multiplicity equations causes these biases for large dense items. A weighted point model has been developed to overcome some of the limitations in the standard point model. Weighting factors are detemiined from Monte Carlo calculations using the MCNPX code. Monte Carlo calculations give the dependence of the weighting factors on sample mass and geometry, and simulated assays using Monte Carlo give the theoretical accuracy of the weighted-point-model assay. Measured multiplicity data evaluated with both the standard and weighted point models are compared to reference values to give the experimental accuracy of the assay. Initial results show significant promise for the weighted point model in reducing or eliminating biases in the neutron multiplicity assay of metal items. The negative biases observed in the assay of plutonium metal samples are caused by variations in the neutron multiplication for neutrons originating in various locations in the sample. The bias depends on the mass and shape of the sample and depends on the amount and energy distribution of the ({alpha},n) neutrons in the sample. When the standard point model is used, this variable-multiplication bias overestimates the multiplication and alpha values of the sample, and underestimates the plutonium mass. The weighted point model potentially can provide assay accuracy of {approx}2% (1 {sigma}) for cylindrical plutonium metal samples < 4 kg with {alpha} < 1 without knowing the exact shape of the samples, provided that the ({alpha},n) source is uniformly distributed throughout the

  8. Effect of study context on item recollection.

    Science.gov (United States)

    Skinner, Erin I; Fernandes, Myra A

    2010-07-01

    We examined how visual context information provided during encoding, and unrelated to the target word, affected later recollection for words presented alone using a remember-know paradigm. Experiments 1A and 1B showed that participants had better overall memory-specifically, recollection-for words studied with pictures of intact faces than for words studied with pictures of scrambled or inverted faces. Experiment 2 replicated these results and showed that recollection was higher for words studied with pictures of faces than when no image accompanied the study word. In Experiment 3 participants showed equivalent memory for words studied with unique faces as for those studied with a repeatedly presented face. Results suggest that recollection benefits when visual context information high in meaningful content accompanies study words and that this benefit is not related to the uniqueness of the context. We suggest that participants use elaborative processes to integrate item and meaningful contexts into ensemble information, improving subsequent item recollection.

  9. Exchange bias studied with polarized neutron reflectivity

    International Nuclear Information System (INIS)

    Velthuis, S. G. E. te

    2000-01-01

    The role of Polarized Neutron Reflectivity (PNR) for studying natural and synthetic exchange biased systems is illustrated. For a partially oxidized thin film of Co, cycling of the magnetic field causes a considerable reduction of the bias, which the onset of diffuse neutron scattering shows to be due to the loosening of the ferromagnetic domains. On the other hand, PNR measurements of a model exchange bias junction consisting of an n-layered Fe/Cr antiferromagnetic (AF) superlattice coupled with an m-layered Fe/Cr ferromagnetic (F) superlattice confirm the predicted collinear magnetization in the two superlattices. The two magnetized states of the F (along or opposite to the bias field) differ only in the relative orientation of the F and adjacent AF layer. The possibility of reading clearly the magnetic state at the interface pinpoints the commanding role that PNR is having in solving this intriguing problem

  10. Differential Item Functioning in the SF-36 Physical Functioning and Mental Health Sub-Scales: A Population-Based Investigation in the Canadian Multicentre Osteoporosis Study.

    Science.gov (United States)

    Lix, Lisa M; Wu, Xiuyun; Hopman, Wilma; Mayo, Nancy; Sajobi, Tolulope T; Liu, Juxin; Prior, Jerilynn C; Papaioannou, Alexandra; Josse, Robert G; Towheed, Tanveer E; Davison, K Shawn; Sawatzky, Richard

    2016-01-01

    Self-reported health status measures, like the Short Form 36-item Health Survey (SF-36), can provide rich information about the overall health of a population and its components, such as physical, mental, and social health. However, differential item functioning (DIF), which arises when population sub-groups with the same underlying (i.e., latent) level of health have different measured item response probabilities, may compromise the comparability of these measures. The purpose of this study was to test for DIF on the SF-36 physical functioning (PF) and mental health (MH) sub-scale items in a Canadian population-based sample. Study data were from the prospective Canadian Multicentre Osteoporosis Study (CaMos), which collected baseline data in 1996-1997. DIF was tested using a multiple indicators multiple causes (MIMIC) method. Confirmatory factor analysis defined the latent variable measurement model for the item responses and latent variable regression with demographic and health status covariates (i.e., sex, age group, body weight, self-perceived general health) produced estimates of the magnitude of DIF effects. The CaMos cohort consisted of 9423 respondents; 69.4% were female and 51.7% were less than 65 years. Eight of 10 items on the PF sub-scale and four of five items on the MH sub-scale exhibited DIF. Large DIF effects were observed on PF sub-scale items about vigorous and moderate activities, lifting and carrying groceries, walking one block, and bathing or dressing. On the MH sub-scale items, all DIF effects were small or moderate in size. SF-36 PF and MH sub-scale scores were not comparable across population sub-groups defined by demographic and health status variables due to the effects of DIF, although the magnitude of this bias was not large for most items. We recommend testing and adjusting for DIF to ensure comparability of the SF-36 in population-based investigations.

  11. Differential Item Functioning in the SF-36 Physical Functioning and Mental Health Sub-Scales: A Population-Based Investigation in the Canadian Multicentre Osteoporosis Study.

    Directory of Open Access Journals (Sweden)

    Lisa M Lix

    Full Text Available Self-reported health status measures, like the Short Form 36-item Health Survey (SF-36, can provide rich information about the overall health of a population and its components, such as physical, mental, and social health. However, differential item functioning (DIF, which arises when population sub-groups with the same underlying (i.e., latent level of health have different measured item response probabilities, may compromise the comparability of these measures. The purpose of this study was to test for DIF on the SF-36 physical functioning (PF and mental health (MH sub-scale items in a Canadian population-based sample.Study data were from the prospective Canadian Multicentre Osteoporosis Study (CaMos, which collected baseline data in 1996-1997. DIF was tested using a multiple indicators multiple causes (MIMIC method. Confirmatory factor analysis defined the latent variable measurement model for the item responses and latent variable regression with demographic and health status covariates (i.e., sex, age group, body weight, self-perceived general health produced estimates of the magnitude of DIF effects.The CaMos cohort consisted of 9423 respondents; 69.4% were female and 51.7% were less than 65 years. Eight of 10 items on the PF sub-scale and four of five items on the MH sub-scale exhibited DIF. Large DIF effects were observed on PF sub-scale items about vigorous and moderate activities, lifting and carrying groceries, walking one block, and bathing or dressing. On the MH sub-scale items, all DIF effects were small or moderate in size.SF-36 PF and MH sub-scale scores were not comparable across population sub-groups defined by demographic and health status variables due to the effects of DIF, although the magnitude of this bias was not large for most items. We recommend testing and adjusting for DIF to ensure comparability of the SF-36 in population-based investigations.

  12. Bond and Equity Home Bias and Foreign Bias: an International Study

    OpenAIRE

    VanPée, Rosanne; De Moor, Lieven

    2012-01-01

    In this paper we explore tentatively and formally the differences between bond and equity home bias and foreign bias based on one large scale dataset including developed and emerging markets for the period 2001 to 2010. We set the stage by tentatively and formally linking the diversion of bond and equity home bias in OECD countries to the increasing public debt issues under the form of government bonds i.e. the supply-driven argument. Unlike Fidora et al. (2007) we do not find that exchange r...

  13. Reconsidering Cluster Bias in Multilevel Data: A Monte Carlo Comparison of Free and Constrained Baseline Approaches.

    Science.gov (United States)

    Guenole, Nigel

    2018-01-01

    The test for item level cluster bias examines the improvement in model fit that results from freeing an item's between level residual variance from a baseline model with equal within and between level factor loadings and between level residual variances fixed at zero. A potential problem is that this approach may include a misspecified unrestricted model if any non-invariance is present, but the log-likelihood difference test requires that the unrestricted model is correctly specified. A free baseline approach where the unrestricted model includes only the restrictions needed for model identification should lead to better decision accuracy, but no studies have examined this yet. We ran a Monte Carlo study to investigate this issue. When the referent item is unbiased, compared to the free baseline approach, the constrained baseline approach led to similar true positive (power) rates but much higher false positive (Type I error) rates. The free baseline approach should be preferred when the referent indicator is unbiased. When the referent assumption is violated, the false positive rate was unacceptably high for both free and constrained baseline approaches, and the true positive rate was poor regardless of whether the free or constrained baseline approach was used. Neither the free or constrained baseline approach can be recommended when the referent indicator is biased. We recommend paying close attention to ensuring the referent indicator is unbiased in tests of cluster bias. All Mplus input and output files, R, and short Python scripts used to execute this simulation study are uploaded to an open access repository.

  14. Survey Response-Related Biases in Contingent Valuation: Concepts, Remedies, and Empirical Application to Valuing Aquatic Plant Management

    Science.gov (United States)

    Mark L. Messonnier; John C. Bergstrom; Chrisopher M. Cornwell; R. Jeff Teasley; H. Ken Cordell

    2000-01-01

    Simple nonresponse and selection biases that may occur in survey research such as contingent valuation applications are discussed and tested. Correction mechanisms for these types of biases are demonstrated. Results indicate the importance of testing and correcting for unit and item nonresponse bias in contingent valuation survey data. When sample nonresponse and...

  15. Persistent User Bias in Case-Crossover Studies in Pharmacoepidemiology

    DEFF Research Database (Denmark)

    Hallas, Jesper; Pottegård, Anton; Wang, Shirley

    2016-01-01

    Studying the effect of chronic medication exposure by means of a case-crossover design may result in an upward-biased odds ratio. In this study, our aim was to assess the occurrence of this bias and to evaluate whether it is remedied by including a control group (the case-time-control design...... for the retinal detachment controls were similar, leading to near-null case-time-control estimates for all 3 medication classes. For wrist fracture and stroke, the odds ratios were higher for cases than for controls, and case-time-control odds ratios were consistently above unity, thus implying significant...... residual bias. In case-crossover studies of medications, contamination by persistent users confers a moderate bias upward, which is partly remedied by using a control group. The optimal strategy for dealing with this problem is currently unknown....

  16. The Protective Behavioral Strategies for Marijuana Scale: Further examination using item response theory.

    Science.gov (United States)

    Pedersen, Eric R; Huang, Wenjing; Dvorak, Robert D; Prince, Mark A; Hummer, Justin F

    2017-08-01

    Given recent state legislation legalizing marijuana for recreational purposes and majority popular opinion favoring these laws, we developed the Protective Behavioral Strategies for Marijuana scale (PBSM) to identify strategies that may mitigate the harms related to marijuana use among those young people who choose to use the drug. In the current study, we expand on the initial exploratory study of the PBSM to further validate the measure with a large and geographically diverse sample (N = 2,117; 60% women, 30% non-White) of college students from 11 different universities across the United States. We sought to develop a psychometrically sound item bank for the PBSM and to create a short assessment form that minimizes respondent burden and time. Quantitative item analyses, including exploratory and confirmatory factor analyses with item response theory (IRT) and evaluation of differential item functioning (DIF), revealed an item bank of 36 items that was examined for unidimensionality and good content coverage, as well as a short form of 17 items that is free of bias in terms of gender (men vs. women), race (White vs. non-White), ethnicity (Hispanic vs. non-Hispanic), and recreational marijuana use legal status (state recreational marijuana was legal for 25.5% of participants). We also provide a scoring table for easy transformation from sum scores to IRT scale scores. The PBSM item bank and short form associated strongly and negatively with past month marijuana use and consequences. The measure may be useful to researchers and clinicians conducting intervention and prevention programs with young adults. (PsycINFO Database Record (c) 2017 APA, all rights reserved).

  17. Study of the Dependency on Magnetic Field and Bias Voltage of an AC-Biased TES Microcalorimeter

    Science.gov (United States)

    Gottardi, L.; Bruijn, M.; denHartog, R.; Hoevers, H.; deKorte, P.; vanderKuur, J.; Linderman, M.; Adams, J.; Bailey, C.; Bandler, S.; hide

    2012-01-01

    At SRON we are studying the performance of a Goddard Space Flight Center single pixel TES microcalorimeter operated in an AC bias configuration. For x-ray photons at 6 keV the pixel shows an x-ray energy resolution Delta E(sub FWHM) = 3.7 eV, which is about a factor 2 worse than the energy resolution observed in an identical DC-biased pixel. In order to better understand the reasons for this discrepancy we characterized the detector as a function of temperature, bias working point and applied perpendicular magnetic field. A strong periodic dependency of the detector noise on the TES AC bias voltage is measured. We discuss the results in the framework of the recently observed weak-link behaviour of a TES microcalorimeter.

  18. Leveraging position bias to improve peer recommendation.

    Directory of Open Access Journals (Sweden)

    Kristina Lerman

    Full Text Available With the advent of social media and peer production, the amount of new online content has grown dramatically. To identify interesting items in the vast stream of new content, providers must rely on peer recommendation to aggregate opinions of their many users. Due to human cognitive biases, the presentation order strongly affects how people allocate attention to the available content. Moreover, we can manipulate attention through the presentation order of items to change the way peer recommendation works. We experimentally evaluate this effect using Amazon Mechanical Turk. We find that different policies for ordering content can steer user attention so as to improve the outcomes of peer recommendation.

  19. Assessing item fit for unidimensional item response theory models using residuals from estimated item response functions.

    Science.gov (United States)

    Haberman, Shelby J; Sinharay, Sandip; Chon, Kyong Hee

    2013-07-01

    Residual analysis (e.g. Hambleton & Swaminathan, Item response theory: principles and applications, Kluwer Academic, Boston, 1985; Hambleton, Swaminathan, & Rogers, Fundamentals of item response theory, Sage, Newbury Park, 1991) is a popular method to assess fit of item response theory (IRT) models. We suggest a form of residual analysis that may be applied to assess item fit for unidimensional IRT models. The residual analysis consists of a comparison of the maximum-likelihood estimate of the item characteristic curve with an alternative ratio estimate of the item characteristic curve. The large sample distribution of the residual is proved to be standardized normal when the IRT model fits the data. We compare the performance of our suggested residual to the standardized residual of Hambleton et al. (Fundamentals of item response theory, Sage, Newbury Park, 1991) in a detailed simulation study. We then calculate our suggested residuals using data from an operational test. The residuals appear to be useful in assessing the item fit for unidimensional IRT models.

  20. A Bifactor Multidimensional Item Response Theory Model for Differential Item Functioning Analysis on Testlet-Based Items

    Science.gov (United States)

    Fukuhara, Hirotaka; Kamata, Akihito

    2011-01-01

    A differential item functioning (DIF) detection method for testlet-based data was proposed and evaluated in this study. The proposed DIF model is an extension of a bifactor multidimensional item response theory (MIRT) model for testlets. Unlike traditional item response theory (IRT) DIF models, the proposed model takes testlet effects into…

  1. Effect of individual thinking styles on item selection during study time allocation.

    Science.gov (United States)

    Jia, Xiaoyu; Li, Weijian; Cao, Liren; Li, Ping; Shi, Meiling; Wang, Jingjing; Cao, Wei; Li, Xinyu

    2018-04-01

    The influence of individual differences on learners' study time allocation has been emphasised in recent studies; however, little is known about the role of individual thinking styles (analytical versus intuitive). In the present study, we explored the influence of individual thinking styles on learners' application of agenda-based and habitual processes when selecting the first item during a study-time allocation task. A 3-item cognitive reflection test (CRT) was used to determine individuals' degree of cognitive reliance on intuitive versus analytical cognitive processing. Significant correlations between CRT scores and the choices of first item selection were observed in both Experiment 1a (study time was 5 seconds per triplet) and Experiment 1b (study time was 20 seconds per triplet). Furthermore, analytical decision makers constructed a value-based agenda (prioritised high-reward items), whereas intuitive decision makers relied more upon habitual responding (selected items from the leftmost of the array). The findings of Experiment 1a were replicated in Experiment 2 notwithstanding ruling out the possible effects from individual intelligence and working memory capacity. Overall, the individual thinking style plays an important role on learners' study time allocation and the predictive ability of CRT is reliable in learners' item selection strategy. © 2016 International Union of Psychological Science.

  2. Considerations about expected a posteriori estimation in adaptive testing: adaptive a priori, adaptive correction for bias, and adaptive integration interval.

    Science.gov (United States)

    Raiche, Gilles; Blais, Jean-Guy

    2009-01-01

    In a computerized adaptive test, we would like to obtain an acceptable precision of the proficiency level estimate using an optimal number of items. Unfortunately, decreasing the number of items is accompanied by a certain degree of bias when the true proficiency level differs significantly from the a priori estimate. The authors suggest that it is possible to reduced the bias, and even the standard error of the estimate, by applying to each provisional estimation one or a combination of the following strategies: adaptive correction for bias proposed by Bock and Mislevy (1982), adaptive a priori estimate, and adaptive integration interval.

  3. Pharmacogenomics Bias - Systematic distortion of study results by genetic heterogeneity

    Directory of Open Access Journals (Sweden)

    Zietemann, Vera

    2008-04-01

    Full Text Available Background: Decision analyses of drug treatments in chronic diseases require modeling the progression of disease and treatment response beyond the time horizon of clinical or epidemiological studies. In many such models, progression and drug effect have been applied uniformly to all patients; heterogeneity in progression, including pharmacogenomic effects, has been ignored. Objective: We sought to systematically evaluate the existence, direction and relative magnitude of a pharmacogenomics bias (PGX-Bias resulting from failure to adjust for genetic heterogeneity in both treatment response (HT and heterogeneity in progression of disease (HP in decision-analytic studies based on clinical study data. Methods: We performed a systematic literature search in electronic databases for studies regarding the effect of genetic heterogeneity on the validity of study results. Included studies have been summarized in evidence tables. In the case of lacking evidence from published studies we sought to perform our own simulation considering both HT and HP. We constructed two simple Markov models with three basic health states (early-stage disease, late-stage disease, dead, one adjusting and the other not adjusting for genetic heterogeneity. Adjustment was done by creating different disease states for presence (G+ and absence (G- of a dichotomous genetic factor. We compared the life expectancy gains attributable to treatment resulting from both models and defined pharmacogenomics bias as percent deviation of treatment-related life expectancy gains in the unadjusted model from those in the adjusted model. We calculated the bias as a function of underlying model parameters to create generic results. We then applied our model to lipid-lowering therapy with pravastatin in patients with coronary atherosclerosis, incorporating the influence of two TaqIB polymorphism variants (B1 and B2 on progression and drug efficacy as reported in the DNA substudy of the REGRESS

  4. Biases in cost measurement for economic evaluation studies in health care.

    Science.gov (United States)

    Jacobs, P; Baladi, J F

    1996-01-01

    This paper addresses the issue of biases in cost measures which used in economic evaluation studies. The basic measure of hospital costs which is used by most investigators is unit cost. Focusing on this measure, a set of criteria which the basic measures must fulfil in order to approximate the marginal cost (MC) of a service for the relevant product, in the representative site, was identified. Then four distinct biases--a scale bias, a case mix bias, a methods bias and a site selection bias--each of which reflects the divergence of the unit cost measure from the desired MC measure, were identified. Measures are proposed for several of these biases and it is suggested how they can be corrected.

  5. Investor’s Commitment Bias and Escalation of Firm’s Investment Decision

    Directory of Open Access Journals (Sweden)

    Anis JARBOUI

    2012-12-01

    Full Text Available This study examines the reasons of perseverance in firm’s investment decision. It shows the possible influence of three closely related features which are: firm’s financial indicators, investor’s risk profile, and investor’s commitment bias, on a firm’s investment decisions escalation. This study aims to provide evidence as to whether investor considers the financial and risk’s perception features (financial strength and risk profile to persevere his initial investment decision while he notes a high level of commitment bias. The proposed model of this paper uses GLM univariate data analyses to examine this relationship. Investor’s risk profile and his commitment bias have been measured by means of a questionnaire comprising several items. As for the selected sample, it has been composed of some 360 Tunisian individual investors. Our results have revealed that investors pay more attention to keep their psychology comfort than their financial comfort. It exposed the importance of the investor’s commitment bias and its risk perception in explaining investment decision escalation. Moreover results shows that there is strong and significant empirical relationship linking the escalatory behavior in investment decision and the interaction effects between the three independent variables. This means that, in practice, investors consider the three factors simultaneously.

  6. Investigation of bias in a study of nuclear shipyard workers

    International Nuclear Information System (INIS)

    Greenberg, E.R.; Rosner, B.; Hennekens, C.; Rinsky, R.; Colton, T.

    1985-01-01

    The authors examined discrepant findings between a 1978 proportional mortality study and a 1981 cohort study of workers at the Portsmouth, New Hampshire Naval Shipyard to determine whether the healthy worker effect, selection bias, or measurement bias could explain why only the proportional mortality study found excess cancer deaths among nuclear workers. Lower mortality from noncancer causes in nuclear workers (the healthy worker effect) partly accounted for the observed elevated cancer proportional mortality. More important, however, was measurement bias which occurred in the proportional mortality study when nuclear workers who had not died of cancer were misclassified as not being nuclear workers based on information from their next of kin, thereby, creating a spurious association. Although the proportional mortality study was based on a small sample of all deaths occuring in the cohort, selection bias did not contribute materially to the discrepant results for total cancer deaths. With regard to leukemia, misclassification of occupation in the proportional mortality study and disagreement about cause of death accounted for some of the reported excess deaths. 16 references, 4 tables

  7. Bias changing molecule–lead couple and inducing low bias negative differential resistance for electrons acceptor predicted by first-principles study

    International Nuclear Information System (INIS)

    Min, Y.; Fang, J.H.; Zhong, C.G.; Dong, Z.C.; Zhao, Z.Y.; Zhou, P.X.; Yao, K.L.

    2015-01-01

    A first-principles study of the transport properties of 3,13-dimercaptononacene–6,21-dione molecule sandwiched between two gold leads is reported. The strong effect of negative differential resistance with large peak-to-valley ratio of 710% is present under low bias. We found that bias can change molecule–lead couple and induce low bias negative differential resistance for electrons acceptor, which may promise the potential applications in molecular devices with low-power dissipation in the future. - Highlights: • Acceptor is constructed to negative differential resistor (NDR). • NDR effect is present under low bias. • Bias change molecule–lead couple and induce NDR effect

  8. Development and validation of an item response theory-based Social Responsiveness Scale short form.

    Science.gov (United States)

    Sturm, Alexandra; Kuhfeld, Megan; Kasari, Connie; McCracken, James T

    2017-09-01

    Research and practice in autism spectrum disorder (ASD) rely on quantitative measures, such as the Social Responsiveness Scale (SRS), for characterization and diagnosis. Like many ASD diagnostic measures, SRS scores are influenced by factors unrelated to ASD core features. This study further interrogates the psychometric properties of the SRS using item response theory (IRT), and demonstrates a strategy to create a psychometrically sound short form by applying IRT results. Social Responsiveness Scale analyses were conducted on a large sample (N = 21,426) of youth from four ASD databases. Items were subjected to item factor analyses and evaluation of item bias by gender, age, expressive language level, behavior problems, and nonverbal IQ. Item selection based on item psychometric properties, DIF analyses, and substantive validity produced a reduced item SRS short form that was unidimensional in structure, highly reliable (α = .96), and free of gender, age, expressive language, behavior problems, and nonverbal IQ influence. The short form also showed strong relationships with established measures of autism symptom severity (ADOS, ADI-R, Vineland). Degree of association between all measures varied as a function of expressive language. Results identified specific SRS items that are more vulnerable to non-ASD-related traits. The resultant 16-item SRS short form may possess superior psychometric properties compared to the original scale and emerge as a more precise measure of ASD core symptom severity, facilitating research and practice. Future research using IRT is needed to further refine existing measures of autism symptomatology. © 2017 Association for Child and Adolescent Mental Health.

  9. Examining Gender Bias in Studies of Innovation

    OpenAIRE

    Crowden, N.

    2003-01-01

    This paper examines the presence of a gender bias in studies of innovation. Using the Innovation Systems Research Network (ISRN) and its interview guide as a case study, this research project examines how accurately and completely such innovation studies present gender differences in the innovation process.

  10. Instemmingsgeneigdheid en verskillende item- en responsformate in 'n gesommeerde selfbeoordelingskaal

    Directory of Open Access Journals (Sweden)

    Nadene Hanekom

    1998-06-01

    Full Text Available This study examines the degree of acquiescence present when the item and response formats of a summated rating scale are varied. It is often recommended that acquiescence response bias in rating scales may be controlled by using both positively and negatively worded items. Such items are generally worded in the Likert-type format of statements. The purpose of the study was to establish whether items in question format would result in a smaller degree of acquiescence than items worded as statements. the response format was also varied (five- and seven-point options to determine whether this would influence the reliability and degree of acquiescence in the scales. A twenty-item Locus of Control (LC questionnaire was used, but each item was complemented by its opposite, resulting in 40 items. The subjects, divided randomly into two groups, were second year students who had to complete four versions of the questionnaire, plus a shortened version of Bass's scale for measuring acquiescence. The LC version were questions or statements each combined with a five- or seven-point respons format. Partial counterbalancing was introduced by testing on two separate occasions, presenting the tests to the two groups in the opposite order. The degree of acquiescence was assessed by correlating the items with their opposite, and by correlating scores on each version with scores on the acquiescence questionnaire. No major difference were found between the various item and response format in relation to acquiescence. Opsomming Hierdie ondersoek is uitgevoer om te bepaal of die mate van instemmingsgeneigdheid deur die item- en responsformaat van 'n gesommeerde selfbeoordelingskaal beinvloed word. Daar word dikwels aanbeveel dat die gebruik van positief- sowel as negatiefbewoorde items in 'n vraelys instemmingsgeneigdheid beperk. Suike items word gewoonlik in die tradisionele Likertformaat as stellings geformuleer. Die doel van die ondersoek was om te bepaal of items

  11. Gender and Socioeconomic Status DIF on The WISC-IV Turkish Form Items: A Comparison of DIF Detection Tecniques

    Directory of Open Access Journals (Sweden)

    Elif Bengi ÜNSAL ÖZBERK

    2017-03-01

    Full Text Available The purpose of this study is to investigate potential gender and socio-economic status bias in theWechler Intelligence Scale for Children: Fourth Edition (WISC-4 by using several differential item functioning detection techniques. In this study, WISC-4 Turkish standardization test pilot data including 817 children were used. In accordance with the purpose of the study, 315 items were used both in polytomously scored subtests such as Block Design, Similarities, Digit Span, Vocabulary, Letter-Number Sequencing, Comprehension, and dichotomously scored subtests such as Picture Concepts, Matrix Reasoning, Picture Completion, Information, Arithmetic, and Word Reasoning. While Rasch Model, Mantel-Haenszel, and SIBTEST DIF detection techniques were used for dichotomously scored items, Partila Credit Model, Mantel, and Poly-SIBTEST techniques were used for polytomously scored items. In terms of DIF techniques, Mantel-Haenszel, SIBTEST and Mantel Test, Poly-SIBTEST analyses provided similar results when DIF based on gender was investigated. In addition Mantel-Haenszel, Rasch estimations and Partial Credit Model, Mantel Test results were similar while investigating DIF according to socioeconomic status.

  12. Smaller food item sizes of snack foods influence reduced portions and caloric intake in young adults.

    Science.gov (United States)

    Marchiori, David; Waroquier, Laurent; Klein, Olivier

    2011-05-01

    Studies considering the impact of food-size variations on consumption have predominantly focused on portion size, whereas very little research has investigated variations in food-item size, especially at snacking occasions, and results have been contradictory. This study evaluated the effect of altering the size of food items (ie, small vs large candies) of equal-size food portions on short-term energy intake while snacking. The study used a between-subjects design (n=33) in a randomized experiment conducted in spring 2008. In a psychology laboratory (separate cubicles), participants (undergraduate psychology students, 29 of 33 female, mean age 20.3±2 years, mean body mass index 21.7±3.7) were offered unlimited consumption of candies while participating in an unrelated computerized experiment. For half of the subjects, items were cut in two to make the small food-item size. Food intake (weight in grams, kilocalories, and number of food items) was examined using analysis of variance. Results showed that decreasing the item size of candies led participants to decrease by half their gram weight intake, resulting in an energy intake decrease of 60 kcal compared to the other group. Appetite ratings and subject and food characteristics had no moderating effect. A cognitive bias could explain why people tend to consider that one unit of food (eg, 10 candies) is the appropriate amount to consume, regardless of the size of the food items in the unit. This study suggests a simple dietary strategy, decreasing food-item size without having to alter the portion size offered, may reduce energy intake at snacking occasions. Copyright © 2011 American Dietetic Association. Published by Elsevier Inc. All rights reserved.

  13. [Immortal time bias in pharmacoepidemiological studies: definition, solutions and examples].

    Science.gov (United States)

    Faillie, Jean-Luc; Suissa, Samy

    2015-01-01

    Among the observational studies of drug effects in chronic diseases, many of them have found effects that were exaggerated or wrong. Among bias responsible for these errors, the immortal time bias, concerning the definition of exposure and exposure periods, is relevantly important as it usually tends to wrongly attribute a significant benefit to the study drug (or exaggerate a real benefit). In this article, we define the mechanism of immortal time bias, we present possible solutions and illustrate its consequences through examples of pharmacoepidemiological studies of drug effects. © 2014 Société Française de Pharmacologie et de Thérapeutique.

  14. Quantifying lead-time bias in risk factor studies of cancer through simulation.

    Science.gov (United States)

    Jansen, Rick J; Alexander, Bruce H; Anderson, Kristin E; Church, Timothy R

    2013-11-01

    Lead-time is inherent in early detection and creates bias in observational studies of screening efficacy, but its potential to bias effect estimates in risk factor studies is not always recognized. We describe a form of this bias that conventional analyses cannot address and develop a model to quantify it. Surveillance Epidemiology and End Results (SEER) data form the basis for estimates of age-specific preclinical incidence, and log-normal distributions describe the preclinical duration distribution. Simulations assume a joint null hypothesis of no effect of either the risk factor or screening on the preclinical incidence of cancer, and then quantify the bias as the risk-factor odds ratio (OR) from this null study. This bias can be used as a factor to adjust observed OR in the actual study. For this particular study design, as average preclinical duration increased, the bias in the total-physical activity OR monotonically increased from 1% to 22% above the null, but the smoking OR monotonically decreased from 1% above the null to 5% below the null. The finding of nontrivial bias in fixed risk-factor effect estimates demonstrates the importance of quantitatively evaluating it in susceptible studies. Copyright © 2013 Elsevier Inc. All rights reserved.

  15. Test Score Equating Using Discrete Anchor Items versus Passage-Based Anchor Items: A Case Study Using "SAT"® Data. Research Report. ETS RR-14-14

    Science.gov (United States)

    Liu, Jinghua; Zu, Jiyun; Curley, Edward; Carey, Jill

    2014-01-01

    The purpose of this study is to investigate the impact of discrete anchor items versus passage-based anchor items on observed score equating using empirical data.This study compares an "SAT"® critical reading anchor that contains more discrete items proportionally, compared to the total tests to be equated, to another anchor that…

  16. Attentional bias in snus users: an experimental study.

    Directory of Open Access Journals (Sweden)

    Rune Aune Mentzoni

    Full Text Available The use of nicotine in the form of "snus" is substantial and increasing in some geographic areas, in particular among young people. It has previously been suggested that addictions may operate through a mechanism of attentional bias, in which stimuli representative of the dependent substance increase in salience, thus increasing the addictive behavior. However, this hypothesis has not been tested for the case of snus. The current experiment used a modified Stroop task and a dot-probe task to investigate whether 40 snus users show an attentional bias towards snus-relevant stimuli, compared to 40 non-snus users. There were no significant differences between the two groups on reaction times or accuracy on either Stroop or dot-probe task, thus failing to show an attentional bias towards snus-relevant stimuli for snus users. This could imply that other mechanisms may contribute to maintenance of snus use than for other addictions. However, this is the first experimental study investigating attentional bias in snus users, and more research is warranted.

  17. Quasi-experimental study designs series-paper 6: risk of bias assessment.

    Science.gov (United States)

    Waddington, Hugh; Aloe, Ariel M; Becker, Betsy Jane; Djimeu, Eric W; Hombrados, Jorge Garcia; Tugwell, Peter; Wells, George; Reeves, Barney

    2017-09-01

    Rigorous and transparent bias assessment is a core component of high-quality systematic reviews. We assess modifications to existing risk of bias approaches to incorporate rigorous quasi-experimental approaches with selection on unobservables. These are nonrandomized studies using design-based approaches to control for unobservable sources of confounding such as difference studies, instrumental variables, interrupted time series, natural experiments, and regression-discontinuity designs. We review existing risk of bias tools. Drawing on these tools, we present domains of bias and suggest directions for evaluation questions. The review suggests that existing risk of bias tools provide, to different degrees, incomplete transparent criteria to assess the validity of these designs. The paper then presents an approach to evaluating the internal validity of quasi-experiments with selection on unobservables. We conclude that tools for nonrandomized studies of interventions need to be further developed to incorporate evaluation questions for quasi-experiments with selection on unobservables. Copyright © 2017 Elsevier Inc. All rights reserved.

  18. Exploring differential item functioning in the Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC

    Directory of Open Access Journals (Sweden)

    Pollard Beth

    2012-12-01

    Full Text Available Abstract Background The Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC is a widely used patient reported outcome in osteoarthritis. An important, but frequently overlooked, aspect of validating health outcome measures is to establish if items exhibit differential item functioning (DIF. That is, if respondents have the same underlying level of an attribute, does the item give the same score in different subgroups or is it biased towards one subgroup or another. The aim of the study was to explore DIF in the Likert format WOMAC for the first time in a UK osteoarthritis population with respect to demographic, social, clinical and psychological factors. Methods The sample comprised a community sample of 763 people with osteoarthritis who participated in the Somerset and Avon Survey of Health. The WOMAC was explored for DIF by gender, age, social deprivation, social class, employment status, distress, body mass index and clinical factors. Ordinal regression models were used to identify DIF items. Results After adjusting for age, two items were identified for the physical functioning subscale as having DIF with age identified as the DIF factor for 2 items, gender for 1 item and body mass index for 1 item. For the WOMAC pain subscale, for people with hip osteoarthritis one item was identified with age-related DIF. The impact of the DIF items rarely had a significant effect on the conclusions of group comparisons. Conclusions Overall, the WOMAC performed well with only a small number of DIF items identified. However, as DIF items were identified in for the WOMAC physical functioning subscale it would be advisable to analyse data taking into account the possible impact of the DIF items when weight, gender or especially age effects, are the focus of interest in UK-based osteoarthritis studies. Similarly for the WOMAC pain subscale in people with hip osteoarthritis it would be worthwhile to analyse data taking into account the

  19. Cross-cultural differences in knee functional status outcomes in a polyglot society represented true disparities not biased by differential item functioning.

    Science.gov (United States)

    Deutscher, Daniel; Hart, Dennis L; Crane, Paul K; Dickstein, Ruth

    2010-12-01

    Comparative effectiveness research across cultures requires unbiased measures that accurately detect clinical differences between patient groups. The purpose of this study was to assess the presence and impact of differential item functioning (DIF) in knee functional status (FS) items administered using computerized adaptive testing (CAT) as a possible cause for observed differences in outcomes between 2 cultural patient groups in a polyglot society. This study was a secondary analysis of prospectively collected data. We evaluated data from 9,134 patients with knee impairments from outpatient physical therapy clinics in Israel. Items were analyzed for DIF related to sex, age, symptom acuity, surgical history, exercise history, and language used to complete the functional survey (Hebrew versus Russian). Several items exhibited DIF, but unadjusted FS estimates and FS estimates that accounted for DIF were essentially equal (intraclass correlation coefficient [2,1]>.999). No individual patient had a difference between unadjusted and adjusted FS estimates as large as the median standard error of the unadjusted estimates. Differences between groups defined by any of the covariates considered were essentially unchanged when using adjusted instead of unadjusted FS estimates. The greatest group-level impact was <0.3% of 1 standard deviation of the unadjusted FS estimates. Complete data where patients answered all items in the scale would have been preferred for DIF analysis, but only CAT data were available. Differences in FS outcomes between groups of patients with knee impairments who answered the knee CAT in Hebrew or Russian in Israel most likely reflected true differences that may reflect societal disparities in this health outcome.

  20. Examining the Effect of Reverse Worded Items on the Factor Structure of the Need for Cognition Scale.

    Directory of Open Access Journals (Sweden)

    Xijuan Zhang

    Full Text Available Reverse worded (RW items are often used to reduce or eliminate acquiescence bias, but there is a rising concern about their harmful effects on the covariance structure of the scale. Therefore, results obtained via traditional covariance analyses may be distorted. This study examined the effect of the RW items on the factor structure of the abbreviated 18-item Need for Cognition (NFC scale using confirmatory factor analysis. We modified the scale to create three revised versions, varying from no RW items to all RW items. We also manipulated the type of the RW items (polar opposite vs. negated. To each of the four scales, we fit four previously developed models. The four models included a 1-factor model, a 2-factor model distinguishing between positively worded (PW items and RW items, and two 2-factor models, each with one substantive factor and one method factor. Results showed that the number and type of the RW items affected the factor structure of the NFC scale. Consistent with previous research findings, for the original NFC scale, which contains both PW and RW items, the 1-factor model did not have good fit. In contrast, for the revised scales that had no RW items or all RW items, the 1-factor model had reasonably good fit. In addition, for the scale with polar opposite and negated RW items, the factor model with a method factor among the polar opposite items had considerably better fit than the 1-factor model.

  1. Potential Reporting Bias in Neuroimaging Studies of Sex Differences.

    Science.gov (United States)

    David, Sean P; Naudet, Florian; Laude, Jennifer; Radua, Joaquim; Fusar-Poli, Paolo; Chu, Isabella; Stefanick, Marcia L; Ioannidis, John P A

    2018-04-17

    Numerous functional magnetic resonance imaging (fMRI) studies have reported sex differences. To empirically evaluate for evidence of excessive significance bias in this literature, we searched for published fMRI studies of human brain to evaluate sex differences, regardless of the topic investigated, in Medline and Scopus over 10 years. We analyzed the prevalence of conclusions in favor of sex differences and the correlation between study sample sizes and number of significant foci identified. In the absence of bias, larger studies (better powered) should identify a larger number of significant foci. Across 179 papers, median sample size was n = 32 (interquartile range 23-47.5). A median of 5 foci related to sex differences were reported (interquartile range, 2-9.5). Few articles (n = 2) had titles focused on no differences or on similarities (n = 3) between sexes. Overall, 158 papers (88%) reached "positive" conclusions in their abstract and presented some foci related to sex differences. There was no statistically significant relationship between sample size and the number of foci (-0.048% increase for every 10 participants, p = 0.63). The extremely high prevalence of "positive" results and the lack of the expected relationship between sample size and the number of discovered foci reflect probable reporting bias and excess significance bias in this literature.

  2. CEO emotional bias and investment decision, Bayesian network method

    Directory of Open Access Journals (Sweden)

    Jarboui Anis

    2012-08-01

    Full Text Available This research examines the determinants of firms’ investment introducing a behavioral perspective that has received little attention in corporate finance literature. The following central hypothesis emerges from a set of recently developed theories: Investment decisions are influenced not only by their fundamentals but also depend on some other factors. One factor is the biasness of any CEO to their investment, biasness depends on the cognition and emotions, because some leaders use them as heuristic for the investment decision instead of fundamentals. This paper shows how CEO emotional bias (optimism, loss aversion and overconfidence affects the investment decisions. The proposed model of this paper uses Bayesian Network Method to examine this relationship. Emotional bias has been measured by means of a questionnaire comprising several items. As for the selected sample, it has been composed of some 100 Tunisian executives. Our results have revealed that the behavioral analysis of investment decision implies leader affected by behavioral biases (optimism, loss aversion, and overconfidence adjusts its investment choices based on their ability to assess alternatives (optimism and overconfidence and risk perception (loss aversion to create of shareholder value and ensure its place at the head of the management team.

  3. Research bias in judgement bias studies : a systematic review of valuation judgement literature

    NARCIS (Netherlands)

    Vincent Gruis; Pim Klamer; Cok Bakker

    2017-01-01

    Valuation judgement bias has been a research topic for several years due to its proclaimed effect on valuation accuracy. However, little is known on the emphasis of literature on judgement bias, with regard to, for instance, research methodologies, research context and robustness of research

  4. Research bias in judgement bias studies : A systematic review of valuation judgement literature

    NARCIS (Netherlands)

    Klamer, Pim; Bakker, C.; Gruis, Vincent

    2017-01-01

    Valuation judgement bias has been a research topic for several years due to its proclaimed effect on valuation accuracy. However, little is known on the emphasis of literature on judgement bias, with regard to, for instance, research methodologies, research context and robustness of research

  5. A Systematic Review of Attention Biases in Opioid, Cannabis, Stimulant Use Disorders.

    Science.gov (United States)

    Zhang, Melvyn; Ying, Jiangbo; Wing, Tracey; Song, Guo; Fung, Daniel S S; Smith, Helen

    2018-06-01

    Background : Opiates, cannabis, and amphetamines are highly abused, and use of these substances are prevalent disorders. Psychological interventions are crucial given that they help individuals maintain abstinence following a lapse or relapse into substance use. Advances in experimental psychology have suggested that automatic attention biases might be responsible for relapse. Prior reviews have provided evidence for the presence of these biases in addictive disorders and the effectiveness of bias modification. However, the prior studies are limited, as they failed to include trials involving participants with these prevalent addictive disorders or have failed to adopt a systematic approach in evidence synthesis. Objectives : The primary aim of this current systematic review is to synthesise the current evidence for attention biases amongst opioid use, cannabis use, and stimulant use disorders. The secondary aim is to determine the efficacy of attention bias modification interventions and other addictions related outcomes. Methods : A search was conducted from November 2017 to January 2018 on PubMed, MEDLINE, Embase, PsycINFO, Science Direct, Cochrane Central, and Scopus. The selection process of the articles was in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analysis guidelines. A qualitative synthesis was undertaken. Risk of bias was assessed using the Cochrane Risk of Bias tool. Results : Six randomised trials were identified. The evidence synthesized from these trials have provided strong evidence that attentional biases are present in opioid and stimulant use disorders. Evidence synthesis for other secondary outcome measures could not be performed given the heterogeneity in the measures reported and the limited number of trials. The risk of bias assessment for the included trials revealed a high risk of selection and attrition bias. Conclusions : This review demonstrates the potential need for interventions targeting attention

  6. Investigating vulnerability to eating disorders: biases in emotional processing.

    Science.gov (United States)

    Pringle, A; Harmer, C J; Cooper, M J

    2010-04-01

    Biases in emotional processing and cognitions about the self are thought to play a role in the maintenance of eating disorders (EDs). However, little is known about whether these difficulties exist pre-morbidly and how they might contribute to risk. Female dieters (n=82) completed a battery of tasks designed to assess the processing of social cues (facial emotion recognition), cognitions about the self [Self-Schema Processing Task (SSPT)] and ED-specific cognitions about eating, weight and shape (emotional Stroop). The 26-item Eating Attitudes Test (EAT-26; Garner et al. 1982) was used to assess subclinical ED symptoms; this was used as an index of vulnerability within this at-risk group. Regression analyses showed that biases in the processing of both neutral and angry faces were predictive of our measure of vulnerability (EAT-26). In the self-schema task, biases in the processing of negative self descriptors previously found to be common in EDs predicted vulnerability. Biases in the processing of shape-related words on the Stroop task were also predictive; however, these biases were more important in dieters who also displayed biases in the self-schema task. We were also able to demonstrate that these biases are specific and separable from more general negative biases that could be attributed to depressive symptoms. These results suggest that specific biases in the processing of social cues, cognitions about the self, and also about eating, weight and shape information, may be important in understanding risk and preventing relapse in EDs.

  7. Examining Event-Related Potential (ERP) correlates of decision bias in recognition memory judgments.

    Science.gov (United States)

    Hill, Holger; Windmann, Sabine

    2014-01-01

    Memory judgments can be based on accurate memory information or on decision bias (the tendency to report that an event is part of episodic memory when one is in fact unsure). Event related potentials (ERP) correlates are important research tools for elucidating the dynamics underlying memory judgments but so far have been established only for investigations of accurate old/new discrimination. To identify the ERP correlates of bias, and observe how these interact with ERP correlates of memory, we conducted three experiments that manipulated decision bias within participants via instructions during recognition memory tests while their ERPs were recorded. In Experiment 1, the bias manipulation was performed between blocks of trials (automatized bias) and compared to trial-by-trial shifts of bias in accord with an external cue (flexibly controlled bias). In Experiment 2, the bias manipulation was performed at two different levels of accurate old/new discrimination as the memory strength of old (studied) items was varied. In Experiment 3, the bias manipulation was added to another, bottom-up driven manipulation of bias induced via familiarity. In the first two Experiments, and in the low familiarity condition of Experiment 3, we found evidence of an early frontocentral ERP component at 320 ms poststimulus (the FN320) that was sensitive to the manipulation of bias via instruction, with more negative amplitudes indexing more liberal bias. By contrast, later during the trial (500-700 ms poststimulus), bias effects interacted with old/new effects across all three experiments. Results suggest that the decision criterion is typically activated early during recognition memory trials, and is integrated with retrieved memory signals and task-specific processing demands later during the trial. More generally, the findings demonstrate how ERPs can help to specify the dynamics of recognition memory processes under top-down and bottom-up controlled retrieval conditions.

  8. Examining Event-Related Potential (ERP) Correlates of Decision Bias in Recognition Memory Judgments

    Science.gov (United States)

    Hill, Holger; Windmann, Sabine

    2014-01-01

    Memory judgments can be based on accurate memory information or on decision bias (the tendency to report that an event is part of episodic memory when one is in fact unsure). Event related potentials (ERP) correlates are important research tools for elucidating the dynamics underlying memory judgments but so far have been established only for investigations of accurate old/new discrimination. To identify the ERP correlates of bias, and observe how these interact with ERP correlates of memory, we conducted three experiments that manipulated decision bias within participants via instructions during recognition memory tests while their ERPs were recorded. In Experiment 1, the bias manipulation was performed between blocks of trials (automatized bias) and compared to trial-by-trial shifts of bias in accord with an external cue (flexibly controlled bias). In Experiment 2, the bias manipulation was performed at two different levels of accurate old/new discrimination as the memory strength of old (studied) items was varied. In Experiment 3, the bias manipulation was added to another, bottom-up driven manipulation of bias induced via familiarity. In the first two Experiments, and in the low familiarity condition of Experiment 3, we found evidence of an early frontocentral ERP component at 320 ms poststimulus (the FN320) that was sensitive to the manipulation of bias via instruction, with more negative amplitudes indexing more liberal bias. By contrast, later during the trial (500–700 ms poststimulus), bias effects interacted with old/new effects across all three experiments. Results suggest that the decision criterion is typically activated early during recognition memory trials, and is integrated with retrieved memory signals and task-specific processing demands later during the trial. More generally, the findings demonstrate how ERPs can help to specify the dynamics of recognition memory processes under top-down and bottom-up controlled retrieval conditions. PMID

  9. Examining Event-Related Potential (ERP correlates of decision bias in recognition memory judgments.

    Directory of Open Access Journals (Sweden)

    Holger Hill

    Full Text Available Memory judgments can be based on accurate memory information or on decision bias (the tendency to report that an event is part of episodic memory when one is in fact unsure. Event related potentials (ERP correlates are important research tools for elucidating the dynamics underlying memory judgments but so far have been established only for investigations of accurate old/new discrimination. To identify the ERP correlates of bias, and observe how these interact with ERP correlates of memory, we conducted three experiments that manipulated decision bias within participants via instructions during recognition memory tests while their ERPs were recorded. In Experiment 1, the bias manipulation was performed between blocks of trials (automatized bias and compared to trial-by-trial shifts of bias in accord with an external cue (flexibly controlled bias. In Experiment 2, the bias manipulation was performed at two different levels of accurate old/new discrimination as the memory strength of old (studied items was varied. In Experiment 3, the bias manipulation was added to another, bottom-up driven manipulation of bias induced via familiarity. In the first two Experiments, and in the low familiarity condition of Experiment 3, we found evidence of an early frontocentral ERP component at 320 ms poststimulus (the FN320 that was sensitive to the manipulation of bias via instruction, with more negative amplitudes indexing more liberal bias. By contrast, later during the trial (500-700 ms poststimulus, bias effects interacted with old/new effects across all three experiments. Results suggest that the decision criterion is typically activated early during recognition memory trials, and is integrated with retrieved memory signals and task-specific processing demands later during the trial. More generally, the findings demonstrate how ERPs can help to specify the dynamics of recognition memory processes under top-down and bottom-up controlled retrieval conditions.

  10. Item validity vs. item discrimination index: a redundancy?

    Science.gov (United States)

    Panjaitan, R. L.; Irawati, R.; Sujana, A.; Hanifah, N.; Djuanda, D.

    2018-03-01

    In several literatures about evaluation and test analysis, it is common to find that there are calculations of item validity as well as item discrimination index (D) with different formula for each. Meanwhile, other resources said that item discrimination index could be obtained by calculating the correlation between the testee’s score in a particular item and the testee’s score on the overall test, which is actually the same concept as item validity. Some research reports, especially undergraduate theses tend to include both item validity and item discrimination index in the instrument analysis. It seems that these concepts might overlap for both reflect the test quality on measuring the examinees’ ability. In this paper, examples of some results of data processing on item validity and item discrimination index were compared. It would be discussed whether item validity and item discrimination index can be represented by one of them only or it should be better to present both calculations for simple test analysis, especially in undergraduate theses where test analyses were included.

  11. Avoiding and Correcting Bias in Score-Based Latent Variable Regression with Discrete Manifest Items

    Science.gov (United States)

    Lu, Irene R. R.; Thomas, D. Roland

    2008-01-01

    This article considers models involving a single structural equation with latent explanatory and/or latent dependent variables where discrete items are used to measure the latent variables. Our primary focus is the use of scores as proxies for the latent variables and carrying out ordinary least squares (OLS) regression on such scores to estimate…

  12. Diagnostic Reasoning and Cognitive Biases of Nurse Practitioners.

    Science.gov (United States)

    Lawson, Thomas N

    2018-04-01

    Diagnostic reasoning is often used colloquially to describe the process by which nurse practitioners and physicians come to the correct diagnosis, but a rich definition and description of this process has been lacking in the nursing literature. A literature review was conducted with theoretical sampling seeking conceptual insight into diagnostic reasoning. Four common themes emerged: Cognitive Biases and Debiasing Strategies, the Dual Process Theory, Diagnostic Error, and Patient Harm. Relevant cognitive biases are discussed, followed by debiasing strategies and application of the dual process theory to reduce diagnostic error and harm. The accuracy of diagnostic reasoning of nurse practitioners may be improved by incorporating these items into nurse practitioner education and practice. [J Nurs Educ. 2018;57(4):203-208.]. Copyright 2018, SLACK Incorporated.

  13. Item response theory analysis applied to the Spanish version of the Personal Outcomes Scale.

    Science.gov (United States)

    Guàrdia-Olmos, J; Carbó-Carreté, M; Peró-Cebollero, M; Giné, C

    2017-11-01

    The study of measurements of quality of life (QoL) is one of the great challenges of modern psychology and psychometric approaches. This issue has greater importance when examining QoL in populations that were historically treated on the basis of their deficiency, and recently, the focus has shifted to what each person values and desires in their life, as in cases of people with intellectual disability (ID). Many studies of QoL scales applied in this area have attempted to improve the validity and reliability of their components by incorporating various sources of information to achieve consistency in the data obtained. The adaptation of the Personal Outcomes Scale (POS) in Spanish has shown excellent psychometric attributes, and its administration has three sources of information: self-assessment, practitioner and family. The study of possible congruence or incongruence of observed distributions of each item between sources is therefore essential to ensure a correct interpretation of the measure. The aim of this paper was to analyse the observed distribution of items and dimensions from the three Spanish POS information sources cited earlier, using the item response theory. We studied a sample of 529 people with ID and their respective practitioners and family member, and in each case, we analysed items and factors using Samejima's model of polytomic ordinal scales. The results indicated an important number of items with differential effects regarding sources, and in some cases, they indicated significant differences in the distribution of items, factors and sources of information. As a result of this analysis, we must affirm that the administration of the POS, considering three sources of information, was adequate overall, but a correct interpretation of the results requires that it obtain much more information to consider, as well as some specific items in specific dimensions. The overall ratings, if these comments are considered, could result in bias. © 2017

  14. The strength of attentional biases reduces as visual short-term memory load increases.

    Science.gov (United States)

    Shimi, A; Astle, D E

    2013-07-01

    Despite our visual system receiving irrelevant input that competes with task-relevant signals, we are able to pursue our perceptual goals. Attention enhances our visual processing by biasing the processing of the input that is relevant to the task at hand. The top-down signals enabling these biases are therefore important for regulating lower level sensory mechanisms. In three experiments, we examined whether we apply similar biases to successfully maintain information in visual short-term memory (VSTM). We presented participants with targets alongside distracters and we graded their perceptual similarity to vary the extent to which they competed. Experiments 1 and 2 showed that the more items held in VSTM before the onset of the distracters, the more perceptually distinct the distracters needed to be for participants to retain the target accurately. Experiment 3 extended these behavioral findings by demonstrating that the perceptual similarity between target and distracters exerted a significantly greater effect on occipital alpha amplitudes, depending on the number of items already held in VSTM. The trade-off between VSTM load and target-distracter competition suggests that VSTM and perceptual competition share a partially overlapping mechanism, namely top-down inputs into sensory areas.

  15. Evaluation of psychometric properties and differential item functioning of 8-item Child Perceptions Questionnaires using item response theory.

    Science.gov (United States)

    Yau, David T W; Wong, May C M; Lam, K F; McGrath, Colman

    2015-08-19

    Four-factor structure of the two 8-item short forms of Child Perceptions Questionnaire CPQ11-14 (RSF:8 and ISF:8) has been confirmed. However, the sum scores are typically reported in practice as a proxy of Oral health-related Quality of Life (OHRQoL), which implied a unidimensional structure. This study first assessed the unidimensionality of 8-item short forms of CPQ11-14. Item response theory (IRT) was employed to offer an alternative and complementary approach of validation and to overcome the limitations of classical test theory assumptions. A random sample of 649 12-year-old school children in Hong Kong was analyzed. Unidimensionality of the scale was tested by confirmatory factor analysis (CFA), principle component analysis (PCA) and local dependency (LD) statistic. Graded response model was fitted to the data. Contribution of each item to the scale was assessed by item information function (IIF). Reliability of the scale was assessed by test information function (TIF). Differential item functioning (DIF) across gender was identified by Wald test and expected score functions. Both CPQ11-14 RSF:8 and ISF:8 did not deviate much from the unidimensionality assumption. Results from CFA indicated acceptable fit of the one-factor model. PCA indicated that the first principle component explained >30 % of the total variation with high factor loadings for both RSF:8 and ISF:8. Almost all LD statistic items suggesting little contribution of information to the scale and item removal caused little practical impact. Comparing the TIFs, RSF:8 showed slightly better information than ISF:8. In addition to oral symptoms items, the item "Concerned with what other people think" demonstrated a uniform DIF (p Items related to oral symptoms were not informative to OHRQoL and deletion of these items is suggested. The impact of DIF across gender on the overall score was minimal. CPQ11-14 RSF:8 performed slightly better than ISF:8 in measurement precision. The 6-item short forms

  16. Combination of biased forecasts: Bias correction or bias based weights?

    OpenAIRE

    Wenzel, Thomas

    1999-01-01

    Most of the literature on combination of forecasts deals with the assumption of unbiased individual forecasts. Here, we consider the case of biased forecasts and discuss two different combination techniques resulting in an unbiased forecast. On the one hand we correct the individual forecasts, and on the other we calculate bias based weights. A simulation study gives some insight in the situations where we should use the different methods.

  17. Bias against research on gender bias.

    Science.gov (United States)

    Cislak, Aleksandra; Formanowicz, Magdalena; Saguy, Tamar

    2018-01-01

    The bias against women in academia is a documented phenomenon that has had detrimental consequences, not only for women, but also for the quality of science. First, gender bias in academia affects female scientists, resulting in their underrepresentation in academic institutions, particularly in higher ranks. The second type of gender bias in science relates to some findings applying only to male participants, which produces biased knowledge. Here, we identify a third potentially powerful source of gender bias in academia: the bias against research on gender bias. In a bibliometric investigation covering a broad range of social sciences, we analyzed published articles on gender bias and race bias and established that articles on gender bias are funded less often and published in journals with a lower Impact Factor than articles on comparable instances of social discrimination. This result suggests the possibility of an underappreciation of the phenomenon of gender bias and related research within the academic community. Addressing this meta-bias is crucial for the further examination of gender inequality, which severely affects many women across the world.

  18. Bias During the Evaluation of Animal Studies?

    Directory of Open Access Journals (Sweden)

    Andrew Knight

    2012-02-01

    Full Text Available My recent book entitled The Costs and Benefits of Animal Experiments seeks to answer a key question within animal ethics, namely: is animal experimentation ethically justifiable? Or, more precisely, is it justifiable within the utilitarian cost:benefit framework that fundamentally underpins most regulations governing animal experimentation? To answer this question I reviewed more than 500 scientific publications describing animal studies, animal welfare impacts, and alternative research, toxicity testing and educational methodologies. To minimise bias I focused primarily on large-scale systematic reviews that had examined the human clinical and toxicological utility of animal studies. Despite this, Dr. Susanne Prankel recently reviewed my book in this journal, essentially accusing me of bias. However, she failed to provide any substantive evidence to refute my conclusions, let alone evidence of similar weight to that on which they are based. Those conclusions are, in fact, firmly based on utilitarian ethical reasoning, informed by scientific evidence of considerable strength, and I believe they are robust.

  19. Bias During the Evaluation of Animal Studies?

    Science.gov (United States)

    Knight, Andrew

    2012-02-23

    My recent book entitled The Costs and Benefits of Animal Experiments seeks to answer a key question within animal ethics, namely: is animal experimentation ethically justifiable? Or, more precisely, is it justifiable within the utilitarian cost:benefit framework that fundamentally underpins most regulations governing animal experimentation? To answer this question I reviewed more than 500 scientific publications describing animal studies, animal welfare impacts, and alternative research, toxicity testing and educational methodologies. To minimise bias I focused primarily on large-scale systematic reviews that had examined the human clinical and toxicological utility of animal studies. Despite this, Dr. Susanne Prankel recently reviewed my book in this journal, essentially accusing me of bias. However, she failed to provide any substantive evidence to refute my conclusions, let alone evidence of similar weight to that on which they are based. Those conclusions are, in fact, firmly based on utilitarian ethical reasoning, informed by scientific evidence of considerable strength, and I believe they are robust.

  20. Development of a subjective cognitive decline questionnaire using item response theory: a pilot study.

    Science.gov (United States)

    Gifford, Katherine A; Liu, Dandan; Romano, Raymond; Jones, Richard N; Jefferson, Angela L

    2015-12-01

    Subjective cognitive decline (SCD) may indicate unhealthy cognitive changes, but no standardized SCD measurement exists. This pilot study aims to identify reliable SCD questions. 112 cognitively normal (NC, 76±8 years, 63% female), 43 mild cognitive impairment (MCI; 77±7 years, 51% female), and 33 diagnostically ambiguous participants (79±9 years, 58% female) were recruited from a research registry and completed 57 self-report SCD questions. Psychometric methods were used for item-reduction. Factor analytic models assessed unidimensionality of the latent trait (SCD); 19 items were removed with extreme response distribution or trait-fit. Item response theory (IRT) provided information about question utility; 17 items with low information were dropped. Post-hoc simulation using computerized adaptive test (CAT) modeling selected the most commonly used items (n=9 of 21 items) that represented the latent trait well (r=0.94) and differentiated NC from MCI participants (F(1,146)=8.9, p=0.003). Item response theory and computerized adaptive test modeling identified nine reliable SCD items. This pilot study is a first step toward refining SCD assessment in older adults. Replication of these findings and validation with Alzheimer's disease biomarkers will be an important next step for the creation of a SCD screener.

  1. Investigating the Impact of Item Parameter Drift for Item Response Theory Models with Mixture Distributions

    Directory of Open Access Journals (Sweden)

    Yoon Soo ePark

    2016-02-01

    Full Text Available This study investigates the impact of item parameter drift (IPD on parameter and ability estimation when the underlying measurement model fits a mixture distribution, thereby violating the item invariance property of unidimensional item response theory (IRT models. An empirical study was conducted to demonstrate the occurrence of both IPD and an underlying mixture distribution using real-world data. Twenty-one trended anchor items from the 1999, 2003, and 2007 administrations of Trends in International Mathematics and Science Study (TIMSS were analyzed using unidimensional and mixture IRT models. TIMSS treats trended anchor items as invariant over testing administrations and uses pre-calibrated item parameters based on unidimensional IRT. However, empirical results showed evidence of two latent subgroups with IPD. Results showed changes in the distribution of examinee ability between latent classes over the three administrations. A simulation study was conducted to examine the impact of IPD on the estimation of ability and item parameters, when data have underlying mixture distributions. Simulations used data generated from a mixture IRT model and estimated using unidimensional IRT. Results showed that data reflecting IPD using mixture IRT model led to IPD in the unidimensional IRT model. Changes in the distribution of examinee ability also affected item parameters. Moreover, drift with respect to item discrimination and distribution of examinee ability affected estimates of examinee ability. These findings demonstrate the need to caution and evaluate IPD using a mixture IRT framework to understand its effect on item parameters and examinee ability.

  2. Investigating the Impact of Item Parameter Drift for Item Response Theory Models with Mixture Distributions.

    Science.gov (United States)

    Park, Yoon Soo; Lee, Young-Sun; Xing, Kuan

    2016-01-01

    This study investigates the impact of item parameter drift (IPD) on parameter and ability estimation when the underlying measurement model fits a mixture distribution, thereby violating the item invariance property of unidimensional item response theory (IRT) models. An empirical study was conducted to demonstrate the occurrence of both IPD and an underlying mixture distribution using real-world data. Twenty-one trended anchor items from the 1999, 2003, and 2007 administrations of Trends in International Mathematics and Science Study (TIMSS) were analyzed using unidimensional and mixture IRT models. TIMSS treats trended anchor items as invariant over testing administrations and uses pre-calibrated item parameters based on unidimensional IRT. However, empirical results showed evidence of two latent subgroups with IPD. Results also showed changes in the distribution of examinee ability between latent classes over the three administrations. A simulation study was conducted to examine the impact of IPD on the estimation of ability and item parameters, when data have underlying mixture distributions. Simulations used data generated from a mixture IRT model and estimated using unidimensional IRT. Results showed that data reflecting IPD using mixture IRT model led to IPD in the unidimensional IRT model. Changes in the distribution of examinee ability also affected item parameters. Moreover, drift with respect to item discrimination and distribution of examinee ability affected estimates of examinee ability. These findings demonstrate the need to caution and evaluate IPD using a mixture IRT framework to understand its effects on item parameters and examinee ability.

  3. A Study on the Systematization of Classification Process for NSG Trigger List Items

    Energy Technology Data Exchange (ETDEWEB)

    Yang, Seunghyo; Tae, Jaewoong; Shin, Donghoon [Korea Institute of Nuclear Nonproliferation and Control/Nuclear Export Control Div., Daejeon (Korea, Republic of)

    2013-05-15

    In 1978, Nuclear Suppliers Group (NSG) was established to prevent nuclear items from being used for nuclear weapons. NSG drew up the NSG Guidelines (INFCIRC/254) that regulates export control items(so-called NSG trigger list items) and procedures. NSG recommends its member countries to reflect these guidelines on their export control systems and fulfill their obligations. Korea has carried out export controls on nuclear items by reflecting NSG Guidelines on Notice on Trade of Strategic Item of Foreign Trade Act since joining NSG in 1995. Nuclear export control starts with Classification that determines whether export items can be used for strategic items (goods and technologies that can be exclusively used for the manufacture, development and use of WMD). The standard of Classification is based on the NSG Guidelines. However, due to the qualitative characteristics of the Guidelines, there take place lots of difficulties in the Classification. Thus this study aims to suggest the systematic Classification process. Recently, the number of Classification requests is rapidly increasing due to the UAE commercial nuclear power plants and the Jordan reactors export. It is required to provide a more systematic Classification standard and process in order to provide an efficient and consistent Classification. Thus, this study analyzed limitations of EDP which causes difficulties in the process of classification due to its qualitative characteristics. Besides, it established systematic Classification process by quantitatively analyzing EDP. Consequently, it is expected that the results of this study will be used for as actual Classification. It still remains to establish a criterion of detailed information, which is one of the most important in the Classification for technology. Therefore, a further study will be conducted to establish a criterion of detailed information by analyzing Classification cases through the text mining techniques.

  4. A Study on the Systematization of Classification Process for NSG Trigger List Items

    International Nuclear Information System (INIS)

    Yang, Seunghyo; Tae, Jaewoong; Shin, Donghoon

    2013-01-01

    In 1978, Nuclear Suppliers Group (NSG) was established to prevent nuclear items from being used for nuclear weapons. NSG drew up the NSG Guidelines (INFCIRC/254) that regulates export control items(so-called NSG trigger list items) and procedures. NSG recommends its member countries to reflect these guidelines on their export control systems and fulfill their obligations. Korea has carried out export controls on nuclear items by reflecting NSG Guidelines on Notice on Trade of Strategic Item of Foreign Trade Act since joining NSG in 1995. Nuclear export control starts with Classification that determines whether export items can be used for strategic items (goods and technologies that can be exclusively used for the manufacture, development and use of WMD). The standard of Classification is based on the NSG Guidelines. However, due to the qualitative characteristics of the Guidelines, there take place lots of difficulties in the Classification. Thus this study aims to suggest the systematic Classification process. Recently, the number of Classification requests is rapidly increasing due to the UAE commercial nuclear power plants and the Jordan reactors export. It is required to provide a more systematic Classification standard and process in order to provide an efficient and consistent Classification. Thus, this study analyzed limitations of EDP which causes difficulties in the process of classification due to its qualitative characteristics. Besides, it established systematic Classification process by quantitatively analyzing EDP. Consequently, it is expected that the results of this study will be used for as actual Classification. It still remains to establish a criterion of detailed information, which is one of the most important in the Classification for technology. Therefore, a further study will be conducted to establish a criterion of detailed information by analyzing Classification cases through the text mining techniques

  5. Answering Fixed Response Items in Chemistry: A Pilot Study.

    Science.gov (United States)

    Hateley, R. J.

    1979-01-01

    Presents a pilot study on student thinking in chemistry. Verbal comments of a group of six college students were recorded and analyzed to identify how each student arrives at the correct answer in fixed response items in chemisty. (HM)

  6. Nonresponse bias in randomized controlled experiments in criminology: Putting the Queensland Community Engagement Trial (QCET) under a microscope.

    Science.gov (United States)

    Antrobus, Emma; Elffers, Henk; White, Gentry; Mazerolle, Lorraine

    2013-01-01

    The goal of this article is to examine whether or not the results of the Queensland Community Engagement Trial (QCET)-a randomized controlled trial that tested the impact of procedural justice policing on citizen attitudes toward police-were affected by different types of nonresponse bias. We use two methods (Cochrane and Elffers methods) to explore nonresponse bias: First, we assess the impact of the low response rate by examining the effects of nonresponse group differences between the experimental and control conditions and pooled variance under different scenarios. Second, we assess the degree to which item response rates are influenced by the control and experimental conditions. Our analysis of the QCET data suggests that our substantive findings are not influenced by the low response rate in the trial. The results are robust even under extreme conditions, and statistical significance of the results would only be compromised in cases where the pooled variance was much larger for the nonresponse group and the difference between experimental and control conditions was greatly diminished. We also find that there were no biases in the item response rates across the experimental and control conditions. RCTs that involve field survey responses-like QCET-are potentially compromised by low response rates and how item response rates might be influenced by the control or experimental conditions. Our results show that the QCET results were not sensitive to the overall low response rate across the experimental and control conditions and the item response rates were not significantly different across the experimental and control groups. Overall, our analysis suggests that the results of QCET are robust and any biases in the survey responses do not significantly influence the main experimental findings.

  7. The specificity of attentional biases by type of gambling: An eye-tracking study

    OpenAIRE

    McGrath, Daniel S.; Meitner, Amadeus; Sears, Christopher R.

    2018-01-01

    A growing body of research indicates that gamblers develop an attentional bias for gambling-related stimuli. Compared to research on substance use, however, few studies have examined attentional biases in gamblers using eye-gaze tracking, which has many advantages over other measures of attention. In addition, previous studies of attentional biases in gamblers have not directly matched type of gambler with personally-relevant gambling cues. The present study investigated the specificity of at...

  8. Using automatic item generation to create multiple-choice test items.

    Science.gov (United States)

    Gierl, Mark J; Lai, Hollis; Turner, Simon R

    2012-08-01

    Many tests of medical knowledge, from the undergraduate level to the level of certification and licensure, contain multiple-choice items. Although these are efficient in measuring examinees' knowledge and skills across diverse content areas, multiple-choice items are time-consuming and expensive to create. Changes in student assessment brought about by new forms of computer-based testing have created the demand for large numbers of multiple-choice items. Our current approaches to item development cannot meet this demand. We present a methodology for developing multiple-choice items based on automatic item generation (AIG) concepts and procedures. We describe a three-stage approach to AIG and we illustrate this approach by generating multiple-choice items for a medical licensure test in the content area of surgery. To generate multiple-choice items, our method requires a three-stage process. Firstly, a cognitive model is created by content specialists. Secondly, item models are developed using the content from the cognitive model. Thirdly, items are generated from the item models using computer software. Using this methodology, we generated 1248 multiple-choice items from one item model. Automatic item generation is a process that involves using models to generate items using computer technology. With our method, content specialists identify and structure the content for the test items, and computer technology systematically combines the content to generate new test items. By combining these outcomes, items can be generated automatically. © Blackwell Publishing Ltd 2012.

  9. A Systematic Approach to Identify Promising New Items for Small to Medium Enterprises: A Case Study

    Directory of Open Access Journals (Sweden)

    Sukjae Jeong

    2016-11-01

    Full Text Available Despite the growing importance of identifying new business items for small and medium enterprises (SMEs, most previous studies focus on conglomerates. The paucity of empirical studies has also led to limited real-life applications. Hence, this study proposes a systematic approach to find new business items (NBIs that help the prospective SMEs develop, evaluate, and select viable business items to survive the competitive environment. The proposed approach comprises two stages: (1 the classification of diversification of SMEs; and (2 the searching and screening of business items. In the first stage, SMEs are allocated to five groups, based on their internal technological competency and external market conditions. In the second stage, based on the types of SMEs identified in the first stage, a set of alternative business items is derived by combining the results of portfolio analysis and benchmarking analysis. After deriving new business items, a market and technology-driven matrix analysis is utilized to screen suitable business items, and the Bruce Merrifield-Ohe (BMO method is used to categorize and identify prospective items based on market attractiveness and internal capability. To illustrate the applicability of the proposed approach, a case study is presented.

  10. Evaluation of Northwest University, Kano Post-UTME Test Items Using Item Response Theory

    Science.gov (United States)

    Bichi, Ado Abdu; Hafiz, Hadiza; Bello, Samira Abdullahi

    2016-01-01

    High-stakes testing is used for the purposes of providing results that have important consequences. Validity is the cornerstone upon which all measurement systems are built. This study applied the Item Response Theory principles to analyse Northwest University Kano Post-UTME Economics test items. The developed fifty (50) economics test items was…

  11. Publication bias in studies of an applied behavior-analytic intervention: an initial analysis.

    Science.gov (United States)

    Sham, Elyssa; Smith, Tristram

    2014-01-01

    Publication bias arises when studies with favorable results are more likely to be reported than are studies with null findings. If this bias occurs in studies with single-subject experimental designs(SSEDs) on applied behavior-analytic (ABA) interventions, it could lead to exaggerated estimates of intervention effects. Therefore, we conducted an initial test of bias by comparing effect sizes, measured by percentage of nonoverlapping data (PND), in published SSED studies (n=21) and unpublished dissertations (n=10) on 1 well-established intervention for children with autism, pivotal response treatment (PRT). Although published and unpublished studies had similar methodologies, the mean PND in published studies was 22% higher than in unpublished studies, 95% confidence interval (4%, 38%). Even when unpublished studies are included, PRT appeared to be effective (PNDM=62%). Nevertheless, the disparity between published and unpublished studies suggests a need for further assessment of publication bias in the ABA literature.

  12. The role of attention in item-item binding in visual working memory.

    Science.gov (United States)

    Peterson, Dwight J; Naveh-Benjamin, Moshe

    2017-09-01

    An important yet unresolved question regarding visual working memory (VWM) relates to whether or not binding processes within VWM require additional attentional resources compared with processing solely the individual components comprising these bindings. Previous findings indicate that binding of surface features (e.g., colored shapes) within VWM is not demanding of resources beyond what is required for single features. However, it is possible that other types of binding, such as the binding of complex, distinct items (e.g., faces and scenes), in VWM may require additional resources. In 3 experiments, we examined VWM item-item binding performance under no load, articulatory suppression, and backward counting using a modified change detection task. Binding performance declined to a greater extent than single-item performance under higher compared with lower levels of concurrent load. The findings from each of these experiments indicate that processing item-item bindings within VWM requires a greater amount of attentional resources compared with single items. These findings also highlight an important distinction between the role of attention in item-item binding within VWM and previous studies of long-term memory (LTM) where declines in single-item and binding test performance are similar under divided attention. The current findings provide novel evidence that the specific type of binding is an important determining factor regarding whether or not VWM binding processes require attention. (PsycINFO Database Record (c) 2017 APA, all rights reserved).

  13. Diet and the risk of breast cancer in a case-control study: does the threat of disease have an influence on recall bias?

    Science.gov (United States)

    Männistö, S; Pietinen, P; Virtanen, M; Kataja, V; Uusitupa, M

    1999-05-01

    It has been suggested that recall bias may explain the discrepant results between case-control and cohort studies on diet and the risk of breast cancer. Two control groups were used for this case-control study of 25 to 75-year-old breast cancer cases (n = 310). The first group consisted of population controls drawn from the Finnish National Population Register (n = 454). The second group consisted of women who were referred to the same examinations as were the cases because of clinical suspicion of breast disease but who were later diagnosed as healthy (referral controls; n = 506). Because the diagnosis was unknown at the time of interview, it was possible to assess by comparing the two control groups whether the self-reporting of diet changed under the threat of disease. Dietary habits were examined using a validated, self-administered food-frequency questionnaire. Premenopausal women misreported their consumption of liquid milk products, tea, and sugar. Reporting bias was also associated with the intake of fat and vitamins. Postmenopausal women misreported consumption of milk products. When recall bias was taken into consideration, milk was associated with increased risk of premenopausal breast cancer, whereas high consumption of poultry or high intake of monounsaturated fatty acids, n-3 fatty acids, n-6 fatty acids, and vitamin E were related to lower risk. The study suggested that oil, milk, cheese, coffee and beta-carotene may act as protective factors in postmenopausal women, whereas butter and cream may be risk factors for breast cancer. In summary, it is possible that some food items may be overreported or underreported under the threat of disease in health-conscious population. However, most of the results in this study were not modified by recall bias.

  14. Potential for parasite-induced biases in aquatic invertebrate population studies

    Science.gov (United States)

    Fisher, Justin D.L.; Mushet, David M.; Stockwell, Craig A.

    2014-01-01

    Recent studies highlight the need to include estimates of detection/capture probability in population studies. This need is particularly important in studies where detection and/or capture probability is influenced by parasite-induced behavioral alterations. We assessed potential biases associated with sampling a population of the amphipod Gammarus lacustris in the presence of Polymorphus spp. acanthocephalan parasites shown to increase positive phototaxis in their amphipod hosts. We trapped G. lacustris at two water depths (benthic and surface) and compared number of captures and number of parasitized individuals at each depth. While we captured the greatest number of G. lacustris individuals in benthic traps, parasitized individuals were captured most often in surface traps. These results reflect the phototaxic movement of infected individuals from benthic locations to sunlit surface waters. We then explored the influence of varying infection rates on a simulated population held at a constant level of abundance. Simulations resulted in increasingly biased abundance estimates as infection rates increased. Our results highlight the need to consider parasite-induced biases when quantifying detection and/or capture probability in studies of aquatic invertebrate populations.

  15. Data analysis strategies for reducing the influence of the bias in cross-cultural research.

    Science.gov (United States)

    Sindik, Josko

    2012-03-01

    In cross-cultural research, researchers have to adjust the constructs and associated measurement instruments that have been developed in one culture and then imported for use in another culture. Importing concepts from other cultures is often simply reduced to language adjustment of the content in the items of the measurement instruments that define a certain (psychological) construct. In the context of cross-cultural research, test bias can be defined as a generic term for all nuisance factors that threaten the validity of cross-cultural comparisons. Bias can be an indicator that instrument scores based on the same items measure different traits and characteristics across different cultural groups. To reduce construct, method and item bias,the researcher can consider these strategies: (1) simply comparing average results in certain measuring instruments; (2) comparing only the reliability of certain dimensions of the measurement instruments, applied to the "target" and "source" samples of participants, i.e. from different cultures; (3) comparing the "framed" factor structure (fixed number of factors) of the measurement instruments, applied to the samples from the "target" and "source" cultures, using explorative factor analysis strategy on separate samples; (4) comparing the complete constructs ("unframed" factor analysis, i.e. unlimited number of factors) in relation to their best psychometric properties and the possibility of interpreting (best suited to certain cultures, applying explorative strategy of factor analysis); or (5) checking the similarity of the constructs in the samples from different cultures (using structural equation modeling approach). Each approach has its advantages and disadvantages. The advantages and lacks of each approach are discussed.

  16. Statistical methods for elimination of guarantee-time bias in cohort studies: a simulation study

    Directory of Open Access Journals (Sweden)

    In Sung Cho

    2017-08-01

    Full Text Available Abstract Background Aspirin has been considered to be beneficial in preventing cardiovascular diseases and cancer. Several pharmaco-epidemiology cohort studies have shown protective effects of aspirin on diseases using various statistical methods, with the Cox regression model being the most commonly used approach. However, there are some inherent limitations to the conventional Cox regression approach such as guarantee-time bias, resulting in an overestimation of the drug effect. To overcome such limitations, alternative approaches, such as the time-dependent Cox model and landmark methods have been proposed. This study aimed to compare the performance of three methods: Cox regression, time-dependent Cox model and landmark method with different landmark times in order to address the problem of guarantee-time bias. Methods Through statistical modeling and simulation studies, the performance of the above three methods were assessed in terms of type I error, bias, power, and mean squared error (MSE. In addition, the three statistical approaches were applied to a real data example from the Korean National Health Insurance Database. Effect of cumulative rosiglitazone dose on the risk of hepatocellular carcinoma was used as an example for illustration. Results In the simulated data, time-dependent Cox regression outperformed the landmark method in terms of bias and mean squared error but the type I error rates were similar. The results from real-data example showed the same patterns as the simulation findings. Conclusions While both time-dependent Cox regression model and landmark analysis are useful in resolving the problem of guarantee-time bias, time-dependent Cox regression is the most appropriate method for analyzing cumulative dose effects in pharmaco-epidemiological studies.

  17. Bias modification training can alter approach bias and chocolate consumption.

    Science.gov (United States)

    Schumacher, Sophie E; Kemps, Eva; Tiggemann, Marika

    2016-01-01

    Recent evidence has demonstrated that bias modification training has potential to reduce cognitive biases for attractive targets and affect health behaviours. The present study investigated whether cognitive bias modification training could be applied to reduce approach bias for chocolate and affect subsequent chocolate consumption. A sample of 120 women (18-27 years) were randomly assigned to an approach-chocolate condition or avoid-chocolate condition, in which they were trained to approach or avoid pictorial chocolate stimuli, respectively. Training had the predicted effect on approach bias, such that participants trained to approach chocolate demonstrated an increased approach bias to chocolate stimuli whereas participants trained to avoid such stimuli showed a reduced bias. Further, participants trained to avoid chocolate ate significantly less of a chocolate muffin in a subsequent taste test than participants trained to approach chocolate. Theoretically, results provide support for the dual process model's conceptualisation of consumption as being driven by implicit processes such as approach bias. In practice, approach bias modification may be a useful component of interventions designed to curb the consumption of unhealthy foods. Copyright © 2015 Elsevier Ltd. All rights reserved.

  18. The Role of Item Models in Automatic Item Generation

    Science.gov (United States)

    Gierl, Mark J.; Lai, Hollis

    2012-01-01

    Automatic item generation represents a relatively new but rapidly evolving research area where cognitive and psychometric theories are used to produce tests that include items generated using computer technology. Automatic item generation requires two steps. First, test development specialists create item models, which are comparable to templates…

  19. Cognitive interference and a food-related memory bias in binge eating disorder.

    Science.gov (United States)

    Svaldi, Jennifer; Schmitz, Florian; Trentowska, Monika; Tuschen-Caffier, Brunna; Berking, Matthias; Naumann, Eva

    2014-01-01

    The present study was concerned with cognitive interference and a specific memory bias for eating-related stimuli in binge eating disorder (BED). Further objectives were to find out under which circumstances such effects would occur, and whether they are related with each other and with reported severity of BED symptoms. A group of women diagnosed with BED and a matched sample of overweight controls completed two paradigms, an n-back task with lures and a recent-probes task. The BED group generally experienced more interference in the n-back task. Additionally, they revealed selectively increased interference for food items in the recent-probes task. Findings can be reconciled with the view that control functions are generally impaired in BED, and that there is an additional bias for eating-related stimuli, both of which were related with reported severity of BED symptoms. Copyright © 2013 Elsevier Ltd. All rights reserved.

  20. Problems with the factor analysis of items: Solutions based on item response theory and item parcelling

    Directory of Open Access Journals (Sweden)

    Gideon P. De Bruin

    2004-10-01

    Full Text Available The factor analysis of items often produces spurious results in the sense that unidimensional scales appear multidimensional. This may be ascribed to failure in meeting the assumptions of linearity and normality on which factor analysis is based. Item response theory is explicitly designed for the modelling of the non-linear relations between ordinal variables and provides a strong alternative to the factor analysis of items. Items may also be combined in parcels that are more likely to satisfy the assumptions of factor analysis than do the items. The use of the Rasch rating scale model and the factor analysis of parcels is illustrated with data obtained with the Locus of Control Inventory. The results of these analyses are compared with the results obtained through the factor analysis of items. It is shown that the Rasch rating scale model and the factoring of parcels produce superior results to the factor analysis of items. Recommendations for the analysis of scales are made. Opsomming Die faktorontleding van items lewer dikwels misleidende resultate op, veral in die opsig dat eendimensionele skale as meerdimensioneel voorkom. Hierdie resultate kan dikwels daaraan toegeskryf word dat daar nie aan die aannames van lineariteit en normaliteit waarop faktorontleding berus, voldoen word nie. Itemresponsteorie, wat eksplisiet vir die modellering van die nie-liniêre verbande tussen ordinale items ontwerp is, bied ’n aantreklike alternatief vir die faktorontleding van items. Items kan ook in pakkies gegroepeer word wat meer waarskynlik aan die aannames van faktorontleding voldoen as individuele items. Die gebruik van die Rasch beoordelingskaalmodel en die faktorontleding van pakkies word aan die hand van data wat met die Lokus van Beheervraelys verkry is, gedemonstreer. Die resultate van hierdie ontledings word vergelyk met die resultate wat deur ‘n faktorontleding van die individuele items verkry is. Die resultate dui daarop dat die Rasch

  1. Transmission of Cognitive Bias and Fear From Parents to Children: An Experimental Study.

    Science.gov (United States)

    Remmerswaal, Danielle; Muris, Peter; Huijding, Jorg

    2016-01-01

    This study explored the role of parents in the development of a cognitive bias and subsequent fear levels in children. In Experiment 1, nonclinical children ages 8-13 (N = 122) underwent a training during which they worked together with their mothers on an information search task. Mothers received instructions to induce either a positive or negative information search bias in their children. Experiment 2 investigated to what extent mothers own cognitive bias predicted children's information search bias. Mothers of 49 nonclinical children ages 9-12 received no explicit training instructions before working together with their child on an information search task. Experiment 1 demonstrated that mothers had a significant impact on children's cognitive bias and fear. More precisely, children who had received a negative parental training displayed an increase in negative information search bias and fear, whereas children who had received a positive parental training showed an increase in positive information search bias and a decrease in fear. In Experiment 2, it was found that children's information search biases after working together with their mothers were predicted by their mothers' initial cognitive bias scores. These findings can be taken as support for the intergenerational transmission of cognitive biases from mothers to children.

  2. The CogBIAS longitudinal study protocol: cognitive and genetic factors influencing psychological functioning in adolescence.

    Science.gov (United States)

    Booth, Charlotte; Songco, Annabel; Parsons, Sam; Heathcote, Lauren; Vincent, John; Keers, Robert; Fox, Elaine

    2017-12-29

    Optimal psychological development is dependent upon a complex interplay between individual and situational factors. Investigating the development of these factors in adolescence will help to improve understanding of emotional vulnerability and resilience. The CogBIAS longitudinal study (CogBIAS-L-S) aims to combine cognitive and genetic approaches to investigate risk and protective factors associated with the development of mood and impulsivity-related outcomes in an adolescent sample. CogBIAS-L-S is a three-wave longitudinal study of typically developing adolescents conducted over 4 years, with data collection at age 12, 14 and 16. At each wave participants will undergo multiple assessments including a range of selective cognitive processing tasks (e.g. attention bias, interpretation bias, memory bias) and psychological self-report measures (e.g. anxiety, depression, resilience). Saliva samples will also be collected at the baseline assessment for genetic analyses. Multilevel statistical analyses will be performed to investigate the developmental trajectory of cognitive biases on psychological functioning, as well as the influence of genetic moderation on these relationships. CogBIAS-L-S represents the first longitudinal study to assess multiple cognitive biases across adolescent development and the largest study of its kind to collect genetic data. It therefore provides a unique opportunity to understand how genes and the environment influence the development and maintenance of cognitive biases and provide insight into risk and protective factors that may be key targets for intervention.

  3. Mean size estimation yields left-side bias: Role of attention on perceptual averaging.

    Science.gov (United States)

    Li, Kuei-An; Yeh, Su-Ling

    2017-11-01

    The human visual system can estimate mean size of a set of items effectively; however, little is known about whether information on each visual field contributes equally to the mean size estimation. In this study, we examined whether a left-side bias (LSB)-perceptual judgment tends to depend more heavily on left visual field's inputs-affects mean size estimation. Participants were instructed to estimate the mean size of 16 spots. In half of the trials, the mean size of the spots on the left side was larger than that on the right side (the left-larger condition) and vice versa (the right-larger condition). Our results illustrated an LSB: A larger estimated mean size was found in the left-larger condition than in the right-larger condition (Experiment 1), and the LSB vanished when participants' attention was effectively cued to the right side (Experiment 2b). Furthermore, the magnitude of LSB increased with stimulus-onset asynchrony (SOA), when spots on the left side were presented earlier than the right side. In contrast, the LSB vanished and then induced a reversed effect with SOA when spots on the right side were presented earlier (Experiment 3). This study offers the first piece of evidence suggesting that LSB does have a significant influence on mean size estimation of a group of items, which is induced by a leftward attentional bias that enhances the prior entry effect on the left side.

  4. Statistical and extra-statistical considerations in differential item functioning analyses

    Directory of Open Access Journals (Sweden)

    G. K. Huysamen

    2004-10-01

    Full Text Available This article briefly describes the main procedures for performing differential item functioning (DIF analyses and points out some of the statistical and extra-statistical implications of these methods. Research findings on the sources of DIF, including those associated with translated tests, are reviewed. As DIF analyses are oblivious of correlations between a test and relevant criteria, the elimination of differentially functioning items does not necessarily improve predictive validity or reduce any predictive bias. The implications of the results of past DIF research for test development in the multilingual and multi-cultural South African society are considered. Opsomming Hierdie artikel beskryf kortliks die hoofprosedures vir die ontleding van differensiële itemfunksionering (DIF en verwys na sommige van die statistiese en buite-statistiese implikasies van hierdie metodes. ’n Oorsig word verskaf van navorsingsbevindings oor die bronne van DIF, insluitend dié by vertaalde toetse. Omdat DIF-ontledings nie die korrelasies tussen ’n toets en relevante kriteria in ag neem nie, sal die verwydering van differensieel-funksionerende items nie noodwendig voorspellingsgeldigheid verbeter of voorspellingsydigheid verminder nie. Die implikasies van vorige DIF-navorsingsbevindings vir toetsontwikkeling in die veeltalige en multikulturele Suid-Afrikaanse gemeenskap word oorweeg.

  5. The Technical Quality of Test Items Generated Using a Systematic Approach to Item Writing.

    Science.gov (United States)

    Siskind, Theresa G.; Anderson, Lorin W.

    The study was designed to examine the similarity of response options generated by different item writers using a systematic approach to item writing. The similarity of response options to student responses for the same item stems presented in an open-ended format was also examined. A non-systematic (subject matter expertise) approach and a…

  6. Item information and discrimination functions for trinary PCM items

    NARCIS (Netherlands)

    Akkermans, Wies; Muraki, Eiji

    1997-01-01

    For trinary partial credit items the shape of the item information and the item discrimination function is examined in relation to the item parameters. In particular, it is shown that these functions are unimodal if δ2 – δ1 < 4 ln 2 and bimodal otherwise. The locations and values of the maxima are

  7. [Unfolding item response model using best-worst scaling].

    Science.gov (United States)

    Ikehara, Kazuya

    2015-02-01

    In attitude measurement and sensory tests, the unfolding model is typically used. In this model, response probability is formulated by the distance between the person and the stimulus. In this study, we proposed an unfolding item response model using best-worst scaling (BWU model), in which a person chooses the best and worst stimulus among repeatedly presented subsets of stimuli. We also formulated an unfolding model using best scaling (BU model), and compared the accuracy of estimates between the BU and BWU models. A simulation experiment showed that the BWU modell performed much better than the BU model in terms of bias and root mean square errors of estimates. With reference to Usami (2011), the proposed models were apllied to actual data to measure attitudes toward tardiness. Results indicated high similarity between stimuli estimates generated with the proposed models and those of Usami (2011).

  8. Empirical evidence of study design biases in randomized trials

    DEFF Research Database (Denmark)

    Page, Matthew J.; Higgins, Julian P. T.; Clayton, Gemma

    2016-01-01

    search September 2012), and searched Ovid MEDLINE and Ovid EMBASE for studies indexed from Jan 2012-May 2015. Data were extracted by one author and verified by another. We combined estimates of average bias (e.g. ratio of odds ratios (ROR) or difference in standardised mean differences (dSMD)) in meta......-analyses using the random-effects model. Analyses were stratified by type of outcome ("mortality" versus "other objective" versus "subjective"). Direction of effect was standardised so that ROR SMD ... studies). For these characteristics, the average bias appeared to be larger in trials of subjective outcomes compared with other objective outcomes. Also, intervention effects for subjective outcomes appear to be exaggerated in trials with lack of/unclear blinding of participants (versus blinding) (dSMD...

  9. Hemispheric biases and the control of visuospatial attention: an ERP study

    Directory of Open Access Journals (Sweden)

    Banich Marie T

    2005-08-01

    Full Text Available Abstract Background We examined whether individual differences in hemispheric utilization can interact with the intrinsic attentional biases of the cerebral hemispheres. Evidence suggests that the hemispheres have competing biases to direct attention contralaterally, with the left hemisphere (LH having a stronger bias than the right hemisphere. There is also evidence that individuals have characteristic biases to utilize one hemisphere more than the other for processing information, which can induce a bias to direct attention to contralateral space. We predicted that LH-biased individuals would display a strong rightward attentional bias, which would create difficulty in selectively attending to target stimuli in the left visual field (LVF as compared to right in the performance of a bilateral flanker task. Results Consistent with our hypothesis, flanker interference effects were found on the N2c event-related brain potential and error rate for LH-biased individuals in the Attend-LVF condition. The error rate effect was correlated with the degree of hemispheric utilization bias for the LH-Bias group. Conclusion We conclude that hemispheric utilization bias can enhance a hemisphere's contralateral attentional bias, at least for individuals with a LH utilization bias. Hemispheric utilization bias may play an important and largely unrecognized role in visuospatial attention.

  10. Investigating Separate and Concurrent Approaches for Item Parameter Drift in 3PL Item Response Theory Equating

    Science.gov (United States)

    Arce-Ferrer, Alvaro J.; Bulut, Okan

    2017-01-01

    This study examines separate and concurrent approaches to combine the detection of item parameter drift (IPD) and the estimation of scale transformation coefficients in the context of the common item nonequivalent groups design with the three-parameter item response theory equating. The study uses real and synthetic data sets to compare the two…

  11. Complement or Contamination: A Study of the Validity of Multiple-Choice Items when Assessing Reasoning Skills in Physics

    OpenAIRE

    Anders Jönsson; David Rosenlund; Fredrik Alvén

    2017-01-01

    The purpose of this study is to investigate the validity of using multiple-choice (MC) items as a complement to constructed-response (CR) items when making decisions about student performance on reasoning tasks. CR items from a national test in physics have been reformulated into MC items and students’ reasoning skills have been analyzed in two substudies. In the first study, 12 students answered the MC items and were asked to explain their answers orally. In the second study, 102 students fr...

  12. Practical Considerations about Expected A Posteriori Estimation in Adaptive Testing: Adaptive A Priori, Adaptive Correction for Bias, and Adaptive Integration Interval.

    Science.gov (United States)

    Raiche, Gilles; Blais, Jean-Guy

    In a computerized adaptive test (CAT), it would be desirable to obtain an acceptable precision of the proficiency level estimate using an optimal number of items. Decreasing the number of items is accompanied, however, by a certain degree of bias when the true proficiency level differs significantly from the a priori estimate. G. Raiche (2000) has…

  13. Item level diagnostics and model - data fit in item response theory ...

    African Journals Online (AJOL)

    Item response theory (IRT) is a framework for modeling and analyzing item response data. Item-level modeling gives IRT advantages over classical test theory. The fit of an item score pattern to an item response theory (IRT) models is a necessary condition that must be assessed for further use of item and models that best fit ...

  14. Beyond attentional bias: a perceptual bias in a dot-probe task.

    Science.gov (United States)

    Bocanegra, Bruno R; Huijding, Jorg; Zeelenberg, René

    2012-12-01

    Previous dot-probe studies indicate that threat-related face cues induce a bias in spatial attention. Independently of spatial attention, a recent psychophysical study suggests that a bilateral fearful face cue improves low spatial-frequency perception (LSF) and impairs high spatial-frequency perception (HSF). Here, we combine these separate lines of research within a single dot-probe paradigm. We found that a bilateral fearful face cue, compared with a bilateral neutral face cue, speeded up responses to LSF targets and slowed down responses to HSF targets. This finding is important, as it shows that emotional cues in dot-probe tasks not only bias where information is preferentially processed (i.e., an attentional bias in spatial location), but also bias what type of information is preferentially processed (i.e., a perceptual bias in spatial frequency). PsycINFO Database Record (c) 2012 APA, all rights reserved.

  15. Health numeracy in Japan: measures of basic numeracy account for framing bias in a highly numerate population

    Directory of Open Access Journals (Sweden)

    Okamoto Masako

    2012-09-01

    Full Text Available Abstract Background Health numeracy is an important factor in how well people make decisions based on medical risk information. However, in many countries, including Japan, numeracy studies have been limited. Methods To fill this gap, we evaluated health numeracy levels in a sample of Japanese adults by translating two well-known scales that objectively measure basic understanding of math and probability: the 3-item numeracy scale developed by Schwartz and colleagues (the Schwartz scale and its expanded version, the 11-item numeracy scale developed by Lipkus and colleagues (the Lipkus scale. Results Participants’ performances (n = 300 on the scales were much higher than in original studies conducted in the United States (80% average item-wise correct response rate for Schwartz-J, and 87% for Lipkus-J. This high performance resulted in a ceiling effect on the distributions of both scores, which made it difficult to apply parametric statistical analysis, and limited the interpretation of statistical results. Nevertheless, the data provided some evidence for the reliability and validity of these scales: The reliability of the Japanese versions (Schwartz-J and Lipkus-J was comparable to the original in terms of their internal consistency (Cronbach’s α = 0.53 for Schwartz-J and 0.72 for Lipkus-J. Convergent validity was suggested by positive correlations with an existing Japanese health literacy measure (the Test for Ability to Interpret Medical Information developed by Takahashi and colleagues that contains some items relevant to numeracy. Furthermore, as shown in the previous studies, health numeracy was still associated with framing bias with individuals whose Lipkus-J performance was below the median being significantly influenced by how probability was framed when they rated surgical risks. A significant association was also found using Schwartz-J, which consisted of only three items. Conclusions Despite relatively high levels of

  16. Reducing selection bias in case-control studies from rare disease registries.

    Science.gov (United States)

    Cole, J Alexander; Taylor, John S; Hangartner, Thomas N; Weinreb, Neal J; Mistry, Pramod K; Khan, Aneal

    2011-09-12

    In clinical research of rare diseases, where small patient numbers and disease heterogeneity limit study design options, registries are a valuable resource for demographic and outcome information. However, in contrast to prospective, randomized clinical trials, the observational design of registries is prone to introduce selection bias and negatively impact the validity of data analyses. The objective of the study was to demonstrate the utility of case-control matching and the risk-set method in order to control bias in data from a rare disease registry. Data from the International Collaborative Gaucher Group (ICGG) Gaucher Registry were used as an example. A case-control matching analysis using the risk-set method was conducted to identify two groups of patients with type 1 Gaucher disease in the ICGG Gaucher Registry: patients with avascular osteonecrosis (AVN) and those without AVN. The frequency distributions of gender, decade of birth, treatment status, and splenectomy status were presented for cases and controls before and after matching. Odds ratios (and 95% confidence intervals) were calculated for each variable before and after matching. The application of case-control matching methodology results in cohorts of cases (i.e., patients with AVN) and controls (i.e., patients without AVN) who have comparable distributions for four common parameters used in subject selection: gender, year of birth (age), treatment status, and splenectomy status. Matching resulted in odds ratios of approximately 1.00, indicating no bias. We demonstrated bias in case-control selection in subjects from a prototype rare disease registry and used case-control matching to minimize this bias. Therefore, this approach appears useful to study cohorts of heterogeneous patients in rare disease registries.

  17. Is bias in the eye of the beholder? A vignette study to assess recognition of cognitive biases in clinical case workups.

    Science.gov (United States)

    Zwaan, Laura; Monteiro, Sandra; Sherbino, Jonathan; Ilgen, Jonathan; Howey, Betty; Norman, Geoffrey

    2017-02-01

    Many authors have implicated cognitive biases as a primary cause of diagnostic error. If this is so, then physicians already familiar with common cognitive biases should consistently identify biases present in a clinical workup. The aim of this paper is to determine whether physicians agree on the presence or absence of particular biases in a clinical case workup and how case outcome knowledge affects bias identification. We conducted a web survey of 37 physicians. Each participant read eight cases and listed which biases were present from a list provided. In half the cases the outcome implied a correct diagnosis; in the other half, it implied an incorrect diagnosis. We compared the number of biases identified when the outcome implied a correct or incorrect primary diagnosis. Additionally, the agreement among participants about presence or absence of specific biases was assessed. When the case outcome implied a correct diagnosis, an average of 1.75 cognitive biases were reported; when incorrect, 3.45 biases (F=71.3, p<0.00001). Individual biases were reported from 73% to 125% more often when an incorrect diagnosis was implied. There was no agreement on presence or absence of individual biases, with κ ranging from 0.000 to 0.044. Individual physicians are unable to agree on the presence or absence of individual cognitive biases. Their judgements are heavily influenced by hindsight bias; when the outcome implies a diagnostic error, twice as many biases are identified. The results present challenges for current error reduction strategies based on identification of cognitive biases. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/.

  18. Quantifying the Association of Self-Enhancement Bias With Self-Ratings of Personality and Life Satisfaction.

    Science.gov (United States)

    Leising, Daniel; Locke, Kenneth D; Kurzius, Elena; Zimmermann, Johannes

    2016-10-01

    Kwan, John, Kenny, Bond, and Robins conceptualize self-enhancement as a favorable comparison of self-judgments with judgments of and by others. Applying a modified version of Kwan et al.'s approach to behavior observation data, we show that the resulting measure of self-enhancement bias is highly reliable, predicts self-ratings of intelligence as well as does actual intelligence, interacts with item desirability in predicting responses to questionnaire items, and also predicts general life satisfaction. Consistent with previous research, however, self-ratings of intelligence did not become more valid when controlling for self-enhancement bias. We also show that common personality scales like the Rosenberg Self-Esteem Scale reflect self-enhancement at least as strongly as do scales that were designed particularly for that purpose (i.e., "social desirability scales"). The relevance of these findings in regard to the validity and utility of social desirability scales is discussed. © The Author(s) 2015.

  19. Dissociating the neural correlates of intra-item and inter-item working-memory binding.

    Directory of Open Access Journals (Sweden)

    Carinne Piekema

    Full Text Available BACKGROUND: Integration of information streams into a unitary representation is an important task of our cognitive system. Within working memory, the medial temporal lobe (MTL has been conceptually linked to the maintenance of bound representations. In a previous fMRI study, we have shown that the MTL is indeed more active during working-memory maintenance of spatial associations as compared to non-spatial associations or single items. There are two explanations for this result, the mere presence of the spatial component activates the MTL, or the MTL is recruited to bind associations between neurally non-overlapping representations. METHODOLOGY/PRINCIPAL FINDINGS: The current fMRI study investigates this issue further by directly comparing intrinsic intra-item binding (object/colour, extrinsic intra-item binding (object/location, and inter-item binding (object/object. The three binding conditions resulted in differential activation of brain regions. Specifically, we show that the MTL is important for establishing extrinsic intra-item associations and inter-item associations, in line with the notion that binding of information processed in different brain regions depends on the MTL. CONCLUSIONS/SIGNIFICANCE: Our findings indicate that different forms of working-memory binding rely on specific neural structures. In addition, these results extend previous reports indicating that the MTL is implicated in working-memory maintenance, challenging the classic distinction between short-term and long-term memory systems.

  20. Analysis of sensitive questions across cultures : An application of multigroup item randomized response theory to sexual attitudes and behavior

    NARCIS (Netherlands)

    de Jong, M.G.; Pieters, R.; Stremersch, S.

    2012-01-01

    Answers to sensitive questions are prone to social desirability bias. If not properly addressed, the validity of the research can be suspect. This article presents multigroup item randomized response theory (MIRRT) to measure self-reported sensitive topics across cultures. The method was

  1. Analysis of differential item functioning in the depression item bank from the Patient Reported Outcome Measurement Information System (PROMIS: An item response theory approach

    Directory of Open Access Journals (Sweden)

    JOSEPH P. EIMICKE

    2009-06-01

    Full Text Available The aims of this paper are to present findings related to differential item functioning (DIF in the Patient Reported Outcome Measurement Information System (PROMIS depression item bank, and to discuss potential threats to the validity of results from studies of DIF. The 32 depression items studied were modified from several widely used instruments. DIF analyses of gender, age and education were performed using a sample of 735 individuals recruited by a survey polling firm. DIF hypotheses were generated by asking content experts to indicate whether or not they expected DIF to be present, and the direction of the DIF with respect to the studied comparison groups. Primary analyses were conducted using the graded item response model (for polytomous, ordered response category data with likelihood ratio tests of DIF, accompanied by magnitude measures. Sensitivity analyses were performed using other item response models and approaches to DIF detection. Despite some caveats, the items that are recommended for exclusion or for separate calibration were "I felt like crying" and "I had trouble enjoying things that I used to enjoy." The item, "I felt I had no energy," was also flagged as evidencing DIF, and recommended for additional review. On the one hand, false DIF detection (Type 1 error was controlled to the extent possible by ensuring model fit and purification. On the other hand, power for DIF detection might have been compromised by several factors, including sparse data and small sample sizes. Nonetheless, practical and not just statistical significance should be considered. In this case the overall magnitude and impact of DIF was small for the groups studied, although impact was relatively large for some individuals.

  2. CPI Bias in Korea

    Directory of Open Access Journals (Sweden)

    Chul Chung

    2007-12-01

    Full Text Available We estimate the CPI bias in Korea by employing the approach of Engel’s Law as suggested by Hamilton (2001. This paper is the first attempt to estimate the bias using Korean panel data, Korean Labor and Income Panel Study(KLIPS. Following Hamilton’s model with non­linear specification correction, our estimation result shows that the cumulative CPI bias over the sample period (2000-2005 was 0.7 percent annually. This CPI bias implies that about 21 percent of the inflation rate during the period can be attributed to the bias. In light of purchasing power parity, we provide an interpretation of the estimated bias.

  3. The Effects of Test Length and Sample Size on Item Parameters in Item Response Theory

    Science.gov (United States)

    Sahin, Alper; Anil, Duygu

    2017-01-01

    This study investigates the effects of sample size and test length on item-parameter estimation in test development utilizing three unidimensional dichotomous models of item response theory (IRT). For this purpose, a real language test comprised of 50 items was administered to 6,288 students. Data from this test was used to obtain data sets of…

  4. Cascading corruption news: explaining the bias of media attention to Brazil’s political scandals

    Directory of Open Access Journals (Sweden)

    Mads Damgaard

    Full Text Available Abstract Through a content analysis of 8,800 news items and six months of front pages of three Brazilian newspapers, all dealing with corruption and political transgression, the present article documents the remarkable bias of media coverage toward corruption scandals. Said bias is examined as an informational phenomenon, arising from key systemic and commercial factors of Brazil’s news media: an information cascade of news on corruption formed, destabilizing the governing coalition and legitimizing the impeachment process of Dilma Rousseff. As this process gained momentum, questions of accountability were disregarded by the media, with harmful effects for democracy.

  5. Bias analysis applied to Agricultural Health Study publications to estimate non-random sources of uncertainty.

    Science.gov (United States)

    Lash, Timothy L

    2007-11-26

    The associations of pesticide exposure with disease outcomes are estimated without the benefit of a randomized design. For this reason and others, these studies are susceptible to systematic errors. I analyzed studies of the associations between alachlor and glyphosate exposure and cancer incidence, both derived from the Agricultural Health Study cohort, to quantify the bias and uncertainty potentially attributable to systematic error. For each study, I identified the prominent result and important sources of systematic error that might affect it. I assigned probability distributions to the bias parameters that allow quantification of the bias, drew a value at random from each assigned distribution, and calculated the estimate of effect adjusted for the biases. By repeating the draw and adjustment process over multiple iterations, I generated a frequency distribution of adjusted results, from which I obtained a point estimate and simulation interval. These methods were applied without access to the primary record-level dataset. The conventional estimates of effect associating alachlor and glyphosate exposure with cancer incidence were likely biased away from the null and understated the uncertainty by quantifying only random error. For example, the conventional p-value for a test of trend in the alachlor study equaled 0.02, whereas fewer than 20% of the bias analysis iterations yielded a p-value of 0.02 or lower. Similarly, the conventional fully-adjusted result associating glyphosate exposure with multiple myleoma equaled 2.6 with 95% confidence interval of 0.7 to 9.4. The frequency distribution generated by the bias analysis yielded a median hazard ratio equal to 1.5 with 95% simulation interval of 0.4 to 8.9, which was 66% wider than the conventional interval. Bias analysis provides a more complete picture of true uncertainty than conventional frequentist statistical analysis accompanied by a qualitative description of study limitations. The latter approach is

  6. Bias analysis applied to Agricultural Health Study publications to estimate non-random sources of uncertainty

    Directory of Open Access Journals (Sweden)

    Lash Timothy L

    2007-11-01

    Full Text Available Abstract Background The associations of pesticide exposure with disease outcomes are estimated without the benefit of a randomized design. For this reason and others, these studies are susceptible to systematic errors. I analyzed studies of the associations between alachlor and glyphosate exposure and cancer incidence, both derived from the Agricultural Health Study cohort, to quantify the bias and uncertainty potentially attributable to systematic error. Methods For each study, I identified the prominent result and important sources of systematic error that might affect it. I assigned probability distributions to the bias parameters that allow quantification of the bias, drew a value at random from each assigned distribution, and calculated the estimate of effect adjusted for the biases. By repeating the draw and adjustment process over multiple iterations, I generated a frequency distribution of adjusted results, from which I obtained a point estimate and simulation interval. These methods were applied without access to the primary record-level dataset. Results The conventional estimates of effect associating alachlor and glyphosate exposure with cancer incidence were likely biased away from the null and understated the uncertainty by quantifying only random error. For example, the conventional p-value for a test of trend in the alachlor study equaled 0.02, whereas fewer than 20% of the bias analysis iterations yielded a p-value of 0.02 or lower. Similarly, the conventional fully-adjusted result associating glyphosate exposure with multiple myleoma equaled 2.6 with 95% confidence interval of 0.7 to 9.4. The frequency distribution generated by the bias analysis yielded a median hazard ratio equal to 1.5 with 95% simulation interval of 0.4 to 8.9, which was 66% wider than the conventional interval. Conclusion Bias analysis provides a more complete picture of true uncertainty than conventional frequentist statistical analysis accompanied by a

  7. Multiple sensitive estimation and optimal sample size allocation in the item sum technique.

    Science.gov (United States)

    Perri, Pier Francesco; Rueda García, María Del Mar; Cobo Rodríguez, Beatriz

    2018-01-01

    For surveys of sensitive issues in life sciences, statistical procedures can be used to reduce nonresponse and social desirability response bias. Both of these phenomena provoke nonsampling errors that are difficult to deal with and can seriously flaw the validity of the analyses. The item sum technique (IST) is a very recent indirect questioning method derived from the item count technique that seeks to procure more reliable responses on quantitative items than direct questioning while preserving respondents' anonymity. This article addresses two important questions concerning the IST: (i) its implementation when two or more sensitive variables are investigated and efficient estimates of their unknown population means are required; (ii) the determination of the optimal sample size to achieve minimum variance estimates. These aspects are of great relevance for survey practitioners engaged in sensitive research and, to the best of our knowledge, were not studied so far. In this article, theoretical results for multiple estimation and optimal allocation are obtained under a generic sampling design and then particularized to simple random sampling and stratified sampling designs. Theoretical considerations are integrated with a number of simulation studies based on data from two real surveys and conducted to ascertain the efficiency gain derived from optimal allocation in different situations. One of the surveys concerns cannabis consumption among university students. Our findings highlight some methodological advances that can be obtained in life sciences IST surveys when optimal allocation is achieved. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  8. Length bias correction in one-day cross-sectional assessments - The nutritionDay study.

    Science.gov (United States)

    Frantal, Sophie; Pernicka, Elisabeth; Hiesmayr, Michael; Schindler, Karin; Bauer, Peter

    2016-04-01

    A major problem occurring in cross-sectional studies is sampling bias. Length of hospital stay (LOS) differs strongly between patients and causes a length bias as patients with longer LOS are more likely to be included and are therefore overrepresented in this type of study. To adjust for the length bias higher weights are allocated to patients with shorter LOS. We determined the effect of length-bias adjustment in two independent populations. Length-bias correction is applied to the data of the nutritionDay project, a one-day multinational cross-sectional audit capturing data on disease and nutrition of patients admitted to hospital wards with right-censoring after 30 days follow-up. We applied the weighting method for estimating the distribution function of patient baseline variables based on the method of non-parametric maximum likelihood. Results are validated using data from all patients admitted to the General Hospital of Vienna between 2005 and 2009, where the distribution of LOS can be assumed to be known. Additionally, a simplified calculation scheme for estimating the adjusted distribution function of LOS is demonstrated on a small patient example. The crude median (lower quartile; upper quartile) LOS in the cross-sectional sample was 14 (8; 24) and decreased to 7 (4; 12) when adjusted. Hence, adjustment for length bias in cross-sectional studies is essential to get appropriate estimates. Copyright © 2015 Elsevier Ltd and European Society for Clinical Nutrition and Metabolism. All rights reserved.

  9. Surveying for "artifacts": the susceptibility of the OCB-performance evaluation relationship to common rater, item, and measurement context effects.

    Science.gov (United States)

    Podsakoff, Nathan P; Whiting, Steven W; Welsh, David T; Mai, Ke Michael

    2013-09-01

    Despite the increased attention paid to biases attributable to common method variance (CMV) over the past 50 years, researchers have only recently begun to systematically examine the effect of specific sources of CMV in previously published empirical studies. Our study contributes to this research by examining the extent to which common rater, item, and measurement context characteristics bias the relationships between organizational citizenship behaviors and performance evaluations using a mixed-effects analytic technique. Results from 173 correlations reported in 81 empirical studies (N = 31,146) indicate that even after controlling for study-level factors, common rater and anchor point number similarity substantially biased the focal correlations. Indeed, these sources of CMV (a) led to estimates that were between 60% and 96% larger when comparing measures obtained from a common rater, versus different raters; (b) led to 39% larger estimates when a common source rated the scales using the same number, versus a different number, of anchor points; and (c) when taken together with other study-level predictors, accounted for over half of the between-study variance in the focal correlations. We discuss the implications for researchers and practitioners and provide recommendations for future research. PsycINFO Database Record (c) 2013 APA, all rights reserved

  10. Ethnic bias and clinical decision-making among New Zealand medical students: an observational study.

    Science.gov (United States)

    Harris, Ricci; Cormack, Donna; Stanley, James; Curtis, Elana; Jones, Rhys; Lacey, Cameron

    2018-01-23

    Health professional racial/ethnic bias may impact on clinical decision-making and contribute to subsequent ethnic health inequities. However, limited research has been undertaken among medical students. This paper presents findings from the Bias and Decision-Making in Medicine (BDMM) study, which sought to examine ethnic bias (Māori (indigenous peoples) compared with New Zealand European) among medical students and associations with clinical decision-making. All final year New Zealand (NZ) medical students in 2014 and 2015 (n = 888) were invited to participate in a cross-sectional online study. Key components included: two chronic disease vignettes (cardiovascular disease (CVD) and depression) with randomized patient ethnicity (Māori or NZ European) and questions on patient management; implicit bias measures (an ethnicity preference Implicit Association Test (IAT) and an ethnicity and compliant patient IAT); and, explicit ethnic bias questions. Associations between ethnic bias and clinical decision-making responses to vignettes were tested using linear regression. Three hundred and two students participated (34% response rate). Implicit and explicit ethnic bias favoring NZ Europeans was apparent among medical students. In the CVD vignette, no significant differences in clinical decision-making by patient ethnicity were observed. There were also no differential associations by patient ethnicity between any measures of ethnic bias (implicit or explicit) and patient management responses in the CVD vignette. In the depression vignette, some differences in the ranking of recommended treatment options were observed by patient ethnicity and explicit preference for NZ Europeans was associated with increased reporting that NZ European patients would benefit from treatment but not Māori (slope difference 0.34, 95% CI 0.08, 0.60; p = 0.011), although this was the only significant finding in these analyses. NZ medical students demonstrated ethnic bias, although

  11. Colour vision and response bias in a coral reef fish.

    Science.gov (United States)

    Cheney, Karen L; Newport, Cait; McClure, Eva C; Marshall, N Justin

    2013-08-01

    Animals use coloured signals for a variety of communication purposes, including to attract potential mates, recognize individuals, defend territories and warn predators of secondary defences (aposematism). To understand the mechanisms that drive the evolution and design of such visual signals, it is important to understand the visual systems and potential response biases of signal receivers. Here, we provide raw data on the spectral capabilities of a coral reef fish, the Picasso triggerfish Rhinecanthus aculeatus, which is potentially trichromatic with three cone sensitivities of 413 nm (single cone), 480 nm (double cone, medium sensitivity) and 528 nm (double cone, long sensitivity), and a rod sensitivity of 498 nm. The ocular media have a 50% transmission cut off at 405 nm. Behavioural experiments confirmed colour vision over their spectral range; triggerfish were significantly more likely to choose coloured stimuli over grey distractors, irrespective of luminance. We then examined whether response biases existed towards coloured and patterned stimuli to provide insight into how visual signals - in particular, aposematic colouration - may evolve. Triggerfish showed a preferential foraging response bias to red and green stimuli, in contrast to blue and yellow, irrespective of pattern. There was no response bias to patterned over monochromatic non-patterned stimuli. A foraging response bias towards red in fish differs from that of avian predators, who often avoid red food items. Red is frequently associated with warning colouration in terrestrial environments (ladybirds, snakes, frogs), whilst blue is used in aquatic environments (blue-ringed octopus, nudibranchs); whether the design of warning (aposematic) displays is a cause or consequence of response biases is unclear.

  12. Forced-Choice Assessment of Work-Related Maladaptive Personality Traits: Preliminary Evidence From an Application of Thurstonian Item Response Modeling.

    Science.gov (United States)

    Guenole, Nigel; Brown, Anna A; Cooper, Andrew J

    2018-06-01

    This article describes an investigation of whether Thurstonian item response modeling is a viable method for assessment of maladaptive traits. Forced-choice responses from 420 working adults to a broad-range personality inventory assessing six maladaptive traits were considered. The Thurstonian item response model's fit to the forced-choice data was adequate, while the fit of a counterpart item response model to responses to the same items but arranged in a single-stimulus design was poor. Monotrait heteromethod correlations indicated corresponding traits in the two formats overlapped substantially, although they did not measure equivalent constructs. A better goodness of fit and higher factor loadings for the Thurstonian item response model, coupled with a clearer conceptual alignment to the theoretical trait definitions, suggested that the single-stimulus item responses were influenced by biases that the independent clusters measurement model did not account for. Researchers may wish to consider forced-choice designs and appropriate item response modeling techniques such as Thurstonian item response modeling for personality questionnaire applications in industrial psychology, especially when assessing maladaptive traits. We recommend further investigation of this approach in actual selection situations and with different assessment instruments.

  13. Reducing selection bias in case-control studies from rare disease registries

    Directory of Open Access Journals (Sweden)

    Mistry Pramod K

    2011-09-01

    Full Text Available Abstract Background In clinical research of rare diseases, where small patient numbers and disease heterogeneity limit study design options, registries are a valuable resource for demographic and outcome information. However, in contrast to prospective, randomized clinical trials, the observational design of registries is prone to introduce selection bias and negatively impact the validity of data analyses. The objective of the study was to demonstrate the utility of case-control matching and the risk-set method in order to control bias in data from a rare disease registry. Data from the International Collaborative Gaucher Group (ICGG Gaucher Registry were used as an example. Methods A case-control matching analysis using the risk-set method was conducted to identify two groups of patients with type 1 Gaucher disease in the ICGG Gaucher Registry: patients with avascular osteonecrosis (AVN and those without AVN. The frequency distributions of gender, decade of birth, treatment status, and splenectomy status were presented for cases and controls before and after matching. Odds ratios (and 95% confidence intervals were calculated for each variable before and after matching. Results The application of case-control matching methodology results in cohorts of cases (i.e., patients with AVN and controls (i.e., patients without AVN who have comparable distributions for four common parameters used in subject selection: gender, year of birth (age, treatment status, and splenectomy status. Matching resulted in odds ratios of approximately 1.00, indicating no bias. Conclusions We demonstrated bias in case-control selection in subjects from a prototype rare disease registry and used case-control matching to minimize this bias. Therefore, this approach appears useful to study cohorts of heterogeneous patients in rare disease registries.

  14. Statistical methods to correct for verification bias in diagnostic studies are inadequate when there are few false negatives: a simulation study

    Directory of Open Access Journals (Sweden)

    Vickers Andrew J

    2008-11-01

    Full Text Available Abstract Background A common feature of diagnostic research is that results for a diagnostic gold standard are available primarily for patients who are positive for the test under investigation. Data from such studies are subject to what has been termed "verification bias". We evaluated statistical methods for verification bias correction when there are few false negatives. Methods A simulation study was conducted of a screening study subject to verification bias. We compared estimates of the area-under-the-curve (AUC corrected for verification bias varying both the rate and mechanism of verification. Results In a single simulated data set, varying false negatives from 0 to 4 led to verification bias corrected AUCs ranging from 0.550 to 0.852. Excess variation associated with low numbers of false negatives was confirmed in simulation studies and by analyses of published studies that incorporated verification bias correction. The 2.5th – 97.5th centile range constituted as much as 60% of the possible range of AUCs for some simulations. Conclusion Screening programs are designed such that there are few false negatives. Standard statistical methods for verification bias correction are inadequate in this circumstance.

  15. Memory bias for emotional and illness-related words in patients with depression, anxiety and somatization disorders: an investigation with the directed forgetting task.

    Science.gov (United States)

    Wingenfeld, Katja; Terfehr, Kirsten; Meyer, Björn; Löwe, Bernd; Spitzer, Carsten

    2013-01-01

    Memory bias to emotion- and illness-related information plays a prominent role in many mental disorders, particularly major depressive disorder, anxiety disorders and somatoform disorder. The current study aimed to investigate memory bias in different mental disorders by using neutral, emotionally valenced and illness-related word stimuli in a directed forgetting task. Seventy-eight inpatients from a university-based psychosomatic hospital participated in the study. The item method of the directed forgetting task was used, in which participants are instructed to either forget or remember each item immediately after it has been presented. Memory performance was tested with a free recall test. Overall, 36 words were presented - 6 from each of 6 categories: neutral, negative, positive, illness related ('somatoform'), depression related, and anxiety related. Three words of each category were to be remembered and 3 were to be forgotten. Independently of the patients' diagnoses, we found that most patients had relative difficulties remembering anxiety- and depression-related words, compared to neutral words, when they were instructed to remember them. By contrast, in the 'instructed forgetting' condition, patients showed deficits in the ability to forget illness-related stimuli relative to neutral material. These effects were unspecific with regard to diagnosis. The results in the 'instructed remembering' condition might be interpreted in the context of cognitive avoidance instead of a memory bias. In the 'instructed forgetting' condition, it appeared that illness-related words were more difficult to suppress compared to the other word types, which could explain the observed memory bias. Copyright © 2012 S. Karger AG, Basel.

  16. The specificity of attentional biases by type of gambling: An eye-tracking study.

    Directory of Open Access Journals (Sweden)

    Daniel S McGrath

    Full Text Available A growing body of research indicates that gamblers develop an attentional bias for gambling-related stimuli. Compared to research on substance use, however, few studies have examined attentional biases in gamblers using eye-gaze tracking, which has many advantages over other measures of attention. In addition, previous studies of attentional biases in gamblers have not directly matched type of gambler with personally-relevant gambling cues. The present study investigated the specificity of attentional biases for individual types of gambling using an eye-gaze tracking paradigm. Three groups of participants (poker players, video lottery terminal/slot machine players, and non-gambling controls took part in one test session in which they viewed 25 sets of four images (poker, VLTs/slot machines, bingo, and board games. Participants' eye fixations were recorded throughout each 8-second presentation of the four images. The results indicated that, as predicted, the two gambling groups preferentially attended to their primary form of gambling, whereas control participants attended to board games more than gambling images. The findings have clinical implications for the treatment of individuals with gambling disorder. Understanding the importance of personally-salient gambling cues will inform the development of effective attentional bias modification treatments for problem gamblers.

  17. The specificity of attentional biases by type of gambling: An eye-tracking study.

    Science.gov (United States)

    McGrath, Daniel S; Meitner, Amadeus; Sears, Christopher R

    2018-01-01

    A growing body of research indicates that gamblers develop an attentional bias for gambling-related stimuli. Compared to research on substance use, however, few studies have examined attentional biases in gamblers using eye-gaze tracking, which has many advantages over other measures of attention. In addition, previous studies of attentional biases in gamblers have not directly matched type of gambler with personally-relevant gambling cues. The present study investigated the specificity of attentional biases for individual types of gambling using an eye-gaze tracking paradigm. Three groups of participants (poker players, video lottery terminal/slot machine players, and non-gambling controls) took part in one test session in which they viewed 25 sets of four images (poker, VLTs/slot machines, bingo, and board games). Participants' eye fixations were recorded throughout each 8-second presentation of the four images. The results indicated that, as predicted, the two gambling groups preferentially attended to their primary form of gambling, whereas control participants attended to board games more than gambling images. The findings have clinical implications for the treatment of individuals with gambling disorder. Understanding the importance of personally-salient gambling cues will inform the development of effective attentional bias modification treatments for problem gamblers.

  18. The 12-item World Health Organization Disability Assessment Schedule II (WHO-DAS II: a nonparametric item response analysis

    Directory of Open Access Journals (Sweden)

    Fernandez Ana

    2010-05-01

    Full Text Available Abstract Background Previous studies have analyzed the psychometric properties of the World Health Organization Disability Assessment Schedule II (WHO-DAS II using classical omnibus measures of scale quality. These analyses are sample dependent and do not model item responses as a function of the underlying trait level. The main objective of this study was to examine the effectiveness of the WHO-DAS II items and their options in discriminating between changes in the underlying disability level by means of item response analyses. We also explored differential item functioning (DIF in men and women. Methods The participants were 3615 adult general practice patients from 17 regions of Spain, with a first diagnosed major depressive episode. The 12-item WHO-DAS II was administered by the general practitioners during the consultation. We used a non-parametric item response method (Kernel-Smoothing implemented with the TestGraf software to examine the effectiveness of each item (item characteristic curves and their options (option characteristic curves in discriminating between changes in the underliying disability level. We examined composite DIF to know whether women had a higher probability than men of endorsing each item. Results Item response analyses indicated that the twelve items forming the WHO-DAS II perform very well. All items were determined to provide good discrimination across varying standardized levels of the trait. The items also had option characteristic curves that showed good discrimination, given that each increasing option became more likely than the previous as a function of increasing trait level. No gender-related DIF was found on any of the items. Conclusions All WHO-DAS II items were very good at assessing overall disability. Our results supported the appropriateness of the weights assigned to response option categories and showed an absence of gender differences in item functioning.

  19. Gender-Based Differential Item Performance in Mathematics Achievement Items.

    Science.gov (United States)

    Doolittle, Allen E.; Cleary, T. Anne

    1987-01-01

    Eight randomly equivalent samples of high school seniors were each given a unique form of the ACT Assessment Mathematics Usage Test (ACTM). Signed measures of differential item performance (DIP) were obtained for each item in the eight ACTM forms. DIP estimates were analyzed and a significant item category effect was found. (Author/LMO)

  20. Priming in implicit memory tasks: prior study causes enhanced discriminability, not only bias.

    Science.gov (United States)

    Zeelenberg, René; Wagenmakers, Eric-Jan M; Raaijmakers, Jeroen G W

    2002-03-01

    R. Ratcliff and G. McKoon (1995, 1996, 1997; R. Ratcliff, D. Allbritton, & G. McKoon, 1997) have argued that repetition priming effects are solely due to bias. They showed that prior study of the target resulted in a benefit in a later implicit memory task. However, prior study of a stimulus similar to the target resulted in a cost. The present study, using a 2-alternative forced-choice procedure, investigated the effect of prior study in an unbiased condition: Both alternatives were studied prior to their presentation in an implicit memory task. Contrary to a pure bias interpretation of priming, consistent evidence was obtained in 3 implicit memory tasks (word fragment completion, auditory word identification, and picture identification) that performance was better when both alternatives were studied than when neither alternative was studied. These results show that prior study results in enhanced discriminability, not only bias.

  1. Identifying group-sensitive physical activities: a differential item functioning analysis of NHANES data.

    Science.gov (United States)

    Gao, Yong; Zhu, Weimo

    2011-05-01

    The purpose of this study was to identify subgroup-sensitive physical activities (PA) using differential item functioning (DIF) analysis. A sub-unweighted sample of 1857 (men=923 and women=934) from the 2003-2004 National Health and Nutrition Examination Survey PA questionnaire data was used for the analyses. Using the Mantel-Haenszel, the simultaneous item bias test, and the ANOVA DIF methods, 33 specific leisure-time moderate and/or vigorous PA (MVPA) items were analyzed for DIF across race/ethnicity, gender, education, income, and age groups. Many leisure-time MVPA items were identified as large DIF items. When participating in the same amount of leisure-time MVPA, non-Hispanic blacks were more likely to participate in basketball and dance activities than non-Hispanic whites (NHW); NHW were more likely to participated in golf and hiking than non-Hispanic blacks; Hispanics were more likely to participate in dancing, hiking, and soccer than NHW, whereas NHW were more likely to engage in bicycling, golf, swimming, and walking than Hispanics; women were more likely to participate in aerobics, dancing, stretching, and walking than men, whereas men were more likely to engage in basketball, fishing, golf, running, soccer, weightlifting, and hunting than women; educated persons were more likely to participate in jogging and treadmill exercise than less educated persons; persons with higher incomes were more likely to engage in golf than those with lower incomes; and adults (20-59 yr) were more likely to participate in basketball, dancing, jogging, running, and weightlifting than older adults (60+ yr), whereas older adults were more likely to participate in walking and golf than younger adults. DIF methods are able to identify subgroup-sensitive PA and thus provide useful information to help design group-sensitive, targeted interventions for disadvantaged PA subgroups. © 2011 by the American College of Sports Medicine

  2. Challenging stereotyping and bias: a voice simulation study.

    Science.gov (United States)

    Dearing, Karen S; Steadman, Sheryl

    2008-02-01

    Stigma is a barrier to mental health care access for patients with schizophrenia and can interfere with developing therapeutic relationships. This study demonstrates success of a voice simulation experience during orientation in changing the biases of nursing students and the effect on the development of the nurse-patient relationship. Ninety-four individuals participated; 52 received a voice simulation experience during orientation, and 42 received orientation with no voice simulation experience. The Medical Condition Regard Scale was administered before and after orientation. Posttest paired t test results show significant differences in attitudes toward patients with voice hearing experiences between the two groups. The themes of personal growth from the focus groups postorientation include Affective Experience, Physical Experience, and Empathy. Findings demonstrate that the orientation process should include methods to challenge stereotyping and bias to decrease stigma, improve service access, and enhance the ability to develop therapeutic relationships.

  3. ROBINS-I: a tool for assessing risk of bias in non-randomised studies of interventions

    OpenAIRE

    Sterne, Jonathan AC; Hern?n, Miguel A; Reeves, Barnaby C; Savovi?, Jelena; Berkman, Nancy D; Viswanathan, Meera; Henry, David; Altman, Douglas G; Ansari, Mohammed T; Boutron, Isabelle; Carpenter, James R; Chan, An-Wen; Churchill, Rachel; Deeks, Jonathan J; Hr?bjartsson, Asbj?rn

    2016-01-01

    Non-randomized studies of the effects of interventions are critical to many areas of health care evaluation, but their results may be biased. It is therefore important to understand and appraise their strengths and weaknesses. We developed ROBINS-I (“Risk Of Bias In Non-randomized Studies - of Interventions”), a new tool for evaluating risk of bias in estimates of the comparative effectiveness (harm or benefit) of interventions from studies that did not use randomization to allocate units (in...

  4. Lower Risk of Death With SGLT2 Inhibitors in Observational Studies: Real or Bias?

    Science.gov (United States)

    Suissa, Samy

    2018-01-01

    Two recent observational studies reported a remarkably lower rate of all-cause death associated with sodium-glucose cotransporter 2 inhibitor (-SGLT2i) use in all patients with type 2 diabetes and not only those at increased cardiovascular risk. The >50% lower mortality rates reported in these studies are much greater than those found in the BI 10773 (Empagliflozin) Cardiovascular Outcome Event Trial in Type 2 Diabetes Mellitus Patients (EMPA-REG OUTCOME) and CANagliflozin cardioVascular Assessment Study (CANVAS) randomized trials. We show that these observational studies are affected by time-related biases, including immortal time bias and time-lag bias, which tend to exaggerate the benefits observed with a drug. The Comparative Effectiveness of Cardiovascular Outcomes in New Users of SGLT-2 Inhibitors (CVD-REAL) study, based on 166,033 users of SGLT2i and 1,226,221 users of other glucose-lowering drugs (oGLD) identified from health care databases of six countries, was affected by immortal time bias. Indeed, the immortal time between the first oGLD prescription and the first SGLT2i prescription was omitted from the analysis, which resulted in increasing the rate of death in the oGLD group and thus producing the appearance of a lower risk of death with SGLT2i use. The Swedish study compared 10,879 SGLT2i/dipeptidyl peptidase 4 inhibitor (DPP-4i) users with 10,879 matched insulin users. Such comparisons involving second-line therapies with a third-line therapy can introduce time-lag bias, as the patients may not be at the same stage of diabetes. This bias is compounded by the fact that the users of insulin had already started their insulin before cohort entry, unlike the new users of SGLT2i. Finally, the study also introduces immortal time bias with respect to the effects of SGLT2i relative to DPP-4i. In conclusion, the >50% lower rate of death with SGLT2i in type 2 diabetes reported by two recent observational studies is likely exaggerated by immortal time and time

  5. Attention bias for chocolate increases chocolate consumption--an attention bias modification study.

    Science.gov (United States)

    Werthmann, Jessica; Field, Matt; Roefs, Anne; Nederkoorn, Chantal; Jansen, Anita

    2014-03-01

    The current study examined experimentally whether a manipulated attention bias for food cues increases craving, chocolate intake and motivation to search for hidden chocolates. To test the effect of attention for food on subsequent chocolate intake, attention for chocolate was experimentally modified by instructing participants to look at chocolate stimuli ("attend chocolate" group) or at non-food stimuli ("attend shoes" group) during a novel attention bias modification task (antisaccade task). Chocolate consumption, changes in craving and search time for hidden chocolates were assessed. Eye-movement recordings were used to monitor the accuracy during the experimental attention modification task as possible moderator of effects. Regression analyses were conducted to test the effect of attention modification and modification accuracy on chocolate intake, craving and motivation to search for hidden chocolates. Results showed that participants with higher accuracy (+1 SD), ate more chocolate when they had to attend to chocolate and ate less chocolate when they had to attend to non-food stimuli. In contrast, for participants with lower accuracy (-1 SD), the results were exactly reversed. No effects of the experimental attention modification on craving or search time for hidden chocolates were found. We used chocolate as food stimuli so it remains unclear how our findings generalize to other types of food. These findings demonstrate further evidence for a link between attention for food and food intake, and provide an indication about the direction of this relationship. Copyright © 2013 Elsevier Ltd. All rights reserved.

  6. P2-19: The Effect of item Repetition on Item-Context Association Depends on the Prior Exposure of Items

    Directory of Open Access Journals (Sweden)

    Hongmi Lee

    2012-10-01

    Full Text Available Previous studies have reported conflicting findings on whether item repetition has beneficial or detrimental effects on source memory. To reconcile such contradictions, we investigated whether the degree of pre-exposure of items can be a potential modulating factor. The experimental procedures spanned two consecutive days. On Day 1, participants were exposed to a set of unfamiliar faces. On Day 2, the same faces presented on the previous day were used again in half of the participants, whereas novel faces were used for the other half. Day 2 procedures consisted of three successive phases: item repetition, source association, and source memory test. In the item repetition phase, half of the face stimuli were repeatedly presented while participants were making male/female judgments. During the source association phase, both the repeated and the unrepeated faces appeared in one of the four locations on the screen. Finally, participants were tested on the location in which a given face was presented during the previous phase and reported the confidence of their memory. Source memory accuracy was measured as the percentage of correct non-guess trials. As results, we found a significant interaction between prior exposure and repetition. Repetition impaired source memory when the items had been pre-exposed on Day 1, while it led to greater accuracy in novel ones. These results show that pre-experimental exposure can modulate the effects of repetition on associative binding between an item and its contextual information, suggesting that pre-existing representation and novelty signal interact to form new episodic memory.

  7. The relationship between social desirability bias and self-reports of health, substance use, and social network factors among urban substance users in Baltimore, Maryland.

    Science.gov (United States)

    Latkin, Carl A; Edwards, Catie; Davey-Rothwell, Melissa A; Tobin, Karin E

    2017-10-01

    Social desirability response bias may lead to inaccurate self-reports and erroneous study conclusions. The present study examined the relationship between social desirability response bias and self-reports of mental health, substance use, and social network factors among a community sample of inner-city substance users. The study was conducted in a sample of 591 opiate and cocaine users in Baltimore, Maryland from 2009 to 2013. Modified items from the Marlowe-Crowne Social Desirability Scale were included in the survey, which was conducted face-to-face and using Audio Computer Self Administering Interview (ACASI) methods. There were highly statistically significant differences in levels of social desirability response bias by levels of depressive symptoms, drug use stigma, physical health status, recent opiate and cocaine use, Alcohol Use Disorders Identification Test (AUDIT) scores, and size of social networks. There were no associations between health service utilization measures and social desirability bias. In multiple logistic regression models, even after including the Center for Epidemiologic Studies Depression Scale (CES-D) as a measure of depressive symptomology, social desirability bias was associated with recent drug use and drug user stigma. Social desirability bias was not associated with enrollment in prior research studies. These findings suggest that social desirability bias is associated with key health measures and that the associations are not primarily due to depressive symptoms. Methods are needed to reduce social desirability bias. Such methods may include the wording and prefacing of questions, clearly defining the role of "study participant," and assessing and addressing motivations for socially desirable responses. Copyright © 2017 Elsevier Ltd. All rights reserved.

  8. Teoria da Resposta ao Item Teoria de la respuesta al item Item response theory

    Directory of Open Access Journals (Sweden)

    Eutalia Aparecida Candido de Araujo

    2009-12-01

    Full Text Available A preocupação com medidas de traços psicológicos é antiga, sendo que muitos estudos e propostas de métodos foram desenvolvidos no sentido de alcançar este objetivo. Entre os trabalhos propostos, destaca-se a Teoria da Resposta ao Item (TRI que, a princípio, veio completar limitações da Teoria Clássica de Medidas, empregada em larga escala até hoje na medida de traços psicológicos. O ponto principal da TRI é que ela leva em consideração o item particularmente, sem relevar os escores totais; portanto, as conclusões não dependem apenas do teste ou questionário, mas de cada item que o compõe. Este artigo propõe-se a apresentar esta Teoria que revolucionou a teoria de medidas.La preocupación con las medidas de los rasgos psicológicos es antigua y muchos estudios y propuestas de métodos fueron desarrollados para lograr este objetivo. Entre estas propuestas de trabajo se incluye la Teoría de la Respuesta al Ítem (TRI que, en principio, vino a completar las limitaciones de la Teoría Clásica de los Tests, ampliamente utilizada hasta hoy en la medida de los rasgos psicológicos. El punto principal de la TRI es que se tiene en cuenta el punto concreto, sin relevar las puntuaciones totales; por lo tanto, los resultados no sólo dependen de la prueba o cuestionario, sino que de cada ítem que lo compone. En este artículo se propone presentar la Teoría que revolucionó la teoría de medidas.The concern with measures of psychological traits is old and many studies and proposals of methods were developed to achieve this goal. Among these proposed methods highlights the Item Response Theory (IRT that, in principle, came to complete limitations of the Classical Test Theory, which is widely used until nowadays in the measurement of psychological traits. The main point of IRT is that it takes into account the item in particular, not relieving the total scores; therefore, the findings do not only depend on the test or questionnaire

  9. [Introduction to critical reading of articles: study design and biases].

    Science.gov (United States)

    García Villar, C

    2015-01-01

    The critical evaluation of an article enables professionals to make good use of the new information and therefore has direct repercussions for the benefit of our patients. Before undertaking a detailed critical reading of the chosen article, we need to consider whether the study used the most appropriate design for the question it aimed to answer (i.e., whether the level of evidence is adequate). To do this, we need to know how to classify studies in function of their design (descriptive or analytical; prospective or retrospective; cross-sectional or longitudinal) as well as their correlation with the levels of evidence. In critical reading it is also important to know the main systematic errors or biases that can affect a study. Biases can appear in any phase of a study; they can affect the sample, the development of the study, or the measurement of the results. Copyright © 2014 SERAM. Published by Elsevier España, S.L.U. All rights reserved.

  10. Examining Multiple Sources of Differential Item Functioning on the Clinician & Group CAHPS® Survey

    Science.gov (United States)

    Rodriguez, Hector P; Crane, Paul K

    2011-01-01

    Objective To evaluate psychometric properties of a widely used patient experience survey. Data Sources English-language responses to the Clinician & Group Consumer Assessment of Healthcare Providers and Systems (CG-CAHPS®) survey (n = 12,244) from a 2008 quality improvement initiative involving eight southern California medical groups. Methods We used an iterative hybrid ordinal logistic regression/item response theory differential item functioning (DIF) algorithm to identify items with DIF related to patient sociodemographic characteristics, duration of the physician–patient relationship, number of physician visits, and self-rated physical and mental health. We accounted for all sources of DIF and determined its cumulative impact. Principal Findings The upper end of the CG-CAHPS® performance range is measured with low precision. With sensitive settings, some items were found to have DIF. However, overall DIF impact was negligible, as 0.14 percent of participants had salient DIF impact. Latinos who spoke predominantly English at home had the highest prevalence of salient DIF impact at 0.26 percent. Conclusions The CG-CAHPS® functions similarly across commercially insured respondents from diverse backgrounds. Consequently, previously documented racial and ethnic group differences likely reflect true differences rather than measurement bias. The impact of low precision at the upper end of the scale should be clarified. PMID:22092021

  11. SAS and SPSS macros to calculate standardized Cronbach's alpha using the upper bound of the phi coefficient for dichotomous items.

    Science.gov (United States)

    Sun, Wei; Chou, Chih-Ping; Stacy, Alan W; Ma, Huiyan; Unger, Jennifer; Gallaher, Peggy

    2007-02-01

    Cronbach's a is widely used in social science research to estimate the internal consistency of reliability of a measurement scale. However, when items are not strictly parallel, the Cronbach's a coefficient provides a lower-bound estimate of true reliability, and this estimate may be further biased downward when items are dichotomous. The estimation of standardized Cronbach's a for a scale with dichotomous items can be improved by using the upper bound of coefficient phi. SAS and SPSS macros have been developed in this article to obtain standardized Cronbach's a via this method. The simulation analysis showed that Cronbach's a from upper-bound phi might be appropriate for estimating the real reliability when standardized Cronbach's a is problematic.

  12. BIAS IN THE MEASUREMENT OF QUALITY OF LIFE: RESPONSE SHIFT

    Directory of Open Access Journals (Sweden)

    Yesim SENOL

    2006-10-01

    Full Text Available Quality of Life (QoL is a descriptive term that refers to people’s emotional, social and physical wellbeing, and their ability to function in the ordinary task of living. The importance of QoL makes it critical to improve and refine measure to understand patients’ experience of health, illness and treatment. However individuals change with time and the basis on which they make a QoL judgment may also change, a phenomenon increasingly referred to as response shift. The definition of response shift is recalibration of internal standards of measurement and reconceptualization of the meaning of item. The purpose of study is to discuss the effects of response shift bias. [TAF Prev Med Bull 2006; 5(5.000: 382-389

  13. Theory and experimental study of biased charge collector for measuring HPIB

    International Nuclear Information System (INIS)

    He Xiaoping; Wang Haiyang; Sun Jianfeng; Yang Hailiang; Qiu Aici; Tang Junping; Li Jingya; Li Hongyu

    2004-01-01

    Structure of the biased charge collector for measuring HPIB (High-power ion beam) is introduced in this paper. The inner charge propagation process of HPIB in the biased charge collector was simulated with KARAT PIC code. The simulation results indicated that charge was neutralized but current was not neutralized in the biased charge collector. The influence of biased voltage and aperture diameter were also simulated. A -800V biased voltage can meet the requirement for measuring 500 keV HPIB, and this is consistent with the experimental results

  14. Combining item response theory with multiple imputation to equate health assessment questionnaires.

    Science.gov (United States)

    Gu, Chenyang; Gutman, Roee

    2017-09-01

    The assessment of patients' functional status across the continuum of care requires a common patient assessment tool. However, assessment tools that are used in various health care settings differ and cannot be easily contrasted. For example, the Functional Independence Measure (FIM) is used to evaluate the functional status of patients who stay in inpatient rehabilitation facilities, the Minimum Data Set (MDS) is collected for all patients who stay in skilled nursing facilities, and the Outcome and Assessment Information Set (OASIS) is collected if they choose home health care provided by home health agencies. All three instruments or questionnaires include functional status items, but the specific items, rating scales, and instructions for scoring different activities vary between the different settings. We consider equating different health assessment questionnaires as a missing data problem, and propose a variant of predictive mean matching method that relies on Item Response Theory (IRT) models to impute unmeasured item responses. Using real data sets, we simulated missing measurements and compared our proposed approach to existing methods for missing data imputation. We show that, for all of the estimands considered, and in most of the experimental conditions that were examined, the proposed approach provides valid inferences, and generally has better coverages, relatively smaller biases, and shorter interval estimates. The proposed method is further illustrated using a real data set. © 2016, The International Biometric Society.

  15. Memory for Items and Relationships among Items Embedded in Realistic Scenes: Disproportionate Relational Memory Impairments in Amnesia

    Science.gov (United States)

    Hannula, Deborah E.; Tranel, Daniel; Allen, John S.; Kirchhoff, Brenda A.; Nickel, Allison E.; Cohen, Neal J.

    2014-01-01

    Objective The objective of this study was to examine the dependence of item memory and relational memory on medial temporal lobe (MTL) structures. Patients with amnesia, who either had extensive MTL damage or damage that was relatively restricted to the hippocampus, were tested, as was a matched comparison group. Disproportionate relational memory impairments were predicted for both patient groups, and those with extensive MTL damage were also expected to have impaired item memory. Method Participants studied scenes, and were tested with interleaved two-alternative forced-choice probe trials. Probe trials were either presented immediately after the corresponding study trial (lag 1), five trials later (lag 5), or nine trials later (lag 9) and consisted of the studied scene along with a manipulated version of that scene in which one item was replaced with a different exemplar (item memory test) or was moved to a new location (relational memory test). Participants were to identify the exact match of the studied scene. Results As predicted, patients were disproportionately impaired on the test of relational memory. Item memory performance was marginally poorer among patients with extensive MTL damage, but both groups were impaired relative to matched comparison participants. Impaired performance was evident at all lags, including the shortest possible lag (lag 1). Conclusions The results are consistent with the proposed role of the hippocampus in relational memory binding and representation, even at short delays, and suggest that the hippocampus may also contribute to successful item memory when items are embedded in complex scenes. PMID:25068665

  16. Beyond assembly bias: exploring secondary halo biases for cluster-size haloes

    Science.gov (United States)

    Mao, Yao-Yuan; Zentner, Andrew R.; Wechsler, Risa H.

    2018-03-01

    Secondary halo bias, commonly known as `assembly bias', is the dependence of halo clustering on a halo property other than mass. This prediction of the Λ Cold Dark Matter cosmology is essential to modelling the galaxy distribution to high precision and interpreting clustering measurements. As the name suggests, different manifestations of secondary halo bias have been thought to originate from halo assembly histories. We show conclusively that this is incorrect for cluster-size haloes. We present an up-to-date summary of secondary halo biases of high-mass haloes due to various halo properties including concentration, spin, several proxies of assembly history, and subhalo properties. While concentration, spin, and the abundance and radial distribution of subhaloes exhibit significant secondary biases, properties that directly quantify halo assembly history do not. In fact, the entire assembly histories of haloes in pairs are nearly identical to those of isolated haloes. In general, a global correlation between two halo properties does not predict whether or not these two properties exhibit similar secondary biases. For example, assembly history and concentration (or subhalo abundance) are correlated for both paired and isolated haloes, but follow slightly different conditional distributions in these two cases. This results in a secondary halo bias due to concentration (or subhalo abundance), despite the lack of assembly bias in the strict sense for cluster-size haloes. Due to this complexity, caution must be exercised in using any one halo property as a proxy to study the secondary bias due to another property.

  17. Use of NON-PARAMETRIC Item Response Theory to develop a shortened version of the Positive and Negative Syndrome Scale (PANSS)

    Science.gov (United States)

    2011-01-01

    Background Nonparametric item response theory (IRT) was used to examine (a) the performance of the 30 Positive and Negative Syndrome Scale (PANSS) items and their options ((levels of severity), (b) the effectiveness of various subscales to discriminate among differences in symptom severity, and (c) the development of an abbreviated PANSS (Mini-PANSS) based on IRT and a method to link scores to the original PANSS. Methods Baseline PANSS scores from 7,187 patients with Schizophrenia or Schizoaffective disorder who were enrolled between 1995 and 2005 in psychopharmacology trials were obtained. Option characteristic curves (OCCs) and Item Characteristic Curves (ICCs) were constructed to examine the probability of rating each of seven options within each of 30 PANSS items as a function of subscale severity, and summed-score linking was applied to items selected for the Mini-PANSS. Results The majority of items forming the Positive and Negative subscales (i.e. 19 items) performed very well and discriminate better along symptom severity compared to the General Psychopathology subscale. Six of the seven Positive Symptom items, six of the seven Negative Symptom items, and seven out of the 16 General Psychopathology items were retained for inclusion in the Mini-PANSS. Summed score linking and linear interpolation was able to produce a translation table for comparing total subscale scores of the Mini-PANSS to total subscale scores on the original PANSS. Results show scores on the subscales of the Mini-PANSS can be linked to scores on the original PANSS subscales, with very little bias. Conclusions The study demonstrated the utility of non-parametric IRT in examining the item properties of the PANSS and to allow selection of items for an abbreviated PANSS scale. The comparisons between the 30-item PANSS and the Mini-PANSS revealed that the shorter version is comparable to the 30-item PANSS, but when applying IRT, the Mini-PANSS is also a good indicator of illness severity

  18. Use of non-parametric item response theory to develop a shortened version of the Positive and Negative Syndrome Scale (PANSS).

    Science.gov (United States)

    Khan, Anzalee; Lewis, Charles; Lindenmayer, Jean-Pierre

    2011-11-16

    Nonparametric item response theory (IRT) was used to examine (a) the performance of the 30 Positive and Negative Syndrome Scale (PANSS) items and their options ((levels of severity), (b) the effectiveness of various subscales to discriminate among differences in symptom severity, and (c) the development of an abbreviated PANSS (Mini-PANSS) based on IRT and a method to link scores to the original PANSS. Baseline PANSS scores from 7,187 patients with Schizophrenia or Schizoaffective disorder who were enrolled between 1995 and 2005 in psychopharmacology trials were obtained. Option characteristic curves (OCCs) and Item Characteristic Curves (ICCs) were constructed to examine the probability of rating each of seven options within each of 30 PANSS items as a function of subscale severity, and summed-score linking was applied to items selected for the Mini-PANSS. The majority of items forming the Positive and Negative subscales (i.e. 19 items) performed very well and discriminate better along symptom severity compared to the General Psychopathology subscale. Six of the seven Positive Symptom items, six of the seven Negative Symptom items, and seven out of the 16 General Psychopathology items were retained for inclusion in the Mini-PANSS. Summed score linking and linear interpolation was able to produce a translation table for comparing total subscale scores of the Mini-PANSS to total subscale scores on the original PANSS. Results show scores on the subscales of the Mini-PANSS can be linked to scores on the original PANSS subscales, with very little bias. The study demonstrated the utility of non-parametric IRT in examining the item properties of the PANSS and to allow selection of items for an abbreviated PANSS scale. The comparisons between the 30-item PANSS and the Mini-PANSS revealed that the shorter version is comparable to the 30-item PANSS, but when applying IRT, the Mini-PANSS is also a good indicator of illness severity.

  19. The effects of self-focus on attentional biases in social anxiety:An ERP study.

    Science.gov (United States)

    Judah, Matt R; Grant, DeMond M; Carlisle, Nancy B

    2016-06-01

    Cognitive theories of social anxiety disorder suggest that biased attention plays a key role in maintaining symptoms. These biases include self-focus and attention to socially threatening stimuli in the environment. The goal of this study was to utilize ERPs that are elicited by a change detection task to examine biases in selective attention (i.e., N2pc) and working memory maintenance (i.e., contralateral delay activity; CDA). Additionally, the effect of self-focus was examined using false heart rate feedback. In support of the manipulation, self-focus cues resulted in greater self-reported self-consciousness and task interference, enhanced anterior P2 amplitude and reduced SPN amplitude. Moreover, P2 amplitude for self-focus cues was correlated with reduced task performance for socially anxious subjects only. The difference in P2 amplitude between self-focus and standard cues was correlated with social anxiety independent of depression. As hypothesized, socially anxious participants (n = 20) showed early selection and maintenance of disgust faces relative to neutral faces as indicated by the N2pc and CDA components. Nonanxious controls (n = 22) did not show these biases. During self-focus cues, controls showed marginal evidence of biased selection for disgust faces, whereas socially anxious subjects showed no bias in this condition. Controls showed an ipsilateral delay activity after being cued to attend to one hemifield. Overall, this study supports early and persistent attentional bias for social threat in socially anxious individuals. Furthermore, self-focus may disrupt these biases. These findings and supplementary data are discussed in light of cognitive models of social anxiety disorder, recent empirical findings, and treatment.

  20. An emotional functioning item bank of 24 items for computerized adaptive testing (CAT) was established

    DEFF Research Database (Denmark)

    Petersen, Morten Aa.; Gamper, Eva-Maria; Costantini, Anna

    2016-01-01

    of the widely used EORTC Quality of Life questionnaire (QLQ-C30). STUDY DESIGN AND SETTING: On the basis of literature search and evaluations by international samples of experts and cancer patients, 38 candidate items were developed. The psychometric properties of the items were evaluated in a large...... international sample of cancer patients. This included evaluations of dimensionality, item response theory (IRT) model fit, differential item functioning (DIF), and of measurement precision/statistical power. RESULTS: Responses were obtained from 1,023 cancer patients from four countries. The evaluations showed...... that 24 items could be included in a unidimensional IRT model. DIF did not seem to have any significant impact on the estimation of EF. Evaluations indicated that the CAT measure may reduce sample size requirements by up to 50% compared to the QLQ-C30 EF scale without reducing power. CONCLUSION...

  1. Bias in clinical intervention research

    DEFF Research Database (Denmark)

    Gluud, Lise Lotte

    2006-01-01

    Research on bias in clinical trials may help identify some of the reasons why investigators sometimes reach the wrong conclusions about intervention effects. Several quality components for the assessment of bias control have been suggested, but although they seem intrinsically valid, empirical...... evidence is needed to evaluate their effects on the extent and direction of bias. This narrative review summarizes the findings of methodological studies on the influence of bias in clinical trials. A number of methodological studies suggest that lack of adequate randomization in published trial reports...

  2. Reliability and norms for the 10-item self-motivation inventory: The TIGER Study

    Science.gov (United States)

    The Self-Motivation Inventory (SMI) has been shown to be a predictor of exercise dropout. The original SMI of 40 items has been shortened to 10 items and the psychometric qualities of the 10-item SMI are not known. To estimate the reliability of a 10-item SMI and develop norms for an ethnically dive...

  3. Is the General Self-Efficacy Scale a Reliable Measure to be used in Cross-Cultural Studies? Results from Brazil, Germany and Colombia.

    Science.gov (United States)

    Damásio, Bruno F; Valentini, Felipe; Núñes-Rodriguez, Susana I; Kliem, Soeren; Koller, Sílvia H; Hinz, Andreas; Brähler, Elmar; Finck, Carolyn; Zenger, Markus

    2016-05-26

    This study evaluated cross-cultural measurement invariance for the General Self-efficacy Scale (GSES) in a large Brazilian (N = 2.394) and representative German (N = 2.046) and Colombian (N = 1.500) samples. Initially, multiple-indicators multiple-causes (MIMIC) analyses showed that sex and age were biasing items responses on the total sample (2 and 10 items, respectively). After controlling for these two covariates, a multigroup confirmatory factor analysis (MGCFA) was employed. Configural invariance was attested. However, metric invariance was not supported for five items, in a total of 10, and scalar invariance was not supported for all items. We also evaluated the differences between the latent scores estimated by two models: MIMIC and MGCFA unconstraining the non-equivalent parameters across countries. The average difference was equal to |.07| on the estimation of the latent scores, and 22.8% of the scores were biased in at least .10 standardized points. Bias effects were above the mean for the German group, which the average difference was equal to |.09|, and 33.7% of the scores were biased in at least .10. In synthesis, the GSES did not provide evidence of measurement invariance to be employed in this cross-cultural study. More than that, our results showed that even when controlling for sex and age effects, the absence of control on items parameters in the MGCFA analyses across countries would implicate in bias of the latent scores estimation, with a higher effect for the German population.

  4. Does nonparticipation in studies of advanced cancer lead to biased quality-of-life scores?

    DEFF Research Database (Denmark)

    Petersen, Morten A; Pedersen, Lise; Groenvold, Mogens

    2009-01-01

    Missing data are a common problem in palliative care research. Often the most impaired patients are unable to participate in studies. This may result in biased findings. We investigated whether observed patient reported outcomes should be adjusted for bias resulting from nonparticipation....

  5. Quality of reporting and of methodology of studies on interventions for trophic ulcers in leprosy: A systematic review

    Directory of Open Access Journals (Sweden)

    Forsetlund L

    2008-01-01

    Full Text Available Background: In the process of conducting a systematic review on interventions for skin lesions due to neuritis in leprosy, we assessed several primary papers with respect to the quality of reporting and methods used in the studies. Awareness of what constitutes weak points in previously conducted studies may be used to improve the planning, conducting and reporting of future clinical trials. Aims: To assess the quality of reporting and of methodology in studies of interventions for skin lesions due to neuritis in leprosy. Methods: Items of importance for preventing selection bias, detection bias, attrition bias and performance bias were among items assessed. The items for assessing methodological quality were used as a basis for making the checklist to assess the quality of reporting. Results: Out of the 854 references that we inspected eight studies were included on the basis of the inclusion criteria. The interventions tested were dressings, topical agents and footwear and in all studies healing of ulcers was the main outcome measure. Reporting of both, methods and results suffered from underreporting and disorganization. The most under-reported items were concealment of allocation, blinding of patients and outcome assessors, intention to treat and validation of outcomes. Conclusion: There is an apparent need to improve the methodological quality as well as the quality of reporting of trials in leprosy ulcer treatment. The most important threat in existing studies is the threat of selection bias. For the reporting of future studies, journals could promote and encourage the use of the CONSORT statement checklist by expecting and requiring that authors adhere to it in their reporting.

  6. Forced to remember: when memory is biased by salient information.

    Science.gov (United States)

    Santangelo, Valerio

    2015-04-15

    The last decades have seen a rapid growing in the attempt to understand the key factors involved in the internal memory representation of the external world. Visual salience have been found to provide a major contribution in predicting the probability for an item/object embedded in a complex setting (i.e., a natural scene) to be encoded and then remembered later on. Here I review the existing literature highlighting the impact of perceptual- (based on low-level sensory features) and semantics-related salience (based on high-level knowledge) on short-term memory representation, along with the neural mechanisms underpinning the interplay between these factors. The available evidence reveal that both perceptual- and semantics-related factors affect attention selection mechanisms during the encoding of natural scenes. Biasing internal memory representation, both perceptual and semantics factors increase the probability to remember high- to the detriment of low-saliency items. The available evidence also highlight an interplay between these factors, with a reduced impact of perceptual-related salience in biasing memory representation as a function of the increasing availability of semantics-related salient information. The neural mechanisms underpinning this interplay involve the activation of different portions of the frontoparietal attention control network. Ventral regions support the assignment of selection/encoding priorities based on high-level semantics, while the involvement of dorsal regions reflects priorities assignment based on low-level sensory features. Copyright © 2015 Elsevier B.V. All rights reserved.

  7. Bias in segmented gamma scans arising from size differences between calibration standards and assay samples

    International Nuclear Information System (INIS)

    Sampson, T.E.

    1991-01-01

    Recent advances in segmented gamma scanning have emphasized software corrections for gamma-ray self-adsorption in particulates or lumps of special nuclear material in the sample. another feature of this software is an attenuation correction factor formalism that explicitly accounts for differences in sample container size and composition between the calibration standards and the individual items being measured. Software without this container-size correction produces biases when the unknowns are not packaged in the same containers as the calibration standards. This new software allows the use of different size and composition containers for standards and unknowns, as enormous savings considering the expense of multiple calibration standard sets otherwise needed. This paper presents calculations of the bias resulting from not using this new formalism. These calculations may be used to estimate bias corrections for segmented gamma scanners that do not incorporate these advanced concepts

  8. Investigation of publication bias in meta-analyses of diagnostic test accuracy: a meta-epidemiological study

    NARCIS (Netherlands)

    van Enst, W. Annefloor; Ochodo, Eleanor; Scholten, Rob J. P. M.; Hooft, Lotty; Leeflang, Mariska M.

    2014-01-01

    The validity of a meta-analysis can be understood better in light of the possible impact of publication bias. The majority of the methods to investigate publication bias in terms of small study-effects are developed for meta-analyses of intervention studies, leaving authors of diagnostic test

  9. Evaluating HIV Knowledge Questionnaires Among Men Who Have Sex with Men: A Multi-Study Item Response Theory Analysis.

    Science.gov (United States)

    Janulis, Patrick; Newcomb, Michael E; Sullivan, Patrick; Mustanski, Brian

    2018-01-01

    Knowledge about the transmission, prevention, and treatment of HIV remains a critical element in psychosocial models of HIV risk behavior and is commonly used as an outcome in HIV prevention interventions. However, most HIV knowledge questions have not undergone rigorous psychometric testing such as using item response theory. The current study used data from six studies of men who have sex with men (MSM; n = 3565) to (1) examine the item properties of HIV knowledge questions, (2) test for differential item functioning on commonly studied characteristics (i.e., age, race/ethnicity, and HIV risk behavior), (3) select items with the optimal item characteristics, and (4) leverage this combined dataset to examine the potential moderating effect of age on the relationship between condomless anal sex (CAS) and HIV knowledge. Findings indicated that existing questions tend to poorly differentiate those with higher levels of HIV knowledge, but items were relatively robust across diverse individuals. Furthermore, age moderated the relationship between CAS and HIV knowledge with older MSM having the strongest association. These findings suggest that additional items are required in order to capture a more nuanced understanding of HIV knowledge and that the association between CAS and HIV knowledge may vary by age.

  10. The effect of framing on surrogate optimism bias: A simulation study.

    Science.gov (United States)

    Patel, Dev; Cohen, Elan D; Barnato, Amber E

    2016-04-01

    To explore the effect of emotion priming and physician communication behaviors on optimism bias. We conducted a 5 × 2 between-subject randomized factorial experiment using a Web-based interactive video designed to simulate a family meeting for a critically ill spouse/parent. Eligibility included age at least 35 years and self-identifying as the surrogate for a spouse/parent. The primary outcome was the surrogate's election of code status. We defined optimism bias as the surrogate's estimate of prognosis with cardiopulmonary resuscitation (CPR) > their recollection of the physician's estimate. Of 373 respondents, 256 (69%) logged in and were randomized and 220 (86%) had nonmissing data for prognosis. Sixty-seven (30%) of 220 overall and 56 of (32%) 173 with an accurate recollection of the physician's estimate had optimism bias. Optimism bias correlated with choosing CPR (P optimism bias. Framing the decision as the patient's vs the surrogate's (25% vs 36%, P = .066) and describing the alternative to CPR as "allow natural death" instead of "do not resuscitate" (25% vs 37%, P = .035) decreased optimism bias. Framing of CPR choice during code status conversations may influence surrogates' optimism bias. Copyright © 2015 Elsevier Inc. All rights reserved.

  11. Minimum bias and underlying event studies at CDF

    International Nuclear Information System (INIS)

    Moggi, Niccolo

    2010-01-01

    Soft, non-perturbative, interactions are poorly understood from the theoretical point of view even though they form a large part of the hadronic cross section at the energies now available. We review the CDF studies on minimum-bias ad underlying event in p(bar p) collisions at 2 TeV. After proposing an operative definition of 'underlying event', we present part of a systematic set of measurements carried out by the CDF Collaboration with the goal to provide data to test and improve the QCD models of hadron collisions. Different analysis strategies of the underlying event and possible event topologies are discussed. Part of the CDF minimum-bias results are also presented: in this sample, that represent the full inelastic cross-section, we can test simultaneously our knowledge of all the components that concur to form hadronic interactions. Comparisons with MonteCarlo simulations are always shown along with the data. These measurements will also contribute to more precise estimates of the soft QCD background of high-p T observables.

  12. Factoring handedness data: I. Item analysis.

    Science.gov (United States)

    Messinger, H B; Messinger, M I

    1995-12-01

    Recently in this journal Peters and Murphy challenged the validity of factor analyses done on bimodal handedness data, suggesting instead that right- and left-handers be studied separately. But bimodality may be avoidable if attention is paid to Oldfield's questionnaire format and instructions for the subjects. Two characteristics appear crucial: a two-column LEFT-RIGHT format for the body of the instrument and what we call Oldfield's Admonition: not to indicate strong preference for handedness item, such as write, unless "... the preference is so strong that you would never try to use the other hand unless absolutely forced to...". Attaining unimodality of an item distribution would seem to overcome the objections of Peters and Murphy. In a 1984 survey in Boston we used Oldfield's ten-item questionnaire exactly as published. This produced unimodal item distributions. With reflection of the five-point item scale and a logarithmic transformation, we achieved a degree of normalization for the items. Two surveys elsewhere based on Oldfield's 20-item list but with changes in the questionnaire format and the instructions, yielded markedly different item distributions with peaks at each extreme and sometimes in the middle as well.

  13. The lack of selection bias in a snowball sampled case-control study on drug abuse.

    Science.gov (United States)

    Lopes, C S; Rodrigues, L C; Sichieri, R

    1996-12-01

    Friend controls in matched case-control studies can be a potential source of bias based on the assumption that friends are more likely to share exposure factors. This study evaluates the role of selection bias in a case-control study that used the snowball sampling method based on friendship for the selection of cases and controls. The cases selected fro the study were drug abusers located in the community. Exposure was defined by the presence of at least one psychiatric diagnosis. Psychiatric and drug abuse/dependence diagnoses were made according to the Diagnostic and Statistical Manual of Mental Disorders (DSM-III-R) criteria. Cases and controls were matched on sex, age and friendship. The measurement of selection bias was made through the comparison of the proportion of exposed controls selected by exposed cases (p1) with the proportion of exposed controls selected by unexposed cases (p2). If p1 = p2 then, selection bias should not occur. The observed distribution of the 185 matched pairs having at least one psychiatric disorder showed a p1 value of 0.52 and a p2 value of 0.51, indicating no selection bias in this study. Our findings support the idea that the use of friend controls can produce a valid basis for a case-control study.

  14. Is racial bias malleable? Whites' lay theories of racial bias predict divergent strategies for interracial interactions.

    Science.gov (United States)

    Neel, Rebecca; Shapiro, Jenessa R

    2012-07-01

    How do Whites approach interracial interactions? We argue that a previously unexamined factor-beliefs about the malleability of racial bias-guides Whites' strategies for difficult interracial interactions. We predicted and found that those who believe racial bias is malleable favor learning-oriented strategies such as taking the other person's perspective and trying to learn why an interaction is challenging, whereas those who believe racial bias is fixed favor performance-oriented strategies such as overcompensating in the interaction and trying to end the interaction as quickly as possible. Four studies support these predictions. Whether measured (Studies 1, 3, and 4) or manipulated (Study 2), beliefs that racial bias is fixed versus malleable yielded these divergent strategies for difficult interracial interactions. Furthermore, beliefs about the malleability of racial bias are distinct from related constructs (e.g., prejudice and motivations to respond without prejudice; Studies 1, 3, and 4) and influence self-reported (Studies 1-3) and actual (Study 4) strategies in imagined (Studies 1-2) and real (Studies 3-4) interracial interactions. Together, these findings demonstrate that beliefs about the malleability of racial bias influence Whites' approaches to and strategies within interracial interactions. PsycINFO Database Record (c) 2012 APA, all rights reserved

  15. Structural Validation of a French Food Frequency Questionnaire of 94 Items.

    Science.gov (United States)

    Gazan, Rozenn; Vieux, Florent; Darmon, Nicole; Maillot, Matthieu

    2017-01-01

    Food frequency questionnaires (FFQs) are used to estimate the usual food and nutrient intakes over a period of time. Such estimates can suffer from measurement errors, either due to bias induced by respondent's answers or to errors induced by the structure of the questionnaire (e.g., using a limited number of food items and an aggregated food database with average portion sizes). The "structural validation" presented in this study aims to isolate and quantify the impact of the inherent structure of a FFQ on the estimation of food and nutrient intakes, independently of respondent's perception of the questionnaire. A semi-quantitative FFQ ( n  = 94 items, including 50 items with questions on portion sizes) and an associated aggregated food composition database (named the item-composition database) were developed, based on the self-reported weekly dietary records of 1918 adults (18-79 years-old) in the French Individual and National Dietary Survey 2 (INCA2), and the French CIQUAL 2013 food-composition database of all the foods ( n  = 1342 foods) declared as consumed in the population. Reference intakes of foods ("REF_FOOD") and nutrients ("REF_NUT") were calculated for each adult using the food-composition database and the amounts of foods self-reported in his/her dietary record. Then, answers to the FFQ were simulated for each adult based on his/her self-reported dietary record. "FFQ_FOOD" and "FFQ_NUT" intakes were estimated using the simulated answers and the item-composition database. Measurement errors (in %), spearman correlations and cross-classification were used to compare "REF_FOOD" with "FFQ_FOOD" and "REF_NUT" with "FFQ_NUT". Compared to "REF_NUT," "FFQ_NUT" total quantity and total energy intake were underestimated on average by 198 g/day and 666 kJ/day, respectively. "FFQ_FOOD" intakes were well estimated for starches, underestimated for most of the subgroups, and overestimated for some subgroups, in particular vegetables. Underestimation were

  16. Empirical evidence of design-related bias in studies of diagnostic tests

    NARCIS (Netherlands)

    Lijmer, J. G.; Mol, B. W.; Heisterkamp, S.; Bonsel, G. J.; Prins, M. H.; van der Meulen, J. H.; Bossuyt, P. M.

    1999-01-01

    CONTEXT: The literature contains a large number of potential biases in the evaluation of diagnostic tests. Strict application of appropriate methodological criteria would invalidate the clinical application of most study results. OBJECTIVE: To empirically determine the quantitative effect of study

  17. Ion beam studies of archaeological gold jewellery items

    International Nuclear Information System (INIS)

    Demortier, G.

    1996-01-01

    Analytical work on material of archaeological interest performed at LARN mainly concerns gold jewellery, with an emphasis to solders on the artefacts and to gold plating or copper depletion gilding. PIXE, RBS but also PIGE and NRA have been applied to a large variety of items. On the basis of elemental analysis, we have identified typical workmanship of ancient goldsmiths in various regions of the world: finely decorated Mesopotamian items, Hellenistic and Byzantine craftsmanship, cloisonne of the Merovingian period, depletion gilding on Pre-Colombian tumbaga. This paper is some shortening of the work performed at LARN during the last ten years. Criteria to properly use PIXE for quantitative analysis of non-homogeneous ancient artefacts presented at the 12th IBA conference in 1995 are also shortly discussed. (orig.)

  18. Ion beam studies of archaeological gold jewellery items

    Energy Technology Data Exchange (ETDEWEB)

    Demortier, G [Facultes Universitaires Notre-Dame de la Paix, Namur (Belgium). Lab. d` Analyses par Reactions Nucleaires

    1996-06-01

    Analytical work on material of archaeological interest performed at LARN mainly concerns gold jewellery, with an emphasis to solders on the artefacts and to gold plating or copper depletion gilding. PIXE, RBS but also PIGE and NRA have been applied to a large variety of items. On the basis of elemental analysis, we have identified typical workmanship of ancient goldsmiths in various regions of the world: finely decorated Mesopotamian items, Hellenistic and Byzantine craftsmanship, cloisonne of the Merovingian period, depletion gilding on Pre-Colombian tumbaga. This paper is some shortening of the work performed at LARN during the last ten years. Criteria to properly use PIXE for quantitative analysis of non-homogeneous ancient artefacts presented at the 12th IBA conference in 1995 are also shortly discussed. (orig.).

  19. Mood-congruent attention and memory bias in dysphoria: Exploring the coherence among information-processing biases.

    Science.gov (United States)

    Koster, Ernst H W; De Raedt, Rudi; Leyman, Lemke; De Lissnyder, Evi

    2010-03-01

    Recent studies indicate that depression is characterized by mood-congruent attention bias at later stages of information-processing. Moreover, depression has been associated with enhanced recall of negative information. The present study tested the coherence between attention and memory bias in dysphoria. Stable dysphoric (n = 41) and non-dysphoric (n = 41) undergraduates first performed a spatial cueing task that included negative, positive, and neutral words. Words were presented for 250 ms under conditions that allowed or prevented elaborate processing. Memory for the words presented in the cueing task was tested using incidental free recall. Dysphoric individuals exhibited an attention bias for negative words in the condition that allowed elaborate processing, with the attention bias for negative words predicting free recall of negative words. Results demonstrate the coherence of attention and memory bias in dysphoric individuals and provide suggestions on the influence of attention bias on further processing of negative material. 2009 Elsevier Ltd. All rights reserved.

  20. Mobile Phone Cognitive Bias Modification Research Platform for Substance Use Disorders: Protocol for a Feasibility Study.

    Science.gov (United States)

    Zhang, Melvyn; Ying, JiangBo; Song, Guo; Fung, Daniel Ss; Smith, Helen

    2018-06-12

    Cognitive biases refer to automatic attentional and interpretational tendencies, which could be retained by cognitive bias modification interventions. Cristea et al and Jones et al have published reviews (in 2016 and 2017 respectively) on the effectiveness of such interventions. The advancement of technologies such as electronic health (eHealth) and mobile health (mHealth) has led to them being harnessed for the delivery of cognitive bias modification. To date, at least eight studies have demonstrated the feasibility of mobile technologies for the delivery of cognitive bias modification. Most of the studies are limited to a description of the conventional cognitive bias modification methodology that has been adopted. None of the studies shared the developmental process for the methodology involved, such that future studies could adopt it in the cost-effective replication of such interventions. It is important to have a common platform that could facilitate the design and customization of cognitive bias modification interventions for a variety of psychiatric and addictive disorders. It is the aim of the current research protocol to describe the design of a research platform that allows for customization of cognitive bias modification interventions for addictive disorders. A multidisciplinary team of 2 addiction psychiatrists, a psychologist with expertise in cognitive bias modification, and a computer engineer, were involved in the development of the intervention. The proposed platform would comprise of a mobile phone version of the cognitive bias task which is controlled by a server that could customize the algorithm for the tasks and collate the reaction-time data in realtime. The server would also allow the researcher to program the specific set of images that will be present in the task. The mobile phone app would synchronize with the backend server in real-time. An open-sourced cross-platform gaming software from React Native was used in the current development

  1. Evaluation of item candidates for a diabetic retinopathy quality of life item bank.

    Science.gov (United States)

    Fenwick, Eva K; Pesudovs, Konrad; Khadka, Jyoti; Rees, Gwyn; Wong, Tien Y; Lamoureux, Ecosse L

    2013-09-01

    We are developing an item bank assessing the impact of diabetic retinopathy (DR) on quality of life (QoL) using a rigorous multi-staged process combining qualitative and quantitative methods. We describe here the first two qualitative phases: content development and item evaluation. After a comprehensive literature review, items were generated from four sources: (1) 34 previously validated patient-reported outcome measures; (2) five published qualitative articles; (3) eight focus groups and 18 semi-structured interviews with 57 DR patients; and (4) seven semi-structured interviews with diabetes or ophthalmic experts. Items were then evaluated during 3 stages, namely binning (grouping) and winnowing (reduction) based on key criteria and panel consensus; development of item stems and response options; and pre-testing of items via cognitive interviews with patients. The content development phase yielded 1,165 unique items across 7 QoL domains. After 3 sessions of binning and winnowing, items were reduced to a minimally representative set (n = 312) across 9 domains of QoL: visual symptoms; ocular surface symptoms; activity limitation; mobility; emotional; health concerns; social; convenience; and economic. After 8 cognitive interviews, 42 items were amended resulting in a final set of 314 items. We have employed a systematic approach to develop items for a DR-specific QoL item bank. The psychometric properties of the nine QoL subscales will be assessed using Rasch analysis. The resulting validated item bank will allow clinicians and researchers to better understand the QoL impact of DR and DR therapies from the patient's perspective.

  2. Analysis and Countermeasure Study on DC Bias of Main Transformer in a City

    Science.gov (United States)

    Wang, PengChao; Wang, Hongtao; Song, Xinpu; Gu, Jun; Liu, yong; Wu, weili

    2017-07-01

    According to the December 2015 Guohua Beijing thermal power transformer DC magnetic bias phenomenon, the monitoring data of 24 hours of direct current is analyzed. We find that the maximum DC current is up to 25 and is about 30s for the trend cycle, on this basis, then, of the geomagnetic storm HVDC and subway operation causes comparison of the mechanism, and make a comprehensive analysis of the thermal power plant’s geographical location, surrounding environment and electrical contact etc.. The results show that the main reason for the DC bias of Guohua thermal power transformer is the operation of the subway, and the change of the DC bias current is periodic. Finally, of Guohua thermal power transformer DC magnetic bias control method is studied, the simulation results show that the method of using neutral point with small resistance or capacitance can effectively inhibit the main transformer neutral point current.

  3. Non-ignorable missingness item response theory models for choice effects in examinee-selected items.

    Science.gov (United States)

    Liu, Chen-Wei; Wang, Wen-Chung

    2017-11-01

    Examinee-selected item (ESI) design, in which examinees are required to respond to a fixed number of items in a given set, always yields incomplete data (i.e., when only the selected items are answered, data are missing for the others) that are likely non-ignorable in likelihood inference. Standard item response theory (IRT) models become infeasible when ESI data are missing not at random (MNAR). To solve this problem, the authors propose a two-dimensional IRT model that posits one unidimensional IRT model for observed data and another for nominal selection patterns. The two latent variables are assumed to follow a bivariate normal distribution. In this study, the mirt freeware package was adopted to estimate parameters. The authors conduct an experiment to demonstrate that ESI data are often non-ignorable and to determine how to apply the new model to the data collected. Two follow-up simulation studies are conducted to assess the parameter recovery of the new model and the consequences for parameter estimation of ignoring MNAR data. The results of the two simulation studies indicate good parameter recovery of the new model and poor parameter recovery when non-ignorable missing data were mistakenly treated as ignorable. © 2017 The British Psychological Society.

  4. Checking Equity: Why Differential Item Functioning Analysis Should Be a Routine Part of Developing Conceptual Assessments

    Czech Academy of Sciences Publication Activity Database

    Martinková, Patrícia; Drabinová, Adéla; Liaw, Y.L.; Sanders, E.A.; McFarland, J.L.; Price, R.M.

    2017-01-01

    Roč. 16, č. 2 (2017), č. článku rm2. ISSN 1931-7913 R&D Projects: GA ČR GJ15-15856Y Grant - others:NSF(US) DUE-1043443 Institutional support: RVO:67985807 Keywords : differential item functioning * fairness * conceptual assessments * concept inventory * undergraduate education * bias Subject RIV: AM - Education OBOR OECD: Education , special (to gifted persons, those with learning disabilities) Impact factor: 3.930, year: 2016

  5. A review of the effects on IRT item parameter estimates with a focus on misbehaving common items in test equating

    Directory of Open Access Journals (Sweden)

    Michalis P Michaelides

    2010-10-01

    Full Text Available Many studies have investigated the topic of change or drift in item parameter estimates in the context of Item Response Theory. Content effects, such as instructional variation and curricular emphasis, as well as context effects, such as the wording, position, or exposure of an item have been found to impact item parameter estimates. The issue becomes more critical when items with estimates exhibiting differential behavior across test administrations are used as common for deriving equating transformations. This paper reviews the types of effects on IRT item parameter estimates and focuses on the impact of misbehaving or aberrant common items on equating transformations. Implications relating to test validity and the judgmental nature of the decision to keep or discard aberrant common items are discussed, with recommendations for future research into more informed and formal ways of dealing with misbehaving common items.

  6. A Review of the Effects on IRT Item Parameter Estimates with a Focus on Misbehaving Common Items in Test Equating.

    Science.gov (United States)

    Michaelides, Michalis P

    2010-01-01

    Many studies have investigated the topic of change or drift in item parameter estimates in the context of item response theory (IRT). Content effects, such as instructional variation and curricular emphasis, as well as context effects, such as the wording, position, or exposure of an item have been found to impact item parameter estimates. The issue becomes more critical when items with estimates exhibiting differential behavior across test administrations are used as common for deriving equating transformations. This paper reviews the types of effects on IRT item parameter estimates and focuses on the impact of misbehaving or aberrant common items on equating transformations. Implications relating to test validity and the judgmental nature of the decision to keep or discard aberrant common items are discussed, with recommendations for future research into more informed and formal ways of dealing with misbehaving common items.

  7. The medial temporal lobes distinguish between within-item and item-context relations during autobiographical memory retrieval.

    Science.gov (United States)

    Sheldon, Signy; Levine, Brian

    2015-12-01

    During autobiographical memory retrieval, the medial temporal lobes (MTL) relate together multiple event elements, including object (within-item relations) and context (item-context relations) information, to create a cohesive memory. There is consistent support for a functional specialization within the MTL according to these relational processes, much of which comes from recognition memory experiments. In this study, we compared brain activation patterns associated with retrieving within-item relations (i.e., associating conceptual and sensory-perceptual object features) and item-context relations (i.e., spatial relations among objects) with respect to naturalistic autobiographical retrieval. We developed a novel paradigm that cued participants to retrieve information about past autobiographical events, non-episodic within-item relations, and non-episodic item-context relations with the perceptuomotor aspects of retrieval equated across these conditions. We used multivariate analysis techniques to extract common and distinct patterns of activity among these conditions within the MTL and across the whole brain, both in terms of spatial and temporal patterns of activity. The anterior MTL (perirhinal cortex and anterior hippocampus) was preferentially recruited for generating within-item relations later in retrieval whereas the posterior MTL (posterior parahippocampal cortex and posterior hippocampus) was preferentially recruited for generating item-context relations across the retrieval phase. These findings provide novel evidence for functional specialization within the MTL with respect to naturalistic memory retrieval. © 2015 Wiley Periodicals, Inc.

  8. Individual Differences in Numeracy and Cognitive Reflection, with Implications for Biases and Fallacies in Probability Judgment.

    Science.gov (United States)

    Liberali, Jordana M; Reyna, Valerie F; Furlan, Sarah; Stein, Lilian M; Pardo, Seth T

    2012-10-01

    Despite evidence that individual differences in numeracy affect judgment and decision making, the precise mechanisms underlying how such differences produce biases and fallacies remain unclear. Numeracy scales have been developed without sufficient theoretical grounding, and their relation to other cognitive tasks that assess numerical reasoning, such as the Cognitive Reflection Test (CRT), has been debated. In studies conducted in Brazil and in the USA, we administered an objective Numeracy Scale (NS), Subjective Numeracy Scale (SNS), and the CRT to assess whether they measured similar constructs. The Rational-Experiential Inventory, inhibition (go/no-go task), and intelligence were also investigated. By examining factor solutions along with frequent errors for questions that loaded on each factor, we characterized different types of processing captured by different items on these scales. We also tested the predictive power of these factors to account for biases and fallacies in probability judgments. In the first study, 259 Brazilian undergraduates were tested on the conjunction and disjunction fallacies. In the second study, 190 American undergraduates responded to a ratio-bias task. Across the different samples, the results were remarkably similar. The results indicated that the CRT is not just another numeracy scale, that objective and subjective numeracy scales do not measure an identical construct, and that different aspects of numeracy predict different biases and fallacies. Dimensions of numeracy included computational skills such as multiplying, proportional reasoning, mindless or verbatim matching, metacognitive monitoring, and understanding the gist of relative magnitude, consistent with dual-process theories such as fuzzy-trace theory.

  9. Individual Differences in Numeracy and Cognitive Reflection, with Implications for Biases and Fallacies in Probability Judgment

    Science.gov (United States)

    LIBERALI, JORDANA M.; REYNA, VALERIE F.; FURLAN, SARAH; STEIN, LILIAN M.; PARDO, SETH T.

    2013-01-01

    Despite evidence that individual differences in numeracy affect judgment and decision making, the precise mechanisms underlying how such differences produce biases and fallacies remain unclear. Numeracy scales have been developed without sufficient theoretical grounding, and their relation to other cognitive tasks that assess numerical reasoning, such as the Cognitive Reflection Test (CRT), has been debated. In studies conducted in Brazil and in the USA, we administered an objective Numeracy Scale (NS), Subjective Numeracy Scale (SNS), and the CRT to assess whether they measured similar constructs. The Rational–Experiential Inventory, inhibition (go/no-go task), and intelligence were also investigated. By examining factor solutions along with frequent errors for questions that loaded on each factor, we characterized different types of processing captured by different items on these scales. We also tested the predictive power of these factors to account for biases and fallacies in probability judgments. In the first study, 259 Brazilian undergraduates were tested on the conjunction and disjunction fallacies. In the second study, 190 American undergraduates responded to a ratio-bias task. Across the different samples, the results were remarkably similar. The results indicated that the CRT is not just another numeracy scale, that objective and subjective numeracy scales do not measure an identical construct, and that different aspects of numeracy predict different biases and fallacies. Dimensions of numeracy included computational skills such as multiplying, proportional reasoning, mindless or verbatim matching, metacognitive monitoring, and understanding the gist of relative magnitude, consistent with dual-process theories such as fuzzy-trace theory. PMID:23878413

  10. Applying Hierarchical Model Calibration to Automatically Generated Items.

    Science.gov (United States)

    Williamson, David M.; Johnson, Matthew S.; Sinharay, Sandip; Bejar, Isaac I.

    This study explored the application of hierarchical model calibration as a means of reducing, if not eliminating, the need for pretesting of automatically generated items from a common item model prior to operational use. Ultimately the successful development of automatic item generation (AIG) systems capable of producing items with highly similar…

  11. Science Library of Test Items. Volume Eighteen. A Collection of Multiple Choice Test Items Relating Mainly to Chemistry.

    Science.gov (United States)

    New South Wales Dept. of Education, Sydney (Australia).

    As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…

  12. Science Library of Test Items. Volume Seventeen. A Collection of Multiple Choice Test Items Relating Mainly to Biology.

    Science.gov (United States)

    New South Wales Dept. of Education, Sydney (Australia).

    As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…

  13. Science Library of Test Items. Volume Nineteen. A Collection of Multiple Choice Test Items Relating Mainly to Geology.

    Science.gov (United States)

    New South Wales Dept. of Education, Sydney (Australia).

    As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…

  14. Sensitivity analysis for publication bias in meta-analysis of diagnostic studies for a continuous biomarker.

    Science.gov (United States)

    Hattori, Satoshi; Zhou, Xiao-Hua

    2018-02-10

    Publication bias is one of the most important issues in meta-analysis. For standard meta-analyses to examine intervention effects, the funnel plot and the trim-and-fill method are simple and widely used techniques for assessing and adjusting for the influence of publication bias, respectively. However, their use may be subjective and can then produce misleading insights. To make a more objective inference for publication bias, various sensitivity analysis methods have been proposed, including the Copas selection model. For meta-analysis of diagnostic studies evaluating a continuous biomarker, the summary receiver operating characteristic (sROC) curve is a very useful method in the presence of heterogeneous cutoff values. To our best knowledge, no methods are available for evaluation of influence of publication bias on estimation of the sROC curve. In this paper, we introduce a Copas-type selection model for meta-analysis of diagnostic studies and propose a sensitivity analysis method for publication bias. Our method enables us to assess the influence of publication bias on the estimation of the sROC curve and then judge whether the result of the meta-analysis is sufficiently confident or should be interpreted with much caution. We illustrate our proposed method with real data. Copyright © 2017 John Wiley & Sons, Ltd.

  15. Attribution bias and social anxiety in schizophrenia

    Directory of Open Access Journals (Sweden)

    Amelie M. Achim

    2016-06-01

    Full Text Available Studies on attribution biases in schizophrenia have produced mixed results, whereas such biases have been more consistently reported in people with anxiety disorders. Anxiety comorbidities are frequent in schizophrenia, in particular social anxiety disorder, which could influence their patterns of attribution biases. The objective of the present study was thus to determine if individuals with schizophrenia and a comorbid social anxiety disorder (SZ+ show distinct attribution biases as compared with individuals with schizophrenia without social anxiety (SZ− and healthy controls. Attribution biases were assessed with the Internal, Personal, and Situational Attributions Questionnaire in 41 individual with schizophrenia and 41 healthy controls. Results revealed the lack of the normal externalizing bias in SZ+, whereas SZ− did not significantly differ from healthy controls on this dimension. The personalizing bias was not influenced by social anxiety but was in contrast linked with delusions, with a greater personalizing bias in individuals with current delusions. Future studies on attribution biases in schizophrenia should carefully document symptom presentation, including social anxiety.

  16. Modeling Local Item Dependence in Cloze and Reading Comprehension Test Items Using Testlet Response Theory

    Science.gov (United States)

    Baghaei, Purya; Ravand, Hamdollah

    2016-01-01

    In this study the magnitudes of local dependence generated by cloze test items and reading comprehension items were compared and their impact on parameter estimates and test precision was investigated. An advanced English as a foreign language reading comprehension test containing three reading passages and a cloze test was analyzed with a…

  17. Suspected survivor bias in case-control studies: stratify on survival time and use a negative control.

    Science.gov (United States)

    van Rein, Nienke; Cannegieter, Suzanne C; Rosendaal, Frits R; Reitsma, Pieter H; Lijfering, Willem M

    2014-02-01

    Selection bias in case-control studies occurs when control selection is inappropriate. However, selection bias due to improper case sampling is less well recognized. We describe how to recognize survivor bias (i.e., selection on exposed cases) and illustrate this with an example study. A case-control study was used to analyze the effect of statins on major bleedings during treatment with vitamin K antagonists. A total of 110 patients who experienced such bleedings were included 18-1,018 days after the bleeding complication and matched to 220 controls. A protective association of major bleeding for exposure to statins (odds ratio [OR]: 0.56; 95% confidence interval: 0.29-1.08) was found, which did not become stronger after adjustment for confounding factors. These observations lead us to suspect survivor bias. To identify this bias, results were stratified on time between bleeding event and inclusion, and repeated for a negative control (an exposure not related to survival): blood group non-O. The ORs for exposure to statins increased gradually to 1.37 with shorter time between outcome and inclusion, whereas ORs for the negative control remained constant, confirming our hypothesis. We recommend the presented method to check for overoptimistic results, that is, survivor bias in case-control studies. Copyright © 2014 Elsevier Inc. All rights reserved.

  18. Attentional Bias for Reward and Punishment in Overweight and Obesity: The TRAILS Study.

    Science.gov (United States)

    Jonker, Nienke C; Glashouwer, Klaske A; Ostafin, Brian D; van Hemel-Ruiter, Madelon E; Smink, Frédérique R E; Hoek, Hans W; de Jong, Peter J

    2016-01-01

    More than 80% of obese adolescents will become obese adults, and it is therefore important to enhance insight into characteristics that underlie the development and maintenance of overweight and obesity at a young age. The current study is the first to focus on attentional biases towards rewarding and punishing cues as potentially important factors. Participants were young adolescents (N = 607) who were followed from the age of 13 until the age of 19, and completed a motivational game indexing the attentional bias to general cues of reward and punishment. Additionally, self-reported reward and punishment sensitivity was measured. This study showed that attentional biases to cues that signal reward or punishment and self-reported reward and punishment sensitivity were not related to body mass index or the change in body mass index over six years in adolescents. Thus, attentional bias to cues of reward and cues of punishment, and self-reported reward and punishment sensitivity, do not seem to be crucial factors in the development and maintenance of overweight and obesity in adolescents. Exploratory analyses of the current study suggest that the amount of effort to gain reward and to avoid punishment may play a role in the development and maintenance of overweight and obesity. However, since the effort measure was a construct based on face validity and has not been properly validated, more studies are necessary before firm conclusions can be drawn.

  19. Cognitive Bias in Systems Verification

    Science.gov (United States)

    Larson, Steve

    2012-01-01

    Working definition of cognitive bias: Patterns by which information is sought and interpreted that can lead to systematic errors in decisions. Cognitive bias is used in diverse fields: Economics, Politics, Intelligence, Marketing, to name a few. Attempts to ground cognitive science in physical characteristics of the cognitive apparatus exceed our knowledge. Studies based on correlations; strict cause and effect is difficult to pinpoint. Effects cited in the paper and discussed here have been replicated many times over, and appear sound. Many biases have been described, but it is still unclear whether they are all distinct. There may only be a handful of fundamental biases, which manifest in various ways. Bias can effect system verification in many ways . Overconfidence -> Questionable decisions to deploy. Availability -> Inability to conceive critical tests. Representativeness -> Overinterpretation of results. Positive Test Strategies -> Confirmation bias. Debiasing at individual level very difficult. The potential effect of bias on the verification process can be managed, but not eliminated. Worth considering at key points in the process.

  20. Within-item strategy switching in arithmetic: a comparative study in children

    Science.gov (United States)

    Ardiale, Eléonore; Lemaire, Patrick

    2013-01-01

    The present study aimed at determining whether (1) children were able to interrupt a strategy execution to switch and choose another better strategy, and (2) their ability to switch strategy within-item improved with age. Third, fifth, and seventh graders performed a computational estimation task in which they had to provide the better estimates to two-digit addition problems (e.g., 32 + 54) while using the rounding-down (e.g., 30 + 50) or the rounding-up strategy (e.g., 40 + 60). After having executing the cued strategy (e.g., 30 + 50) during 1,000 ms, participants were given the opportunity to switch to another better strategy (e.g., 40 + 60) or to repeat the same strategy (e.g., 30 + 50). The results showed that children switched strategies within items, and were able to switch more often when the addition problems were cued with the poorer strategy (e.g., 40 + 60 for 32 + 54) than when cued with the better strategy (e.g., 30 + 50). As they grew up, children based their decisions to switch strategies more often on whether the 1,000-ms strategy execution concerned the better strategy or strategy difficulty (i.e., the rounding-up strategy). These findings have important implications to further understand mechanisms underlying within-item strategy switching as well as strategic variations in children. PMID:24368906

  1. SLS Trade Study 0058: Day of Launch (DOL) Wind Biasing

    Science.gov (United States)

    Decker, Ryan K.; Duffin, Paul; Hill, Ashley; Beck, Roger; Dukeman, Greg

    2014-01-01

    SLS heritage hardware and legacy designs have shown load exceedances at several locations during Design Analysis Cycles (DAC): MPCV Z bending moments; ICPS Electro-Mechanical Actuator (EMA) loads; Core Stage loads just downstream of Booster forward interface. SLS Buffet Loads Mitigation Task Team (BLMTT) tasked to study issue. Identified low frequency buffet load responses are a function of the vehicle's total angle of attack (AlphaTotal). SLS DOL Wind Biasing Trade team to analyze DOL wind biasing methods to limit maximum AlphaTotal in the M0.8 - 2.0 altitude region for EM-1 and EM-2 missions through investigating: Trajectory design process; Wind wavelength filtering options; Launch availability; DOL process to achieve shorter processing/uplink timeline. Trade Team consisted of personnel supporting SLS, MPCV, GSDO programs.

  2. Selection bias in population-based cancer case-control studies due to incomplete sampling frame coverage.

    Science.gov (United States)

    Walsh, Matthew C; Trentham-Dietz, Amy; Gangnon, Ronald E; Nieto, F Javier; Newcomb, Polly A; Palta, Mari

    2012-06-01

    Increasing numbers of individuals are choosing to opt out of population-based sampling frames due to privacy concerns. This is especially a problem in the selection of controls for case-control studies, as the cases often arise from relatively complete population-based registries, whereas control selection requires a sampling frame. If opt out is also related to risk factors, bias can arise. We linked breast cancer cases who reported having a valid driver's license from the 2004-2008 Wisconsin women's health study (N = 2,988) with a master list of licensed drivers from the Wisconsin Department of Transportation (WDOT). This master list excludes Wisconsin drivers that requested their information not be sold by the state. Multivariate-adjusted selection probability ratios (SPR) were calculated to estimate potential bias when using this driver's license sampling frame to select controls. A total of 962 cases (32%) had opted out of the WDOT sampling frame. Cases age <40 (SPR = 0.90), income either unreported (SPR = 0.89) or greater than $50,000 (SPR = 0.94), lower parity (SPR = 0.96 per one-child decrease), and hormone use (SPR = 0.93) were significantly less likely to be covered by the WDOT sampling frame (α = 0.05 level). Our results indicate the potential for selection bias due to differential opt out between various demographic and behavioral subgroups of controls. As selection bias may differ by exposure and study base, the assessment of potential bias needs to be ongoing. SPRs can be used to predict the direction of bias when cases and controls stem from different sampling frames in population-based case-control studies.

  3. Effects of social approval bias on self-reported fruit and vegetable consumption: a randomized controlled trial

    Directory of Open Access Journals (Sweden)

    Marcus Al C

    2008-06-01

    Full Text Available Abstract Background Self-reports of dietary intake in the context of nutrition intervention research can be biased by the tendency of respondents to answer consistent with expected norms (social approval bias. The objective of this study was to assess the potential influence of social approval bias on self-reports of fruit and vegetable intake obtained using both food frequency questionnaire (FFQ and 24-hour recall methods. Methods A randomized blinded trial compared reported fruit and vegetable intake among subjects exposed to a potentially biasing prompt to that from control subjects. Subjects included 163 women residing in Colorado between 35 and 65 years of age who were randomly selected and recruited by telephone to complete what they were told would be a future telephone survey about health. Randomly half of the subjects then received a letter prior to the interview describing this as a study of fruit and vegetable intake. The letter included a brief statement of the benefits of fruits and vegetables, a 5-A-Day sticker, and a 5-a-Day refrigerator magnet. The remainder received the same letter, but describing the study purpose only as a more general nutrition survey, with neither the fruit and vegetable message nor the 5-A-Day materials. Subjects were then interviewed on the telephone within 10 days following the letters using an eight-item FFQ and a limited 24-hour recall to estimate fruit and vegetable intake. All interviewers were blinded to the treatment condition. Results By the FFQ method, subjects who viewed the potentially biasing prompts reported consuming more fruits and vegetables than did control subjects (5.2 vs. 3.7 servings per day, p Conclusion Self-reports of fruit and vegetable intake using either a food frequency questionnaire or a limited 24-hour recall are both susceptible to substantial social approval bias. Valid assessments of intervention effects in nutritional intervention trials may require objective measures of

  4. Does remembering emotional items impair recall of same-emotion items?

    Science.gov (United States)

    Sison, Jo Ann G; Mather, Mara

    2007-04-01

    In the part-set cuing effect, cuing a subset of previously studied items impairs recall of the remaining noncued items. This experiment reveals that cuing participants with previously-studied emotional pictures (e.g., fear-evoking pictures of people) can impair recall of pictures involving the same emotion but different content (e.g., fear-evoking pictures of animals). This indicates that new events can be organized in memory using emotion as a grouping function to create associations. However, whether new information is organized in memory along emotional or nonemotional lines appears to be a flexible process that depends on people's current focus. Mentioning in the instructions that the pictures were either amusement- or fear-related led to memory impairment for pictures with the same emotion as cued pictures, whereas mentioning that the pictures depicted either animals or people led to memory impairment for pictures with the same type of actor.

  5. Effectiveness of two web-based cognitive bias modification interventions targeting approach and attentional bias in gambling problems: study protocol for a pilot randomised controlled trial.

    Science.gov (United States)

    Boffo, Marilisa; Willemen, Ronny; Pronk, Thomas; Wiers, Reinout W; Dom, Geert

    2017-10-03

    Disordered gamblers have phenotypical and pathological similarities to those with substance use disorders (SUD), including exaggerated automatic cognitive processing of motivationally salient gambling cues in the environment (i.e., attentional and approach bias). Cognitive bias modification (CBM) is a family of computerised interventions that have proved effective in successfully re-training these automatic cognitive biases in SUD. CBM interventions can, in principle, be administered online, thus showing potential of being a low-cost, low-threshold addition to conventional treatments. This paper presents the design of a pilot randomised controlled trial exploring the effectiveness of two web-based CBM interventions targeting attentional and approach bias towards gambling cues in a sample of Dutch and Belgian problematic and pathological gamblers. Participants (N = 182) are community-recruited adults experiencing gambling problems, who have gambled at least twice in the past 6 months and are motivated to change their gambling behaviour. After a baseline assessment session, participants are randomly assigned to one of four experimental conditions (attentional or approach bias training, or the placebo version of the two trainings) and complete six sessions of training. At baseline and before each training session, participants receive automated personalised feedback on their gambling motives and reasons to quit or reduce gambling. The post-intervention, 1-month, and 3-month follow-up assessments will examine changes in gambling behaviour, with frequency and expenditure as primary outcomes, and depressive symptoms and gambling-related attentional and approach biases as secondary outcomes. Secondary analyses will explore possible moderators (interference control capacity and trait impulsivity) and mediators (change in cognitive bias) of training effects on the primary outcomes. This study is the first to explore the effectiveness of an online CBM intervention for

  6. More is not Always Better: The Relation between Item Response and Item Response Time in Raven’s Matrices

    Directory of Open Access Journals (Sweden)

    Frank Goldhammer

    2015-03-01

    Full Text Available The role of response time in completing an item can have very different interpretations. Responding more slowly could be positively related to success as the item is answered more carefully. However, the association may be negative if working faster indicates higher ability. The objective of this study was to clarify the validity of each assumption for reasoning items considering the mode of processing. A total of 230 persons completed a computerized version of Raven’s Advanced Progressive Matrices test. Results revealed that response time overall had a negative effect. However, this effect was moderated by items and persons. For easy items and able persons the effect was strongly negative, for difficult items and less able persons it was less negative or even positive. The number of rules involved in a matrix problem proved to explain item difficulty significantly. Most importantly, a positive interaction effect between the number of rules and item response time indicated that the response time effect became less negative with an increasing number of rules. Moreover, exploratory analyses suggested that the error type influenced the response time effect.

  7. Sympathetic bias.

    Science.gov (United States)

    Levy, David M; Peart, Sandra J

    2008-06-01

    We wish to deal with investigator bias in a statistical context. We sketch how a textbook solution to the problem of "outliers" which avoids one sort of investigator bias, creates the temptation for another sort. We write down a model of the approbation seeking statistician who is tempted by sympathy for client to violate the disciplinary standards. We give a simple account of one context in which we might expect investigator bias to flourish. Finally, we offer tentative suggestions to deal with the problem of investigator bias which follow from our account. As we have given a very sparse and stylized account of investigator bias, we ask what might be done to overcome this limitation.

  8. Cognitive interviewing methodology in the development of a pediatric item bank: a patient reported outcomes measurement information system (PROMIS study

    Directory of Open Access Journals (Sweden)

    DeWalt Darren A

    2009-01-01

    Full Text Available Abstract Background The evaluation of patient-reported outcomes (PROs in health care has seen greater use in recent years, and methods to improve the reliability and validity of PRO instruments are advancing. This paper discusses the cognitive interviewing procedures employed by the Patient Reported Outcomes Measurement Information System (PROMIS pediatrics group for the purpose of developing a dynamic, electronic item bank for field testing with children and adolescents using novel computer technology. The primary objective of this study was to conduct cognitive interviews with children and adolescents to gain feedback on items measuring physical functioning, emotional health, social health, fatigue, pain, and asthma-specific symptoms. Methods A total of 88 cognitive interviews were conducted with 77 children and adolescents across two sites on 318 items. From this initial item bank, 25 items were deleted and 35 were revised and underwent a second round of cognitive interviews. A total of 293 items were retained for field testing. Results Children as young as 8 years of age were able to comprehend the majority of items, response options, directions, recall period, and identify problems with language that was difficult for them to understand. Cognitive interviews indicated issues with item comprehension on several items which led to alternative wording for these items. Conclusion Children ages 8–17 years were able to comprehend most item stems and response options in the present study. Field testing with the resulting items and response options is presently being conducted as part of the PROMIS Pediatric Item Bank development process.

  9. Differential item functioning magnitude and impact measures from item response theory models.

    Science.gov (United States)

    Kleinman, Marjorie; Teresi, Jeanne A

    2016-01-01

    Measures of magnitude and impact of differential item functioning (DIF) at the item and scale level, respectively are presented and reviewed in this paper. Most measures are based on item response theory models. Magnitude refers to item level effect sizes, whereas impact refers to differences between groups at the scale score level. Reviewed are magnitude measures based on group differences in the expected item scores and impact measures based on differences in the expected scale scores. The similarities among these indices are demonstrated. Various software packages are described that provide magnitude and impact measures, and new software presented that computes all of the available statistics conveniently in one program with explanations of their relationships to one another.

  10. Exploring Selective Exposure and Confirmation Bias as Processes Underlying Employee Work Happiness: An Intervention Study.

    Science.gov (United States)

    Williams, Paige; Kern, Margaret L; Waters, Lea

    2016-01-01

    Employee psychological capital (PsyCap), perceptions of organizational virtue (OV), and work happiness have been shown to be associated within and over time. This study examines selective exposure and confirmation bias as potential processes underlying PsyCap, OV, and work happiness associations. As part of a quasi-experimental study design, school staff (N = 69) completed surveys at three time points. After the first assessment, some staff (n = 51) completed a positive psychology training intervention. Results of descriptive statistics, correlation, and regression analyses on the intervention group provide some support for selective exposure and confirmation bias as explanatory mechanisms. In focusing on the processes through which employee attitudes may influence work happiness this study advances theoretical understanding, specifically of selective exposure and confirmation bias in a field study context.

  11. AN INVESTIGATION OF ITEM BIAS.

    Science.gov (United States)

    CLEARY, T. ANNE; HILTON, THOMAS L.

    THE PURPOSE OF THIS INVESTIGATION WAS TO DETERMINE WHETHER THE PRELIMINARY SCHOLASTIC APTITUDE TEST PRESENTED A DIFFERENTIAL DIFFICULTY FOR RACIAL AND SOCIOECONOMIC GROUPS. THE SUBJECTS WERE TWO GROUPS TOTALING 1,410 NEGRO AND WHITE HIGH SCHOOL SENIORS IN AN INTEGRATED HIGH SCHOOL WHO HAD TAKEN THE TEST. THEY WERE DIVIDED INTO THREE SOCIOECONOMIC…

  12. ITEM LEVEL DIAGNOSTICS AND MODEL - DATA FIT IN ITEM ...

    African Journals Online (AJOL)

    Global Journal

    Item response theory (IRT) is a framework for modeling and analyzing item response ... data. Though, there is an argument that the evaluation of fit in IRT modeling has been ... National Council on Measurement in Education ... model data fit should be based on three types of ... prediction should be assessed through the.

  13. A Multilevel Multidimensional Item Response Theory Model to Address the Role of Response Style on Measurement of Attitudes in PISA 2006

    Science.gov (United States)

    Lu, Yi

    2012-01-01

    Cross-national comparisons of responses to survey items are often affected by response style, particularly extreme response style (ERS). ERS varies across cultures, and has the potential to bias inferences in cross-national comparisons. For example, in both PISA and TIMSS assessments, it has been documented that when examined within countries,…

  14. Work ability as prognostic risk marker of disability pension : Single-item work ability score versus multi-item work ability index

    NARCIS (Netherlands)

    Roelen, C.A.M.; Rhenen, van W.; Groothoff, J.W.; Klink, van der J.J.L.; Twisk, W.R.; Heymans, M.W.

    2014-01-01

    Work ability predicts future disability pension (DP). A single-item work ability score (WAS) is emerging as a measure for work ability. This study compared single-item WAS with the multi-item work ability index (WAI) in its ability to identify workers at risk of DP.

  15. Losing Items in the Psychogeriatric Nursing Home

    Directory of Open Access Journals (Sweden)

    J. van Hoof PhD

    2016-09-01

    Full Text Available Introduction: Losing items is a time-consuming occurrence in nursing homes that is ill described. An explorative study was conducted to investigate which items got lost by nursing home residents, and how this affects the residents and family caregivers. Method: Semi-structured interviews and card sorting tasks were conducted with 12 residents with early-stage dementia and 12 family caregivers. Thematic analysis was applied to the outcomes of the sessions. Results: The participants stated that numerous personal items and assistive devices get lost in the nursing home environment, which had various emotional, practical, and financial implications. Significant amounts of time are spent on trying to find items, varying from 1 hr up to a couple of weeks. Numerous potential solutions were identified by the interviewees. Discussion: Losing items often goes together with limitations to the participation of residents. Many family caregivers are reluctant to replace lost items, as these items may get lost again.

  16. Examination of the PROMIS upper extremity item bank.

    Science.gov (United States)

    Hung, Man; Voss, Maren W; Bounsanga, Jerry; Crum, Anthony B; Tyser, Andrew R

    Clinical measurement. The psychometric properties of the PROMIS v1.2 UE item bank were tested on various samples prior to its release, but have not been fully evaluated among the orthopaedic population. This study assesses the performance of the UE item bank within the UE orthopaedic patient population. The UE item bank was administered to 1197 adult patients presenting to a tertiary orthopaedic clinic specializing in hand and UE conditions and was examined using traditional statistics and Rasch analysis. The UE item bank fits a unidimensional model (outfit MNSQ range from 0.64 to 1.70) and has adequate reliabilities (person = 0.84; item = 0.82) and local independence (item residual correlations range from -0.37 to 0.34). Only one item exhibits gender differential item functioning. Most items target low levels of function. The UE item bank is a useful clinical assessment tool. Additional items covering higher functions are needed to enhance validity. Supplemental testing is recommended for patients at higher levels of function until more high function UE items are developed. 2c. Copyright © 2016 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.

  17. Bias-correction in vector autoregressive models

    DEFF Research Database (Denmark)

    Engsted, Tom; Pedersen, Thomas Quistgaard

    2014-01-01

    We analyze the properties of various methods for bias-correcting parameter estimates in both stationary and non-stationary vector autoregressive models. First, we show that two analytical bias formulas from the existing literature are in fact identical. Next, based on a detailed simulation study......, we show that when the model is stationary this simple bias formula compares very favorably to bootstrap bias-correction, both in terms of bias and mean squared error. In non-stationary models, the analytical bias formula performs noticeably worse than bootstrapping. Both methods yield a notable...... improvement over ordinary least squares. We pay special attention to the risk of pushing an otherwise stationary model into the non-stationary region of the parameter space when correcting for bias. Finally, we consider a recently proposed reduced-bias weighted least squares estimator, and we find...

  18. Re-evaluating a vision-related quality of life questionnaire with item response theory (IRT and differential item functioning (DIF analyses

    Directory of Open Access Journals (Sweden)

    Knol Dirk L

    2011-09-01

    Full Text Available Abstract Background For the Low Vision Quality Of Life questionnaire (LVQOL it is unknown whether the psychometric properties are satisfactory when an item response theory (IRT perspective is considered. This study evaluates some essential psychometric properties of the LVQOL questionnaire in an IRT model, and investigates differential item functioning (DIF. Methods Cross-sectional data were used from an observational study among visually-impaired patients (n = 296. Calibration was performed for every dimension of the LVQOL in the graded response model. Item goodness-of-fit was assessed with the S-X2-test. DIF was assessed on relevant background variables (i.e. age, gender, visual acuity, eye condition, rehabilitation type and administration type with likelihood-ratio tests for DIF. The magnitude of DIF was interpreted by assessing the largest difference in expected scores between subgroups. Measurement precision was assessed by presenting test information curves; reliability with the index of subject separation. Results All items of the LVQOL dimensions fitted the model. There was significant DIF on several items. For two items the maximum difference between expected scores exceeded one point, and DIF was found on multiple relevant background variables. Item 1 'Vision in general' from the "Adjustment" dimension and item 24 'Using tools' from the "Reading and fine work" dimension were removed. Test information was highest for the "Reading and fine work" dimension. Indices for subject separation ranged from 0.83 to 0.94. Conclusions The items of the LVQOL showed satisfactory item fit to the graded response model; however, two items were removed because of DIF. The adapted LVQOL with 21 items is DIF-free and therefore seems highly appropriate for use in heterogeneous populations of visually impaired patients.

  19. Effect of standardized training on the reliability of the Cochrane risk of bias assessment tool: a study protocol.

    Science.gov (United States)

    da Costa, Bruno R; Resta, Nina M; Beckett, Brooke; Israel-Stahre, Nicholas; Diaz, Alison; Johnston, Bradley C; Egger, Matthias; Jüni, Peter; Armijo-Olivo, Susan

    2014-12-13

    The Cochrane risk of bias (RoB) tool has been widely embraced by the systematic review community, but several studies have reported that its reliability is low. We aim to investigate whether training of raters, including objective and standardized instructions on how to assess risk of bias, can improve the reliability of this tool. We describe the methods that will be used in this investigation and present an intensive standardized training package for risk of bias assessment that could be used by contributors to the Cochrane Collaboration and other reviewers. This is a pilot study. We will first perform a systematic literature review to identify randomized clinical trials (RCTs) that will be used for risk of bias assessment. Using the identified RCTs, we will then do a randomized experiment, where raters will be allocated to two different training schemes: minimal training and intensive standardized training. We will calculate the chance-corrected weighted Kappa with 95% confidence intervals to quantify within- and between-group Kappa agreement for each of the domains of the risk of bias tool. To calculate between-group Kappa agreement, we will use risk of bias assessments from pairs of raters after resolution of disagreements. Between-group Kappa agreement will quantify the agreement between the risk of bias assessment of raters in the training groups and the risk of bias assessment of experienced raters. To compare agreement of raters under different training conditions, we will calculate differences between Kappa values with 95% confidence intervals. This study will investigate whether the reliability of the risk of bias tool can be improved by training raters using standardized instructions for risk of bias assessment. One group of inexperienced raters will receive intensive training on risk of bias assessment and the other will receive minimal training. By including a control group with minimal training, we will attempt to mimic what many review authors

  20. Science Library of Test Items. Volume Twenty-Two. A Collection of Multiple Choice Test Items Relating Mainly to Skills.

    Science.gov (United States)

    New South Wales Dept. of Education, Sydney (Australia).

    As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…

  1. Science Library of Test Items. Volume Twenty. A Collection of Multiple Choice Test Items Relating Mainly to Physics, 1.

    Science.gov (United States)

    New South Wales Dept. of Education, Sydney (Australia).

    As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…

  2. Heart Failure Therapeutics on the Basis of a Biased Ligand of the Angiotensin-2 Type 1 Receptor Rationale and Design of the BLAST-AHF Study (Biased Ligand of the Angiotensin Receptor Study in Acute Heart Failure)

    NARCIS (Netherlands)

    Felker, G. Michael; Butler, Javed; Collins, Sean P.; Cotter, Gad; Davison, Beth A.; Ezekowitz, Justin A.; Filippatos, Gerasimos; Levy, Phillip D.; Metra, Marco; Ponikowski, Piotr; Soergel, David G.; Teerlink, John R.; Violin, Jonathan D.; Voors, Adriaan A.; Pang, Peter S.

    The BLAST-AHF (Biased Ligand of the Angiotensin Receptor Study in Acute Heart Failure) study is designed to test the efficacy and safety of TRV027, a novel biased ligand of the angiotensin-2 type 1 receptor, in patients with acute heart failure (AHF). AHF remains a major public health problem, and

  3. Effectiveness of Item Response Theory (IRT) Proficiency Estimation Methods under Adaptive Multistage Testing. Research Report. ETS RR-15-11

    Science.gov (United States)

    Kim, Sooyeon; Moses, Tim; Yoo, Hanwook Henry

    2015-01-01

    The purpose of this inquiry was to investigate the effectiveness of item response theory (IRT) proficiency estimators in terms of estimation bias and error under multistage testing (MST). We chose a 2-stage MST design in which 1 adaptation to the examinees' ability levels takes place. It includes 4 modules (1 at Stage 1, 3 at Stage 2) and 3 paths…

  4. Negative effects of item repetition on source memory.

    Science.gov (United States)

    Kim, Kyungmi; Yi, Do-Joon; Raye, Carol L; Johnson, Marcia K

    2012-08-01

    In the present study, we explored how item repetition affects source memory for new item-feature associations (picture-location or picture-color). We presented line drawings varying numbers of times in Phase 1. In Phase 2, each drawing was presented once with a critical new feature. In Phase 3, we tested memory for the new source feature of each item from Phase 2. Experiments 1 and 2 demonstrated and replicated the negative effects of item repetition on incidental source memory. Prior item repetition also had a negative effect on source memory when different source dimensions were used in Phases 1 and 2 (Experiment 3) and when participants were explicitly instructed to learn source information in Phase 2 (Experiments 4 and 5). Importantly, when the order between Phases 1 and 2 was reversed, such that item repetition occurred after the encoding of critical item-source combinations, item repetition no longer affected source memory (Experiment 6). Overall, our findings did not support predictions based on item predifferentiation, within-dimension source interference, or general interference from multiple traces of an item. Rather, the findings were consistent with the idea that prior item repetition reduces attention to subsequent presentations of the item, decreasing the likelihood that critical item-source associations will be encoded.

  5. Work ability as prognostic risk marker of disability pension: single-item work ability score versus multi-item work ability index

    NARCIS (Netherlands)

    Roelen, C.A.M.; van Rhenen, W.; Groothoff, J.W.; van der Klink, J.J.L.; Twisk, J.W.R.; Heymans, M.W.

    2014-01-01

    Objectives Work ability predicts future disability pension (DP). A single-item work ability score (WAS) is emerging as a measure for work ability. This study compared single-item WAS with the multi-item work ability index (WAI) in its ability to identify workers at risk of DP. Methods This

  6. Work ability as prognostic risk marker of disability pension : single-item work ability score versus multi-item work ability index

    NARCIS (Netherlands)

    Roelen, Corne A. M.; van Rhenen, Willem; Groothoff, Johan W.; van der Klink, Jac J. L.; Twisk, Jos W. R.; Heymans, Martijn W.

    Objectives Work ability predicts future disability pension (DP). A single-item work ability score (WAS) is emerging as a measure for work ability. This study compared single-item WAS with the multi-item work ability index (WAI) in its ability to identify workers at risk of DP. Methods This

  7. Individuals with knee impairments identify items in need of clarification in the Patient Reported Outcomes Measurement Information System (PROMIS®) pain interference and physical function item banks - a qualitative study.

    Science.gov (United States)

    Lynch, Andrew D; Dodds, Nathan E; Yu, Lan; Pilkonis, Paul A; Irrgang, James J

    2016-05-11

    The content and wording of the Patient Reported Outcome Measurement Information System (PROMIS) Physical Function and Pain Interference item banks have not been qualitatively assessed by individuals with knee joint impairments. The purpose of this investigation was to identify items in the PROMIS Physical Function and Pain Interference Item Banks that are irrelevant, unclear, or otherwise difficult to respond to for individuals with impairment of the knee and to suggest modifications based on cognitive interviews. Twenty-nine individuals with knee joint impairments qualitatively assessed items in the Pain Interference and Physical Function Item Banks in a mixed-methods cognitive interview. Field notes were analyzed to identify themes and frequency counts were calculated to identify items not relevant to individuals with knee joint impairments. Issues with clarity were identified in 23 items in the Physical Function Item Bank, resulting in the creation of 43 new or modified items, typically changing words within the item to be clearer. Interpretation issues included whether or not the knee joint played a significant role in overall health and age/gender differences in items. One quarter of the original items (31 of 124) in the Physical Function Item Bank were identified as irrelevant to the knee joint. All 41 items in the Pain Interference Item Bank were identified as clear, although individuals without significant pain substituted other symptoms which interfered with their life. The Physical Function Item Bank would benefit from additional items that are relevant to individuals with knee joint impairments and, by extension, to other lower extremity impairments. Several issues in clarity were identified that are likely to be present in other patient cohorts as well.

  8. Estimation bias and bias correction in reduced rank autoregressions

    DEFF Research Database (Denmark)

    Nielsen, Heino Bohn

    2017-01-01

    This paper characterizes the finite-sample bias of the maximum likelihood estimator (MLE) in a reduced rank vector autoregression and suggests two simulation-based bias corrections. One is a simple bootstrap implementation that approximates the bias at the MLE. The other is an iterative root...

  9. An experimental study of spider-related covariation bias in 8-to 13-year-old children

    NARCIS (Netherlands)

    Muris, Peter; de Jong, Peter J.; Meesters, Cor; Waterreus, Bregje; van Lubeck, Janet

    2005-01-01

    Covariation bias can be defined as phobic subjects' tendency to overestimate the association between phobic stimuli and aversive outcomes. The current study presents two experiments that examined this type of cognitive bias in children aged 8-13 years (N = 147 in Experiment 1, N = 240 in Experiment

  10. Cognitive bias measurement and social anxiety disorder: Correlating self-report data and attentional bias

    Directory of Open Access Journals (Sweden)

    Alexander Miloff

    2015-09-01

    Full Text Available Social anxiety disorder (SAD and attentional bias are theoretically connected in cognitive behavioral therapeutic models. In fact, there is an emerging field focusing on modifying attentional bias as a stand-alone treatment. However, it is unclear to what degree these attentional biases are present before commencing treatment. The purpose of this study was to measure pre-treatment attentional bias in 153 participants diagnosed with SAD using a home-based Internet version of the dot-probe paradigm. Results showed no significant correlation for attentional bias (towards or away from negative words or faces and the self-rated version of the Liebowitz Social Anxiety Scale (LSAS-SR. However, two positive correlations were found for the secondary measures Generalized Anxiety Disorder 7 (GAD-7 and Patient Health Questionnaire 9 (PHQ-9. These indicated that those with elevated levels of anxiety and depression had a higher bias towards negative faces in neutral–negative and positive–negative valence combinations, respectively. The unreliability of the dot-probe paradigm and home-based Internet delivery are discussed to explain the lack of correlations between LSAS-SR and attentional bias. Changes to the dot-probe task are suggested that could improve reliability.

  11. The Accumulative Effect of Concentric‐Biased and Eccentric‐ Biased Exercise on Cardiorespiratory and Metabolic Responses to Subsequent Low‐Intensity Exercise: A Preliminary Study

    Directory of Open Access Journals (Sweden)

    Gavin James Peter

    2015-12-01

    Full Text Available The study investigated the accumulative effect of concentric-biased and eccentric-biased exercise on cardiorespiratory, metabolic and neuromuscular responses to low-intensity exercise performed hours later. Fourteen young men cycled at low-intensity (~60 rpm at 50% maximal oxygen uptake for 10 min before, and 12 h after: concentric-biased, single-leg cycling exercise (CON (performed ~19:30 h and eccentric-biased, double-leg knee extension exercise (ECC (~06:30 h the following morning. Respiratory measures were sampled breath-by-breath, with oxidation values derived from stoichiometry equations. Knee extensor neuromuscular function was assessed before and after CON and ECC. Cardiorespiratory responses during low-intensity cycling were unchanged by accumulative CON and ECC. The RER was lower during low-intensity exercise 12 h after CON and ECC (0.88 ± 0.08, when compared to baseline (0.92 ± 0.09; p = 0.02. Fat oxidation increased from baseline (0.24 ± 0.2 g·min1 to 12 h after CON and ECC (0.39 ± 0.2 g·min1; p = 0.01. Carbohydrate oxidation decreased from baseline (1.59 ± 0.4 g·min-1 to 12 h after CON and ECC (1.36 ± 0.4 g·min1; p = 0.03. These were accompanied by knee extensor force loss (right leg: -11.6%, p < 0.001; left leg: -10.6%, p = 0.02 and muscle soreness (right leg: 2.5 ± 0.9, p < 0.0001; left leg: 2.3 ± 1.2, p < 0.01. Subsequent concentric-biased and eccentric-biased exercise led to increased fat oxidation and decreased carbohydrate oxidation, without impairing cardiorespiration, during low-intensity cycling. An accumulation of fatiguing and damaging exercise increases fat utilisation during low intensity exercise performed as little as 12 h later.

  12. Bias versus bias: harnessing hindsight to reveal paranormal belief change beyond demand characteristics.

    Science.gov (United States)

    Kane, Michael J; Core, Tammy J; Hunt, R Reed

    2010-04-01

    Psychological change is difficult to assess, in part because self-reported beliefs and attitudes may be biased or distorted. The present study probed belief change, in an educational context, by using the hindsight bias to counter another bias that generally plagues assessment of subjective change. Although research has indicated that skepticism courses reduce paranormal beliefs, those findings may reflect demand characteristics (biases toward desired, skeptical responses). Our hindsight-bias procedure circumvented demand by asking students, following semester-long skepticism (and control) courses, to recall their precourse levels of paranormal belief. People typically remember themselves as previously thinking, believing, and acting as they do now, so current skepticism should provoke false recollections of previous skepticism. Given true belief change, therefore, skepticism students should have remembered themselves as having been more skeptical than they were. They did, at least about paranormal topics that were covered most extensively in the course. Our findings thus show hindsight to be useful in evaluating cognitive change beyond demand characteristics.

  13. Reducing neutron multiplicity counting bias for plutonium warhead authentication

    Energy Technology Data Exchange (ETDEWEB)

    Goettsche, Malte

    2015-06-05

    Confidence in future nuclear arms control agreements could be enhanced by direct verification of warheads. It would include warhead authentication. This is the assessment based on measurements whether a declaration that a specific item is a nuclear warhead is true. An information barrier can be used to protect sensitive information during measurements. It could for example show whether attributes such as a fissile mass exceeding a threshold are met without indicating detailed measurement results. Neutron multiplicity measurements would be able to assess a plutonium fissile mass attribute if it were possible to show that their bias is low. Plutonium measurements have been conducted with the He-3 based Passive Scrap Multiplicity Counter. The measurement data has been used as a reference to test the capacity of the Monte Carlo code MCNPX-PoliMi to simulate neutron multiplicity measurements. The simulation results with their uncertainties are in agreement with the experimental results. It is essential to use cross-sections which include neutron scattering with the detector's polyethylene molecular structure. Further MCNPX-PoliMi simulations have been conducted in order to study bias that occurs when measuring samples with large plutonium masses such as warheads. Simulation results of solid and hollow metal spheres up to 6000 g show that the masses are underpredicted by as much as 20%. The main source of this bias has been identified in the false assumption that the neutron multiplication does not depend on the position where a spontaneous fission event occurred. The multiplication refers to the total number of neutrons leaking a sample after a primary spontaneous fission event, taking induced fission into consideration. The correction of the analysis has been derived and implemented in a MATLAB code. It depends on four geometry-dependent correction coefficients. When the sample configuration is fully known, these can be exactly determined and remove this type of

  14. The risk of bias of animal experiments in implant dentistry: a methodological study.

    Science.gov (United States)

    Faggion, Clovis Mariano; Diaz, Karla Tatiana; Aranda, Luisiana; Gabel, Frank; Listl, Stefan; Alarcón, Marco Antonio

    2017-07-01

    To evaluate the risk of bias (ROB) in reports of randomised controlled trials (RCTs) of animal experiments published in implant dentistry, and to explore the association between animal experiment characteristics and ROB. We searched the MEDLINE (via PubMed), SCOPUS and SciELO databases from 2010 to March 2015 for reports of RCTs of animal experiments published in implant dentistry. We evaluated independently and in duplicate the ROB of these experiments by the use of a tool specifically developed to evaluate ROB in animal studies, the SYRCLE's tool. ROB was judged as low, high or unclear (when there was not enough information to judge ROB). We used univariate and multivariate logistic regression analyses to evaluate the association of specific study characteristics and extent of ROB. We initially selected 850 publications and 161 reports of animal experiments were included. For a total of 1449 entries (records), 486 (34%) were rated as low ROB. High ROB was attributed to 80 (6%) of entries, and 883 (60%) entries were rated as unclear ROB. The characteristics "impact factor" (IF), reporting of standard error (SE) and reporting of confidence interval (CI) were significantly associated with low ROB in some SYRCLE domains. A substantial number of items with unclear ROB were observed in this sample of animal experiments in implant dentistry. Furthermore, the present findings suggest that implant dentistry animal experiments published in journals with higher IF and better report of measures of precision; that is, CI and SE may have lower ROB than those not having these characteristics. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  15. Cross-Cultural Study of Information Processing Biases in Chronic Fatigue Syndrome: Comparison of Dutch and UK Chronic Fatigue Patients.

    Science.gov (United States)

    Hughes, Alicia M; Hirsch, Colette R; Nikolaus, Stephanie; Chalder, Trudie; Knoop, Hans; Moss-Morris, Rona

    2018-02-01

    This study aims to replicate a UK study, with a Dutch sample to explore whether attention and interpretation biases and general attentional control deficits in chronic fatigue syndrome (CFS) are similar across populations and cultures. Thirty eight Dutch CFS participants were compared to 52 CFS and 51 healthy participants recruited from the UK. Participants completed self-report measures of symptoms, functioning, and mood, as well as three experimental tasks (i) visual-probe task measuring attentional bias to illness (somatic symptoms and disability) versus neutral words, (ii) interpretive bias task measuring positive versus somatic interpretations of ambiguous information, and (iii) the Attention Network Test measuring general attentional control. Compared to controls, Dutch and UK participants with CFS showed a significant attentional bias for illness-related words and were significantly more likely to interpret ambiguous information in a somatic way. These effects were not moderated by attentional control. There were no significant differences between the Dutch and UK CFS groups on attentional bias, interpretation bias, or attentional control scores. This study replicated the main findings of the UK study, with a Dutch CFS population, indicating that across these two cultures, people with CFS demonstrate biases in how somatic information is attended to and interpreted. These illness-specific biases appear to be unrelated to general attentional control deficits.

  16. Confirmation bias in studies of nestmate recognition: a cautionary note for research into the behaviour of animals.

    Science.gov (United States)

    van Wilgenburg, Ellen; Elgar, Mark A

    2013-01-01

    Confirmation bias is a tendency of people to interpret information in a way that confirms their expectations. A long recognized phenomenon in human psychology, confirmation bias can distort the results of a study and thus reduce its reliability. While confirmation bias can be avoided by conducting studies blind to treatment groups, this practice is not always used. Surprisingly, this is true of research in animal behaviour, and the extent to which confirmation bias influences research outcomes in this field is rarely investigated. Here we conducted a meta-analysis, using studies on nestmate recognition in ants, to compare the outcomes of studies that were conducted blind with those that were not. Nestmate recognition studies typically perform intra- and inter colony aggression assays, with the a priori expectation that there should be little or no aggression among nestmates. Aggressive interactions between ants can include subtle behaviours such as mandible flaring and recoil, which can be hard to quantify, making these types of assays prone to confirmation bias. Our survey revealed that only 29% of our sample of 79 studies were conducted blind. These studies were more likely to report aggression among nestmates if they were conducted blind (73%) than if they were not (21%). Moreover, we found that the effect size between nestmate and non-nestmate treatment means is significantly lower in experiments conducted blind than those in which colony identity is known (1.38 versus 2.76). We discuss the implications of the impact of confirmation bias for research that attempts to obtain quantitative synthesises of data from different studies.

  17. Confirmation bias in studies of nestmate recognition: a cautionary note for research into the behaviour of animals.

    Directory of Open Access Journals (Sweden)

    Ellen van Wilgenburg

    Full Text Available Confirmation bias is a tendency of people to interpret information in a way that confirms their expectations. A long recognized phenomenon in human psychology, confirmation bias can distort the results of a study and thus reduce its reliability. While confirmation bias can be avoided by conducting studies blind to treatment groups, this practice is not always used. Surprisingly, this is true of research in animal behaviour, and the extent to which confirmation bias influences research outcomes in this field is rarely investigated. Here we conducted a meta-analysis, using studies on nestmate recognition in ants, to compare the outcomes of studies that were conducted blind with those that were not. Nestmate recognition studies typically perform intra- and inter colony aggression assays, with the a priori expectation that there should be little or no aggression among nestmates. Aggressive interactions between ants can include subtle behaviours such as mandible flaring and recoil, which can be hard to quantify, making these types of assays prone to confirmation bias. Our survey revealed that only 29% of our sample of 79 studies were conducted blind. These studies were more likely to report aggression among nestmates if they were conducted blind (73% than if they were not (21%. Moreover, we found that the effect size between nestmate and non-nestmate treatment means is significantly lower in experiments conducted blind than those in which colony identity is known (1.38 versus 2.76. We discuss the implications of the impact of confirmation bias for research that attempts to obtain quantitative synthesises of data from different studies.

  18. Spectroscopic and impedance studies of reverse biased degraded dye solar cells

    CSIR Research Space (South Africa)

    Le Roux, Lukas J

    2011-03-01

    Full Text Available The work that is presented here is focused on the results that were obtained during studies of the performance of Dye Solar Cells under certain reverse bias conditions. This reverse voltage could permanently modify or damage a cell...

  19. Evolution of a Test Item

    Science.gov (United States)

    Spaan, Mary

    2007-01-01

    This article follows the development of test items (see "Language Assessment Quarterly", Volume 3 Issue 1, pp. 71-79 for the article "Test and Item Specifications Development"), beginning with a review of test and item specifications, then proceeding to writing and editing of items, pretesting and analysis, and finally selection of an item for a…

  20. Do pig farmers preferences bias consumer choice for pork? Response to critique of the pork preference studies.

    Science.gov (United States)

    Ngapo, T M; Fortin, J; Martin, J-F

    2010-08-01

    Québec consumers and pig farmers selected their preferred chop from 16 images that had been modified to give 16 treatments: two levels each of fat cover, colour, marbling and drip. The selection process was repeated eight times from different groups of chops. Fat cover (47% preferred lean) and colour (44%, light red) were the most frequently chosen characteristics. No significant differences were observed between farmers and consumers preferences (chi(2) test, Ppreference-based clusters were found; 41% preferring dark red, lean meat and 59%, light red, lean meat, without marbling or drip. Choice-based clusters showed no significant links with either individual socio-demographic items, including pig farmer as occupation, or the three socio-demographic-based clusters observed (chi(2) test, Pconsumers and, therefore, inclusion of pig farmers in consumer panels would not bias consumer choice for pork. Crown Copyright (c) 2010. Published by Elsevier Ltd. All rights reserved.

  1. A Study on the Countermeasures to the Revision of Nuclear Controlled Items

    International Nuclear Information System (INIS)

    Choi, Sun Do; Lim, Dong Hyuk

    2011-01-01

    NSG(Nuclear Suppliers Group) was formed to prevent proliferation in 1977 with nuclear test in India in 1974. INFCIRC/254/Part1 (Trigger List) as guidelines for controlling the nuclear material, reactor and related equipment, reactor nuclear material, reprocessing, enrichment, conversion, molding, heavy water production plant/equipment, technical was released in 1978, and the Export Control guidelines (INFCIRC/ 254/Part2) about Dual-use item which can be used for nuclear development was established in 1992. The two Export Control guidelines are agreements between NSG Participating Governments (PGs), so all PGs have an obligation about implementation of the agreement. In addition, NSG guidelines can be the export base of control law of the member nations including our country which joined in 1995 or matched with it. Recently, NSG is in the progress of the fundamental review of NSG guidelines established in 1978 and 1992. The terms of agreement will be reflected to the domestic legislation through the fundamental review, and it will entail the changes of classification and export license standard of export items. Thus, it was studied about export controlled items review and revise plan for establishing the clear export control guidelines by review of NSG guidelines as follows

  2. Structural Validation of a French Food Frequency Questionnaire of 94 Items

    Directory of Open Access Journals (Sweden)

    Rozenn Gazan

    2017-12-01

    Full Text Available BackgroundFood frequency questionnaires (FFQs are used to estimate the usual food and nutrient intakes over a period of time. Such estimates can suffer from measurement errors, either due to bias induced by respondent’s answers or to errors induced by the structure of the questionnaire (e.g., using a limited number of food items and an aggregated food database with average portion sizes. The “structural validation” presented in this study aims to isolate and quantify the impact of the inherent structure of a FFQ on the estimation of food and nutrient intakes, independently of respondent’s perception of the questionnaire.MethodsA semi-quantitative FFQ (n = 94 items, including 50 items with questions on portion sizes and an associated aggregated food composition database (named the item-composition database were developed, based on the self-reported weekly dietary records of 1918 adults (18–79 years-old in the French Individual and National Dietary Survey 2 (INCA2, and the French CIQUAL 2013 food-composition database of all the foods (n = 1342 foods declared as consumed in the population. Reference intakes of foods (“REF_FOOD” and nutrients (“REF_NUT” were calculated for each adult using the food-composition database and the amounts of foods self-reported in his/her dietary record. Then, answers to the FFQ were simulated for each adult based on his/her self-reported dietary record. “FFQ_FOOD” and “FFQ_NUT” intakes were estimated using the simulated answers and the item-composition database. Measurement errors (in %, spearman correlations and cross-classification were used to compare “REF_FOOD” with “FFQ_FOOD” and “REF_NUT” with “FFQ_NUT”.ResultsCompared to “REF_NUT,” “FFQ_NUT” total quantity and total energy intake were underestimated on average by 198 g/day and 666 kJ/day, respectively. “FFQ_FOOD” intakes were well estimated for starches, underestimated for most of the subgroups, and

  3. Item response theory analysis of the life orientation test-revised: age and gender differential item functioning analyses.

    Science.gov (United States)

    Steca, Patrizia; Monzani, Dario; Greco, Andrea; Chiesi, Francesca; Primi, Caterina

    2015-06-01

    This study is aimed at testing the measurement properties of the Life Orientation Test-Revised (LOT-R) for the assessment of dispositional optimism by employing item response theory (IRT) analyses. The LOT-R was administered to a large sample of 2,862 Italian adults. First, confirmatory factor analyses demonstrated the theoretical conceptualization of the construct measured by the LOT-R as a single bipolar dimension. Subsequently, IRT analyses for polytomous, ordered response category data were applied to investigate the items' properties. The equivalence of the items across gender and age was assessed by analyzing differential item functioning. Discrimination and severity parameters indicated that all items were able to distinguish people with different levels of optimism and adequately covered the spectrum of the latent trait. Additionally, the LOT-R appears to be gender invariant and, with minor exceptions, age invariant. Results provided evidence that the LOT-R is a reliable and valid measure of dispositional optimism. © The Author(s) 2014.

  4. Causal role for inverse reasoning on obsessive-compulsive symptoms: Preliminary evidence from a cognitive bias modification for interpretation bias study.

    Science.gov (United States)

    Wong, Shiu F; Grisham, Jessica R

    2017-12-01

    The inference-based approach (IBA) is a cognitive account of the genesis and maintenance of obsessive-compulsive disorder (OCD). According to the IBA, individuals with OCD are prone to using inverse reasoning, in which hypothetical causes form the basis of conclusions about reality. Several studies have provided preliminary support for an association between features of the IBA and OCD symptoms. However, there are currently no studies that have investigated the proposed causal relationship of inverse reasoning in OCD. In a non-clinical sample (N = 187), we used an interpretive cognitive bias procedure to train a bias towards using inverse reasoning (n = 64), healthy sensory-based reasoning (n = 65), or a control condition (n = 58). Participants were randomly allocated to these training conditions. This manipulation allowed us to assess whether, consistent with the IBA, inverse reasoning training increased compulsive-like behaviours and self-reported OCD symptoms. Results indicated that compared to a control condition, participants trained in inverse reasoning reported more OCD symptoms and were more avoidant of potentially contaminated objects. Moreover, change in inverse reasoning bias was a small but significant mediator of the relationship between training condition and behavioural avoidance. Conversely, training in a healthy (non-inverse) reasoning style did not have any effect on symptoms or behaviour relative to the control condition. As this study was conducted in a non-clinical sample, we were unable to generalise our findings to a clinical population. Findings generally support the IBA model by providing preliminary evidence of a causal role for inverse reasoning in OCD. Copyright © 2017 Elsevier Ltd. All rights reserved.

  5. Effect of Differential Item Functioning on Test Equating

    Science.gov (United States)

    Kabasakal, Kübra Atalay; Kelecioglu, Hülya

    2015-01-01

    This study examines the effect of differential item functioning (DIF) items on test equating through multilevel item response models (MIRMs) and traditional IRMs. The performances of three different equating models were investigated under 24 different simulation conditions, and the variables whose effects were examined included sample size, test…

  6. Breast density quantification using magnetic resonance imaging (MRI) with bias field correction: a postmortem study.

    Science.gov (United States)

    Ding, Huanjun; Johnson, Travis; Lin, Muqing; Le, Huy Q; Ducote, Justin L; Su, Min-Ying; Molloi, Sabee

    2013-12-01

    Quantification of breast density based on three-dimensional breast MRI may provide useful information for the early detection of breast cancer. However, the field inhomogeneity can severely challenge the computerized image segmentation process. In this work, the effect of the bias field in breast density quantification has been investigated with a postmortem study. T1-weighted images of 20 pairs of postmortem breasts were acquired on a 1.5 T breast MRI scanner. Two computer-assisted algorithms were used to quantify the volumetric breast density. First, standard fuzzy c-means (FCM) clustering was used on raw images with the bias field present. Then, the coherent local intensity clustering (CLIC) method estimated and corrected the bias field during the iterative tissue segmentation process. Finally, FCM clustering was performed on the bias-field-corrected images produced by CLIC method. The left-right correlation for breasts in the same pair was studied for both segmentation algorithms to evaluate the precision of the tissue classification. Finally, the breast densities measured with the three methods were compared to the gold standard tissue compositions obtained from chemical analysis. The linear correlation coefficient, Pearson's r, was used to evaluate the two image segmentation algorithms and the effect of bias field. The CLIC method successfully corrected the intensity inhomogeneity induced by the bias field. In left-right comparisons, the CLIC method significantly improved the slope and the correlation coefficient of the linear fitting for the glandular volume estimation. The left-right breast density correlation was also increased from 0.93 to 0.98. When compared with the percent fibroglandular volume (%FGV) from chemical analysis, results after bias field correction from both the CLIC the FCM algorithms showed improved linear correlation. As a result, the Pearson's r increased from 0.86 to 0.92 with the bias field correction. The investigated CLIC method

  7. Breast density quantification using magnetic resonance imaging (MRI) with bias field correction: A postmortem study

    International Nuclear Information System (INIS)

    Ding, Huanjun; Johnson, Travis; Lin, Muqing; Le, Huy Q.; Ducote, Justin L.; Su, Min-Ying; Molloi, Sabee

    2013-01-01

    Purpose: Quantification of breast density based on three-dimensional breast MRI may provide useful information for the early detection of breast cancer. However, the field inhomogeneity can severely challenge the computerized image segmentation process. In this work, the effect of the bias field in breast density quantification has been investigated with a postmortem study. Methods: T1-weighted images of 20 pairs of postmortem breasts were acquired on a 1.5 T breast MRI scanner. Two computer-assisted algorithms were used to quantify the volumetric breast density. First, standard fuzzy c-means (FCM) clustering was used on raw images with the bias field present. Then, the coherent local intensity clustering (CLIC) method estimated and corrected the bias field during the iterative tissue segmentation process. Finally, FCM clustering was performed on the bias-field-corrected images produced by CLIC method. The left–right correlation for breasts in the same pair was studied for both segmentation algorithms to evaluate the precision of the tissue classification. Finally, the breast densities measured with the three methods were compared to the gold standard tissue compositions obtained from chemical analysis. The linear correlation coefficient, Pearson'sr, was used to evaluate the two image segmentation algorithms and the effect of bias field. Results: The CLIC method successfully corrected the intensity inhomogeneity induced by the bias field. In left–right comparisons, the CLIC method significantly improved the slope and the correlation coefficient of the linear fitting for the glandular volume estimation. The left–right breast density correlation was also increased from 0.93 to 0.98. When compared with the percent fibroglandular volume (%FGV) from chemical analysis, results after bias field correction from both the CLIC the FCM algorithms showed improved linear correlation. As a result, the Pearson'sr increased from 0.86 to 0.92 with the bias field correction

  8. Application of Probabilistic Multiple-Bias Analyses to a Cohort- and a Case-Control Study on the Association between Pandemrix™ and Narcolepsy.

    Directory of Open Access Journals (Sweden)

    Kaatje Bollaerts

    Full Text Available An increase in narcolepsy cases was observed in Finland and Sweden towards the end of the 2009 H1N1 influenza pandemic. Preliminary observational studies suggested a temporal link with the pandemic influenza vaccine Pandemrix™, leading to a number of additional studies across Europe. Given the public health urgency, these studies used readily available retrospective data from various sources. The potential for bias in such settings was generally acknowledged. Although generally advocated by key opinion leaders and international health authorities, no systematic quantitative assessment of the potential joint impact of biases was undertaken in any of these studies.We applied bias-level multiple-bias analyses to two of the published narcolepsy studies: a pediatric cohort study from Finland and a case-control study from France. In particular, we developed Monte Carlo simulation models to evaluate a potential cascade of biases, including confounding by age, by indication and by natural H1N1 infection, selection bias, disease- and exposure misclassification. All bias parameters were evidence-based to the extent possible.Given the assumptions used for confounding, selection bias and misclassification, the Finnish rate ratio of 13.78 (95% CI: 5.72-28.11 reduced to a median value of 6.06 (2.5th- 97.5th percentile: 2.49-15.1 and the French odds ratio of 5.43 (95% CI: 2.6-10.08 to 1.85 (2.5th-97.5th percentile: 0.85-4.08.We illustrate multiple-bias analyses using two studies on the Pandemrix™-narcolepsy association and advocate their use to better understand the robustness of study findings. Based on our multiple-bias models, the observed Pandemrix™-narcolepsy association consistently persists in the Finnish study. For the French study, the results of our multiple-bias models were inconclusive.

  9. Science Library of Test Items. Volume Twenty-One. A Collection of Multiple Choice Test Items Relating Mainly to Physics, 2.

    Science.gov (United States)

    New South Wales Dept. of Education, Sydney (Australia).

    As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…

  10. A study of the face validity of the 40 item version of the Defense Style Questionnaire (DSQ-40).

    Science.gov (United States)

    Chabrol, Henri; Rousseau, Amélie; Rodgers, Rachel; Callahan, Stacey; Pirlot, Gérard; Sztulman, Henri

    2005-11-01

    There are few studies examining the face validity of the 40-item version of the Defense Style Questionnaire (DSQ-40). Moreover, the existing studies have provided conflicting results. The present study provides an in-depth examination of the face validity of the DSQ-40. Eight clinicians independently attributed each item of the DSQ-40 to a defense mechanism. The defense mechanisms listed in the DSM-IV Defensive Functioning Scale and their definitions were provided as a guide, along with the definition of those defense mechanisms investigated by the DSQ that are not included. It was further specified that the raters could attribute the items to defense mechanisms other than those listed or coping mechanisms. Twelve items out of 40 (30%) were attributed to the defense mechanisms they were supposed to investigate by fewer than four out of the eight raters. This result suggests that a substantial part of the DSQ-40 is lacking in face validity.

  11. All in its proper time: monitoring the emergence of a memory bias for novel, arousing-negative words in individuals with high and low trait anxiety.

    Science.gov (United States)

    Eden, Annuschka Salima; Zwitserlood, Pienie; Keuper, Katharina; Junghöfer, Markus; Laeger, Inga; Zwanzger, Peter; Dobel, Christian

    2014-01-01

    The well-established memory bias for arousing-negative stimuli seems to be enhanced in high trait-anxious persons and persons suffering from anxiety disorders. We monitored the emergence and development of such a bias during and after learning, in high and low trait anxious participants. A word-learning paradigm was applied, consisting of spoken pseudowords paired either with arousing-negative or neutral pictures. Learning performance during training evidenced a short-lived advantage for arousing-negative associated words, which was not present at the end of training. Cued recall and valence ratings revealed a memory bias for pseudowords that had been paired with arousing-negative pictures, immediately after learning and two weeks later. This held even for items that were not explicitly remembered. High anxious individuals evidenced a stronger memory bias in the cued-recall test, and their ratings were also more negative overall compared to low anxious persons. Both effects were evident, even when explicit recall was controlled for. Regarding the memory bias in anxiety prone persons, explicit memory seems to play a more crucial role than implicit memory. The study stresses the need for several time points of bias measurement during the course of learning and retrieval, as well as the employment of different measures for learning success.

  12. All in its proper time: monitoring the emergence of a memory bias for novel, arousing-negative words in individuals with high and low trait anxiety.

    Directory of Open Access Journals (Sweden)

    Annuschka Salima Eden

    Full Text Available The well-established memory bias for arousing-negative stimuli seems to be enhanced in high trait-anxious persons and persons suffering from anxiety disorders. We monitored the emergence and development of such a bias during and after learning, in high and low trait anxious participants. A word-learning paradigm was applied, consisting of spoken pseudowords paired either with arousing-negative or neutral pictures. Learning performance during training evidenced a short-lived advantage for arousing-negative associated words, which was not present at the end of training. Cued recall and valence ratings revealed a memory bias for pseudowords that had been paired with arousing-negative pictures, immediately after learning and two weeks later. This held even for items that were not explicitly remembered. High anxious individuals evidenced a stronger memory bias in the cued-recall test, and their ratings were also more negative overall compared to low anxious persons. Both effects were evident, even when explicit recall was controlled for. Regarding the memory bias in anxiety prone persons, explicit memory seems to play a more crucial role than implicit memory. The study stresses the need for several time points of bias measurement during the course of learning and retrieval, as well as the employment of different measures for learning success.

  13. Preferred Reporting Items for a Systematic Review and Meta-analysis of Diagnostic Test Accuracy Studies: The PRISMA-DTA Statement.

    Science.gov (United States)

    McInnes, Matthew D F; Moher, David; Thombs, Brett D; McGrath, Trevor A; Bossuyt, Patrick M; Clifford, Tammy; Cohen, Jérémie F; Deeks, Jonathan J; Gatsonis, Constantine; Hooft, Lotty; Hunt, Harriet A; Hyde, Christopher J; Korevaar, Daniël A; Leeflang, Mariska M G; Macaskill, Petra; Reitsma, Johannes B; Rodin, Rachel; Rutjes, Anne W S; Salameh, Jean-Paul; Stevens, Adrienne; Takwoingi, Yemisi; Tonelli, Marcello; Weeks, Laura; Whiting, Penny; Willis, Brian H

    2018-01-23

    Systematic reviews of diagnostic test accuracy synthesize data from primary diagnostic studies that have evaluated the accuracy of 1 or more index tests against a reference standard, provide estimates of test performance, allow comparisons of the accuracy of different tests, and facilitate the identification of sources of variability in test accuracy. To develop the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) diagnostic test accuracy guideline as a stand-alone extension of the PRISMA statement. Modifications to the PRISMA statement reflect the specific requirements for reporting of systematic reviews and meta-analyses of diagnostic test accuracy studies and the abstracts for these reviews. Established standards from the Enhancing the Quality and Transparency of Health Research (EQUATOR) Network were followed for the development of the guideline. The original PRISMA statement was used as a framework on which to modify and add items. A group of 24 multidisciplinary experts used a systematic review of articles on existing reporting guidelines and methods, a 3-round Delphi process, a consensus meeting, pilot testing, and iterative refinement to develop the PRISMA diagnostic test accuracy guideline. The final version of the PRISMA diagnostic test accuracy guideline checklist was approved by the group. The systematic review (produced 64 items) and the Delphi process (provided feedback on 7 proposed items; 1 item was later split into 2 items) identified 71 potentially relevant items for consideration. The Delphi process reduced these to 60 items that were discussed at the consensus meeting. Following the meeting, pilot testing and iterative feedback were used to generate the 27-item PRISMA diagnostic test accuracy checklist. To reflect specific or optimal contemporary systematic review methods for diagnostic test accuracy, 8 of the 27 original PRISMA items were left unchanged, 17 were modified, 2 were added, and 2 were omitted. The 27-item

  14. Instructional Topics in Educational Measurement (ITEMS) Module: Using Automated Processes to Generate Test Items

    Science.gov (United States)

    Gierl, Mark J.; Lai, Hollis

    2013-01-01

    Changes to the design and development of our educational assessments are resulting in the unprecedented demand for a large and continuous supply of content-specific test items. One way to address this growing demand is with automatic item generation (AIG). AIG is the process of using item models to generate test items with the aid of computer…

  15. A 67-Item Stress Resilience item bank showing high content validity was developed in a psychosomatic sample.

    Science.gov (United States)

    Obbarius, Nina; Fischer, Felix; Obbarius, Alexander; Nolte, Sandra; Liegl, Gregor; Rose, Matthias

    2018-04-10

    To develop the first item bank to measure Stress Resilience (SR) in clinical populations. Qualitative item development resulted in an initial pool of 131 items covering a broad theoretical SR concept. These items were tested in n=521 patients at a psychosomatic outpatient clinic. Exploratory and Confirmatory Factor Analysis (CFA), as well as other state-of-the-art item analyses and IRT were used for item evaluation and calibration of the final item bank. Out of the initial item pool of 131 items, we excluded 64 items (54 factor loading .3, 2 non-discriminative Item Response Curves, 4 Differential Item Functioning). The final set of 67 items indicated sufficient model fit in CFA and IRT analyses. Additionally, a 10-item short form with high measurement precision (SE≤.32 in a theta range between -1.8 and +1.5) was derived. Both the SR item bank and the SR short form were highly correlated with an existing static legacy tool (Connor-Davidson Resilience Scale). The final SR item bank and 10-item short form showed good psychometric properties. When further validated, they will be ready to be used within a framework of Computer-Adaptive Tests for a comprehensive assessment of the Stress-Construct. Copyright © 2018. Published by Elsevier Inc.

  16. Examining sex differences in DSM-IV-TR narcissistic personality disorder symptom expression using Item Response Theory (IRT).

    Science.gov (United States)

    Hoertel, Nicolas; Peyre, Hugo; Lavaud, Pierre; Blanco, Carlos; Guerin-Langlois, Christophe; René, Margaux; Schuster, Jean-Pierre; Lemogne, Cédric; Delorme, Richard; Limosin, Frédéric

    2017-12-14

    The limited published literature on the subject suggests that there may be differences in how females and males experience narcissistic personality disorder (NPD) symptoms. The aim of this study was to use methods based on item response theory to examine whether, when equating for levels of NPD symptom severity, there are sex differences in the likelihood of reporting DSM-IV-TR NPD symptoms. We conducted these analyses using a large, nationally representative sample from the USA (n=34,653), the second wave of the National Epidemiologic Survey on Alcohol and Related Conditions (NESARC). There were statistically and clinically significant sex differences for 2 out of the 9 DSM-IV-TR NPD symptoms. We found that males were more likely to endorse the item 'lack of empathy' at lower levels of narcissistic personality disorder severity than females. The item 'being envious' was a better indicator of NPD severity in males than in females. There were no clinically significant sex differences on the remaining NPD symptoms. Overall, our findings indicate substantial sex differences in narcissistic personality disorder symptom expression. Although our results may reflect sex-bias in diagnostic criteria, they are consistent with recent views suggesting that narcissistic personality disorder may be underpinned by shared and sex-specific mechanisms. Copyright © 2017 Elsevier B.V. All rights reserved.

  17. Voluntary vs. compulsory student evaluation of clerkships: effect on validity and potential bias.

    Science.gov (United States)

    Aoun Bahous, Sola; Salameh, Pascale; Salloum, Angelique; Salameh, Wael; Park, Yoon Soo; Tekian, Ara

    2018-01-05

    Students evaluations of their learning experiences can provide a useful source of information about clerkship effectiveness in undergraduate medical education. However, low response rates in clerkship evaluation surveys remain an important limitation. This study examined the impact of increasing response rates using a compulsory approach on validity evidence. Data included 192 responses obtained voluntarily from 49 third-year students in 2014-2015, and 171 responses obtained compulsorily from 49 students in the first six months of the consecutive year at one medical school in Lebanon. Evidence supporting internal structure and response process validity was compared between the two administration modalities. The authors also tested for potential bias introduced by the use of the compulsory approach by examining students' responses to a sham item that was added to the last survey administration. Response rates increased from 56% in the voluntary group to 100% in the compulsory group (P two consecutive years. Testing for non-response bias in the voluntary group showed that females were more frequent responders in two clerkships. Testing for authority-induced bias revealed that students might complete the evaluation randomly without attention to content. While increasing response rates is often a policy requirement aimed to improve the credibility of ratings, using authority to enforce responses may not increase reliability and can raise concerns over the meaningfulness of the evaluation. Administrators are urged to consider not only response rates, but also representativeness and quality of responses in administering evaluation surveys.

  18. Affective Biases in Humans and Animals.

    Science.gov (United States)

    Robinson, E S J; Roiser, J P

    Depression is one of the most common but poorly understood psychiatric conditions. Although drug treatments and psychological therapies are effective in some patients, many do not achieve full remission and some patients receive no apparent benefit. Developing new improved treatments requires a better understanding of the aetiology of symptoms and evaluation of novel therapeutic targets in pre-clinical studies. Recent developments in our understanding of the basic cognitive processes that may contribute to the development of depression and its treatment offer new opportunities for both clinical and pre-clinical research. This chapter discusses the clinical evidence supporting a cognitive neuropsychological model of depression and antidepressant efficacy, and how this information may be usefully translated to pre-clinical investigation. Studies using neuropsychological tests in depressed patients and at risk populations have revealed basic negative emotional biases and disrupted reward and punishment processing, which may also impact on non-affective cognition. These affective biases are sensitive to antidepressant treatments with early onset effects observed, suggesting an important role in recovery. This clinical work into affective biases has also facilitated back-translation to animals and the development of assays to study affective biases in rodents. These animal studies suggest that, similar to humans, rodents in putative negative affective states exhibit negative affective biases on decision-making and memory tasks. Antidepressant treatments also induce positive biases in these rodent tasks, supporting the translational validity of this approach. Although still in the early stages of development and validation, affective biases in depression have the potential to offer new insights into the clinical condition, as well as facilitating the development of more translational approaches for pre-clinical studies.

  19. Cross-cultural and sex differences in the Emotional Skills and Competence Questionnaire scales: Challenges of differential item functioning analyses

    Directory of Open Access Journals (Sweden)

    Bo Molander

    2009-11-01

    Full Text Available University students in Croatia, Slovenia, and Sweden (N = 1129 were examined by means of the Emotional Skills and Competence Questionnaire (Takšić, 1998. Results showed a significant effect for the sex factor only on the total-score scale, women scoring higher than men, but significant effects were obtained for country, as well as for sex, on the Express and Label (EL and Perceive and Understand (PU subscales. Sweden showed higher scores than Croatia and Slovenia on the EL scale, and Slovenia showed higher scores than Croatia and Sweden on the PU scale. In subsequent analyses of differential item functioning (DIF, comparisons were carried out for pairs of countries. The analyses revealed that a large proportion of the items in the total-score scale were potentially biased, most so for the Croatian-Swedish comparison, less for the Slovenian-Swedish comparison, and least for the Croatian-Slovenian comparison. These findings give doubts about the validity of mean score differences in comparisons of countries. However, DIF analyses of sex differences within each country show very few DIF items, indicating that the ESCQ instrument works well within each cultural/linguistic setting. Possible explanations of the findings are discussed, and improvements for future studies are suggested.

  20. Galaxy bias and primordial non-Gaussianity

    Energy Technology Data Exchange (ETDEWEB)

    Assassi, Valentin; Baumann, Daniel [DAMTP, Cambridge University, Wilberforce Road, Cambridge CB3 0WA (United Kingdom); Schmidt, Fabian, E-mail: assassi@ias.edu, E-mail: D.D.Baumann@uva.nl, E-mail: fabians@MPA-Garching.MPG.DE [Max-Planck-Institut für Astrophysik, Karl-Schwarzschild-Str. 1, 85748 Garching (Germany)

    2015-12-01

    We present a systematic study of galaxy biasing in the presence of primordial non-Gaussianity. For a large class of non-Gaussian initial conditions, we define a general bias expansion and prove that it is closed under renormalization, thereby showing that the basis of operators in the expansion is complete. We then study the effects of primordial non-Gaussianity on the statistics of galaxies. We show that the equivalence principle enforces a relation between the scale-dependent bias in the galaxy power spectrum and that in the dipolar part of the bispectrum. This provides a powerful consistency check to confirm the primordial origin of any observed scale-dependent bias. Finally, we also discuss the imprints of anisotropic non-Gaussianity as motivated by recent studies of higher-spin fields during inflation.

  1. Galaxy bias and primordial non-Gaussianity

    International Nuclear Information System (INIS)

    Assassi, Valentin; Baumann, Daniel; Schmidt, Fabian

    2015-01-01

    We present a systematic study of galaxy biasing in the presence of primordial non-Gaussianity. For a large class of non-Gaussian initial conditions, we define a general bias expansion and prove that it is closed under renormalization, thereby showing that the basis of operators in the expansion is complete. We then study the effects of primordial non-Gaussianity on the statistics of galaxies. We show that the equivalence principle enforces a relation between the scale-dependent bias in the galaxy power spectrum and that in the dipolar part of the bispectrum. This provides a powerful consistency check to confirm the primordial origin of any observed scale-dependent bias. Finally, we also discuss the imprints of anisotropic non-Gaussianity as motivated by recent studies of higher-spin fields during inflation

  2. Relationship between Future Time Orientation and Item Nonresponse on Subjective Probability Questions: A Cross-Cultural Analysis.

    Science.gov (United States)

    Lee, Sunghee; Liu, Mingnan; Hu, Mengyao

    2017-06-01

    Time orientation is an unconscious yet fundamental cognitive process that provides a framework for organizing personal experiences in temporal categories of past, present and future, reflecting the relative emphasis given to these categories. Culture lies central to individuals' time orientation, leading to cultural variations in time orientation. For example, people from future-oriented cultures tend to emphasize the future and store information relevant for the future more than those from present- or past-oriented cultures. For survey questions that ask respondents to report expected probabilities of future events, this may translate into culture-specific question difficulties, manifested through systematically varying "I don't know" item nonresponse rates. This study drew on the time orientation theory and examined culture-specific nonresponse patterns on subjective probability questions using methodologically comparable population-based surveys from multiple countries. The results supported our hypothesis. Item nonresponse rates on these questions varied significantly in the way that future-orientation at the group as well as individual level was associated with lower nonresponse rates. This pattern did not apply to non-probability questions. Our study also suggested potential nonresponse bias. Examining culture-specific constructs, such as time orientation, as a framework for measurement mechanisms may contribute to improving cross-cultural research.

  3. Negative effects of item repetition on source memory

    OpenAIRE

    Kim, Kyungmi; Yi, Do-Joon; Raye, Carol L.; Johnson, Marcia K.

    2012-01-01

    In the present study, we explored how item repetition affects source memory for new item–feature associations (picture–location or picture–color). We presented line drawings varying numbers of times in Phase 1. In Phase 2, each drawing was presented once with a critical new feature. In Phase 3, we tested memory for the new source feature of each item from Phase 2. Experiments 1 and 2 demonstrated and replicated the negative effects of item repetition on incidental source memory. Prior item re...

  4. Computerized Adaptive Test (CAT) Applications and Item Response Theory Models for Polytomous Items

    Science.gov (United States)

    Aybek, Eren Can; Demirtasli, R. Nukhet

    2017-01-01

    This article aims to provide a theoretical framework for computerized adaptive tests (CAT) and item response theory models for polytomous items. Besides that, it aims to introduce the simulation and live CAT software to the related researchers. Computerized adaptive test algorithm, assumptions of item response theory models, nominal response…

  5. Language-related differential item functioning between English and German PROMIS Depression items is negligible.

    Science.gov (United States)

    Fischer, H Felix; Wahl, Inka; Nolte, Sandra; Liegl, Gregor; Brähler, Elmar; Löwe, Bernd; Rose, Matthias

    2017-12-01

    To investigate differential item functioning (DIF) of PROMIS Depression items between US and German samples we compared data from the US PROMIS calibration sample (n = 780), a German general population survey (n = 2,500) and a German clinical sample (n = 621). DIF was assessed in an ordinal logistic regression framework, with 0.02 as criterion for R 2 -change and 0.096 for Raju's non-compensatory DIF. Item parameters were initially fixed to the PROMIS Depression metric; we used plausible values to account for uncertainty in depression estimates. Only four items showed DIF. Accounting for DIF led to negligible effects for the full item bank as well as a post hoc simulated computer-adaptive test (German general population sample was considerably lower compared to the US reference value of 50. Overall, we found little evidence for language DIF between US and German samples, which could be addressed by either replacing the DIF items by items not showing DIF or by scoring the short form in German samples with the corrected item parameters reported. Copyright © 2016 John Wiley & Sons, Ltd.

  6. The relationship between study sponsorship, risks of bias, and research outcomes in atrazine exposure studies conducted in non-human animals: Systematic review and meta-analysis.

    Science.gov (United States)

    Bero, L; Anglemyer, A; Vesterinen, H; Krauth, D

    2016-01-01

    A critical component of systematic review methodology is the assessment of the risks of bias of studies that are included in the review. There is controversy about whether funding source should be included in a risk of bias assessment of animal toxicology studies. To determine whether industry research sponsorship is associated with methodological biases, the results, or conclusions of animal studies examining the effect of exposure to atrazine on reproductive or developmental outcomes. We searched multiple electronic databases and the reference lists of relevant articles to identify original research studies examining the effect of any dose of atrazine exposure at any life stage on reproduction or development in non-human animals. We compared methodological risks of bias, the conclusions of the studies, the statistical significance of the findings, and the magnitude of effect estimates between industry sponsored and non-industry sponsored studies. Fifty-one studies met the inclusion criteria. There were no differences in methodological risks of bias in industry versus non-industry sponsored studies. 39 studies tested environmentally relevant concentrations of atrazine (11 industry sponsored, 24 non-industry sponsored, 4 with no funding disclosures). Non-industry sponsored studies (12/24, 50.0%) were more likely to conclude that atrazine was harmful compared to industry sponsored studies (2/11, 18.1%) (p value=0.07). A higher proportion of non-industry sponsored studies reported statistically significant harmful effects (8/24, 33.3%) compared to industry-sponsored studies (1/11; 9.1%) (p value=0.13). The association of industry sponsorship with decreased effect sizes for harm outcomes was inconclusive. Our findings support the inclusion of research sponsorship as a risk of bias criterion in tools used to assess risks of bias in animal studies for systematic reviews. The reporting of other empirically based risk of bias criteria for animal studies, such as blinded

  7. Item-level factor analysis of the Self-Efficacy Scale.

    Science.gov (United States)

    Bunketorp Käll, Lina

    2014-03-01

    This study explores the internal structure of the Self-Efficacy Scale (SES) using item response analysis. The SES was previously translated into Swedish and modified to encompass all types of pain, not exclusively back pain. Data on perceived self-efficacy in 47 patients with subacute whiplash-associated disorders were derived from a previously conducted randomized-controlled trial. The item-level factor analysis was carried out using a six-step procedure. To further study the item inter-relationships and to determine the underlying structure empirically, the 20 items of the SES were also subjected to principal component analysis with varimax rotation. The analyses showed two underlying factors, named 'social activities' and 'physical activities', with seven items loading on each factor. The remaining six items of the SES appeared to measure somewhat different constructs and need to be analysed further.

  8. Selecting Items for Criterion-Referenced Tests.

    Science.gov (United States)

    Mellenbergh, Gideon J.; van der Linden, Wim J.

    1982-01-01

    Three item selection methods for criterion-referenced tests are examined: the classical theory of item difficulty and item-test correlation; the latent trait theory of item characteristic curves; and a decision-theoretic approach for optimal item selection. Item contribution to the standardized expected utility of mastery testing is discussed. (CM)

  9. Asymptotic Standard Errors for Item Response Theory True Score Equating of Polytomous Items

    Science.gov (United States)

    Cher Wong, Cheow

    2015-01-01

    Building on previous works by Lord and Ogasawara for dichotomous items, this article proposes an approach to derive the asymptotic standard errors of item response theory true score equating involving polytomous items, for equivalent and nonequivalent groups of examinees. This analytical approach could be used in place of empirical methods like…

  10. Toward a synthesis of cognitive biases: how noisy information processing can bias human decision making.

    Science.gov (United States)

    Hilbert, Martin

    2012-03-01

    A single coherent framework is proposed to synthesize long-standing research on 8 seemingly unrelated cognitive decision-making biases. During the past 6 decades, hundreds of empirical studies have resulted in a variety of rules of thumb that specify how humans systematically deviate from what is normatively expected from their decisions. Several complementary generative mechanisms have been proposed to explain those cognitive biases. Here it is suggested that (at least) 8 of these empirically detected decision-making biases can be produced by simply assuming noisy deviations in the memory-based information processes that convert objective evidence (observations) into subjective estimates (decisions). An integrative framework is presented to show how similar noise-based mechanisms can lead to conservatism, the Bayesian likelihood bias, illusory correlations, biased self-other placement, subadditivity, exaggerated expectation, the confidence bias, and the hard-easy effect. Analytical tools from information theory are used to explore the nature and limitations that characterize such information processes for binary and multiary decision-making exercises. The ensuing synthesis offers formal mathematical definitions of the biases and their underlying generative mechanism, which permits a consolidated analysis of how they are related. This synthesis contributes to the larger goal of creating a coherent picture that explains the relations among the myriad of seemingly unrelated biases and their potential psychological generative mechanisms. Limitations and research questions are discussed.

  11. MIMIC Methods for Assessing Differential Item Functioning in Polytomous Items

    Science.gov (United States)

    Wang, Wen-Chung; Shih, Ching-Lin

    2010-01-01

    Three multiple indicators-multiple causes (MIMIC) methods, namely, the standard MIMIC method (M-ST), the MIMIC method with scale purification (M-SP), and the MIMIC method with a pure anchor (M-PA), were developed to assess differential item functioning (DIF) in polytomous items. In a series of simulations, it appeared that all three methods…

  12. Can Item Keyword Feedback Help Remediate Knowledge Gaps?

    Science.gov (United States)

    Feinberg, Richard A; Clauser, Amanda L

    2016-10-01

    In graduate medical education, assessment results can effectively guide professional development when both assessment and feedback support a formative model. When individuals cannot directly access the test questions and responses, a way of using assessment results formatively is to provide item keyword feedback. The purpose of the following study was to investigate whether exposure to item keyword feedback aids in learner remediation. Participants included 319 trainees who completed a medical subspecialty in-training examination (ITE) in 2012 as first-year fellows, and then 1 year later in 2013 as second-year fellows. Performance on 2013 ITE items in which keywords were, or were not, exposed as part of the 2012 ITE score feedback was compared across groups based on the amount of time studying (preparation). For the same items common to both 2012 and 2013 ITEs, response patterns were analyzed to investigate changes in answer selection. Test takers who indicated greater amounts of preparation on the 2013 ITE did not perform better on the items in which keywords were exposed compared to those who were not exposed. The response pattern analysis substantiated overall growth in performance from the 2012 ITE. For items with incorrect responses on both attempts, examinees selected the same option 58% of the time. Results from the current study were unsuccessful in supporting the use of item keywords in aiding remediation. Unfortunately, the results did provide evidence of examinees retaining misinformation.

  13. Mixture Item Response Theory-MIMIC Model: Simultaneous Estimation of Differential Item Functioning for Manifest Groups and Latent Classes

    Science.gov (United States)

    Bilir, Mustafa Kuzey

    2009-01-01

    This study uses a new psychometric model (mixture item response theory-MIMIC model) that simultaneously estimates differential item functioning (DIF) across manifest groups and latent classes. Current DIF detection methods investigate DIF from only one side, either across manifest groups (e.g., gender, ethnicity, etc.), or across latent classes…

  14. Using Differential Item Functioning Procedures to Explore Sources of Item Difficulty and Group Performance Characteristics.

    Science.gov (United States)

    Scheuneman, Janice Dowd; Gerritz, Kalle

    1990-01-01

    Differential item functioning (DIF) methodology for revealing sources of item difficulty and performance characteristics of different groups was explored. A total of 150 Scholastic Aptitude Test items and 132 Graduate Record Examination general test items were analyzed. DIF was evaluated for males and females and Blacks and Whites. (SLD)

  15. Implicit and Explicit Weight Bias in a National Sample of 4732 Medical Students: The Medical Student CHANGES Study

    OpenAIRE

    Phelan, Sean M.; Dovidio, John F.; Puhl, Rebecca M.; Burgess, Diana J.; Nelson, David B.; Yeazel, Mark W.; Hardeman, Rachel; Perry, Sylvia; van Ryn, Michelle

    2014-01-01

    Objective To examine the magnitude of explicit and implicit weight biases compared to biases against other groups; and identify student factors predicting bias in a large national sample of medical students. Design and Methods A web-based survey was completed by 4732 1st year medical students from 49 medical schools as part of a longitudinal study of medical education. The survey included a validated measure of implicit weight bias, the implicit association test, and 2 measures of explicit bi...

  16. Assessing risk of bias in studies that evaluate health care interventions

    DEFF Research Database (Denmark)

    Page, Matthew J.; Boutron, Isabelle; Hansen, Camilla

    2018-01-01

    Methods to assess risk of bias in a way that is reliable, reproducible and transparent to readers, have evolved over time. Viswanathan et al. recently provided updated recommendations for assessing risk of bias in systematic reviews of health care interventions. We comment on their recommendations...

  17. Episodic memory for spatial context biases spatial attention.

    Science.gov (United States)

    Ciaramelli, Elisa; Lin, Olivia; Moscovitch, Morris

    2009-01-01

    The study explores the bottom-up attentional consequences of episodic memory retrieval. Individuals studied words (Experiment 1) or pictures (Experiment 2) presented on the left or on the right of the screen. They then viewed studied and new stimuli in the centre of the screen. One-second after the appearance of each stimulus, participants had to respond to a dot presented on the left or on the right of the screen. The dot could follow a stimulus that had been presented, during the study phase, on the same side as the dot (congruent condition), a stimulus that had been presented on the opposite side (incongruent condition), or a new stimulus (neutral condition). Subjects were faster to respond to the dot in the congruent compared to the incongruent condition, with an overall right visual field advantage in Experiment 1. The memory-driven facilitation effect correlated with subjects' re-experiencing of the encoding context (R responses; Experiment 1), but not with their explicit memory for the side of items' presentation (source memory; Experiment 2). The results indicate that memory contents are attended automatically and can bias the deployment of attention. The degree to which memory and attention interact appears related to subjective but not objective indicators of memory strength.

  18. Item Response Data Analysis Using Stata Item Response Theory Package

    Science.gov (United States)

    Yang, Ji Seung; Zheng, Xiaying

    2018-01-01

    The purpose of this article is to introduce and review the capability and performance of the Stata item response theory (IRT) package that is available from Stata v.14, 2015. Using a simulated data set and a publicly available item response data set extracted from Programme of International Student Assessment, we review the IRT package from…

  19. Partial verification bias and incorporation bias affected accuracy estimates of diagnostic studies for biomarkers that were part of an existing composite gold standard.

    Science.gov (United States)

    Karch, Annika; Koch, Armin; Zapf, Antonia; Zerr, Inga; Karch, André

    2016-10-01

    To investigate how choice of gold standard biases estimates of sensitivity and specificity in studies reassessing the diagnostic accuracy of biomarkers that are already part of a lifetime composite gold standard (CGS). We performed a simulation study based on the real-life example of the biomarker "protein 14-3-3" used for diagnosing Creutzfeldt-Jakob disease. Three different types of gold standard were compared: perfect gold standard "autopsy" (available in a small fraction only; prone to partial verification bias), lifetime CGS (including the biomarker under investigation; prone to incorporation bias), and "best available" gold standard (autopsy if available, otherwise CGS). Sensitivity was unbiased when comparing 14-3-3 with autopsy but overestimated when using CGS or "best available" gold standard. Specificity of 14-3-3 was underestimated in scenarios comparing 14-3-3 with autopsy (up to 24%). In contrast, overestimation (up to 20%) was observed for specificity compared with CGS; this could be reduced to 0-10% when using the "best available" gold standard. Choice of gold standard affects considerably estimates of diagnostic accuracy. Using the "best available" gold standard (autopsy where available, otherwise CGS) leads to valid estimates of specificity, whereas sensitivity is estimated best when tested against autopsy alone. Copyright © 2016 Elsevier Inc. All rights reserved.

  20. Assessing Risk of Bias in Randomized Controlled Trials for Autism Spectrum Disorder

    Directory of Open Access Journals (Sweden)

    Paola Matiko Martins Okuda

    2017-11-01

    Full Text Available AimTo determine construct validity and reliability indicators of the Cochrane risk of bias (RoB tool in the context of randomized clinical trials (RCTs for autism spectrum disorder (ASD.MethodsConfirmatory factor analysis was used to evaluate a unidimensional model consisting of 9 RoB categorical indicators evaluated across 94 RCTs addressing interventions for ASD.ResultsOnly five of the nine original RoB items returned good fit indices and so were retained in the analysis. Only one of this five had very high factor loadings. The remaining four indicators had more measurement error than common variance with the RoB latent factor. Together, the five indicators showed poor reliability (ω = 0.687; 95% CI: 0.613–0.761.ConclusionAlthough the Cochrane model of RoB for ASD exhibited good fit indices, the majorities of the items have more residual variance than common variance and, therefore, did not adequately capture the RoB in ASD intervention trials.

  1. Item Banking with Embedded Standards

    Science.gov (United States)

    MacCann, Robert G.; Stanley, Gordon

    2009-01-01

    An item banking method that does not use Item Response Theory (IRT) is described. This method provides a comparable grading system across schools that would be suitable for low-stakes testing. It uses the Angoff standard-setting method to obtain item ratings that are stored with each item. An example of such a grading system is given, showing how…

  2. A Case Study of Gender Bias at the Postdoctoral Level in Physics ...

    Indian Academy of Sciences (India)

    2008-04-17

    Apr 17, 2008 ... ... data for such studies unfortunately carries the dangers of survey bias. .... internet, we compiled a list of past postdoctoral researchers on the Run II ... from the internet included, amongst other things, vitae and/or current.

  3. Introduction to Unconscious Bias

    Science.gov (United States)

    Schmelz, Joan T.

    2010-05-01

    We all have biases, and we are (for the most part) unaware of them. In general, men and women BOTH unconsciously devalue the contributions of women. This can have a detrimental effect on grant proposals, job applications, and performance reviews. Sociology is way ahead of astronomy in these studies. When evaluating identical application packages, male and female University psychology professors preferred 2:1 to hire "Brian” over "Karen” as an assistant professor. When evaluating a more experienced record (at the point of promotion to tenure), reservations were expressed four times more often when the name was female. This unconscious bias has a repeated negative effect on Karen's career. This talk will introduce the concept of unconscious bias and also give recommendations on how to address it using an example for a faculty search committee. The process of eliminating unconscious bias begins with awareness, then moves to policy and practice, and ends with accountability.

  4. Negativity Bias in Dangerous Drivers.

    Directory of Open Access Journals (Sweden)

    Jing Chai

    Full Text Available The behavioral and cognitive characteristics of dangerous drivers differ significantly from those of safe drivers. However, differences in emotional information processing have seldom been investigated. Previous studies have revealed that drivers with higher anger/anxiety trait scores are more likely to be involved in crashes and that individuals with higher anger traits exhibit stronger negativity biases when processing emotions compared with control groups. However, researchers have not explored the relationship between emotional information processing and driving behavior. In this study, we examined the emotional information processing differences between dangerous drivers and safe drivers. Thirty-eight non-professional drivers were divided into two groups according to the penalty points that they had accrued for traffic violations: 15 drivers with 6 or more points were included in the dangerous driver group, and 23 drivers with 3 or fewer points were included in the safe driver group. The emotional Stroop task was used to measure negativity biases, and both behavioral and electroencephalograph data were recorded. The behavioral results revealed stronger negativity biases in the dangerous drivers than in the safe drivers. The bias score was correlated with self-reported dangerous driving behavior. Drivers with strong negativity biases reported having been involved in mores crashes compared with the less-biased drivers. The event-related potentials (ERPs revealed that the dangerous drivers exhibited reduced P3 components when responding to negative stimuli, suggesting decreased inhibitory control of information that is task-irrelevant but emotionally salient. The influence of negativity bias provides one possible explanation of the effects of individual differences on dangerous driving behavior and traffic crashes.

  5. Effect of Malmquist bias on correlation studies with IRAS data base

    Science.gov (United States)

    Verter, Frances

    1993-01-01

    The relationships between galaxy properties in the sample of Trinchieri et al. (1989) are reexamined with corrections for Malmquist bias. The linear correlations are tested and linear regressions are fit for log-log plots of L(FIR), L(H-alpha), and L(B) as well as ratios of these quantities. The linear correlations for Malmquist bias are corrected using the method of Verter (1988), in which each galaxy observation is weighted by the inverse of its sampling volume. The linear regressions are corrected for Malmquist bias by a new method invented here in which each galaxy observation is weighted by its sampling volume. The results of correlation and regressions among the sample are significantly changed in the anticipated sense that the corrected correlation confidences are lower and the corrected slopes of the linear regressions are lower. The elimination of Malmquist bias eliminates the nonlinear rise in luminosity that has caused some authors to hypothesize additional components of FIR emission.

  6. Three Modeling Applications to Promote Automatic Item Generation for Examinations in Dentistry.

    Science.gov (United States)

    Lai, Hollis; Gierl, Mark J; Byrne, B Ellen; Spielman, Andrew I; Waldschmidt, David M

    2016-03-01

    Test items created for dentistry examinations are often individually written by content experts. This approach to item development is expensive because it requires the time and effort of many content experts but yields relatively few items. The aim of this study was to describe and illustrate how items can be generated using a systematic approach. Automatic item generation (AIG) is an alternative method that allows a small number of content experts to produce large numbers of items by integrating their domain expertise with computer technology. This article describes and illustrates how three modeling approaches to item content-item cloning, cognitive modeling, and image-anchored modeling-can be used to generate large numbers of multiple-choice test items for examinations in dentistry. Test items can be generated by combining the expertise of two content specialists with technology supported by AIG. A total of 5,467 new items were created during this study. From substitution of item content, to modeling appropriate responses based upon a cognitive model of correct responses, to generating items linked to specific graphical findings, AIG has the potential for meeting increasing demands for test items. Further, the methods described in this study can be generalized and applied to many other item types. Future research applications for AIG in dental education are discussed.

  7. Attention bias to emotional information in children as a function of maternal emotional disorders and maternal attention biases.

    Science.gov (United States)

    Waters, Allison M; Forrest, Kylee; Peters, Rosie-Mae; Bradley, Brendan P; Mogg, Karin

    2015-03-01

    Children of parents with emotional disorders have an increased risk for developing anxiety and depressive disorders. Yet the mechanisms that contribute to this increased risk are poorly understood. The present study aimed to examine attention biases in children as a function of maternal lifetime emotional disorders and maternal attention biases. There were 134 participants, including 38 high-risk children, and their mothers who had lifetime emotional disorders; and 29 low-risk children, and their mothers without lifetime emotional disorders. Mothers and children completed a visual probe task with emotional face pairs presented for 500 ms. Attention bias in children did not significantly differ solely as a function of whether or not their mothers had lifetime emotional disorders. However, attention bias in high-risk children was significantly related to their mothers' attention bias. Specifically, children of mothers with lifetime emotional disorders showed a greater negative attention bias if their mothers had a greater tendency to direct attention away from positive information. This study was cross-sectional in nature, and therefore unable to assess long-term predictive effects. Also, just one exposure duration of 500 ms was utilised. Attention bias for negative information is greater in offspring of mothers who have lifetime emotional disorders and a reduced positive bias, which could be a risk marker for the development of emotional disorders in children.

  8. Women Faculty in Higher Education: A Case Study on Gender Bias

    Science.gov (United States)

    Bingham, Teri; Nix, Susan J.

    2010-01-01

    This study examines the perceptions of female faculty members in higher education to ascertain their views regarding gender bias in the workplace. A questionnaire was used to collect data from the participants regarding their beliefs of the value and productivity of their work, possible disparity in treatment based on gender, constraints put on…

  9. Journal bias or author bias?

    Science.gov (United States)

    Harris, Ian

    2016-01-01

    I read with interest the comment by Mark Wilson in the Indian Journal of Medical Ethics regarding bias and conflicts of interest in medical journals. Wilson targets one journal (the New England Journal of Medicine: NEJM) and one particular "scandal" to make his point that journals' decisions on publication are biased by commercial conflicts of interest (CoIs). It is interesting that he chooses the NEJM which, by his own admission, had one of the strictest CoI policies and had published widely on this topic. The feeling is that if the NEJM can be guilty, they can all be guilty.

  10. Negativity bias for sad faces in depression: An event-related potential study.

    Science.gov (United States)

    Dai, Qin; Wei, Juanjuan; Shu, Xiaorui; Feng, Zhengzhi

    2016-12-01

    Negativity bias in depression has been previously confirmed. However, mainly during a valence category task, it remains unclear how happy or unhappy individuals perceive emotional materials. Moreover, cerebral alteration measurements during a valence judgment task is lacking. The present study aimed to explore a valence judgment of a valence rating task, combined with event-related potential (ERP) recording. Healthy controls, individuals with sub-clinical depression, and patients diagnosed with major depressive disorder (MDD) were recruited. Twenty-four subjects in each group completed a valence rating task, during which the ERP amplitudes were recorded. The MDD group had lower valence scores, faster responses, and greater N1 amplitudes for sad faces, whereas individuals with sub-clinical depression had faster responses and greater P1 amplitudes for all faces but lower valence scores and greater P2 amplitudes for happy faces. The findings suggest the tendency toward a negativity bias in valence ratings in patients with depression supported by behavioral and cerebral evidence, which is a latent trait of depression, possibly associated with the vulnerability of depression. The current study offers the first experimental evidence of cognitive and cerebral biomarkers of negativity bias in valence ratings in depression, which confirms Beck's cognitive theory and gives important direction for clinical therapy. Copyright © 2016 International Federation of Clinical Neurophysiology. Published by Elsevier Ireland Ltd. All rights reserved.

  11. Attention, interpretation, and memory biases in subclinical depression: a proof-of-principle test of the combined cognitive biases hypothesis.

    Science.gov (United States)

    Everaert, Jonas; Duyck, Wouter; Koster, Ernst H W

    2014-04-01

    Emotional biases in attention, interpretation, and memory are viewed as important cognitive processes underlying symptoms of depression. To date, there is a limited understanding of the interplay among these processing biases. This study tested the dependence of memory on depression-related biases in attention and interpretation. Subclinically depressed and nondepressed participants completed a computerized version of the scrambled sentences test (measuring interpretation bias) while their eye movements were recorded (measuring attention bias). This task was followed by an incidental free recall test of previously constructed interpretations (measuring memory bias). Path analysis revealed a good fit for the model in which selective orienting of attention was associated with interpretation bias, which in turn was associated with a congruent bias in memory. Also, a good fit was observed for a path model in which biases in the maintenance of attention and interpretation were associated with memory bias. Both path models attained a superior fit compared with path models without the theorized functional relations among processing biases. These findings enhance understanding of how mechanisms of attention and interpretation regulate what is remembered. As such, they offer support for the combined cognitive biases hypothesis or the notion that emotionally biased cognitive processes are not isolated mechanisms but instead influence each other. Implications for theoretical models and emotion regulation across the spectrum of depressive symptoms are discussed.

  12. Effect of cognitive biases on human-robot interaction: a case study of robot's misattribution

    OpenAIRE

    Biswas, Mriganka; Murray, John

    2014-01-01

    This paper presents a model for developing long-term human-robot interactions and social relationships based on the principle of 'human' cognitive biases applied to a robot. The aim of this work is to study how a robot influenced with human ‘misattribution’ helps to build better human-robot interactions than unbiased robots. The results presented in this paper suggest that it is important to know the effect of cognitive biases in human characteristics and interactions in order to better u...

  13. The valuation of environmental goods in Norway: A contingent valuation study with multiple bias testing

    Energy Technology Data Exchange (ETDEWEB)

    Strand, J.; Taraldset, A.

    1991-12-31

    We report on a study of contingent valuation of reduction in air pollution, and of a broader set of six environmental issues, among a population sample in Oslo. We derive an estimate of the extent of upward bias due to ``mental accouting`` in the expressed valuation of the air pollution issue, in two steps: (1) by comparing valuation of air pollution alone, with the same when the other issues at the same time are to be dealt with; and (2) by deriving the implicit valuation of the air pollution issue from the ranking of issues, and total valuation of all six issues. We find that expressed valuation of air pollution reductions are 3-4 times as high as the ``true`` values, and argue that this discrepancy is mainly due to mental account biases. We also test for strategic, starting point, information and interviewer biases, which are all present and, with the exception of the information bias, all in the expected directions. 9 refs., 4 tabs.

  14. Neural correlates of belief-bias reasoning under time pressure: a near-infrared spectroscopy study.

    Science.gov (United States)

    Tsujii, Takeo; Watanabe, Shigeru

    2010-04-15

    The dual-process theory of reasoning explained the belief-bias effect, the tendency for human reasoning to be erroneously biased when logical conclusions are incongruent with belief about the world, by proposing a belief-based fast heuristic system and a logic-based slow analytic system. Although the claims were supported by behavioral findings that the belief-bias effect was enhanced when subjects were not given sufficient time for reasoning, the neural correlates were still unknown. The present study therefore examined the relationship between the time-pressure effect and activity in the inferior frontal cortex (IFC) during belief-bias reasoning using near-infrared spectroscopy (NIRS). Forty-eight subjects performed congruent and incongruent reasoning tasks, involving long-span (20 s) and short-span trials (10 s). Behavioral analysis found that only incongruent reasoning performance was impaired by the time-pressure of short-span trials. NIRS analysis found that the time-pressure decreased right IFC activity during incongruent trials. Correlation analysis showed that subjects with enhanced right IFC activity could perform better in incongruent trials, while subjects for whom the right IFC activity was impaired by the time-pressure could not maintain better reasoning performance. These findings suggest that the right IFC may be responsible for the time-pressure effect in conflicting reasoning processes. When the right IFC activity was impaired in the short-span trials in which subjects were not given sufficient time for reasoning, the subjects may rely on the fast heuristic system, which result in belief-bias responses. We therefore offer the first demonstration of neural correlates of time-pressure effect on the IFC activity in belief-bias reasoning. Copyright 2009 Elsevier Inc. All rights reserved.

  15. Assessing the risk of bias in randomized controlled trials in the field of dentistry indexed in the Lilacs (Literatura Latino-Americana e do Caribe em Ciências da Saúde) database.

    Science.gov (United States)

    Ferreira, Christiane Alves; Loureiro, Carlos Alfredo Salles; Saconato, Humberto; Atallah, Alvaro Nagib

    2011-03-01

    Well-conducted randomized controlled trials (RCTs) represent the highest level of evidence when the research question relates to the effect of therapeutic or preventive interventions. However, the degree of control over bias between RCTs presents great variability between studies. For this reason, with the increasing interest in and production of systematic reviews and meta-analyses, it has been necessary to develop methodology supported by empirical evidence, so as to encourage and enhance the production of valid RCTs with low risk of bias. The aim here was to conduct a methodological analysis within the field of dentistry, regarding the risk of bias in open-access RCTs available in the Lilacs (Literatura Latino-Americana e do Caribe em Ciências da Saúde) database. This was a methodology study conducted at Universidade Federal de São Paulo (Unifesp) that assessed the risk of bias in RCTs, using the following dimensions: allocation sequence generation, allocation concealment, blinding, and data on incomplete outcomes. Out of the 4,503 articles classified, only 10 studies (0.22%) were considered to be true RCTs and, of these, only a single study was classified as presenting low risk of bias. The items that the authors of these RCTs most frequently controlled for were blinding and data on incomplete outcomes. The effective presence of bias seriously weakened the reliability of the results from the dental studies evaluated, such that they would be of little use for clinicians and administrators as support for decision-making processes.

  16. Development of Heuristic Bias Detection in Elementary School

    Science.gov (United States)

    De Neys, Wim; Feremans, Vicky

    2013-01-01

    Although human reasoning is often biased by intuitive heuristics, recent studies have shown that adults and adolescents detect the biased nature of their judgments. The present study focused on the development of this critical bias sensitivity by examining the detection skills of young children in elementary school. Third and 6th graders were…

  17. Neural correlates of dual-task effect on belief-bias syllogistic reasoning: a near-infrared spectroscopy study.

    Science.gov (United States)

    Tsujii, Takeo; Watanabe, Shigeru

    2009-09-01

    Recent dual-process reasoning theories have explained the belief-bias effect, the tendency for human reasoning to be erroneously biased when logical conclusions are incongruent with beliefs about the world, by proposing a belief-based automatic heuristic system and logic-based demanding analytic system. Although these claims are supported by the behavioral finding that high-load secondary tasks enhance the belief-bias effect, the neural correlates of dual-task reasoning remain unknown. The present study therefore examined the relationship between dual-task effect and activity in the inferior frontal cortex (IFC) during belief-bias reasoning by near-infrared spectroscopy (NIRS). Forty-eight subjects participated in this study (MA=23.46 years). They were required to perform congruent and incongruent reasoning trials while responding to high- and low-load secondary tasks. Behavioral analysis showed that the high-load secondary task impaired only incongruent reasoning performance. NIRS analysis found that the high-load secondary task decreased right IFC activity during incongruent trials. Correlation analysis showed that subjects with enhanced right IFC activity could perform better in the incongruent reasoning trials, though subjects for whom right IFC activity was impaired by the secondary task could not maintain better reasoning performance. These findings suggest that the right IFC may be responsible for the dual-task effect in conflicting reasoning processes. When secondary tasks impair right IFC activity, subjects may rely on the automatic heuristic system, which results in belief-bias responses. We therefore offer the first demonstration of neural correlates of dual-task effect on IFC activity in belief-bias reasoning.

  18. Bias due to differential participation in case-control studies and review of available approaches for adjustment.

    Science.gov (United States)

    Aigner, Annette; Grittner, Ulrike; Becher, Heiko

    2018-01-01

    Low response rates in epidemiologic research potentially lead to the recruitment of a non-representative sample of controls in case-control studies. Problems in the unbiased estimation of odds ratios arise when characteristics causing the probability of participation are associated with exposure and outcome. This is a specific setting of selection bias and a realistic hazard in many case-control studies. This paper formally describes the problem and shows its potential extent, reviews existing approaches for bias adjustment applicable under certain conditions, compares and applies them. We focus on two scenarios: a characteristic C causing differential participation of controls is linked to the outcome through its association with risk factor E (scenario I), and C is additionally a genuine risk factor itself (scenario II). We further assume external data sources are available which provide an unbiased estimate of C in the underlying population. Given these scenarios, we (i) review available approaches and their performance in the setting of bias due to differential participation; (ii) describe two existing approaches to correct for the bias in both scenarios in more detail; (iii) present the magnitude of the resulting bias by simulation if the selection of a non-representative sample is ignored; and (iv) demonstrate the approaches' application via data from a case-control study on stroke. The bias of the effect measure for variable E in scenario I and C in scenario II can be large and should therefore be adjusted for in any analysis. It is positively associated with the difference in response rates between groups of the characteristic causing differential participation, and inversely associated with the total response rate in the controls. Adjustment in a standard logistic regression framework is possible in both scenarios if the population distribution of the characteristic causing differential participation is known or can be approximated well.

  19. Applying modern psychometric techniques to melodic discrimination testing: Item response theory, computerised adaptive testing, and automatic item generation.

    Science.gov (United States)

    Harrison, Peter M C; Collins, Tom; Müllensiefen, Daniel

    2017-06-15

    Modern psychometric theory provides many useful tools for ability testing, such as item response theory, computerised adaptive testing, and automatic item generation. However, these techniques have yet to be integrated into mainstream psychological practice. This is unfortunate, because modern psychometric techniques can bring many benefits, including sophisticated reliability measures, improved construct validity, avoidance of exposure effects, and improved efficiency. In the present research we therefore use these techniques to develop a new test of a well-studied psychological capacity: melodic discrimination, the ability to detect differences between melodies. We calibrate and validate this test in a series of studies. Studies 1 and 2 respectively calibrate and validate an initial test version, while Studies 3 and 4 calibrate and validate an updated test version incorporating additional easy items. The results support the new test's viability, with evidence for strong reliability and construct validity. We discuss how these modern psychometric techniques may also be profitably applied to other areas of music psychology and psychological science in general.

  20. Multiple imputation using linked proxy outcome data resulted in important bias reduction and efficiency gains: a simulation study.

    Science.gov (United States)

    Cornish, R P; Macleod, J; Carpenter, J R; Tilling, K

    2017-01-01

    When an outcome variable is missing not at random (MNAR: probability of missingness depends on outcome values), estimates of the effect of an exposure on this outcome are often biased. We investigated the extent of this bias and examined whether the bias can be reduced through incorporating proxy outcomes obtained through linkage to administrative data as auxiliary variables in multiple imputation (MI). Using data from the Avon Longitudinal Study of Parents and Children (ALSPAC) we estimated the association between breastfeeding and IQ (continuous outcome), incorporating linked attainment data (proxies for IQ) as auxiliary variables in MI models. Simulation studies explored the impact of varying the proportion of missing data (from 20 to 80%), the correlation between the outcome and its proxy (0.1-0.9), the strength of the missing data mechanism, and having a proxy variable that was incomplete. Incorporating a linked proxy for the missing outcome as an auxiliary variable reduced bias and increased efficiency in all scenarios, even when 80% of the outcome was missing. Using an incomplete proxy was similarly beneficial. High correlations (> 0.5) between the outcome and its proxy substantially reduced the missing information. Consistent with this, ALSPAC analysis showed inclusion of a proxy reduced bias and improved efficiency. Gains with additional proxies were modest. In longitudinal studies with loss to follow-up, incorporating proxies for this study outcome obtained via linkage to external sources of data as auxiliary variables in MI models can give practically important bias reduction and efficiency gains when the study outcome is MNAR.

  1. Selection bias in studies of human reproduction-longevity trade-offs.

    Science.gov (United States)

    Helle, Samuli

    2017-12-13

    A shorter lifespan as a potential cost of high reproductive effort in humans has intrigued researchers for more than a century. However, the results have been inconclusive so far and despite strong theoretical expectations we do not currently have compelling evidence for the longevity costs of reproduction. Using Monte Carlo simulation, it is shown here that a common practice in human reproduction-longevity studies using historical data (the most relevant data sources for this question), the omission of women who died prior to menopausal age from the analysis, results in severe underestimation of the potential underlying trade-off between reproduction and lifespan. In other words, assuming that such a trade-off is expressed also during reproductive years, the strength of the trade-off between reproduction and lifespan is progressively weakened when women dying during reproductive ages are sequentially and non-randomly excluded from the analysis. In cases of small sample sizes (e.g. few hundreds of observations), this selection bias by reducing statistical power may even partly explain the null results commonly found in this field. Future studies in this field should thus apply statistical approaches that account for or avoid selection bias in order to recover reliable effect size estimates between reproduction and longevity. © 2017 The Author(s).

  2. Quantifying the impact of selection bias caused by nonparticipation in a case-control study of mobile phone use

    DEFF Research Database (Denmark)

    Vrijheid, Martine; Richardson, Lesley; Armstrong, Bruce K

    2009-01-01

    To quantitatively assess the impact of selection bias caused by nonparticipation in a multinational case-control study of mobile phone use and brain tumor.......To quantitatively assess the impact of selection bias caused by nonparticipation in a multinational case-control study of mobile phone use and brain tumor....

  3. Attentional bias to betel quid cues: An eye tracking study.

    Science.gov (United States)

    Shen, Bin; Chiu, Meng-Chun; Li, Shuo-Heng; Huang, Guo-Joe; Liu, Ling-Jun; Ho, Ming-Chou

    2016-09-01

    The World Health Organization regards betel quid as a human carcinogen, and DSM-IV and ICD-10 dependence symptoms may develop with heavy use. This study, conducted in central Taiwan, investigated whether betel quid chewers can exhibit overt orienting to selectively respond to the betel quid cues. Twenty-four male chewers' and 23 male nonchewers' eye movements to betel-quid-related pictures and matched pictures were assessed during a visual probe task. The eye movement index showed that betel quid chewers were more likely to initially direct their gaze to the betel quid cues, t(23) = 3.70, p betel quid chewers' attentional bias. The results demonstrated that the betel quid chewers (but not the nonchewers) were more likely to initially direct their gaze to the betel quid cues, and spent more time and were more fixated on them. These findings suggested that when attention is directly measured through the eye tracking technique, this methodology may be more sensitive to detecting attentional biases in betel quid chewers. (PsycINFO Database Record (c) 2016 APA, all rights reserved).

  4. Bias correction of risk estimates in vaccine safety studies with rare adverse events using a self-controlled case series design.

    Science.gov (United States)

    Zeng, Chan; Newcomer, Sophia R; Glanz, Jason M; Shoup, Jo Ann; Daley, Matthew F; Hambidge, Simon J; Xu, Stanley

    2013-12-15

    The self-controlled case series (SCCS) method is often used to examine the temporal association between vaccination and adverse events using only data from patients who experienced such events. Conditional Poisson regression models are used to estimate incidence rate ratios, and these models perform well with large or medium-sized case samples. However, in some vaccine safety studies, the adverse events studied are rare and the maximum likelihood estimates may be biased. Several bias correction methods have been examined in case-control studies using conditional logistic regression, but none of these methods have been evaluated in studies using the SCCS design. In this study, we used simulations to evaluate 2 bias correction approaches-the Firth penalized maximum likelihood method and Cordeiro and McCullagh's bias reduction after maximum likelihood estimation-with small sample sizes in studies using the SCCS design. The simulations showed that the bias under the SCCS design with a small number of cases can be large and is also sensitive to a short risk period. The Firth correction method provides finite and less biased estimates than the maximum likelihood method and Cordeiro and McCullagh's method. However, limitations still exist when the risk period in the SCCS design is short relative to the entire observation period.

  5. A systematic review of context bias in invasion biology.

    Directory of Open Access Journals (Sweden)

    Robert J Warren

    Full Text Available The language that scientists use to frame biological invasions may reveal inherent bias-including how data are interpreted. A frequent critique of invasion biology is the use of value-laden language that may indicate context bias. Here we use a systematic study of language and interpretation in papers drawn from invasion biology to evaluate whether there is a link between the framing of papers and the interpretation of results. We also examine any trends in context bias in biological invasion research. We examined 651 peer-reviewed invasive species competition studies and implemented a rigorous systematic review to examine bias in the presentation and interpretation of native and invasive competition in invasion biology. We predicted that bias in the presentation of invasive species is increasing, as suggested by several authors, and that bias against invasive species would result in misinterpreting their competitive dominance in correlational observational studies compared to causative experimental studies. We indeed found evidence of bias in the presentation and interpretation of invasive species research; authors often introduced research with invasive species in a negative context and study results were interpreted against invasive species more in correlational studies. However, we also found a distinct decrease in those biases since the mid-2000s. Given that there have been several waves of criticism from scientists both inside and outside invasion biology, our evidence suggests that the subdiscipline has somewhat self-corrected apparent biases.

  6. A systematic review of context bias in invasion biology.

    Science.gov (United States)

    Warren, Robert J; King, Joshua R; Tarsa, Charlene; Haas, Brian; Henderson, Jeremy

    2017-01-01

    The language that scientists use to frame biological invasions may reveal inherent bias-including how data are interpreted. A frequent critique of invasion biology is the use of value-laden language that may indicate context bias. Here we use a systematic study of language and interpretation in papers drawn from invasion biology to evaluate whether there is a link between the framing of papers and the interpretation of results. We also examine any trends in context bias in biological invasion research. We examined 651 peer-reviewed invasive species competition studies and implemented a rigorous systematic review to examine bias in the presentation and interpretation of native and invasive competition in invasion biology. We predicted that bias in the presentation of invasive species is increasing, as suggested by several authors, and that bias against invasive species would result in misinterpreting their competitive dominance in correlational observational studies compared to causative experimental studies. We indeed found evidence of bias in the presentation and interpretation of invasive species research; authors often introduced research with invasive species in a negative context and study results were interpreted against invasive species more in correlational studies. However, we also found a distinct decrease in those biases since the mid-2000s. Given that there have been several waves of criticism from scientists both inside and outside invasion biology, our evidence suggests that the subdiscipline has somewhat self-corrected apparent biases.

  7. Methodological quality of diagnostic accuracy studies on non-invasive coronary CT angiography: influence of QUADAS (Quality Assessment of Diagnostic Accuracy Studies included in systematic reviews) items on sensitivity and specificity

    International Nuclear Information System (INIS)

    Schueler, Sabine; Walther, Stefan; Schuetz, Georg M.; Schlattmann, Peter; Dewey, Marc

    2013-01-01

    To evaluate the methodological quality of diagnostic accuracy studies on coronary computed tomography (CT) angiography using the QUADAS (Quality Assessment of Diagnostic Accuracy Studies included in systematic reviews) tool. Each QUADAS item was individually defined to adapt it to the special requirements of studies on coronary CT angiography. Two independent investigators analysed 118 studies using 12 QUADAS items. Meta-regression and pooled analyses were performed to identify possible effects of methodological quality items on estimates of diagnostic accuracy. The overall methodological quality of coronary CT studies was merely moderate. They fulfilled a median of 7.5 out of 12 items. Only 9 of the 118 studies fulfilled more than 75 % of possible QUADAS items. One QUADAS item (''Uninterpretable Results'') showed a significant influence (P = 0.02) on estimates of diagnostic accuracy with ''no fulfilment'' increasing specificity from 86 to 90 %. Furthermore, pooled analysis revealed that each QUADAS item that is not fulfilled has the potential to change estimates of diagnostic accuracy. The methodological quality of studies investigating the diagnostic accuracy of non-invasive coronary CT is only moderate and was found to affect the sensitivity and specificity. An improvement is highly desirable because good methodology is crucial for adequately assessing imaging technologies. (orig.)

  8. Methodological quality of diagnostic accuracy studies on non-invasive coronary CT angiography: influence of QUADAS (Quality Assessment of Diagnostic Accuracy Studies included in systematic reviews) items on sensitivity and specificity

    Energy Technology Data Exchange (ETDEWEB)

    Schueler, Sabine; Walther, Stefan; Schuetz, Georg M. [Humboldt-Universitaet zu Berlin, Freie Universitaet Berlin, Charite Medical School, Department of Radiology, Berlin (Germany); Schlattmann, Peter [University Hospital of Friedrich Schiller University Jena, Department of Medical Statistics, Informatics, and Documentation, Jena (Germany); Dewey, Marc [Humboldt-Universitaet zu Berlin, Freie Universitaet Berlin, Charite Medical School, Department of Radiology, Berlin (Germany); Charite, Institut fuer Radiologie, Berlin (Germany)

    2013-06-15

    To evaluate the methodological quality of diagnostic accuracy studies on coronary computed tomography (CT) angiography using the QUADAS (Quality Assessment of Diagnostic Accuracy Studies included in systematic reviews) tool. Each QUADAS item was individually defined to adapt it to the special requirements of studies on coronary CT angiography. Two independent investigators analysed 118 studies using 12 QUADAS items. Meta-regression and pooled analyses were performed to identify possible effects of methodological quality items on estimates of diagnostic accuracy. The overall methodological quality of coronary CT studies was merely moderate. They fulfilled a median of 7.5 out of 12 items. Only 9 of the 118 studies fulfilled more than 75 % of possible QUADAS items. One QUADAS item (''Uninterpretable Results'') showed a significant influence (P = 0.02) on estimates of diagnostic accuracy with ''no fulfilment'' increasing specificity from 86 to 90 %. Furthermore, pooled analysis revealed that each QUADAS item that is not fulfilled has the potential to change estimates of diagnostic accuracy. The methodological quality of studies investigating the diagnostic accuracy of non-invasive coronary CT is only moderate and was found to affect the sensitivity and specificity. An improvement is highly desirable because good methodology is crucial for adequately assessing imaging technologies. (orig.)

  9. Will a Short Training Session Improve Multiple-Choice Item-Writing Quality by Dental School Faculty? A Pilot Study.

    Science.gov (United States)

    Dellinges, Mark A; Curtis, Donald A

    2017-08-01

    Faculty members are expected to write high-quality multiple-choice questions (MCQs) in order to accurately assess dental students' achievement. However, most dental school faculty members are not trained to write MCQs. Extensive faculty development programs have been used to help educators write better test items. The aim of this pilot study was to determine if a short workshop would result in improved MCQ item-writing by dental school faculty at one U.S. dental school. A total of 24 dental school faculty members who had previously written MCQs were randomized into a no-intervention group and an intervention group in 2015. Six previously written MCQs were randomly selected from each of the faculty members and given an item quality score. The intervention group participated in a training session of one-hour duration that focused on reviewing standard item-writing guidelines to improve in-house MCQs. The no-intervention group did not receive any training but did receive encouragement and an explanation of why good MCQ writing was important. The faculty members were then asked to revise their previously written questions, and these were given an item quality score. The item quality scores for each faculty member were averaged, and the difference from pre-training to post-training scores was evaluated. The results showed a significant difference between pre-training and post-training MCQ difference scores for the intervention group (p=0.04). This pilot study provides evidence that the training session of short duration was effective in improving the quality of in-house MCQs.

  10. An item response theory analysis of the Executive Interview and development of the EXIT8: A Project FRONTIER Study.

    Science.gov (United States)

    Jahn, Danielle R; Dressel, Jeffrey A; Gavett, Brandon E; O'Bryant, Sid E

    2015-01-01

    The Executive Interview (EXIT25) is an effective measure of executive dysfunction, but may be inefficient due to the time it takes to complete 25 interview-based items. The current study aimed to examine psychometric properties of the EXIT25, with a specific focus on determining whether a briefer version of the measure could comprehensively assess executive dysfunction. The current study applied a graded response model (a type of item response theory model for polytomous categorical data) to identify items that were most closely related to the underlying construct of executive functioning and best discriminated between varying levels of executive functioning. Participants were 660 adults ages 40 to 96 years living in West Texas, who were recruited through an ongoing epidemiological study of rural health and aging, called Project FRONTIER. The EXIT25 was the primary measure examined. Participants also completed the Trail Making Test and Controlled Oral Word Association Test, among other measures, to examine the convergent validity of a brief form of the EXIT25. Eight items were identified that provided the majority of the information about the underlying construct of executive functioning; total scores on these items were associated with total scores on other measures of executive functioning and were able to differentiate between cognitively healthy, mildly cognitively impaired, and demented participants. In addition, cutoff scores were recommended based on sensitivity and specificity of scores. A brief, eight-item version of the EXIT25 may be an effective and efficient screening for executive dysfunction among older adults.

  11. On the relative independence of thinking biases and cognitive ability.

    Science.gov (United States)

    Stanovich, Keith E; West, Richard F

    2008-04-01

    In 7 different studies, the authors observed that a large number of thinking biases are uncorrelated with cognitive ability. These thinking biases include some of the most classic and well-studied biases in the heuristics and biases literature, including the conjunction effect, framing effects, anchoring effects, outcome bias, base-rate neglect, "less is more" effects, affect biases, omission bias, myside bias, sunk-cost effect, and certainty effects that violate the axioms of expected utility theory. In a further experiment, the authors nonetheless showed that cognitive ability does correlate with the tendency to avoid some rational thinking biases, specifically the tendency to display denominator neglect, probability matching rather than maximizing, belief bias, and matching bias on the 4-card selection task. The authors present a framework for predicting when cognitive ability will and will not correlate with a rational thinking tendency. (c) 2008 APA, all rights reserved.

  12. Rasch Measurement Analysis of a 25-Item Version of the Mueller/McCloskey Nurse Job Satisfaction Scale in a Sample of Nurses in Lebanon and Qatar

    Directory of Open Access Journals (Sweden)

    Michael Clinton

    2015-06-01

    Full Text Available The Mueller/McCloskey Nurse Job Satisfaction Scale (MMSS is widely used, but its psychometric characteristics have not been sufficiently validated for use in Middle Eastern countries. The objective of our methodological study was to determine the psychometric suitability of a 25-item version of the MMSS (MMSS-25 for use in middle-income and high-income Middle Eastern countries. A total of 1,322 registered nurses, 859 in Lebanon and 463 in Qatar, completed the MMSS-25 as part of a cross-sectional multinational investigation of nursing shortages in the region. We used the Rasch rating scale model to investigate the psychometric performance of the MMSS-25. We identified possible item bias among MMSS-25 items. We conducted confirmatory factor analyses (CFA to compare the fit to our data of five factor structures reported in the literature. We concluded that irrespective of administration in English or Arabic, the MMSS-25 is not sufficiently productive of measurement for use in the region. A core set of 13 items (MMSS-13, Cronbach’s α = .82 loading on five dimensions eliminates redundant MMSS items and is suitable for initial screening of nurses’ satisfaction. Of the five factor structures we examined, the MMSS-13 was the only close fit to our data (comparative fit index = 0.951; Tucker–Lewis index = 0.931; root mean square error of approximation = 0.051; p value = .401. The MMSS-13 has psychometric characteristics superior to MMSS-25, but additional items are required to meet the research-specific objectives of future studies of nurses’ job satisfaction in Middle Eastern countries.

  13. An empirical comparison of Item Response Theory and Classical Test Theory

    Directory of Open Access Journals (Sweden)

    Špela Progar

    2008-11-01

    Full Text Available Based on nonlinear models between the measured latent variable and the item response, item response theory (IRT enables independent estimation of item and person parameters and local estimation of measurement error. These properties of IRT are also the main theoretical advantages of IRT over classical test theory (CTT. Empirical evidence, however, often failed to discover consistent differences between IRT and CTT parameters and between invariance measures of CTT and IRT parameter estimates. In this empirical study a real data set from the Third International Mathematics and Science Study (TIMSS 1995 was used to address the following questions: (1 How comparable are CTT and IRT based item and person parameters? (2 How invariant are CTT and IRT based item parameters across different participant groups? (3 How invariant are CTT and IRT based item and person parameters across different item sets? The findings indicate that the CTT and the IRT item/person parameters are very comparable, that the CTT and the IRT item parameters show similar invariance property when estimated across different groups of participants, that the IRT person parameters are more invariant across different item sets, and that the CTT item parameters are at least as much invariant in different item sets as the IRT item parameters. The results furthermore demonstrate that, with regards to the invariance property, IRT item/person parameters are in general empirically superior to CTT parameters, but only if the appropriate IRT model is used for modelling the data.

  14. Attention bias modification training under working memory load increases the magnitude of change in attentional bias.

    Science.gov (United States)

    Clarke, Patrick J F; Branson, Sonya; Chen, Nigel T M; Van Bockstaele, Bram; Salemink, Elske; MacLeod, Colin; Notebaert, Lies

    2017-12-01

    Attention bias modification (ABM) procedures have shown promise as a therapeutic intervention, however current ABM procedures have proven inconsistent in their ability to reliably achieve the requisite change in attentional bias needed to produce emotional benefits. This highlights the need to better understand the precise task conditions that facilitate the intended change in attention bias in order to realise the therapeutic potential of ABM procedures. Based on the observation that change in attentional bias occurs largely outside conscious awareness, the aim of the current study was to determine if an ABM procedure delivered under conditions likely to preclude explicit awareness of the experimental contingency, via the addition of a working memory load, would contribute to greater change in attentional bias. Bias change was assessed among 122 participants in response to one of four ABM tasks given by the two experimental factors of ABM training procedure delivered either with or without working memory load, and training direction of either attend-negative or avoid-negative. Findings revealed that avoid-negative ABM procedure under working memory load resulted in significantly greater reductions in attentional bias compared to the equivalent no-load condition. The current findings will require replication with clinical samples to determine the utility of the current task for achieving emotional benefits. These present findings are consistent with the position that the addition of a working memory load may facilitate change in attentional bias in response to an ABM training procedure. Copyright © 2017 Elsevier Ltd. All rights reserved.

  15. Updating schematic emotional facial expressions in working memory: Response bias and sensitivity.

    Science.gov (United States)

    Tamm, Gerly; Kreegipuu, Kairi; Harro, Jaanus; Cowan, Nelson

    2017-01-01

    It is unclear if positive, negative, or neutral emotional expressions have an advantage in short-term recognition. Moreover, it is unclear from previous studies of working memory for emotional faces whether effects of emotions comprise response bias or sensitivity. The aim of this study was to compare how schematic emotional expressions (sad, angry, scheming, happy, and neutral) are discriminated and recognized in an updating task (2-back recognition) in a representative sample of birth cohort of young adults. Schematic facial expressions allow control of identity processing, which is separate from expression processing, and have been used extensively in attention research but not much, until now, in working memory research. We found that expressions with a U-curved mouth (i.e., upwardly curved), namely happy and scheming expressions, favoured a bias towards recognition (i.e., towards indicating that the probe and the stimulus in working memory are the same). Other effects of emotional expression were considerably smaller (1-2% of the variance explained)) compared to a large proportion of variance that was explained by the physical similarity of items being compared. We suggest that the nature of the stimuli plays a role in this. The present application of signal detection methodology with emotional, schematic faces in a working memory procedure requiring fast comparisons helps to resolve important contradictions that have emerged in the emotional perception literature. Copyright © 2016 Elsevier B.V. All rights reserved.

  16. Selection bias in genetic-epidemiological studies of cleft lip and palate

    Energy Technology Data Exchange (ETDEWEB)

    Christensen, K.; Holm, N.V.; Kock, K. (Odense Univ. (Denmark)); Olsen, J. (Aarhus Univ. (Denmark)); Fogh-Anderson, P.

    1992-09-01

    The possible impact of selection bias in genetic and epidemiological studies of cleft lip and palate was studied, using three nationwide ascertainment sources and an autopsy study in a 10% sample of the Danish population. A total of 670 cases were identified. Two national record systems, when used together, were found suitable for ascertaining facial cleft in live births. More than 95% ascertainment was obtained by means of surgical files for cleft lip (with or without cleft palate) without associated malformations/syndromes. However, surgical files could be a poor source for studying isolated cleft palate (CP) (only a 60% and biased ascertainment), and they cannot be used to study the prevalence of associated malformations or syndromes in facial cleft cases. The male:female ratio was 0.88 in surgically treated cases of CP and was 1.5 in nonoperated CP cases, making the overall sex ratio for CP 1.1 (95% confidence limits 0.86-1.4) The sex ratio for CP without associated malformation was 1.1 (95% confidence limits 0.84-1.6). One of the major test criteria in CP multifactorial threshold models (higher CP liability among male CP relatives) must be reconsidered, if other investigations confirm that a CP sex-ratio reversal to male predominance occurs when high ascertainment is achieved. 24 refs., 1 fig., 4 tabs.

  17. Generalizability theory and item response theory

    NARCIS (Netherlands)

    Glas, Cornelis A.W.; Eggen, T.J.H.M.; Veldkamp, B.P.

    2012-01-01

    Item response theory is usually applied to items with a selected-response format, such as multiple choice items, whereas generalizability theory is usually applied to constructed-response tasks assessed by raters. However, in many situations, raters may use rating scales consisting of items with a

  18. Sharing the cost of redundant items

    DEFF Research Database (Denmark)

    Hougaard, Jens Leth; Moulin, Hervé

    2014-01-01

    We ask how to share the cost of finitely many public goods (items) among users with different needs: some smaller subsets of items are enough to serve the needs of each user, yet the cost of all items must be covered, even if this entails inefficiently paying for redundant items. Typical examples...... are network connectivity problems when an existing (possibly inefficient) network must be maintained. We axiomatize a family cost ratios based on simple liability indices, one for each agent and for each item, measuring the relative worth of this item across agents, and generating cost allocation rules...... additive in costs....

  19. Positivity effect in source attributions of arousal-matched emotional and non-emotional words during item-based directed forgetting.

    Science.gov (United States)

    Gallant, Sara N; Yang, Lixia

    2014-01-01

    Consistent with their emphasis on emotional goals, older adults often exhibit a positivity bias in attention and memory relative to their young counterparts (i.e., a positivity effect). The current study sought to determine how this age-related positivity effect would impact intentional forgetting of emotional words, a process critical to efficient operation of memory. Using an item-based directed forgetting task, 36 young and 36 older adults studied a series of arousal-equivalent words that varied in valence (i.e., positive, negative, and neutral). Each word was followed by a cue to either remember or forget the word. A subsequent "tagging" recognition task required classification of items as to-be-remembered (TBR), to-be-forgotten (TBF), or new as a measure of directed forgetting and source attribution in participants' memory. Neither young nor older adults' intentional forgetting was affected by the valence of words. A goal-consistent valence effect did, however, emerge in older adults' source attribution performance. Specifically, older adults assigned more TBR-cues to positive words and more TBF-cues to negative words. Results are discussed in light of existing literature on emotion and directed forgetting as well as the socioemotional selectivity theory underlying the age-related positivity effect.

  20. Positivity effect in source attributions of arousal-matched emotional and non-emotional words during item-based directed forgetting

    Directory of Open Access Journals (Sweden)

    Sara N. Gallant

    2014-11-01

    Full Text Available Consistent with their emphasis on emotional goals, older adults often exhibit a positivity bias in attention and memory relative to their young counterparts (i.e., a positivity effect. The current study sought to determine how this age-related positivity effect would impact intentional forgetting of emotional words, a process critical to efficient operation of memory. Using an item-based directed forgetting task, 36 young and 36 older adults studied a series of arousal-equivalent words that varied in valence (i.e., positive, negative, and neutral. Each word was followed by a cue to either remember or forget the word. A subsequent tagging recognition task required classification of items as to-be-remembered (TBR, to-be-forgotten (TBF, or new as a measure of directed forgetting and source attribution in participants’ memory. Valence did not affect intentional forgetting in both young and older age groups. A goal-consistent valence effect did, however, emerge in older adults’ source attribution performance. Specifically, older adults assigned more TBR-cues to positive words and more TBF-cues to negative words. Results are discussed in light of existing literature on emotion and directed forgetting as well as the socioemotional selectivity theory underlying the age-related positivity effect.

  1. Domain wall engineering through exchange bias

    International Nuclear Information System (INIS)

    Albisetti, E.; Petti, D.

    2016-01-01

    The control of the structure and position of magnetic domain walls is at the basis of the development of different magnetic devices and architectures. Several nanofabrication techniques have been proposed to geometrically confine and shape domain wall structures; however, a fine tuning of the position and micromagnetic configuration is hardly achieved, especially in continuous films. This work shows that, by controlling the unidirectional anisotropy of a continuous ferromagnetic film through exchange bias, domain walls whose spin arrangement is generally not favored by dipolar and exchange interactions can be created. Micromagnetic simulations reveal that the domain wall width, position and profile can be tuned by establishing an abrupt change in the direction and magnitude of the exchange bias field set in the system. - Highlights: • Micromagnetic simulations study domain walls in exchange biased thin films. • Novel domain wall configurations can be stabilized via exchange bias. • Domain walls nucleate at the boundary of regions with different exchange bias. • Domain wall width and spin profile are controlled by tuning the exchange bias.

  2. Exploring differential item functioning (DIF) with the Rasch model: a comparison of gender differences on eighth grade science items in the United States and Spain.

    Science.gov (United States)

    Babiar, Tasha Calvert

    2011-01-01

    Traditionally, women and minorities have not been fully represented in science and engineering. Numerous studies have attributed these differences to gaps in science achievement as measured by various standardized tests. Rather than describe mean group differences in science achievement across multiple cultures, this study focused on an in-depth item-level analysis across two countries: Spain and the United States. This study investigated eighth-grade gender differences on science items across the two countries. A secondary purpose of the study was to explore the nature of gender differences using the many-faceted Rasch Model as a way to estimate gender DIF. A secondary analysis of data from the Third International Mathematics and Science Study (TIMSS) was used to address three questions: 1) Does gender DIF in science achievement exist? 2) Is there a relationship between gender DIF and characteristics of the science items? 3) Do the relationships between item characteristics and gender DIF in science items replicate across countries. Participants included 7,087 eight grade students from the United States and 3,855 students from Spain who participated in TIMSS. The Facets program (Linacre and Wright, 1992) was used to estimate gender DIF. The results of the analysis indicate that the content of the item seemed to be related to gender DIF. The analysis also suggests that there is a relationship between gender DIF and item format. No pattern of gender DIF related to cognitive demand was found. The general pattern of gender DIF was similar across the two countries used in the analysis. The strength of item-level analysis as opposed to group mean difference analysis is that gender differences can be detected at the item level, even when no mean differences can be detected at the group level.

  3. The Number of Response Categories and the Reverse Directional Item Problem in Likert-Type Scales: A Study with the Rasch Model

    Directory of Open Access Journals (Sweden)

    Mustafa İLHAN

    2017-09-01

    Full Text Available This study addressed reverse directional item and the number of response categories problems in Likert-type scales. The Fear of Negative Evaluation Scale (FNES and the Oxford Happiness Questionnaire (OHQ were used as data collection tools. The data of the study were analyzed according to the Rasch model. The analysis found that the observed and expected test characteristic curves were largely overlapped, each of the three rating scales worked effectively, and the differences between response categories could be distinguished successfully by the participants in straightforward directional items. On the other hand, it was determined that there were significant differences between the observed and expected test characteristic curves in reverse directional items. It was also found that no matter which one of these three, five and seven-point rating scales was used, the participants could not distinguish the response categories of the reverse directional items on the FNES and the OHQ. Afterwards, the reverse directional items were removed from the data file, and the analysis was repeated. The analysis results revealed that item discrimination, reliability coefficients for person facet, separation ratios and Chi square values calculated for the facets of person and items were higher in five-pointed rating compared to three and seven pointed rating.

  4. Generalizability theory and item response theory

    OpenAIRE

    Glas, Cornelis A.W.; Eggen, T.J.H.M.; Veldkamp, B.P.

    2012-01-01

    Item response theory is usually applied to items with a selected-response format, such as multiple choice items, whereas generalizability theory is usually applied to constructed-response tasks assessed by raters. However, in many situations, raters may use rating scales consisting of items with a selected-response format. This chapter presents a short overview of how item response theory and generalizability theory were integrated to model such assessments. Further, the precision of the esti...

  5. The valuation of environmental goods in Norway: A contingent valuation study with multiple bias testing

    Energy Technology Data Exchange (ETDEWEB)

    Strand, J.; Taraldset, A.

    1991-01-01

    We report on a study of contingent valuation of reduction in air pollution, and of a broader set of six environmental issues, among a population sample in Oslo. We derive an estimate of the extent of upward bias due to mental accouting'' in the expressed valuation of the air pollution issue, in two steps: (1) by comparing valuation of air pollution alone, with the same when the other issues at the same time are to be dealt with; and (2) by deriving the implicit valuation of the air pollution issue from the ranking of issues, and total valuation of all six issues. We find that expressed valuation of air pollution reductions are 3-4 times as high as the true'' values, and argue that this discrepancy is mainly due to mental account biases. We also test for strategic, starting point, information and interviewer biases, which are all present and, with the exception of the information bias, all in the expected directions. 9 refs., 4 tabs.

  6. Extending item response theory to online homework

    Directory of Open Access Journals (Sweden)

    Gerd Kortemeyer

    2014-05-01

    Full Text Available Item response theory (IRT becomes an increasingly important tool when analyzing “big data” gathered from online educational venues. However, the mechanism was originally developed in traditional exam settings, and several of its assumptions are infringed upon when deployed in the online realm. For a large-enrollment physics course for scientists and engineers, the study compares outcomes from IRT analyses of exam and homework data, and then proceeds to investigate the effects of each confounding factor introduced in the online realm. It is found that IRT yields the correct trends for learner ability and meaningful item parameters, yet overall agreement with exam data is moderate. It is also found that learner ability and item discrimination is robust over a wide range with respect to model assumptions and introduced noise. Item difficulty is also robust, but over a narrower range.

  7. Effects of personal characteristics on susceptibility to decision bias : a literature study

    NARCIS (Netherlands)

    Toet, A.; Brouwer, A.M.; Bosch, K. van den; Korteling, J.E.

    2016-01-01

    Cognitive biases and heuristics are pervasive simplifications and distortions in judgement and reasoning that systematically affect human decision making. Knowledge in this area may enable us to foresee and reduce detrimental effects of biases or to influence others more effectively. We therefore

  8. Approximate Bias Correction in Econometrics

    OpenAIRE

    James G. MacKinnon; Anthony A. Smith Jr.

    1995-01-01

    This paper discusses ways to reduce the bias of consistent estimators that are biased in finite samples. It is necessary that the bias function, which relates parameter values to bias, should be estimable by computer simulation or by some other method. If so, bias can be reduced or, in some cases that may not be unrealistic, even eliminated. In general, several evaluations of the bias function will be required to do this. Unfortunately, reducing bias may increase the variance, or even the mea...

  9. Threats to Validity When Using Open-Ended Items in International Achievement Studies: Coding Responses to the PISA 2012 Problem-Solving Test in Finland

    Science.gov (United States)

    Arffman, Inga

    2016-01-01

    Open-ended (OE) items are widely used to gather data on student performance in international achievement studies. However, several factors may threaten validity when using such items. This study examined Finnish coders' opinions about threats to validity when coding responses to OE items in the PISA 2012 problem-solving test. A total of 6…

  10. Item Response Theory with Covariates (IRT-C): Assessing Item Recovery and Differential Item Functioning for the Three-Parameter Logistic Model

    Science.gov (United States)

    Tay, Louis; Huang, Qiming; Vermunt, Jeroen K.

    2016-01-01

    In large-scale testing, the use of multigroup approaches is limited for assessing differential item functioning (DIF) across multiple variables as DIF is examined for each variable separately. In contrast, the item response theory with covariate (IRT-C) procedure can be used to examine DIF across multiple variables (covariates) simultaneously. To…

  11. Risk of bias and confounding of observational studies of Zika virus infection: A scoping review of research protocols.

    Science.gov (United States)

    Reveiz, Ludovic; Haby, Michelle M; Martínez-Vega, Ruth; Pinzón-Flores, Carlos E; Elias, Vanessa; Smith, Emma; Pinart, Mariona; Broutet, Nathalie; Becerra-Posada, Francisco; Aldighieri, Sylvain; Van Kerkhove, Maria D

    2017-01-01

    Given the severity and impact of the current Zika virus (ZIKV) outbreak in the Americas, numerous countries have rushed to develop research studies to assess ZIKV and its potential health consequences. In an effort to ensure that studies are comprehensive, both internally and externally valid, and with reliable results, the World Health Organization, the Pan American Health Organization, Institut Pasteur, the networks of Fiocruz, the Consortia for the Standardization of Influenza Seroepidemiology (CONSISE) and the International Severe Acute Respiratory and Emerging Infection Consortium (ISARIC) have generated six standardized clinical and epidemiological research protocols and questionnaires to address key public health questions on ZIKV. We conducted a systematic search of ongoing study protocols related to ZIKV research. We analyzed the content of protocols of 32 cohort studies and 13 case control studies for systematic bias that could produce erroneous results. Additionally we aimed to characterize the risks of bias and confounding in observational studies related to ZIKV and to propose ways to minimize them, including the use of six newly standardized research protocols. Observational studies of ZIKV face an array of challenges, including measurement of exposure and outcomes (microcephaly and Guillain-Barré Syndrome). Potential confounders need to be measured where known and controlled for in the analysis. Selection bias due to non-random selection is a significant issue, particularly in the case-control design, and losses to follow-up is equally important for the cohort design. Observational research seeking to answer key questions on the ZIKV should consider these restrictions and take precautions to minimize bias in an effort to provide reliable and valid results. Utilization of the standardized research protocols developed by the WHO, PAHO, Institut Pasteur, and CONSISE will harmonize the key methodological aspects of each study design to minimize bias at

  12. Risk of bias and confounding of observational studies of Zika virus infection: A scoping review of research protocols.

    Directory of Open Access Journals (Sweden)

    Ludovic Reveiz

    Full Text Available Given the severity and impact of the current Zika virus (ZIKV outbreak in the Americas, numerous countries have rushed to develop research studies to assess ZIKV and its potential health consequences. In an effort to ensure that studies are comprehensive, both internally and externally valid, and with reliable results, the World Health Organization, the Pan American Health Organization, Institut Pasteur, the networks of Fiocruz, the Consortia for the Standardization of Influenza Seroepidemiology (CONSISE and the International Severe Acute Respiratory and Emerging Infection Consortium (ISARIC have generated six standardized clinical and epidemiological research protocols and questionnaires to address key public health questions on ZIKV.We conducted a systematic search of ongoing study protocols related to ZIKV research. We analyzed the content of protocols of 32 cohort studies and 13 case control studies for systematic bias that could produce erroneous results. Additionally we aimed to characterize the risks of bias and confounding in observational studies related to ZIKV and to propose ways to minimize them, including the use of six newly standardized research protocols.Observational studies of ZIKV face an array of challenges, including measurement of exposure and outcomes (microcephaly and Guillain-Barré Syndrome. Potential confounders need to be measured where known and controlled for in the analysis. Selection bias due to non-random selection is a significant issue, particularly in the case-control design, and losses to follow-up is equally important for the cohort design.Observational research seeking to answer key questions on the ZIKV should consider these restrictions and take precautions to minimize bias in an effort to provide reliable and valid results. Utilization of the standardized research protocols developed by the WHO, PAHO, Institut Pasteur, and CONSISE will harmonize the key methodological aspects of each study design to

  13. A Diagnostic Study of Pre-Service Teachers' Competency in Multiple-Choice Item Development

    Science.gov (United States)

    Asim, Alice E.; Ekuri, Emmanuel E.; Eni, Eni I.

    2013-01-01

    Large class size is an issue in testing at all levels of Education. As a panacea to this, multiple choice test formats has become very popular. This case study was designed to diagnose pre-service teachers' competency in constructing questions (IQT); direct questions (DQT); and best answer (BAT) varieties of multiple choice items. Subjects were 88…

  14. The randomly renewed general item and the randomly inspected item with exponential life distribution

    International Nuclear Information System (INIS)

    Schneeweiss, W.G.

    1979-01-01

    For a randomly renewed item the probability distributions of the time to failure and of the duration of down time and the expectations of these random variables are determined. Moreover, it is shown that the same theory applies to randomly checked items with exponential probability distribution of life such as electronic items. The case of periodic renewals is treated as an example. (orig.) [de

  15. Sources of interference in item and associative recognition memory.

    Science.gov (United States)

    Osth, Adam F; Dennis, Simon

    2015-04-01

    A powerful theoretical framework for exploring recognition memory is the global matching framework, in which a cue's memory strength reflects the similarity of the retrieval cues being matched against the contents of memory simultaneously. Contributions at retrieval can be categorized as matches and mismatches to the item and context cues, including the self match (match on item and context), item noise (match on context, mismatch on item), context noise (match on item, mismatch on context), and background noise (mismatch on item and context). We present a model that directly parameterizes the matches and mismatches to the item and context cues, which enables estimation of the magnitude of each interference contribution (item noise, context noise, and background noise). The model was fit within a hierarchical Bayesian framework to 10 recognition memory datasets that use manipulations of strength, list length, list strength, word frequency, study-test delay, and stimulus class in item and associative recognition. Estimates of the model parameters revealed at most a small contribution of item noise that varies by stimulus class, with virtually no item noise for single words and scenes. Despite the unpopularity of background noise in recognition memory models, background noise estimates dominated at retrieval across nearly all stimulus classes with the exception of high frequency words, which exhibited equivalent levels of context noise and background noise. These parameter estimates suggest that the majority of interference in recognition memory stems from experiences acquired before the learning episode. (c) 2015 APA, all rights reserved).

  16. A Monte Carlo Study of the Effect of Item Characteristic Curve Estimation on the Accuracy of Three Person-Fit Statistics

    Science.gov (United States)

    St-Onge, Christina; Valois, Pierre; Abdous, Belkacem; Germain, Stephane

    2009-01-01

    To date, there have been no studies comparing parametric and nonparametric Item Characteristic Curve (ICC) estimation methods on the effectiveness of Person-Fit Statistics (PFS). The primary aim of this study was to determine if the use of ICCs estimated by nonparametric methods would increase the accuracy of item response theory-based PFS for…

  17. A New Navigation Satellite Clock Bias Prediction Method Based on Modified Clock-bias Quadratic Polynomial Model

    Science.gov (United States)

    Wang, Y. P.; Lu, Z. P.; Sun, D. S.; Wang, N.

    2016-01-01

    In order to better express the characteristics of satellite clock bias (SCB) and improve SCB prediction precision, this paper proposed a new SCB prediction model which can take physical characteristics of space-borne atomic clock, the cyclic variation, and random part of SCB into consideration. First, the new model employs a quadratic polynomial model with periodic items to fit and extract the trend term and cyclic term of SCB; then based on the characteristics of fitting residuals, a time series ARIMA ~(Auto-Regressive Integrated Moving Average) model is used to model the residuals; eventually, the results from the two models are combined to obtain final SCB prediction values. At last, this paper uses precise SCB data from IGS (International GNSS Service) to conduct prediction tests, and the results show that the proposed model is effective and has better prediction performance compared with the quadratic polynomial model, grey model, and ARIMA model. In addition, the new method can also overcome the insufficiency of the ARIMA model in model recognition and order determination.

  18. An Investigation of Item Type in a Standards-Based Assessment.

    Directory of Open Access Journals (Sweden)

    Liz Hollingworth

    2007-12-01

    Full Text Available Large-scale state assessment programs use both multiple-choice and open-ended items on tests for accountability purposes. Certainly, there is an intuitive belief among some educators and policy makers that open-ended items measure something different than multiple-choice items. This study examined two item formats in custom-built, standards-based tests of achievement in Reading and Mathematics at grades 3-8. In this paper, we raise questions about the value of including open-ended items, given scoring costs, time constraints, and the higher probability of missing data from test-takers.

  19. Procedures for Selecting Items for Computerized Adaptive Tests.

    Science.gov (United States)

    Kingsbury, G. Gage; Zara, Anthony R.

    1989-01-01

    Several classical approaches and alternative approaches to item selection for computerized adaptive testing (CAT) are reviewed and compared. The study also describes procedures for constrained CAT that may be added to classical item selection approaches to allow them to be used for applied testing. (TJH)

  20. Effect and reporting bias of RhoA/ROCK-blockade intervention on locomotor recovery after spinal cord injury: a systematic review and meta-analysis.

    Science.gov (United States)

    Watzlawick, Ralf; Sena, Emily S; Dirnagl, Ulrich; Brommer, Benedikt; Kopp, Marcel A; Macleod, Malcolm R; Howells, David W; Schwab, Jan M

    2014-01-01

    Blockade of small GTPase-RhoA signaling pathway is considered a candidate translational strategy to improve functional outcome after spinal cord injury (SCI) in humans. Pooling preclinical evidence by orthodox meta-analysis is confounded by missing data (publication bias). To conduct a systematic review and meta-analysis of RhoA/Rho-associated coiled-coil containing protein kinase (ROCK) blocking approaches to (1) analyze the impact of bias that may lead to inflated effect sizes and (2) determine the normalized effect size of functional locomotor recovery after experimental thoracic SCI. We conducted a systematic search of PubMed, EMBASE, and Web of Science and hand searched related references. Studies were selected if they reported the effect of RhoA/ROCK inhibitors (C3-exoenzmye, fasudil, Y-27632, ibuprofen, siRhoA, and p21) in experimental spinal cord hemisection, contusion, or transection on locomotor recovery measured by the Basso, Beattie, and Bresnahan score or the Basso Mouse Scale for Locomotion. Two investigators independently assessed the identified studies. Details of individual study characteristics from each publication were extracted and effect sizes pooled using a random effects model. We assessed risk for bias using a 9-point-item quality checklist and calculated publication bias with Egger regression and the trim and fill method. A stratified meta-analysis was used to assess the impact of study characteristics on locomotor recovery. Thirty studies (725 animals) were identified. RhoA/ROCK inhibition was found to improve locomotor outcome by 21% (95% CI, 16.0-26.6). Assessment of publication bias by the trim and fill method suggested that 30% of experiments remain unpublished. Inclusion of these theoretical missing studies suggested a 27% overestimation of efficacy, reducing the overall efficacy to a 15% improvement in locomotor recovery. Low study quality was associated with larger estimates of neurobehavioral outcome. Taking into account

  1. Separating relational from item load effects in paired recognition: temporoparietal and middle frontal gyral activity with increased associates, but not items during encoding and retention.

    Science.gov (United States)

    Phillips, Steven; Niki, Kazuhisa

    2002-10-01

    Working memory is affected by items stored and the relations between them. However, separating these factors has been difficult, because increased items usually accompany increased associations/relations. Hence, some have argued, relational effects are reducible to item effects. We overcome this problem by manipulating index length: the fewest number of item positions at which there is a unique item, or tuple of items (if length >1), for every instance in the relational (memory) set. Longer indexes imply greater similarity (number of shared items) between instances and higher load on encoding processes. Subjects were given lists of study pairs and asked to make a recognition judgement. The number of unique items and index length in the three list conditions were: (1) AB, CD: four/one; (2) AB, CD, EF: six/one; and (3) AB, AD, CB: four/two, respectively. Japanese letters were used in Experiments 1 (kanji-ideograms) and 2 (hiragana-phonograms); numbers in Experiment 3; and shapes generated from Fourier descriptors in Experiment 4. Across all materials, right dominant temporoparietal and middle frontal gyral activity was found with increased index length, but not items during study. In Experiment 5, a longer delay was used to isolate retention effects in the absence of visual stimuli. Increased left hemispheric activity was observed in the precuneus, middle frontal gyrus, and superior temporal gyrus with increased index length for the delay period. These results show that relational load is not reducible to item load.

  2. Specificity and overlap of attention and memory biases in depression.

    Science.gov (United States)

    Marchetti, Igor; Everaert, Jonas; Dainer-Best, Justin; Loeys, Tom; Beevers, Christopher G; Koster, Ernst H W

    2018-01-01

    Attentional and memory biases are viewed as crucial cognitive processes underlying symptoms of depression. However, it is still unclear whether these two biases are uniquely related to depression or whether they show substantial overlap. We investigated the degree of specificity and overlap of attentional and memory biases for depressotypic stimuli in relation to depression and anxiety by means of meta-analytic commonality analysis. By including four published studies, we considered a pool of 463 healthy and subclinically depressed individuals, different experimental paradigms, and different psychological measures. Memory bias is reliably and strongly related to depression and, specifically, to symptoms of negative mood, worthlessness, feelings of failure, and pessimism. Memory bias for negative information was minimally related to anxiety. Moreover, neither attentional bias nor the overlap between attentional and memory biases were significantly related to depression. Limitations include cross-sectional nature of the study. Our study showed that, across different paradigms and psychological measures, memory bias (and not attentional bias) represents a primary mechanism in depression. Copyright © 2017 Elsevier B.V. All rights reserved.

  3. Analysis test of understanding of vectors with the three-parameter logistic model of item response theory and item response curves technique

    Directory of Open Access Journals (Sweden)

    Suttida Rakkapao

    2016-10-01

    Full Text Available This study investigated the multiple-choice test of understanding of vectors (TUV, by applying item response theory (IRT. The difficulty, discriminatory, and guessing parameters of the TUV items were fit with the three-parameter logistic model of IRT, using the parscale program. The TUV ability is an ability parameter, here estimated assuming unidimensionality and local independence. Moreover, all distractors of the TUV were analyzed from item response curves (IRC that represent simplified IRT. Data were gathered on 2392 science and engineering freshmen, from three universities in Thailand. The results revealed IRT analysis to be useful in assessing the test since its item parameters are independent of the ability parameters. The IRT framework reveals item-level information, and indicates appropriate ability ranges for the test. Moreover, the IRC analysis can be used to assess the effectiveness of the test’s distractors. Both IRT and IRC approaches reveal test characteristics beyond those revealed by the classical analysis methods of tests. Test developers can apply these methods to diagnose and evaluate the features of items at various ability levels of test takers.

  4. Analysis test of understanding of vectors with the three-parameter logistic model of item response theory and item response curves technique

    Science.gov (United States)

    Rakkapao, Suttida; Prasitpong, Singha; Arayathanitkul, Kwan

    2016-12-01

    This study investigated the multiple-choice test of understanding of vectors (TUV), by applying item response theory (IRT). The difficulty, discriminatory, and guessing parameters of the TUV items were fit with the three-parameter logistic model of IRT, using the parscale program. The TUV ability is an ability parameter, here estimated assuming unidimensionality and local independence. Moreover, all distractors of the TUV were analyzed from item response curves (IRC) that represent simplified IRT. Data were gathered on 2392 science and engineering freshmen, from three universities in Thailand. The results revealed IRT analysis to be useful in assessing the test since its item parameters are independent of the ability parameters. The IRT framework reveals item-level information, and indicates appropriate ability ranges for the test. Moreover, the IRC analysis can be used to assess the effectiveness of the test's distractors. Both IRT and IRC approaches reveal test characteristics beyond those revealed by the classical analysis methods of tests. Test developers can apply these methods to diagnose and evaluate the features of items at various ability levels of test takers.

  5. Brief Sensation Seeking Scale: Latent structure of 8-item and 4-item versions in Peruvian adolescents.

    Science.gov (United States)

    Merino-Soto, Cesar; Salas Blas, Edwin

    2018-01-01

    This research intended to validate two brief scales of sensations seeking with Peruvian adolescents: the eight item scale (BSSS8; Hoyle, Stephenson, Palmgreen, Lorch, y Donohew, 2002) and the four item scale (BSSS4; Stephenson, Hoyle, Slater, y Palmgreen, 2003). Questionnaires were administered to 618 voluntary participants, with an average age of 13.6 years, from different levels of high school, state and private school in a district in the south of Lima. It analyzed the internal structure of both short versions using three models: a) unidimensional (M1), b) oblique or related dimensions (M2), and c) the bifactor model (M3). Results show that both instruments have a single dimension which best represents the variability of the items; a fact that can be explained both by the complexity of the concept and by the small number of items representing each factor, which is more noticeable in the BSSS4. Reliability is within levels found by previous studies: alpha: .745 = BSSS8 and BSSS4 =. 643; omega coefficient: .747 in BSSS8 and .651 in BSSS4. These are considered suitable for the type of instruments studied. Based on the correlation between the two instruments, it was found that there are satisfactory levels of equivalence between the BSSS8 and BSSS4. However, it is recommended that the BSSS4 is mainly used for research and for the purpose of describing populations.

  6. Effects of statistical models and items difficulties on making trait-level inferences: A simulation study

    Directory of Open Access Journals (Sweden)

    Nelson Hauck Filho

    2014-12-01

    Full Text Available Researchers dealing with the task of estimating locations of individuals on continuous latent variables may rely on several statistical models described in the literature. However, weighting costs and benefits of using one specific model over alternative models depends on empirical information that is not always clearly available. Therefore, the aim of this simulation study was to compare the performance of seven popular statistical models in providing adequate latent trait estimates in conditions of items difficulties targeted at the sample mean or at the tails of the latent trait distribution. Results suggested an overall tendency of models to provide more accurate estimates of true latent scores when using items targeted at the sample mean of the latent trait distribution. Rating Scale Model, Graded Response Model, and Weighted Least Squares Mean- and Variance-adjusted Confirmatory Factor Analysis yielded the most reliable latent trait estimates, even when applied to inadequate items for the sample distribution of the latent variable. These findings have important implications concerning some popular methodological practices in Psychology and related areas.

  7. Publication bias in dermatology systematic reviews and meta-analyses.

    Science.gov (United States)

    Atakpo, Paul; Vassar, Matt

    2016-05-01

    Systematic reviews and meta-analyses in dermatology provide high-level evidence for clinicians and policy makers that influence clinical decision making and treatment guidelines. One methodological problem with systematic reviews is the under representation of unpublished studies. This problem is due in part to publication bias. Omission of statistically non-significant data from meta-analyses may result in overestimation of treatment effect sizes which may lead to clinical consequences. Our goal was to assess whether systematic reviewers in dermatology evaluate and report publication bias. Further, we wanted to conduct our own evaluation of publication bias on meta-analyses that failed to do so. Our study considered systematic reviews and meta-analyses from ten dermatology journals from 2006 to 2016. A PubMed search was conducted, and all full-text articles that met our inclusion criteria were retrieved and coded by the primary author. 293 articles were included in our analysis. Additionally, we formally evaluated publication bias in meta-analyses that failed to do so using trim and fill and cumulative meta-analysis by precision methods. Publication bias was mentioned in 107 articles (36.5%) and was formally evaluated in 64 articles (21.8%). Visual inspection of a funnel plot was the most common method of evaluating publication bias. Publication bias was present in 45 articles (15.3%), not present in 57 articles (19.5%) and not determined in 191 articles (65.2%). Using the trim and fill method, 7 meta-analyses (33.33%) showed evidence of publication bias. Although the trim and fill method only found evidence of publication bias in 7 meta-analyses, the cumulative meta-analysis by precision method found evidence of publication bias in 15 meta-analyses (71.4%). Many of the reviews in our study did not mention or evaluate publication bias. Further, of the 42 articles that stated following PRISMA reporting guidelines, 19 (45.2%) evaluated for publication bias. In

  8. Simulating publication bias

    DEFF Research Database (Denmark)

    Paldam, Martin

    is censoring: selection by the size of estimate; SR3 selects the optimal combination of fit and size; and SR4 selects the first satisficing result. The last four SRs are steered by priors and result in bias. The MST and the FAT-PET have been developed for detection and correction of such bias. The simulations......Economic research typically runs J regressions for each selected for publication – it is often selected as the ‘best’ of the regressions. The paper examines five possible meanings of the word ‘best’: SR0 is ideal selection with no bias; SR1 is polishing: selection by statistical fit; SR2...... are made by data variation, while the model is the same. It appears that SR0 generates narrow funnels much at odds with observed funnels, while the other four funnels look more realistic. SR1 to SR4 give the mean a substantial bias that confirms the prior causing the bias. The FAT-PET MRA works well...

  9. The Dif Identification in Constructed Response Items Using Partial Credit Model

    Directory of Open Access Journals (Sweden)

    Heri Retnawati

    2017-10-01

    Full Text Available The study was to identify the load, the type and the significance of differential item functioning (DIF in constructed response item using the partial credit model (PCM. The data in the study were the students’ instruments and the students’ responses toward the PISA-like test items that had been completed by 386 ninth grade students and 460 tenth grade students who had been about 15 years old in the Province of Yogyakarta Special Region in Indonesia. The analysis toward the item characteristics through the student categorization based on their class was conducted toward the PCM using CONQUEST software. Furthermore, by applying these items characteristics, the researcher draw the category response function (CRF graphic in order to identify whether the type of DIF content had been in uniform or non-uniform. The significance of DIF was identified by comparing the discrepancy between the difficulty level parameter and the error in the CONQUEST output results. The results of the analysis showed that from 18 items that had been analyzed there were 4 items which had not been identified load DIF, there were 5 items that had been identified containing DIF but not statistically significant and there were 9 items that had been identified containing DIF significantly. The causes of items containing DIF were discussed.

  10. 17 CFR 260.7a-16 - Inclusion of items, differentiation between items and answers, omission of instructions.

    Science.gov (United States)

    2010-04-01

    ... 17 Commodity and Securities Exchanges 3 2010-04-01 2010-04-01 false Inclusion of items, differentiation between items and answers, omission of instructions. 260.7a-16 Section 260.7a-16 Commodity and... INDENTURE ACT OF 1939 Formal Requirements § 260.7a-16 Inclusion of items, differentiation between items and...

  11. Comparison of Alternate and Original Items on the Montreal Cognitive Assessment.

    Science.gov (United States)

    Lebedeva, Elena; Huang, Mei; Koski, Lisa

    2016-03-01

    The Montreal Cognitive Assessment (MoCA) is a screening tool for mild cognitive impairment (MCI) in elderly individuals. We hypothesized that measurement error when using the new alternate MoCA versions to monitor change over time could be related to the use of items that are not of comparable difficulty to their corresponding originals of similar content. The objective of this study was to compare the difficulty of the alternate MoCA items to the original ones. Five selected items from alternate versions of the MoCA were included with items from the original MoCA administered adaptively to geriatric outpatients (N = 78). Rasch analysis was used to estimate the difficulty level of the items. None of the five items from the alternate versions matched the difficulty level of their corresponding original items. This study demonstrates the potential benefits of a Rasch analysis-based approach for selecting items during the process of development of parallel forms. The results suggest that better match of the items from different MoCA forms by their difficulty would result in higher sensitivity to changes in cognitive function over time.

  12. Accuracy and bias of ICT self-efficacy: an empirical study into students' over- and underestimation of their ICT competences

    NARCIS (Netherlands)

    Aesaert, K.; Voogt, J.; Kuiper, E.; van Braak, J.

    2017-01-01

    Most studies on the assessment of ICT competences use measures of ICT self-efficacy. These studies are often accused that they suffer from self-reported bias, i.e. students can over- and/or underestimate their ICT competences. As such, taking bias and accuracy of ICT self-efficacy into account,

  13. Minimum Bias Trigger in ATLAS

    International Nuclear Information System (INIS)

    Kwee, Regina

    2010-01-01

    Since the restart of the LHC in November 2009, ATLAS has collected inelastic pp collisions to perform first measurements on charged particle densities. These measurements will help to constrain various models describing phenomenologically soft parton interactions. Understanding the trigger efficiencies for different event types are therefore crucial to minimize any possible bias in the event selection. ATLAS uses two main minimum bias triggers, featuring complementary detector components and trigger levels. While a hardware based first trigger level situated in the forward regions with 2.2 < |η| < 3.8 has been proven to select pp-collisions very efficiently, the Inner Detector based minimum bias trigger uses a random seed on filled bunches and central tracking detectors for the event selection. Both triggers were essential for the analysis of kinematic spectra of charged particles. Their performance and trigger efficiency measurements as well as studies on possible bias sources will be presented. We also highlight the advantage of these triggers for particle correlation analyses. (author)

  14. PENGEMBANGAN TES BERPIKIR KRITIS DENGAN PENDEKATAN ITEM RESPONSE THEORY

    Directory of Open Access Journals (Sweden)

    Fajrianthi Fajrianthi

    2016-06-01

    Full Text Available Penelitian ini bertujuan untuk menghasilkan sebuah alat ukur (tes berpikir kritis yang valid dan reliabel untuk digunakan, baik dalam lingkup pendidikan maupun kerja di Indonesia. Tahapan penelitian dilakukan berdasarkan tahap pengembangan tes menurut Hambleton dan Jones (1993. Kisi-kisi dan pembuatan butir didasarkan pada konsep dalam tes Watson-Glaser Critical Thinking Appraisal (WGCTA. Pada WGCTA, berpikir kritis terdiri dari lima dimensi yaitu Inference, Recognition Assumption, Deduction, Interpretation dan Evaluation of arguments. Uji coba tes dilakukan pada 1.453 peserta tes seleksi karyawan di Surabaya, Gresik, Tuban, Bojonegoro, Rembang. Data dikotomi dianalisis dengan menggunakan model IRT dengan dua parameter yaitu daya beda dan tingkat kesulitan butir. Analisis dilakukan dengan menggunakan program statistik Mplus versi 6.11 Sebelum melakukan analisis dengan IRT, dilakukan pengujian asumsi yaitu uji unidimensionalitas, independensi lokal dan Item Characteristic Curve (ICC. Hasil analisis terhadap 68 butir menghasilkan 15 butir dengan daya beda yang cukup baik dan tingkat kesulitan butir yang berkisar antara –4 sampai dengan 2.448. Sedikitnya jumlah butir yang berkualitas baik disebabkan oleh kelemahan dalam menentukan subject matter experts di bidang berpikir kritis dan pemilihan metode skoring. Kata kunci: Pengembangan tes, berpikir kritis, item response theory   DEVELOPING CRITICAL THINKING TEST UTILISING ITEM RESPONSE THEORY Abstract The present study was aimed to develop a valid and reliable instrument in assesing critical thinking which can be implemented both in educational and work settings in Indonesia. Following the Hambleton and Jones’s (1993 procedures on test development, the study developed the instrument by employing the concept of critical thinking from Watson-Glaser Critical Thinking Appraisal (WGCTA. The study included five dimensions of critical thinking as adopted from the WGCTA: Inference, Recognition

  15. The development of a single-item Food Choice Questionnaire

    NARCIS (Netherlands)

    Onwezen, M.C.; Reinders, M.J.; Verain, M.C.D.; Snoek, H.M.

    2019-01-01

    Based on the multi-item Food Choice Questionnaire (FCQ) originally developed by Steptoe and colleagues (1995), the current study developed a single-item FCQ that provides an acceptable balance between practical needs and psychometric concerns. Studies 1 (N = 1851) and 2 (2a (N = 3290), 2b (N =

  16. The Dif Identification in Constructed Response Items Using Partial Credit Model

    OpenAIRE

    Heri Retnawati

    2017-01-01

    The study was to identify the load, the type and the significance of differential item functioning (DIF) in constructed response item using the partial credit model (PCM). The data in the study were the students’ instruments and the students’ responses toward the PISA-like test items that had been completed by 386 ninth grade students and 460 tenth grade students who had been about 15 years old in the Province of Yogyakarta Special Region in Indonesia. The analysis toward the item characteris...

  17. Biases in Visual, Auditory, and Audiovisual Perception of Space

    Science.gov (United States)

    Odegaard, Brian; Wozny, David R.; Shams, Ladan

    2015-01-01

    Localization of objects and events in the environment is critical for survival, as many perceptual and motor tasks rely on estimation of spatial location. Therefore, it seems reasonable to assume that spatial localizations should generally be accurate. Curiously, some previous studies have reported biases in visual and auditory localizations, but these studies have used small sample sizes and the results have been mixed. Therefore, it is not clear (1) if the reported biases in localization responses are real (or due to outliers, sampling bias, or other factors), and (2) whether these putative biases reflect a bias in sensory representations of space or a priori expectations (which may be due to the experimental setup, instructions, or distribution of stimuli). Here, to address these questions, a dataset of unprecedented size (obtained from 384 observers) was analyzed to examine presence, direction, and magnitude of sensory biases, and quantitative computational modeling was used to probe the underlying mechanism(s) driving these effects. Data revealed that, on average, observers were biased towards the center when localizing visual stimuli, and biased towards the periphery when localizing auditory stimuli. Moreover, quantitative analysis using a Bayesian Causal Inference framework suggests that while pre-existing spatial biases for central locations exert some influence, biases in the sensory representations of both visual and auditory space are necessary to fully explain the behavioral data. How are these opposing visual and auditory biases reconciled in conditions in which both auditory and visual stimuli are produced by a single event? Potentially, the bias in one modality could dominate, or the biases could interact/cancel out. The data revealed that when integration occurred in these conditions, the visual bias dominated, but the magnitude of this bias was reduced compared to unisensory conditions. Therefore, multisensory integration not only improves the

  18. Biases in Visual, Auditory, and Audiovisual Perception of Space.

    Directory of Open Access Journals (Sweden)

    Brian Odegaard

    2015-12-01

    Full Text Available Localization of objects and events in the environment is critical for survival, as many perceptual and motor tasks rely on estimation of spatial location. Therefore, it seems reasonable to assume that spatial localizations should generally be accurate. Curiously, some previous studies have reported biases in visual and auditory localizations, but these studies have used small sample sizes and the results have been mixed. Therefore, it is not clear (1 if the reported biases in localization responses are real (or due to outliers, sampling bias, or other factors, and (2 whether these putative biases reflect a bias in sensory representations of space or a priori expectations (which may be due to the experimental setup, instructions, or distribution of stimuli. Here, to address these questions, a dataset of unprecedented size (obtained from 384 observers was analyzed to examine presence, direction, and magnitude of sensory biases, and quantitative computational modeling was used to probe the underlying mechanism(s driving these effects. Data revealed that, on average, observers were biased towards the center when localizing visual stimuli, and biased towards the periphery when localizing auditory stimuli. Moreover, quantitative analysis using a Bayesian Causal Inference framework suggests that while pre-existing spatial biases for central locations exert some influence, biases in the sensory representations of both visual and auditory space are necessary to fully explain the behavioral data. How are these opposing visual and auditory biases reconciled in conditions in which both auditory and visual stimuli are produced by a single event? Potentially, the bias in one modality could dominate, or the biases could interact/cancel out. The data revealed that when integration occurred in these conditions, the visual bias dominated, but the magnitude of this bias was reduced compared to unisensory conditions. Therefore, multisensory integration not only

  19. Does neurocognitive function affect cognitive bias toward an emotional stimulus? Association between general attentional ability and attentional bias toward threat

    Directory of Open Access Journals (Sweden)

    Yuko eHakamata

    2014-08-01

    Full Text Available Background: Although poorer cognitive performance has been found to be associated with anxiety, it remains unclear whether neurocognitive function affects biased cognitive processing toward emotional information. We investigated whether general cognitive function evaluated with a standard neuropsychological test predicts biased cognition, focusing on attentional bias toward threat.Methods: One hundred and five healthy young adults completed a dot-probe task measuring attentional bias and the Repeatable Battery for the Assessment of Neuropsychological Status (RBANS measuring general cognitive function, which consists of five domains: immediate memory, visuospatial/constructional, language, attention, and delayed memory. Stepwise multiple regression analysis was performed to examine the relationships between attentional bias and cognitive function. Results: The attentional domain was the best predictor of attentional bias toward threat (β = -0.26, p = 0.006. Within the attentional domain, digit symbol coding was negatively correlated with attentional bias (r = -0.28, p = 0.005.Conclusions: The present study provides the first evidence that general attentional ability, which was assessed with a standard neuropsychological test, affects attentional bias toward threatening information. Individual cognitive profiles might be important for the measurement and modification of cognitive biases.

  20. Understanding and quantifying cognitive complexity level in mathematical problem solving items

    Directory of Open Access Journals (Sweden)

    SUSAN E. EMBRETSON

    2008-09-01

    Full Text Available The linear logistic test model (LLTM; Fischer, 1973 has been applied to a wide variety of new tests. When the LLTM application involves item complexity variables that are both theoretically interesting and empirically supported, several advantages can result. These advantages include elaborating construct validity at the item level, defining variables for test design, predicting parameters of new items, item banking by sources of complexity and providing a basis for item design and item generation. However, despite the many advantages of applying LLTM to test items, it has been applied less often to understand the sources of complexity for large-scale operational test items. Instead, previously calibrated item parameters are modeled using regression techniques because raw item response data often cannot be made available. In the current study, both LLTM and regression modeling are applied to mathematical problem solving items from a widely used test. The findings from the two methods are compared and contrasted for their implications for continued development of ability and achievement tests based on mathematical problem solving items.

  1. Media bias under direct and indirect government control: when is the bias smaller?

    OpenAIRE

    Abhra Roy

    2015-01-01

    We present an analytical framework to compare media bias under direct and indirect government control. In this context, we show that direct control can lead to a smaller bias and higher welfare than indirect control. We further show that the size of the advertising market affects media bias only under direct control. Media bias, under indirect control, is not affected by the size of the advertising market.

  2. Editorial Changes and Item Performance: Implications for Calibration and Pretesting

    Directory of Open Access Journals (Sweden)

    Heather Stoffel

    2014-11-01

    Full Text Available Previous research on the impact of text and formatting changes on test-item performance has produced mixed results. This matter is important because it is generally acknowledged that any change to an item requires that it be recalibrated. The present study investigated the effects of seven classes of stylistic changes on item difficulty, discrimination, and response time for a subset of 65 items that make up a standardized test for physician licensure completed by 31,918 examinees in 2012. One of two versions of each item (original or revised was randomly assigned to examinees such that each examinee saw only two experimental items, with each item being administered to approximately 480 examinees. The stylistic changes had little or no effect on item difficulty or discrimination; however, one class of edits -' changing an item from an open lead-in (incomplete statement to a closed lead-in (direct question -' did result in slightly longer response times. Data for nonnative speakers of English were analyzed separately with nearly identical results. These findings have implications for the conventional practice of repretesting (or recalibrating items that have been subjected to minor editorial changes.

  3. Assessment of cognitive bias in decision-making and leadership styles among critical care nurses: a mixed methods study.

    Science.gov (United States)

    Lean Keng, Soon; AlQudah, Hani Nawaf Ibrahim

    2017-02-01

    To raise awareness of critical care nurses' cognitive bias in decision-making, its relationship with leadership styles and its impact on care delivery. The relationship between critical care nurses' decision-making and leadership styles in hospitals has been widely studied, but the influence of cognitive bias on decision-making and leadership styles in critical care environments remains poorly understood, particularly in Jordan. Two-phase mixed methods sequential explanatory design and grounded theory. critical care unit, Prince Hamza Hospital, Jordan. Participant sampling: convenience sampling Phase 1 (quantitative, n = 96), purposive sampling Phase 2 (qualitative, n = 20). Pilot tested quantitative survey of 96 critical care nurses in 2012. Qualitative in-depth interviews, informed by quantitative results, with 20 critical care nurses in 2013. Descriptive and simple linear regression quantitative data analyses. Thematic (constant comparative) qualitative data analysis. Quantitative - correlations found between rationality and cognitive bias, rationality and task-oriented leadership styles, cognitive bias and democratic communication styles and cognitive bias and task-oriented leadership styles. Qualitative - 'being competent', 'organizational structures', 'feeling self-confident' and 'being supported' in the work environment identified as key factors influencing critical care nurses' cognitive bias in decision-making and leadership styles. Two-way impact (strengthening and weakening) of cognitive bias in decision-making and leadership styles on critical care nurses' practice performance. There is a need to heighten critical care nurses' consciousness of cognitive bias in decision-making and leadership styles and its impact and to develop organization-level strategies to increase non-biased decision-making. © 2016 John Wiley & Sons Ltd.

  4. Maintenance of item and order information in verbal working memory.

    Science.gov (United States)

    Camos, Valérie; Lagner, Prune; Loaiza, Vanessa M

    2017-09-01

    Although verbal recall of item and order information is well-researched in short-term memory paradigms, there is relatively little research concerning item and order recall from working memory. The following study examined whether manipulating the opportunity for attentional refreshing and articulatory rehearsal in a complex span task differently affected the recall of item- and order-specific information of the memoranda. Five experiments varied the opportunity for articulatory rehearsal and attentional refreshing in a complex span task, but the type of recall was manipulated between experiments (item and order, order only, and item only recall). The results showed that impairing attentional refreshing and articulatory rehearsal similarly affected recall regardless of whether the scoring procedure (Experiments 1 and 4) or recall requirements (Experiments 2, 3, and 5) reflected item- or order-specific recall. This implies that both mechanisms sustain the maintenance of item and order information, and suggests that the common cumulative functioning of these two mechanisms to maintain items could be at the root of order maintenance.

  5. Large-scale galaxy bias

    Science.gov (United States)

    Desjacques, Vincent; Jeong, Donghui; Schmidt, Fabian

    2018-02-01

    This review presents a comprehensive overview of galaxy bias, that is, the statistical relation between the distribution of galaxies and matter. We focus on large scales where cosmic density fields are quasi-linear. On these scales, the clustering of galaxies can be described by a perturbative bias expansion, and the complicated physics of galaxy formation is absorbed by a finite set of coefficients of the expansion, called bias parameters. The review begins with a detailed derivation of this very important result, which forms the basis of the rigorous perturbative description of galaxy clustering, under the assumptions of General Relativity and Gaussian, adiabatic initial conditions. Key components of the bias expansion are all leading local gravitational observables, which include the matter density but also tidal fields and their time derivatives. We hence expand the definition of local bias to encompass all these contributions. This derivation is followed by a presentation of the peak-background split in its general form, which elucidates the physical meaning of the bias parameters, and a detailed description of the connection between bias parameters and galaxy statistics. We then review the excursion-set formalism and peak theory which provide predictions for the values of the bias parameters. In the remainder of the review, we consider the generalizations of galaxy bias required in the presence of various types of cosmological physics that go beyond pressureless matter with adiabatic, Gaussian initial conditions: primordial non-Gaussianity, massive neutrinos, baryon-CDM isocurvature perturbations, dark energy, and modified gravity. Finally, we discuss how the description of galaxy bias in the galaxies' rest frame is related to clustering statistics measured from the observed angular positions and redshifts in actual galaxy catalogs.

  6. An Efficient Way to Detect Poststroke Depression by Subsequent Administration of a 9-Item and a 2-Item Patient Health Questionnaire

    NARCIS (Netherlands)

    de Man-van Ginkel, Janneke M.; Hafsteinsdottir, Thora; Lindeman, Eline; Burger, Huibert; Grobbee, Diederick; Schuurmans, Marieke

    Background and Purpose-The early detection of poststroke depression is essential for optimizing recovery after stroke. A prospective study was conducted to investigate the diagnostic value of the 9-item and the 2-item Patient Health Questionnaire (PHQ-9, PHQ-2). Methods-One hundred seventy-one

  7. Evaluating an Automated Number Series Item Generator Using Linear Logistic Test Models

    Directory of Open Access Journals (Sweden)

    Bao Sheng Loe

    2018-04-01

    Full Text Available This study investigates the item properties of a newly developed Automatic Number Series Item Generator (ANSIG. The foundation of the ANSIG is based on five hypothesised cognitive operators. Thirteen item models were developed using the numGen R package and eleven were evaluated in this study. The 16-item ICAR (International Cognitive Ability Resource1 short form ability test was used to evaluate construct validity. The Rasch Model and two Linear Logistic Test Model(s (LLTM were employed to estimate and predict the item parameters. Results indicate that a single factor determines the performance on tests composed of items generated by the ANSIG. Under the LLTM approach, all the cognitive operators were significant predictors of item difficulty. Moderate to high correlations were evident between the number series items and the ICAR test scores, with high correlation found for the ICAR Letter-Numeric-Series type items, suggesting adequate nomothetic span. Extended cognitive research is, nevertheless, essential for the automatic generation of an item pool with predictable psychometric properties.

  8. Item Response Theory Models for Wording Effects in Mixed-Format Scales

    Science.gov (United States)

    Wang, Wen-Chung; Chen, Hui-Fang; Jin, Kuan-Yu

    2015-01-01

    Many scales contain both positively and negatively worded items. Reverse recoding of negatively worded items might not be enough for them to function as positively worded items do. In this study, we commented on the drawbacks of existing approaches to wording effect in mixed-format scales and used bi-factor item response theory (IRT) models to…

  9. Approximation Preserving Reductions among Item Pricing Problems

    Science.gov (United States)

    Hamane, Ryoso; Itoh, Toshiya; Tomita, Kouhei

    When a store sells items to customers, the store wishes to determine the prices of the items to maximize its profit. Intuitively, if the store sells the items with low (resp. high) prices, the customers buy more (resp. less) items, which provides less profit to the store. So it would be hard for the store to decide the prices of items. Assume that the store has a set V of n items and there is a set E of m customers who wish to buy those items, and also assume that each item i ∈ V has the production cost di and each customer ej ∈ E has the valuation vj on the bundle ej ⊆ V of items. When the store sells an item i ∈ V at the price ri, the profit for the item i is pi = ri - di. The goal of the store is to decide the price of each item to maximize its total profit. We refer to this maximization problem as the item pricing problem. In most of the previous works, the item pricing problem was considered under the assumption that pi ≥ 0 for each i ∈ V, however, Balcan, et al. [In Proc. of WINE, LNCS 4858, 2007] introduced the notion of “loss-leader, ” and showed that the seller can get more total profit in the case that pi < 0 is allowed than in the case that pi < 0 is not allowed. In this paper, we derive approximation preserving reductions among several item pricing problems and show that all of them have algorithms with good approximation ratio.

  10. Which Statistic Should Be Used to Detect Item Preknowledge When the Set of Compromised Items Is Known?

    Science.gov (United States)

    Sinharay, Sandip

    2017-09-01

    Benefiting from item preknowledge is a major type of fraudulent behavior during educational assessments. Belov suggested the posterior shift statistic for detection of item preknowledge and showed its performance to be better on average than that of seven other statistics for detection of item preknowledge for a known set of compromised items. Sinharay suggested a statistic based on the likelihood ratio test for detection of item preknowledge; the advantage of the statistic is that its null distribution is known. Results from simulated and real data and adaptive and nonadaptive tests are used to demonstrate that the Type I error rate and power of the statistic based on the likelihood ratio test are very similar to those of the posterior shift statistic. Thus, the statistic based on the likelihood ratio test appears promising in detecting item preknowledge when the set of compromised items is known.

  11. Are most samples of animals systematically biased? Consistent individual trait differences bias samples despite random sampling.

    Science.gov (United States)

    Biro, Peter A

    2013-02-01

    Sampling animals from the wild for study is something nearly every biologist has done, but despite our best efforts to obtain random samples of animals, 'hidden' trait biases may still exist. For example, consistent behavioral traits can affect trappability/catchability, independent of obvious factors such as size and gender, and these traits are often correlated with other repeatable physiological and/or life history traits. If so, systematic sampling bias may exist for any of these traits. The extent to which this is a problem, of course, depends on the magnitude of bias, which is presently unknown because the underlying trait distributions in populations are usually unknown, or unknowable. Indeed, our present knowledge about sampling bias comes from samples (not complete population censuses), which can possess bias to begin with. I had the unique opportunity to create naturalized populations of fish by seeding each of four small fishless lakes with equal densities of slow-, intermediate-, and fast-growing fish. Using sampling methods that are not size-selective, I observed that fast-growing fish were up to two-times more likely to be sampled than slower-growing fish. This indicates substantial and systematic bias with respect to an important life history trait (growth rate). If correlations between behavioral, physiological and life-history traits are as widespread as the literature suggests, then many animal samples may be systematically biased with respect to these traits (e.g., when collecting animals for laboratory use), and affect our inferences about population structure and abundance. I conclude with a discussion on ways to minimize sampling bias for particular physiological/behavioral/life-history types within animal populations.

  12. Virtual Reality-Based Attention Bias Modification Training for Social Anxiety: A Feasibility and Proof of Concept Study.

    Science.gov (United States)

    Urech, Antoine; Krieger, Tobias; Chesham, Alvin; Mast, Fred W; Berger, Thomas

    2015-01-01

    Attention bias modification (ABM) programs have been considered as a promising new approach for the treatment of various disorders, including social anxiety disorder (SAD). However, previous studies yielded ambiguous results regarding the efficacy of ABM in SAD. The present proof-of-concept study investigates the feasibility of a newly developed virtual reality (VR)-based dot-probe training paradigm. It was designed to facilitate attentional disengagement from threatening stimuli in socially anxious individuals (N = 15). The following outcomes were examined: (a) self-reports of enjoyment, motivation, flow, and presence; (b) attentional bias for social stimuli; and (c) social anxiety symptoms. Results showed that ABM training is associated with high scores in enjoyment, motivation, flow, and presence. Furthermore, significant improvements in terms of attention bias and social anxiety symptoms were observed from pre- to follow-up assessment. The study suggests that VR is a feasible and presumably a promising new medium for ABM trainings. Controlled studies will need to be carried out.

  13. Virtual Reality-Based Attention Bias Modification Training for Social Anxiety: A Feasibility and Proof of Concept Study

    Directory of Open Access Journals (Sweden)

    Antoine eUrech

    2015-10-01

    Full Text Available Attention bias modification (ABM programs have been considered as a promising new approach for the treatment of various disorders, including social anxiety disorder (SAD. However, previous studies yielded ambiguous results regarding the efficacy of ABM in SAD. The present proof-of-concept study investigates the feasibility of a newly developed virtual reality (VR-based dot-probe training paradigm. It was designed to facilitate attentional disengagement from threatening stimuli in socially anxious individuals (N=15. The following outcomes were examined: (a self-reports of enjoyment, motivation, flow and presence, (b attentional bias for social stimuli, and (c social anxiety symptoms. Results showed that ABM training is associated with high scores in enjoyment, motivation, flow and presence. Furthermore, significant improvements in terms of attention bias and social anxiety symptoms were observed from pre- to follow-up assessment. The study suggests that VR is a feasible and presumably a promising new medium for ABM trainings. Controlled studies will need to be carried out.

  14. Quality assessment of observational studies in a drug-safety systematic review, comparison of two tools: the Newcastle–Ottawa Scale and the RTI item bank

    Directory of Open Access Journals (Sweden)

    Margulis AV

    2014-10-01

    Full Text Available Andrea V Margulis,1 Manel Pladevall,1 Nuria Riera-Guardia,1 Cristina Varas-Lorenzo,1 Lorna Hazell,2,3 Nancy D Berkman,4 Meera Viswanathan,4 Susana Perez-Gutthann,1 1RTI Health Solutions, Barcelona, Spain; 2Drug Safety Research Unit, Southampton, UK; 3Associate Department of the School of Pharmacy and Biomedical Sciences, University of Portsmouth, Portsmouth, UK; 4RTI International, Research Triangle Park, NC, USA Background: The study objective was to compare the Newcastle–Ottawa Scale (NOS and the RTI item bank (RTI-IB and estimate interrater agreement using the RTI-IB within a systematic review on the cardiovascular safety of glucose-lowering drugs. Methods: We tailored both tools and added four questions to the RTI-IB. Two reviewers assessed the quality of the 44 included studies with both tools, (independently for the RTI-IB and agreed on which responses conveyed low, unclear, or high risk of bias. For each question in the RTI-IB (n=31, the observed interrater agreement was calculated as the percentage of studies given the same bias assessment by both reviewers; chance-adjusted interrater agreement was estimated with the first-order agreement coefficient (AC1 statistic. Results: The NOS required less tailoring and was easier to use than the RTI-IB, but the RTI-IB produced a more thorough assessment. The RTI-IB includes most of the domains measured in the NOS. Median observed interrater agreement for the RTI-IB was 75% (25th percentile [p25] =61%; p75 =89%; median AC1 statistic was 0.64 (p25 =0.51; p75 =0.86. Conclusion: The RTI-IB facilitates a more complete quality assessment than the NOS but is more burdensome. The observed agreement and AC1 statistic in this study were higher than those reported by the RTI-IB's developers. Keywords: systematic review, meta-analysis, quality assessment, AC1

  15. Evaluation of biases present in the cohort multiple randomised controlled trial design: a simulation study

    Directory of Open Access Journals (Sweden)

    Jane Candlish

    2017-01-01

    Full Text Available Abstract Background The cohort multiple randomised controlled trial (cmRCT design provides an opportunity to incorporate the benefits of randomisation within clinical practice; thus reducing costs, integrating electronic healthcare records, and improving external validity. This study aims to address a key concern of the cmRCT design: refusal to treatment is only present in the intervention arm, and this may lead to bias and reduce statistical power. Methods We used simulation studies to assess the effect of this refusal, both random and related to event risk, on bias of the effect estimator and statistical power. A series of simulations were undertaken that represent a cmRCT trial with time-to-event endpoint. Intention-to-treat (ITT, per protocol (PP, and instrumental variable (IV analysis methods, two stage predictor substitution and two stage residual inclusion, were compared for various refusal scenarios. Results We found the IV methods provide a less biased estimator for the causal effect when refusal is present in the intervention arm, with the two stage residual inclusion method performing best with regards to minimum bias and sufficient power. We demonstrate that sample sizes should be adapted based on expected and actual refusal rates in order to be sufficiently powered for IV analysis. Conclusion We recommend running both an IV and ITT analyses in an individually randomised cmRCT as it is expected that the effect size of interest, or the effect we would observe in clinical practice, would lie somewhere between that estimated with ITT and IV analyses. The optimum (in terms of bias and power instrumental variable method was the two stage residual inclusion method. We recommend using adaptive power calculations, updating them as refusal rates are collected in the trial recruitment phase in order to be sufficiently powered for IV analysis.

  16. Validation and psychometric properties of the Somatic and Psychological HEalth REport (SPHERE) in a young Australian-based population sample using non-parametric item response theory.

    Science.gov (United States)

    Couvy-Duchesne, Baptiste; Davenport, Tracey A; Martin, Nicholas G; Wright, Margaret J; Hickie, Ian B

    2017-08-01

    The Somatic and Psychological HEalth REport (SPHERE) is a 34-item self-report questionnaire that assesses symptoms of mental distress and persistent fatigue. As it was developed as a screening instrument for use mainly in primary care-based clinical settings, its validity and psychometric properties have not been studied extensively in population-based samples. We used non-parametric Item Response Theory to assess scale validity and item properties of the SPHERE-34 scales, collected through four waves of the Brisbane Longitudinal Twin Study (N = 1707, mean age = 12, 51% females; N = 1273, mean age = 14, 50% females; N = 1513, mean age = 16, 54% females, N = 1263, mean age = 18, 56% females). We estimated the heritability of the new scores, their genetic correlation, and their predictive ability in a sub-sample (N = 1993) who completed the Composite International Diagnostic Interview. After excluding items most responsible for noise, sex or wave bias, the SPHERE-34 questionnaire was reduced to 21 items (SPHERE-21), comprising a 14-item scale for anxiety-depression and a 10-item scale for chronic fatigue (3 items overlapping). These new scores showed high internal consistency (alpha > 0.78), moderate three months reliability (ICC = 0.47-0.58) and item scalability (Hi > 0.23), and were positively correlated (phenotypic correlations r = 0.57-0.70; rG = 0.77-1.00). Heritability estimates ranged from 0.27 to 0.51. In addition, both scores were associated with later DSM-IV diagnoses of MDD, social anxiety and alcohol dependence (OR in 1.23-1.47). Finally, a post-hoc comparison showed that several psychometric properties of the SPHERE-21 were similar to those of the Beck Depression Inventory. The scales of SPHERE-21 measure valid and comparable constructs across sex and age groups (from 9 to 28 years). SPHERE-21 scores are heritable, genetically correlated and show good predictive ability of mental health in an Australian-based population

  17. Investigation of study items for the patterns of care study in the radiotherapy of laryngeal cancer: preliminary results

    International Nuclear Information System (INIS)

    Chung, Woong Ki; Ahn, Sung Ja; Kim, Il Han

    2003-01-01

    In order to develop the national guide-lines for the standardization of radiotherapy we are planning to establish a web-based, on-line data-base system for laryngeal cancer. As a first step this study was performed to accumulate the basic clinical information of laryngeal cancer and to determine the items needed for the data-base system. We analyzed the clinical data of patients who were treated under the diagnosis of laryngeal cancer from January 1998 through December 1999 in the South-west area of Korea. Eligibility criteria of the patients are as follows: 18 years or older, currently diagnosed with primary epithelial carcinoma of larynx, and no history of previous treatments for another cancers and the other laryngeal diseases. The items were developed and filled out by radiation oncologist who are members of Korean Southwest Radiation Oncology Group. SPSS v10.0 software was used for statistical analysis, Data of forty-five patients were collected. Age distribution of patients ranged from 28 to 88 years (median, 61). Laryngeal cancer occurred predominantly in males (10: t sex ratio). Twenty-eight patients (62%) had primary cancers in the glottis and 17 (38%) in the supraglottis. Most of them were diagnosed pathologically as squamous cell carcinoma (44/45, 98%). Twenty-four of 28 glottic cancer patients (86%) had AJCC (American Joint Committee on Cancer) stage l/ll, but 50% (8/16) had in supraglottic cancer patients (p=0.02). Most patients (89%) had the symptom of hoarseness. Indirect laryngoscopy was done in all patients and direct laryngoscopy was performed in 43 (98%) patients. Twenty-one of 28 (75%) glottic cancer cases and 6 of 17 (35%) supraglottic cancer cases were treated with radiation alone, respectively. The combined treatment of surgery and radiation was used in 5 (18%) glottic and 8 (47%) supraglottic patients. Chemotherapy and radiation was used in 2 (7%) glottic and 3 (18%) supraglottic patients. There was no statistically significant difference in

  18. Item Modeling Concept Based on Multimedia Authoring

    Directory of Open Access Journals (Sweden)

    Janez Stergar

    2008-09-01

    Full Text Available In this paper a modern item design framework for computer based assessment based on Flash authoring environment will be introduced. Question design will be discussed as well as the multimedia authoring environment used for item modeling emphasized. Item type templates are a structured means of collecting and storing item information that can be used to improve the efficiency and security of the innovative item design process. Templates can modernize the item design, enhance and speed up the development process. Along with content creation, multimedia has vast potential for use in innovative testing. The introduced item design template is based on taxonomy of innovative items which have great potential for expanding the content areas and construct coverage of an assessment. The presented item design approach is based on GUI's – one for question design based on implemented item design templates and one for user interaction tracking/retrieval. The concept of user interfaces based on Flash technology will be discussed as well as implementation of the innovative approach of the item design forms with multimedia authoring. Also an innovative method for user interaction storage/retrieval based on PHP extending Flash capabilities in the proposed framework will be introduced.

  19. Revealing the Cosmic Web-dependent Halo Bias

    Science.gov (United States)

    Yang, Xiaohu; Zhang, Youcai; Lu, Tianhuan; Wang, Huiyuan; Shi, Feng; Tweed, Dylan; Li, Shijie; Luo, Wentao; Lu, Yi; Yang, Lei

    2017-10-01

    Halo bias is the one of the key ingredients of the halo models. It was shown at a given redshift to be only dependent, to the first order, on the halo mass. In this study, four types of cosmic web environments—clusters, filaments, sheets, and voids—are defined within a state-of-the-art high-resolution N-body simulation. Within these environments, we use both halo-dark matter cross correlation and halo-halo autocorrelation functions to probe the clustering properties of halos. The nature of the halo bias differs strongly between the four different cosmic web environments described here. With respect to the overall population, halos in clusters have significantly lower biases in the {10}11.0˜ {10}13.5 {h}-1 {M}⊙ mass range. In other environments, however, halos show extremely enhanced biases up to a factor 10 in voids for halos of mass ˜ {10}12.0 {h}-1 {M}⊙ . Such a strong cosmic web environment dependence in the halo bias may play an important role in future cosmological and galaxy formation studies. Within this cosmic web framework, the age dependency of halo bias is found to be only significant in clusters and filaments for relatively small halos ≲ {10}12.5 {h}-1 {M}⊙ .

  20. The Influence of Item Properties on Association-Memory

    Science.gov (United States)

    Madan, Christopher R.; Glaholt, Mackenzie G.; Caplan, Jeremy B.

    2010-01-01

    Word properties like imageability and word frequency improve cued recall of verbal paired-associates. We asked whether these enhancements follow simply from prior effects on item-memory, or also strengthen associations between items. Participants studied word pairs varying in imageability or frequency: pairs were "pure" (high-high, low-low) or…

  1. Differential item functioning of the patient-reported outcomes information system (PROMIS®) pain interference item bank by language (Spanish versus English).

    Science.gov (United States)

    Paz, Sylvia H; Spritzer, Karen L; Reise, Steven P; Hays, Ron D

    2017-06-01

    About 70% of Latinos, 5 years old or older, in the United States speak Spanish at home. Measurement equivalence of the PROMIS ® pain interference (PI) item bank by language of administration (English versus Spanish) has not been evaluated. A sample of 527 adult Spanish-speaking Latinos completed the Spanish version of the 41-item PROMIS ® pain interference item bank. We evaluate dimensionality, monotonicity and local independence of the Spanish-language items. Then we evaluate differential item functioning (DIF) using ordinal logistic regression with item response theory scores estimated from DIF-free "anchor" items. One of the 41 items in the Spanish version of the PROMIS ® PI item bank was identified as having significant uniform DIF. English- and Spanish-speaking subjects with the same level of pain interference responded differently to 1 of the 41 items in the PROMIS ® PI item bank. This item was not retained due to proprietary issues. The original English language item parameters can be used when estimating PROMIS ® PI scores.

  2. Effects of cortisol on the memory bias for emotional words? A study in patients with depression and healthy participants using the Directed Forgetting task.

    Science.gov (United States)

    Kuehl, Linn K; Wolf, Oliver T; Driessen, Martin; Schlosser, Nicole; Fernando, Silvia Carvalho; Wingenfeld, Katja

    2017-09-01

    Mood congruent alterations in information processing such as an impaired memory bias for emotional information and impaired inhibitory functions are prominent features of a major depressive disorder (MDD). Furthermore, in MDD patients hypothalamic-pituitary-adrenal axis dysfunctions are frequently found. Impairing effects of stress or cortisol administration on memory retrieval as well as impairing stress effects on cognitive inhibition are well documented in healthy participants. In MDD patients, no effect of acute cortisol administration on memory retrieval was found. The current study investigated the effect of acute cortisol administration on memory bias in MDD patients (N = 55) and healthy controls (N = 63) using the Directed Forgetting (DF) task with positive, negative and neutral words in a placebo controlled, double blind design. After oral administration of 10 mg hydrocortisone/placebo, the item method of the DF task was conducted. Memory performance was tested with a free recall test. Cortisol was not found to have an effect on the results of the DF task. Interestingly, there was significant impact of valence: both groups showed the highest DF score for positive words and remembered significantly more positive words that were supposed to be remembered and significantly more negative words that were supposed to be forgotten. In general, healthy participants remembered more words than the depressed patients. Still, the depressed patients were able to inhibit intentionally irrelevant information at a comparable level as the healthy controls. These results demonstrate the importance to distinguish in experimental designs between different cognitive domains such as inhibition and memory in our study. Copyright © 2017 Elsevier Ltd. All rights reserved.

  3. Compliance of systematic reviews in veterinary journals with Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) literature search reporting guidelines.

    Science.gov (United States)

    Toews, Lorraine C

    2017-07-01

    Complete, accurate reporting of systematic reviews facilitates assessment of how well reviews have been conducted. The primary objective of this study was to examine compliance of systematic reviews in veterinary journals with Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) guidelines for literature search reporting and to examine the completeness, bias, and reproducibility of the searches in these reviews from what was reported. The second objective was to examine reporting of the credentials and contributions of those involved in the search process. A sample of systematic reviews or meta-analyses published in veterinary journals between 2011 and 2015 was obtained by searching PubMed. Reporting in the full text of each review was checked against certain PRISMA checklist items. Over one-third of reviews (37%) did not search the CAB Abstracts database, and 9% of reviews searched only 1 database. Over two-thirds of reviews (65%) did not report any search for grey literature or stated that they excluded grey literature. The majority of reviews (95%) did not report a reproducible search strategy. Most reviews had significant deficiencies in reporting the search process that raise questions about how these searches were conducted and ultimately cast serious doubts on the validity and reliability of reviews based on a potentially biased and incomplete body of literature. These deficiencies also highlight the need for veterinary journal editors and publishers to be more rigorous in requiring adherence to PRISMA guidelines and to encourage veterinary researchers to include librarians or information specialists on systematic review teams to improve the quality and reporting of searches.

  4. Process-conditioned bias correction for seasonal forecasting: a case-study with ENSO in Peru

    Science.gov (United States)

    Manzanas, R.; Gutiérrez, J. M.

    2018-05-01

    This work assesses the suitability of a first simple attempt for process-conditioned bias correction in the context of seasonal forecasting. To do this, we focus on the northwestern part of Peru and bias correct 1- and 4-month lead seasonal predictions of boreal winter (DJF) precipitation from the ECMWF System4 forecasting system for the period 1981-2010. In order to include information about the underlying large-scale circulation which may help to discriminate between precipitation affected by different processes, we introduce here an empirical quantile-quantile mapping method which runs conditioned on the state of the Southern Oscillation Index (SOI), which is accurately predicted by System4 and is known to affect the local climate. Beyond the reduction of model biases, our results show that the SOI-conditioned method yields better ROC skill scores and reliability than the raw model output over the entire region of study, whereas the standard unconditioned implementation provides no added value for any of these metrics. This suggests that conditioning the bias correction on simple but well-simulated large-scale processes relevant to the local climate may be a suitable approach for seasonal forecasting. Yet, further research on the suitability of the application of similar approaches to the one considered here for other regions, seasons and/or variables is needed.

  5. Attention restores discrete items to visual short-term memory.

    Science.gov (United States)

    Murray, Alexandra M; Nobre, Anna C; Clark, Ian A; Cravo, André M; Stokes, Mark G

    2013-04-01

    When a memory is forgotten, is it lost forever? Our study shows that selective attention can restore forgotten items to visual short-term memory (VSTM). In our two experiments, all stimuli presented in a memory array were designed to be equally task relevant during encoding. During the retention interval, however, participants were sometimes given a cue predicting which of the memory items would be probed at the end of the delay. This shift in task relevance improved recall for that item. We found that this type of cuing improved recall for items that otherwise would have been irretrievable, providing critical evidence that attention can restore forgotten information to VSTM. Psychophysical modeling of memory performance has confirmed that restoration of information in VSTM increases the probability that the cued item is available for recall but does not improve the representational quality of the memory. We further suggest that attention can restore discrete items to VSTM.

  6. Assessing the extent of non-stationary biases in GCMs

    Science.gov (United States)

    Nahar, Jannatun; Johnson, Fiona; Sharma, Ashish

    2017-06-01

    General circulation models (GCMs) are the main tools for estimating changes in the climate for the future. The imperfect representation of climate models introduces biases in the simulations that need to be corrected prior to their use for impact assessments. Bias correction methods generally assume that the bias calculated over the historical period does not change and can be applied to the future. This study investigates this assumption by considering the extent and nature of bias non-stationarity using 20th century precipitation and temperature simulations from six CMIP5 GCMs across Australia. Four statistics (mean, standard deviation, 10th and 90th quantiles) in monthly and seasonal biases are obtained for three different time window lengths (10, 25 and 33 years) to examine the properties of bias over time. This approach is repeated for two different phases of the Interdecadal Pacific Oscillation (IPO), which is known to have strong influences on the Australian climate. It is found that bias non-stationarity at decadal timescales is indeed an issue over some of Australia for some GCMs. When considering interdecadal variability there are significant difference in the bias between positive and negative phases of the IPO. Regional analyses confirmed these findings with the largest differences seen on the east coast of Australia, where IPO impacts tend to be the strongest. The nature of the bias non-stationarity found in this study suggests that it will be difficult to modify existing bias correction approaches to account for non-stationary biases. A more practical approach for impact assessments that use bias correction maybe to use a selection of GCMs where the assumption of bias non-stationarity holds.

  7. Assessing Projection Bias in Consumers' Food Preferences.

    Directory of Open Access Journals (Sweden)

    Tiziana de-Magistris

    Full Text Available The aim of this study is to test whether projection bias exists in consumers' purchasing decisions for food products. To achieve our aim, we used a non-hypothetical experiment (i.e., experimental auction, where hungry and non-hungry participants were incentivized to reveal their willingness to pay (WTP. The results confirm the existence of projection bias when consumers made their decisions on food products. In particular, projection bias existed because currently hungry participants were willing to pay a higher price premium for cheeses than satiated ones, both in hungry and satiated future states. Moreover, participants overvalued the food product more when they were delivered in the future hungry condition than in the satiated one. Our study provides clear, quantitative and meaningful evidence of projection bias because our findings are based on economic valuation of food preferences. Indeed, the strength of this study is that findings are expressed in terms of willingness to pay which is an interpretable amount of money.

  8. Reliability and validity of the Spanish version of the 10-item Connor-Davidson Resilience Scale (10-item CD-RISC in young adults

    Directory of Open Access Journals (Sweden)

    García-Campayo Javier

    2011-08-01

    Full Text Available Abstract Background The 10-item Connor-Davidson Resilience Scale (10-item CD-RISC is an instrument for measuring resilience that has shown good psychometric properties in its original version in English. The aim of this study was to evaluate the validity and reliability of the Spanish version of the 10-item CD-RISC in young adults and to verify whether it is structured in a single dimension as in the original English version. Findings Cross-sectional observational study including 681 university students ranging in age from 18 to 30 years. The number of latent factors in the 10 items of the scale was analyzed by exploratory factor analysis. Confirmatory factor analysis was used to verify whether a single factor underlies the 10 items of the scale as in the original version in English. The convergent validity was analyzed by testing whether the mean of the scores of the mental component of SF-12 (MCS and the quality of sleep as measured with the Pittsburgh Sleep Index (PSQI were higher in subjects with better levels of resilience. The internal consistency of the 10-item CD-RISC was estimated using the Cronbach α test and test-retest reliability was estimated with the intraclass correlation coefficient. The Cronbach α coefficient was 0.85 and the test-retest intraclass correlation coefficient was 0.71. The mean MCS score and the level of quality of sleep in both men and women were significantly worse in subjects with lower resilience scores. Conclusions The Spanish version of the 10-item CD-RISC showed good psychometric properties in young adults and thus can be used as a reliable and valid instrument for measuring resilience. Our study confirmed that a single factor underlies the resilience construct, as was the case of the original scale in English.

  9. Culture modulates implicit ownership-induced self-bias in memory.

    Science.gov (United States)

    Sparks, Samuel; Cunningham, Sheila J; Kritikos, Ada

    2016-08-01

    The relation of incoming stimuli to the self implicitly determines the allocation of cognitive resources. Cultural variations in the self-concept shape cognition, but the extent is unclear because the majority of studies sample only Western participants. We report cultural differences (Asian versus Western) in ownership-induced self-bias in recognition memory for objects. In two experiments, participants allocated a series of images depicting household objects to self-owned or other-owned virtual baskets based on colour cues before completing a surprise recognition memory test for the objects. The 'other' was either a stranger or a close other. In both experiments, Western participants showed greater recognition memory accuracy for self-owned compared with other-owned objects, consistent with an independent self-construal. In Experiment 1, which required minimal attention to the owned objects, Asian participants showed no such ownership-related bias in recognition accuracy. In Experiment 2, which required attention to owned objects to move them along the screen, Asian participants again showed no overall memory advantage for self-owned items and actually exhibited higher recognition accuracy for mother-owned than self-owned objects, reversing the pattern observed for Westerners. This is consistent with an interdependent self-construal which is sensitive to the particular relationship between the self and other. Overall, our results suggest that the self acts as an organising principle for allocating cognitive resources, but that the way it is constructed depends upon cultural experience. Additionally, the manifestation of these cultural differences in self-representation depends on the allocation of attentional resources to self- and other-associated stimuli. Crown Copyright © 2016. Published by Elsevier B.V. All rights reserved.

  10. A strategy for optimizing item-pool management

    NARCIS (Netherlands)

    Ariel, A.; van der Linden, Willem J.; Veldkamp, Bernard P.

    2006-01-01

    Item-pool management requires a balancing act between the input of new items into the pool and the output of tests assembled from it. A strategy for optimizing item-pool management is presented that is based on the idea of a periodic update of an optimal blueprint for the item pool to tune item

  11. Food items contributing most to variation in antioxidant intake; a cross-sectional study among Norwegian women.

    Science.gov (United States)

    Qureshi, Samera Azeem; Lund, Annette Christin; Veierød, Marit Bragelien; Carlsen, Monica Hauger; Blomhoff, Rune; Andersen, Lene Frost; Ursin, Giske

    2014-01-16

    Fruit and vegetable intake has been found to reduce the risk of cardiovascular disease, certain types of cancer and diabetes mellitus. It is possible that antioxidants play a large part in this protective effect. However, which foods account for the variation in antioxidant intake in a population is not very clear. We used food frequency data from a population-based sample of women to identify the food items that contributed most to the variation in antioxidant intake in Norwegian diet. We used data from a study conducted among participants in the Norwegian Breast Cancer Screening Program (NBCSP), the national program which invites women aged 50-69 years to mammographic screening every 2 years. A subset of 6514 women who attended the screening in 2006/2007 completed a food frequency questionnaire (FFQ). Daily intake of energy, nutrients and antioxidant intake were estimated. We used multiple linear regression analysis to capture the variation in antioxidant intake. The mean (SD) antioxidant intake was 23.0 (8.5) mmol/day. Coffee consumption explained 54% of the variation in antioxidant intake, while fruits and vegetables explained 22%. The twenty food items that contributed most to the total variation in antioxidant intake explained 98% of the variation in intake. These included different types of coffee, tea, red wine, blueberries, walnuts, oranges, cinnamon and broccoli. In this study we identified a list of food items which capture the variation in antioxidant intake among these women. The major contributors to dietary total antioxidant intake were coffee, tea, red wine, blueberries, walnuts, oranges, cinnamon and broccoli. These items should be assessed in as much detail as possible in studies that wish to capture the variation in antioxidant intake.

  12. Large-scale galaxy bias

    Science.gov (United States)

    Jeong, Donghui; Desjacques, Vincent; Schmidt, Fabian

    2018-01-01

    Here, we briefly introduce the key results of the recent review (arXiv:1611.09787), whose abstract is as following. This review presents a comprehensive overview of galaxy bias, that is, the statistical relation between the distribution of galaxies and matter. We focus on large scales where cosmic density fields are quasi-linear. On these scales, the clustering of galaxies can be described by a perturbative bias expansion, and the complicated physics of galaxy formation is absorbed by a finite set of coefficients of the expansion, called bias parameters. The review begins with a detailed derivation of this very important result, which forms the basis of the rigorous perturbative description of galaxy clustering, under the assumptions of General Relativity and Gaussian, adiabatic initial conditions. Key components of the bias expansion are all leading local gravitational observables, which include the matter density but also tidal fields and their time derivatives. We hence expand the definition of local bias to encompass all these contributions. This derivation is followed by a presentation of the peak-background split in its general form, which elucidates the physical meaning of the bias parameters, and a detailed description of the connection between bias parameters and galaxy (or halo) statistics. We then review the excursion set formalism and peak theory which provide predictions for the values of the bias parameters. In the remainder of the review, we consider the generalizations of galaxy bias required in the presence of various types of cosmological physics that go beyond pressureless matter with adiabatic, Gaussian initial conditions: primordial non-Gaussianity, massive neutrinos, baryon-CDM isocurvature perturbations, dark energy, and modified gravity. Finally, we discuss how the description of galaxy bias in the galaxies' rest frame is related to clustering statistics measured from the observed angular positions and redshifts in actual galaxy catalogs.

  13. On the Borders of Harmful and Helpful Beauty Biases

    Directory of Open Access Journals (Sweden)

    Maria Agthe

    2016-06-01

    Full Text Available Research with European Caucasian samples demonstrates that attractiveness-based biases in social evaluation depend on the constellation of the sex of the evaluator and the sex of the target: Whereas people generally show positive biases toward attractive opposite-sex persons, they show less positive or even negative biases toward attractive same-sex persons. By examining these biases both within and between different ethnicities, the current studies provide new evidence for both the generalizability and the specificity of these attractiveness-based social perception biases. Examining within-ethnicity effects, Study 1 is the first to demonstrate that samples from diverse ethnic backgrounds parallel the finding of European Caucasian samples: The advantageous or adverse effects of attractiveness depend on the gender constellation of the evaluator and the evaluated person. Examining between-ethnicity effects, Study 2 found that these attractiveness-based biases emerge almost exclusively toward targets of the evaluator’s own ethnic background; these biases were reduced or eliminated for cross-ethnicity evaluations and interaction intentions. We discuss these findings in light of evolutionary principles and reflect on potential interactions between culture and evolved cognitive mechanisms.

  14. Information environment, behavioral biases, and home bias in analysts’ recommendations

    DEFF Research Database (Denmark)

    Farooq, Omar; Taouss, Mohammed

    2012-01-01

    Can information environment of a firm explain home bias in analysts’ recommendations? Can the extent of agency problems explain optimism difference between foreign and local analysts? This paper answers these questions by documenting the effect of information environment on home bias in analysts’...

  15. Are great apes able to reason from multi-item samples to populations of food items?

    Science.gov (United States)

    Eckert, Johanna; Rakoczy, Hannes; Call, Josep

    2017-10-01

    Inductive learning from limited observations is a cognitive capacity of fundamental importance. In humans, it is underwritten by our intuitive statistics, the ability to draw systematic inferences from populations to randomly drawn samples and vice versa. According to recent research in cognitive development, human intuitive statistics develops early in infancy. Recent work in comparative psychology has produced first evidence for analogous cognitive capacities in great apes who flexibly drew inferences from populations to samples. In the present study, we investigated whether great apes (Pongo abelii, Pan troglodytes, Pan paniscus, Gorilla gorilla) also draw inductive inferences in the opposite direction, from samples to populations. In two experiments, apes saw an experimenter randomly drawing one multi-item sample from each of two populations of food items. The populations differed in their proportion of preferred to neutral items (24:6 vs. 6:24) but apes saw only the distribution of food items in the samples that reflected the distribution of the respective populations (e.g., 4:1 vs. 1:4). Based on this observation they were then allowed to choose between the two populations. Results show that apes seemed to make inferences from samples to populations and thus chose the population from which the more favorable (4:1) sample was drawn in Experiment 1. In this experiment, the more attractive sample not only contained proportionally but also absolutely more preferred food items than the less attractive sample. Experiment 2, however, revealed that when absolute and relative frequencies were disentangled, apes performed at chance level. Whether these limitations in apes' performance reflect true limits of cognitive competence or merely performance limitations due to accessory task demands is still an open question. © 2017 Wiley Periodicals, Inc.

  16. Attentional Bias towards Positive Emotion Predicts Stress Resilience.

    Science.gov (United States)

    Thoern, Hanna A; Grueschow, Marcus; Ehlert, Ulrike; Ruff, Christian C; Kleim, Birgit

    2016-01-01

    There is extensive evidence for an association between an attentional bias towards emotionally negative stimuli and vulnerability to stress-related psychopathology. Less is known about whether selective attention towards emotionally positive stimuli relates to mental health and stress resilience. The current study used a modified Dot Probe task to investigate if individual differences in attentional biases towards either happy or angry emotional stimuli, or an interaction between these biases, are related to self-reported trait stress resilience. In a nonclinical sample (N = 43), we indexed attentional biases as individual differences in reaction time for stimuli preceded by either happy or angry (compared to neutral) face stimuli. Participants with greater attentional bias towards happy faces (but not angry faces) reported higher trait resilience. However, an attentional bias towards angry stimuli moderated this effect: The attentional bias towards happy faces was only predictive for resilience in those individuals who also endorsed an attentional bias towards angry stimuli. An attentional bias towards positive emotional stimuli may thus be a protective factor contributing to stress resilience, specifically in those individuals who also endorse an attentional bias towards negative emotional stimuli. Our findings therefore suggest a novel target for prevention and treatment interventions addressing stress-related psychopathology.

  17. Item response theory - A first approach

    Science.gov (United States)

    Nunes, Sandra; Oliveira, Teresa; Oliveira, Amílcar

    2017-07-01

    The Item Response Theory (IRT) has become one of the most popular scoring frameworks for measurement data, frequently used in computerized adaptive testing, cognitively diagnostic assessment and test equating. According to Andrade et al. (2000), IRT can be defined as a set of mathematical models (Item Response Models - IRM) constructed to represent the probability of an individual giving the right answer to an item of a particular test. The number of Item Responsible Models available to measurement analysis has increased considerably in the last fifteen years due to increasing computer power and due to a demand for accuracy and more meaningful inferences grounded in complex data. The developments in modeling with Item Response Theory were related with developments in estimation theory, most remarkably Bayesian estimation with Markov chain Monte Carlo algorithms (Patz & Junker, 1999). The popularity of Item Response Theory has also implied numerous overviews in books and journals, and many connections between IRT and other statistical estimation procedures, such as factor analysis and structural equation modeling, have been made repeatedly (Van der Lindem & Hambleton, 1997). As stated before the Item Response Theory covers a variety of measurement models, ranging from basic one-dimensional models for dichotomously and polytomously scored items and their multidimensional analogues to models that incorporate information about cognitive sub-processes which influence the overall item response process. The aim of this work is to introduce the main concepts associated with one-dimensional models of Item Response Theory, to specify the logistic models with one, two and three parameters, to discuss some properties of these models and to present the main estimation procedures.

  18. Identifying predictors of physics item difficulty: A linear regression approach

    Science.gov (United States)

    Mesic, Vanes; Muratovic, Hasnija

    2011-06-01

    Large-scale assessments of student achievement in physics are often approached with an intention to discriminate students based on the attained level of their physics competencies. Therefore, for purposes of test design, it is important that items display an acceptable discriminatory behavior. To that end, it is recommended to avoid extraordinary difficult and very easy items. Knowing the factors that influence physics item difficulty makes it possible to model the item difficulty even before the first pilot study is conducted. Thus, by identifying predictors of physics item difficulty, we can improve the test-design process. Furthermore, we get additional qualitative feedback regarding the basic aspects of student cognitive achievement in physics that are directly responsible for the obtained, quantitative test results. In this study, we conducted a secondary analysis of data that came from two large-scale assessments of student physics achievement at the end of compulsory education in Bosnia and Herzegovina. Foremost, we explored the concept of “physics competence” and performed a content analysis of 123 physics items that were included within the above-mentioned assessments. Thereafter, an item database was created. Items were described by variables which reflect some basic cognitive aspects of physics competence. For each of the assessments, Rasch item difficulties were calculated in separate analyses. In order to make the item difficulties from different assessments comparable, a virtual test equating procedure had to be implemented. Finally, a regression model of physics item difficulty was created. It has been shown that 61.2% of item difficulty variance can be explained by factors which reflect the automaticity, complexity, and modality of the knowledge structure that is relevant for generating the most probable correct solution, as well as by the divergence of required thinking and interference effects between intuitive and formal physics knowledge

  19. Identifying predictors of physics item difficulty: A linear regression approach

    Directory of Open Access Journals (Sweden)

    Hasnija Muratovic

    2011-06-01

    Full Text Available Large-scale assessments of student achievement in physics are often approached with an intention to discriminate students based on the attained level of their physics competencies. Therefore, for purposes of test design, it is important that items display an acceptable discriminatory behavior. To that end, it is recommended to avoid extraordinary difficult and very easy items. Knowing the factors that influence physics item difficulty makes it possible to model the item difficulty even before the first pilot study is conducted. Thus, by identifying predictors of physics item difficulty, we can improve the test-design process. Furthermore, we get additional qualitative feedback regarding the basic aspects of student cognitive achievement in physics that are directly responsible for the obtained, quantitative test results. In this study, we conducted a secondary analysis of data that came from two large-scale assessments of student physics achievement at the end of compulsory education in Bosnia and Herzegovina. Foremost, we explored the concept of “physics competence” and performed a content analysis of 123 physics items that were included within the above-mentioned assessments. Thereafter, an item database was created. Items were described by variables which reflect some basic cognitive aspects of physics competence. For each of the assessments, Rasch item difficulties were calculated in separate analyses. In order to make the item difficulties from different assessments comparable, a virtual test equating procedure had to be implemented. Finally, a regression model of physics item difficulty was created. It has been shown that 61.2% of item difficulty variance can be explained by factors which reflect the automaticity, complexity, and modality of the knowledge structure that is relevant for generating the most probable correct solution, as well as by the divergence of required thinking and interference effects between intuitive and formal

  20. Item response theory analysis of the Pain Self-Efficacy Questionnaire.

    Science.gov (United States)

    Costa, Daniel S J; Asghari, Ali; Nicholas, Michael K

    2017-01-01

    The Pain Self-Efficacy Questionnaire (PSEQ) is a 10-item instrument designed to assess the extent to which a person in pain believes s/he is able to accomplish various activities despite their pain. There is strong evidence for the validity and reliability of both the full-length PSEQ and a 2-item version. The purpose of this study is to further examine the properties of the PSEQ using an item response theory (IRT) approach. We used the two-parameter graded response model to examine the category probability curves, and location and discrimination parameters of the 10 PSEQ items. In item response theory, responses to a set of items are assumed to be probabilistically determined by a latent (unobserved) variable. In the graded-response model specifically, item response threshold (the value of the latent variable for which adjacent response categories are equally likely) and discrimination parameters are estimated for each item. Participants were 1511 mixed, chronic pain patients attending for initial assessment at a tertiary pain management centre. All items except item 7 ('I can cope with my pain without medication') performed well in IRT analysis, and the category probability curves suggested that participants used the 7-point response scale consistently. Items 6 ('I can still do many of the things I enjoy doing, such as hobbies or leisure activity, despite pain'), 8 ('I can still accomplish most of my goals in life, despite the pain') and 9 ('I can live a normal lifestyle, despite the pain') captured higher levels of the latent variable with greater precision. The results from this IRT analysis add to the body of evidence based on classical test theory illustrating the strong psychometric properties of the PSEQ. Despite the relatively poor performance of Item 7, its clinical utility warrants its retention in the questionnaire. The strong psychometric properties of the PSEQ support its use as an effective tool for assessing self-efficacy in people with pain

  1. Using Item Response Theory to Develop a 60-Item Representation of the NEO PI-R Using the International Personality Item Pool: Development of the IPIP-NEO-60.

    Science.gov (United States)

    Maples-Keller, Jessica L; Williamson, Rachel L; Sleep, Chelsea E; Carter, Nathan T; Campbell, W Keith; Miller, Joshua D

    2017-10-31

    Given advantages of freely available and modifiable measures, an increase in the use of measures developed from the International Personality Item Pool (IPIP), including the 300-item representation of the Revised NEO Personality Inventory (NEO PI-R; Costa & McCrae, 1992a ) has occurred. The focus of this study was to use item response theory to develop a 60-item, IPIP-based measure of the Five-Factor Model (FFM) that provides equal representation of the FFM facets and to test the reliability and convergent and criterion validity of this measure compared to the NEO Five Factor Inventory (NEO-FFI). In an undergraduate sample (n = 359), scores from the NEO-FFI and IPIP-NEO-60 demonstrated good reliability and convergent validity with the NEO PI-R and IPIP-NEO-300. Additionally, across criterion variables in the undergraduate sample as well as a community-based sample (n = 757), the NEO-FFI and IPIP-NEO-60 demonstrated similar nomological networks across a wide range of external variables (r ICC = .96). Finally, as expected, in an MTurk sample the IPIP-NEO-60 demonstrated advantages over the Big Five Inventory-2 (Soto & John, 2017 ; n = 342) with regard to the Agreeableness domain content. The results suggest strong reliability and validity of the IPIP-NEO-60 scores.

  2. Distinctive Characteristics of Sexual Orientation Bias Crimes

    Science.gov (United States)

    Stacey, Michele

    2011-01-01

    Despite increased attention in the area of hate crime research in the past 20 years, sexual orientation bias crimes have rarely been singled out for study. When these types of crimes are looked at, the studies are typically descriptive in nature. This article seeks to increase our knowledge of sexual orientation bias by answering the question:…

  3. Self-reported Cognitive Biases Moderate the Associations Between Social Stress and Paranoid Ideation in a Virtual Reality Experimental Study

    NARCIS (Netherlands)

    Pot-Kolder, Roos; Veling, Wim; Counotte, Jacqueline; van der Gaag, Mark

    2017-01-01

    Introduction: Cognitive biases are associated with psychosis liability and paranoid ideation. This study investigated the moderating relationship between pre-existing self-reported cognitive biases and the occurrence of paranoid ideation in response to different levels of social stress in a virtual

  4. Analysis Test of Understanding of Vectors with the Three-Parameter Logistic Model of Item Response Theory and Item Response Curves Technique

    Science.gov (United States)

    Rakkapao, Suttida; Prasitpong, Singha; Arayathanitkul, Kwan

    2016-01-01

    This study investigated the multiple-choice test of understanding of vectors (TUV), by applying item response theory (IRT). The difficulty, discriminatory, and guessing parameters of the TUV items were fit with the three-parameter logistic model of IRT, using the parscale program. The TUV ability is an ability parameter, here estimated assuming…

  5. Harmonizing Measures of Cognitive Performance Across International Surveys of Aging Using Item Response Theory.

    Science.gov (United States)

    Chan, Kitty S; Gross, Alden L; Pezzin, Liliana E; Brandt, Jason; Kasper, Judith D

    2015-12-01

    To harmonize measures of cognitive performance using item response theory (IRT) across two international aging studies. Data for persons ≥65 years from the Health and Retirement Study (HRS, N = 9,471) and the English Longitudinal Study of Aging (ELSA, N = 5,444). Cognitive performance measures varied (HRS fielded 25, ELSA 13); 9 were in common. Measurement precision was examined for IRT scores based on (a) common items, (b) common items adjusted for differential item functioning (DIF), and (c) DIF-adjusted all items. Three common items (day of date, immediate word recall, and delayed word recall) demonstrated DIF by survey. Adding survey-specific items improved precision but mainly for HRS respondents at lower cognitive levels. IRT offers a feasible strategy for harmonizing cognitive performance measures across other surveys and for other multi-item constructs of interest in studies of aging. Practical implications depend on sample distribution and the difficulty mix of in-common and survey-specific items. © The Author(s) 2015.

  6. Interpretation bias and social anxiety: does interpretation bias mediate the relationship between trait social anxiety and state anxiety responses?

    Science.gov (United States)

    Chen, Junwen; Milne, Kirby; Dayman, Janet; Kemps, Eva

    2018-05-23

    Two studies aimed to examine whether high socially anxious individuals are more likely to negatively interpret ambiguous social scenarios and facial expressions compared to low socially anxious individuals. We also examined whether interpretation bias serves as a mediator of the relationship between trait social anxiety and state anxiety responses, in particular current state anxiety, bodily sensations, and perceived probability and cost of negative evaluation pertaining to a speech task. Study 1 used ambiguous social scenarios and Study 2 used ambiguous facial expressions as stimuli to objectively assess interpretation bias. Undergraduate students with high and low social anxiety completed measures of state anxiety responses at three time points: baseline, after the interpretation bias task, and after the preparation for an impromptu speech. Results showed that high socially anxious individuals were more likely to endorse threat interpretations for ambiguous social scenarios and to interpret ambiguous faces as negative than low socially anxious individuals. Furthermore, negative interpretations mediated the relationship between trait social anxiety and perceived probability of negative evaluation pertaining to the speech task in Study 1 but not Study 2. The present studies provide new insight into the role of interpretation bias in social anxiety.

  7. Emotional sensitization highlights the attentional bias in blood-injection-injury phobics: an ERP study.

    Science.gov (United States)

    Sarlo, Michela; Buodo, Giulia; Devigili, Andrea; Munafò, Marianna; Palomba, Daniela

    2011-02-18

    The presence of an attentional bias towards disorder-related stimuli has not been consistently demonstrated in blood phobics. The present study was aimed at investigating whether or not an attentional bias, as measured by event-related potentials (ERPs), could be highlighted in blood phobics by inducing cognitive-emotional sensitization through the repetitive presentation of different disorder-related pictures. The mean amplitudes of the N100, P200, P300 and late positive potentials to picture onset were assessed along with subjective ratings of valence and arousal in 13 blood phobics and 12 healthy controls. Blood phobics, but not controls, showed a linear increase of subjective arousal over time, suggesting that cognitive-emotional sensitization did occur. The analysis of cortical responses showed larger N100 and smaller late positive potentials in phobics than in controls in response to mutilations. These findings suggest that cognitive-emotional sensitization induced an attentional bias in blood phobics during picture viewing, involving early selective encoding and late cognitive avoidance of disorder-related stimuli depicting mutilations. © 2010 Elsevier Ireland Ltd. All rights reserved.

  8. An Effect Size Measure for Raju's Differential Functioning for Items and Tests

    Science.gov (United States)

    Wright, Keith D.; Oshima, T. C.

    2015-01-01

    This study established an effect size measure for differential functioning for items and tests' noncompensatory differential item functioning (NCDIF). The Mantel-Haenszel parameter served as the benchmark for developing NCDIF's effect size measure for reporting moderate and large differential item functioning in test items. The effect size of…

  9. Improving measurement of injection drug risk behavior using item response theory.

    Science.gov (United States)

    Janulis, Patrick

    2014-03-01

    Recent research highlights the multiple steps to preparing and injecting drugs and the resultant viral threats faced by drug users. This research suggests that more sensitive measurement of injection drug HIV risk behavior is required. In addition, growing evidence suggests there are gender differences in injection risk behavior. However, the potential for differential item functioning between genders has not been explored. To explore item response theory as an improved measurement modeling technique that provides empirically justified scaling of injection risk behavior and to examine for potential gender-based differential item functioning. Data is used from three studies in the National Institute on Drug Abuse's Criminal Justice Drug Abuse Treatment Studies. A two-parameter item response theory model was used to scale injection risk behavior and logistic regression was used to examine for differential item functioning. Item fit statistics suggest that item response theory can be used to scale injection risk behavior and these models can provide more sensitive estimates of risk behavior. Additionally, gender-based differential item functioning is present in the current data. Improved measurement of injection risk behavior using item response theory should be encouraged as these models provide increased congruence between construct measurement and the complexity of injection-related HIV risk. Suggestions are made to further improve injection risk behavior measurement. Furthermore, results suggest direct comparisons of composite scores between males and females may be misleading and future work should account for differential item functioning before comparing levels of injection risk behavior.

  10. Biased language use in stereotype maintenance : The role of encoding and goals

    NARCIS (Netherlands)

    Wenneker, CPJ; Wigboldus, DHJ; Spears, R

    2005-01-01

    In 4 studies, the authors investigated the relative impact of biased encoding of information and communication goals on biased language use. A category label (linguistic expectancy bias, Study 1) or a group label (linguistic intergroup bias, Study 2) was presented either before or after a story that

  11. Apparatus bias and place conditioning with ethanol in mice.

    Science.gov (United States)

    Cunningham, Christopher L; Ferree, Nikole K; Howard, MacKenzie A

    2003-12-01

    Although the distinction between "biased" and "unbiased" is generally recognized as an important methodological issue in place conditioning, previous studies have not adequately addressed the distinction between a biased/unbiased apparatus and a biased/unbiased stimulus assignment procedure. Moreover, a review of the recent literature indicates that many reports (70% of 76 papers published in 2001) fail to provide adequate information about apparatus bias. This issue is important because the mechanisms underlying a drug's effect in the place-conditioning procedure may differ depending on whether the apparatus is biased or unbiased. The present studies were designed to assess the impact of apparatus bias and stimulus assignment procedure on ethanol-induced place conditioning in mice (DBA/2 J). A secondary goal was to compare various dependent variables commonly used to index conditioned place preference. Apparatus bias was manipulated by varying the combination of tactile (floor) cues available during preference tests. Experiment 1 used an unbiased apparatus in which the stimulus alternatives were equally preferred during a pre-test as indicated by the group average. Experiment 2 used a biased apparatus in which one of the stimuli was strongly preferred by most mice (mean % time on cue = 67%) during the pre-test. In both studies, the stimulus paired with drug (CS+) was assigned randomly (i.e., an "unbiased" stimulus assignment procedure). Experimental mice received four pairings of CS+ with ethanol (2 g/kg, i.p.) and four pairings of the alternative stimulus (CS-) with saline; control mice received saline on both types of trial. Each experiment concluded with a 60-min choice test. With the unbiased apparatus (experiment 1), significant place conditioning was obtained regardless of whether drug was paired with the subject's initially preferred or non-preferred stimulus. However, with the biased apparatus (experiment 2), place conditioning was apparent only when

  12. Understanding and Overcoming Implicit Gender Bias in Plastic Surgery.

    Science.gov (United States)

    Phillips, Nicole A; Tannan, Shruti C; Kalliainen, Loree K

    2016-11-01

    Although explicit sex-based discrimination has largely been deemed unacceptable in professional settings, implicit gender bias persists and results in a significant lack of parity in plastic surgery and beyond. Implicit gender bias is the result of a complex interplay of cultural and societal expectations, learned behaviors, and standardized associations. As such, both male and female surgeons are subject to its influence. A review of the literature was conducted, examining theories of gender bias, current manifestations of gender bias in plastic surgery and other fields, and interventions designed to address gender bias. Multiple studies demonstrate persistent gender bias that impacts female physicians at all levels of training. Several institutions have enacted successful interventions to identify and address gender bias. Explicit gender bias has largely disappeared, yet unconscious or implicit gender bias persists. A wide-scale commitment to addressing implicit gender bias in plastic surgery is necessary and overdue. Recommendations include immediate actions that can be undertaken on an individual basis, and changes that should be implemented at a national and international level by leaders in the field.

  13. The self-attribution bias and paranormal beliefs.

    Science.gov (United States)

    van Elk, Michiel

    2017-03-01

    The present study investigated the relation between paranormal beliefs, illusory control and the self-attribution bias, i.e., the motivated tendency to attribute positive outcomes to oneself while negative outcomes are externalized. Visitors of a psychic fair played a card guessing game and indicated their perceived control over randomly selected cards as a function of the congruency and valence of the card. A stronger self-attribution bias was observed for paranormal believers compared to skeptics and this bias was specifically related to traditional religious beliefs and belief in superstition. No relation between paranormal beliefs and illusory control was found. Self-report measures indicated that paranormal beliefs were associated to being raised in a spiritual family and to anomalous experiences during childhood. Thereby this study suggests that paranormal beliefs are related to specific cognitive biases that in turn are shaped by socio-cultural factors. Copyright © 2017 Elsevier Inc. All rights reserved.

  14. Shilling Attacks Detection in Recommender Systems Based on Target Item Analysis.

    Science.gov (United States)

    Zhou, Wei; Wen, Junhao; Koh, Yun Sing; Xiong, Qingyu; Gao, Min; Dobbie, Gillian; Alam, Shafiq

    2015-01-01

    Recommender systems are highly vulnerable to shilling attacks, both by individuals and groups. Attackers who introduce biased ratings in order to affect recommendations, have been shown to negatively affect collaborative filtering (CF) algorithms. Previous research focuses only on the differences between genuine profiles and attack profiles, ignoring the group characteristics in attack profiles. In this paper, we study the use of statistical metrics to detect rating patterns of attackers and group characteristics in attack profiles. Another question is that most existing detecting methods are model specific. Two metrics, Rating Deviation from Mean Agreement (RDMA) and Degree of Similarity with Top Neighbors (DegSim), are used for analyzing rating patterns between malicious profiles and genuine profiles in attack models. Building upon this, we also propose and evaluate a detection structure called RD-TIA for detecting shilling attacks in recommender systems using a statistical approach. In order to detect more complicated attack models, we propose a novel metric called DegSim' based on DegSim. The experimental results show that our detection model based on target item analysis is an effective approach for detecting shilling attacks.

  15. Shilling Attacks Detection in Recommender Systems Based on Target Item Analysis

    Science.gov (United States)

    Zhou, Wei; Wen, Junhao; Koh, Yun Sing; Xiong, Qingyu; Gao, Min; Dobbie, Gillian; Alam, Shafiq

    2015-01-01

    Recommender systems are highly vulnerable to shilling attacks, both by individuals and groups. Attackers who introduce biased ratings in order to affect recommendations, have been shown to negatively affect collaborative filtering (CF) algorithms. Previous research focuses only on the differences between genuine profiles and attack profiles, ignoring the group characteristics in attack profiles. In this paper, we study the use of statistical metrics to detect rating patterns of attackers and group characteristics in attack profiles. Another question is that most existing detecting methods are model specific. Two metrics, Rating Deviation from Mean Agreement (RDMA) and Degree of Similarity with Top Neighbors (DegSim), are used for analyzing rating patterns between malicious profiles and genuine profiles in attack models. Building upon this, we also propose and evaluate a detection structure called RD-TIA for detecting shilling attacks in recommender systems using a statistical approach. In order to detect more complicated attack models, we propose a novel metric called DegSim’ based on DegSim. The experimental results show that our detection model based on target item analysis is an effective approach for detecting shilling attacks. PMID:26222882

  16. 76 FR 60474 - Commercial Item Handbook

    Science.gov (United States)

    2011-09-29

    ... DEPARTMENT OF DEFENSE Defense Acquisition Regulations System Commercial Item Handbook AGENCY.... SUMMARY: DoD has updated its Commercial Item Handbook. The purpose of the Handbook is to help acquisition personnel develop sound business strategies for procuring commercial items. DoD is seeking industry input on...

  17. Corpus-based Transitivity Biases in Individuals with Aphasia

    Directory of Open Access Journals (Sweden)

    Gayle DeDe

    2015-04-01

    Full Text Available Introduction Spontaneous speech samples in individuals with aphasia (IWA have been analyzed to examine many different psycholinguistic features. The present study focused on how IWA use verbs in spontaneous speech. Some verbs can occur in more than one argument structure, but are biased to occur more frequently in one frame than another. For example, "watch" appears in transitive and intransitive structures, but is usually used transitively. This is known as a transitivity bias. It is unknown whether IWA show the same transitivity biases in production as those reported in previous corpus studies with unimpaired individuals. Studies of sentence comprehension have shown that IWA are sensitive to verb biases (e.g., DeDe, 2013. In addition, IWA have shown an overall preference for transitive structures, which are the most frequent structures in English (Roland, Dick, & Elman, 2007. The present study investigated whether IWA show the same pattern of transitive and intransitive biases in spontaneous speech as unimpaired individuals. Method Participants: 278 interviews with IWA were taken from AphasiaBank. The IWA represented a range of aphasia types. Participants were omitted if they spoke English as a second language. Materials: 54 verbs were coded. We chose verbs with the goal of representing different bias types (e.g., transitive, intransitive, sentential complement. Of these, data from 11 transitively biased and 11 intransitively biased verbs (matched for frequency of use and number of syllables are presented here. Coding: All productions of the 54 verbs were coded. The coding protocol was based on Gahl, Jurafsky, and Roland (2004. We implemented an additional level of coding to indicate erroneous verb productions, such as ungrammatical structures and verb agreement errors. Results The (intransitivity biases for IWA were compared to biases from a previously published corpus study (Gahl et al., 2004. The IWA used transitively biased verbs in

  18. Dissociation between source and item memory in Parkinson's disease

    Institute of Scientific and Technical Information of China (English)

    Hu Panpan; Li Youhai; Ma Huijuan; Xi Chunhua; Chen Xianwen; Wang Kai

    2014-01-01

    Background Episodic memory includes information about item memory and source memory.Many researches support the hypothesis that these two memory systems are implemented by different brain structures.The aim of this study was to investigate the characteristics of item memory and source memory processing in patients with Parkinson's disease (PD),and to further verify the hypothesis of dual-process model of source and item memory.Methods We established a neuropsychological battery to measure the performance of item memory and source memory.Totally 35 PD individuals and 35 matched healthy controls (HC) were administrated with the battery.Item memory task consists of the learning and recognition of high-frequency national Chinese characters; source memory task consists of the learning and recognition of three modes (character,picture,and image) of objects.Results Compared with the controls,the idiopathic PD patients have been impaired source memory (PD vs.HC:0.65±0.06 vs.0.72±0.09,P=0.001),but not impaired in item memory (PD vs.HC:0.65±0.07 vs.0.67±0.08,P=0.240).Conclusions The present experiment provides evidence for dissociation between item and source memory in PD patients,thereby strengthening the claim that the item or source memory rely on different brain structures.PD patients show poor source memory,in which dopamine plays a critical role.

  19. Criteria for eliminating items of a Test of Figural Analogies

    Directory of Open Access Journals (Sweden)

    Diego Blum

    2013-12-01

    Full Text Available This paper describes the steps taken to eliminate two of the items in a Test of Figural Analogies (TFA. The main guidelines of psychometric analysis concerning Classical Test Theory (CTT and Item Response Theory (IRT are explained. The item elimination process was based on both the study of the CTT difficulty and discrimination index, and the unidimensionality analysis. The a, b, and c parameters of the Three Parameter Logistic Model of IRT were also considered for this purpose, as well as the assessment of each item fitting this model. The unfavourable characteristics of a group of TFA items are detailed, and decisions leading to their possible elimination are discussed.

  20. Exploring differential item functioning (DIF) with the Rasch model: A comparison of gender differences on eighth-grade science items in the United States and Spain

    Science.gov (United States)

    Calvert, Tasha

    Despite the attention that has been given to gender and science, boys continue to outperform girls in science achievement, particularly by the end of secondary school. Because it is unclear whether gender differences have narrowed over time (Leder, 1992; Willingham & Cole, 1997), it is important to continue a line of inquiry into the nature of gender differences, specifically at the international level. The purpose of this study was to investigate gender differences in science achievement across two countries: United States and Spain. A secondary purpose was to demonstrate an alternative method for exploring gender differences based on the many-faceted Rasch model (1980). A secondary analysis of the data from the Third International Mathematics and Science Study (TIMSS) was used to examine the relationship between gender DIF (differential item functioning) and item characteristics (item type, content, and performance expectation) across both countries. Nationally representative samples of eighth grade students in the United States and Spain who participated in TIMSS were analyzed to answer the research questions in this study. In both countries, girls showed an advantage over boys on life science items and most extended response items, whereas boys, by and large, had an advantage on earth science, physics, and chemistry items. However, even within areas that favored boys, such as physics, there were items that were differentially easier for girls. In general, patterns in gender differences were similar across both countries although there were a few differences between the countries on individual items. It was concluded that simply looking at mean differences does not provide an adequate understanding of the nature of gender differences in science achievement.

  1. Forms of Attrition in a Longitudinal Study of Religion and Health in Older Adults and Implications for Sample Bias.

    Science.gov (United States)

    Hayward, R David; Krause, Neal

    2016-02-01

    The use of longitudinal designs in the field of religion and health makes it important to understand how attrition bias may affect findings in this area. This study examines attrition in a 4-wave, 8-year study of older adults. Attrition resulted in a sample biased toward more educated and more religiously involved individuals. Conditional linear growth curve models found that trajectories of change for some variables differed among attrition categories. Ineligibles had worsening depression, declining control, and declining attendance. Mortality was associated with worsening religious coping styles. Refusers experienced worsening depression. Nevertheless, there was no evidence of bias in the key religion and health results.

  2. Giant exchange bias in MnPd/Co bilayers

    International Nuclear Information System (INIS)

    Nguyen Thanh Nam; Nguyen Phu Thuy; Nguyen Anh Tuan; Nguyen Nguyen Phuoc; Suzuki, Takao

    2007-01-01

    A systematic study of exchange bias in MnPd/Co bilayers has been carried out, where the dependences of exchange bias, unidirectional anisotropy constant and coercivity on the thicknesses of MnPd and Co layers were investigated. A huge unidirectional anisotropy constant, J K =2.5erg/cm 2 was observed, which is in reasonable agreement with the theoretical prediction based on the model by Meiklejohn and Bean. The angular dependences of exchange bias field and coercivity have also been examined showing that both exchange bias and coercivity follow 1/cosα rule

  3. Spare Items validation

    International Nuclear Information System (INIS)

    Fernandez Carratala, L.

    1998-01-01

    There is an increasing difficulty for purchasing safety related spare items, with certifications by manufacturers for maintaining the original qualifications of the equipment of destination. The main reasons are, on the top of the logical evolution of technology, applied to the new manufactured components, the quitting of nuclear specific production lines and the evolution of manufacturers quality systems, originally based on nuclear codes and standards, to conventional industry standards. To face this problem, for many years different Dedication processes have been implemented to verify whether a commercial grade element is acceptable to be used in safety related applications. In the same way, due to our particular position regarding the spare part supplies, mainly from markets others than the american, C.N. Trillo has developed a methodology called Spare Items Validation. This methodology, which is originally based on dedication processes, is not a single process but a group of coordinated processes involving engineering, quality and management activities. These are to be performed on the spare item itself, its design control, its fabrication and its supply for allowing its use in destinations with specific requirements. The scope of application is not only focussed on safety related items, but also to complex design, high cost or plant reliability related components. The implementation in C.N. Trillo has been mainly curried out by merging, modifying and making the most of processes and activities which were already being performed in the company. (Author)

  4. Optimal classifier selection and negative bias in error rate estimation: an empirical study on high-dimensional prediction

    Directory of Open Access Journals (Sweden)

    Boulesteix Anne-Laure

    2009-12-01

    Full Text Available Abstract Background In biometric practice, researchers often apply a large number of different methods in a "trial-and-error" strategy to get as much as possible out of their data and, due to publication pressure or pressure from the consulting customer, present only the most favorable results. This strategy may induce a substantial optimistic bias in prediction error estimation, which is quantitatively assessed in the present manuscript. The focus of our work is on class prediction based on high-dimensional data (e.g. microarray data, since such analyses are particularly exposed to this kind of bias. Methods In our study we consider a total of 124 variants of classifiers (possibly including variable selection or tuning steps within a cross-validation evaluation scheme. The classifiers are applied to original and modified real microarray data sets, some of which are obtained by randomly permuting the class labels to mimic non-informative predictors while preserving their correlation structure. Results We assess the minimal misclassification rate over the different variants of classifiers in order to quantify the bias arising when the optimal classifier is selected a posteriori in a data-driven manner. The bias resulting from the parameter tuning (including gene selection parameters as a special case and the bias resulting from the choice of the classification method are examined both separately and jointly. Conclusions The median minimal error rate over the investigated classifiers was as low as 31% and 41% based on permuted uninformative predictors from studies on colon cancer and prostate cancer, respectively. We conclude that the strategy to present only the optimal result is not acceptable because it yields a substantial bias in error rate estimation, and suggest alternative approaches for properly reporting classification accuracy.

  5. Correction of bias in belt transect studies of immotile objects

    Science.gov (United States)

    Anderson, D.R.; Pospahala, R.S.

    1970-01-01

    Unless a correction is made, population estimates derived from a sample of belt transects will be biased if a fraction of, the individuals on the sample transects are not counted. An approach, useful for correcting this bias when sampling immotile populations using transects of a fixed width, is presented. The method assumes that a searcher's ability to find objects near the center of the transect is nearly perfect. The method utilizes a mathematical equation, estimated from the data, to represent the searcher's inability to find all objects at increasing distances from the center of the transect. An example of the analysis of data, formation of the equation, and application is presented using waterfowl nesting data collected in Colorado.

  6. Immediacy bias in social-emotional comparisons.

    Science.gov (United States)

    White, Katherine; Van Boven, Leaf

    2012-08-01

    In seven studies of naturally occurring, "real-world" emotional events, people demonstrated an immediacy bias in social-emotional comparisons, perceiving their own current or recent emotional reactions as more intense compared with others' emotional reactions to the same events. The events examined include crossing a scary bridge (study 1a), a national tragedy (study 1b), terrorist attacks (studies 2a and 3b), a natural disaster (study 2b), and a presidential election (study 3b). These perceived differences between one's own and others' emotions declined over time, as relatively immediate and recent emotions subsided, a pattern that people were not intuitively aware of (study 2c). This immediacy bias in social-emotional comparisons emerged for both explicit comparisons (studies 1a, 1b, and 3b), and for absolute judgments of emotional intensity (studies 2a, 2b, and 3a). Finally, the immediacy bias in social-emotional comparisons was reduced when people were reminded that emotional display norms might lead others' appearances to understate emotional intensity (studies 3a and 3b). Implications of these findings for social-emotional phenomena are discussed.

  7. Intentional Forgetting Reduces Color-Naming Interference: Evidence from Item-Method Directed Forgetting

    Science.gov (United States)

    Lee, Yuh-shiow; Lee, Huang-mou; Fawcett, Jonathan M.

    2013-01-01

    In an item-method-directed forgetting task, Chinese words were presented individually, each followed by an instruction to remember or forget. Colored probe items were presented following each memory instruction requiring a speeded color-naming response. Half of the probe items were novel and unrelated to the preceding study item, whereas the…

  8. Using Localized Survey Items to Augment Standardized Benchmarking Measures: A LibQUAL+[TM] Study

    Science.gov (United States)

    Thompson, Bruce; Cook, Colleen; Kyrillidou, Martha

    2006-01-01

    The LibQUAL+[TM] protocol solicits open-ended comments from users with regard to library service quality, gathers data on 22 core items, and, at the option of individual libraries, also garners ratings on five items drawn from a pool of more than 100 choices selected by libraries. In this article, the relationship of scores on these locally…

  9. Difference in method of administration did not significantly impact item response

    DEFF Research Database (Denmark)

    Bjorner, Jakob B; Rose, Matthias; Gandek, Barbara

    2014-01-01

    assistant (PDA), or personal computer (PC) on the Internet, and a second form by PC, in the same administration. Structural invariance, equivalence of item responses, and measurement precision were evaluated using confirmatory factor analysis and item response theory methods. RESULTS: Multigroup...... levels in IVR, PQ, or PDA administration as compared to PC. Availability of large item response theory-calibrated PROMIS item banks allowed for innovations in study design and analysis.......PURPOSE: To test the impact of method of administration (MOA) on the measurement characteristics of items developed in the Patient-Reported Outcomes Measurement Information System (PROMIS). METHODS: Two non-overlapping parallel 8-item forms from each of three PROMIS domains (physical function...

  10. Item Analysis in Introductory Economics Testing.

    Science.gov (United States)

    Tinari, Frank D.

    1979-01-01

    Computerized analysis of multiple choice test items is explained. Examples of item analysis applications in the introductory economics course are discussed with respect to three objectives: to evaluate learning; to improve test items; and to help improve classroom instruction. Problems, costs and benefits of the procedures are identified. (JMD)

  11. Internet-based cognitive bias modification for obsessive compulsive disorder: study protocol for a randomized controlled trial.

    Science.gov (United States)

    Williams, Alishia D; Pajak, Rosanna; O'Moore, Kathleen; Andrews, Gavin; Grisham, Jessica R

    2014-05-29

    Cognitive bias modification (CBM) interventions have demonstrated efficacy in augmenting core biases implicated in psychopathology. The current randomized controlled trial (RCT) will evaluate the efficacy of an internet-delivered positive imagery cognitive bias modification intervention for obsessive compulsive disorder (OCD) when compared to a control condition. Patients meeting diagnostic criteria for a current or lifetime diagnosis of OCD will be recruited via the research arm of a not-for-profit clinical and research unit in Australia. The minimum sample size for each group (alpha set at 0.05, power at .80) was identified as 29, but increased to 35 to allow for 20% attrition. We will measure the impact of CBM on interpretations bias using the OC Bias Measure (The Ambiguous Scenarios Test for OCD ;AST-OCD) and OC-beliefs (The Obsessive Beliefs Questionnaire-TRIP; OBQ-TRIP). Secondary outcome measures include the Dimensional Obsessive-Compulsive Scale (DOCS), the Patient Health Questionnaire (PHQ-9), The Kessler Psychological Distress Scale (K10), and the Word Sentence Association Test for OCD (WSAO). Change in diagnostic status will be indexed using the OCD Mini International Neuropsychiatric Interview (M.I.N.I) Module at baseline and follow-up. Intent-to-treat (ITT) marginal and mixed-effect models using restricted maximum likelihood (REML) estimation will be used to evaluate the primary hypotheses. Stability of bias change will be assessed at 1-month follow-up. A limitation of the online nature of the study is the inability to include a behavioral outcome measure. The trial was registered on 10 October 2013 with the Australian New Zealand Clinical Trials Registry (ACTRN12613001130752).

  12. Imputation across genotyping arrays for genome-wide association studies: assessment of bias and a correction strategy.

    Science.gov (United States)

    Johnson, Eric O; Hancock, Dana B; Levy, Joshua L; Gaddis, Nathan C; Saccone, Nancy L; Bierut, Laura J; Page, Grier P

    2013-05-01

    A great promise of publicly sharing genome-wide association data is the potential to create composite sets of controls. However, studies often use different genotyping arrays, and imputation to a common set of SNPs has shown substantial bias: a problem which has no broadly applicable solution. Based on the idea that using differing genotyped SNP sets as inputs creates differential imputation errors and thus bias in the composite set of controls, we examined the degree to which each of the following occurs: (1) imputation based on the union of genotyped SNPs (i.e., SNPs available on one or more arrays) results in bias, as evidenced by spurious associations (type 1 error) between imputed genotypes and arbitrarily assigned case/control status; (2) imputation based on the intersection of genotyped SNPs (i.e., SNPs available on all arrays) does not evidence such bias; and (3) imputation quality varies by the size of the intersection of genotyped SNP sets. Imputations were conducted in European Americans and African Americans with reference to HapMap phase II and III data. Imputation based on the union of genotyped SNPs across the Illumina 1M and 550v3 arrays showed spurious associations for 0.2 % of SNPs: ~2,000 false positives per million SNPs imputed. Biases remained problematic for very similar arrays (550v1 vs. 550v3) and were substantial for dissimilar arrays (Illumina 1M vs. Affymetrix 6.0). In all instances, imputing based on the intersection of genotyped SNPs (as few as 30 % of the total SNPs genotyped) eliminated such bias while still achieving good imputation quality.

  13. Sources of CAM3 vorticity bias during northern winter from diagnostic study of the vorticity equation

    Energy Technology Data Exchange (ETDEWEB)

    Grotjahn, Richard [University of California, Department of Land, Air and Water Resources, Davis, CA (United States); Pan, Lin-Lin; Tribbia, Joseph [National Center for Atmospheric Research, Boulder, CO (United States)

    2011-06-15

    CAM3 (Community Atmosphere Model version 3) simulation bias is diagnosed using the vorticity equation. The study compares CAM3 output with ECMWF (European Centre for Medium-Range Weather Forecasts) 40 year reanalysis (ERA-40) data. A time mean vorticity bias equation is also formulated and the terms are grouped into categories: linear terms, nonlinear terms, transient contributions, and friction (calculated as a residual). Frontal cyclone storms have much weaker band passed kinetic energy and enstrophy in CAM3. The downstream end of the North Atlantic storm track (NAST) has large location error. While the vorticity equation terms have similar amplitude ranking in CAM3 and ERA-40 at upper levels, the ranking differs notably in the lower troposphere. The linear and friction terms dominate the vorticity bias equation. The transient terms contribute along the storm track, but the nonlinear terms are generally much smaller, with the primary exception being over the Iberian peninsula. Friction is much stronger in CAM3. As evidence, nearly all wavelengths (including the longest planetary waves) have smaller amplitude in CAM3 than in ERA-40 vorticity data. Negative near surface vorticity tendency bias on the European side of the Arctic is linked to the NAST track error (evident in the divergence term). CAM3 misses the Beaufort high in sea level pressure (SLP) due to low level warm temperature bias, too little vortex compression, and to too little horizontal advection of negative vorticity compared with ERA-40. Generally lower SLP values in CAM3 over the entire Arctic follow from lower level warm bias in CAM3. (orig.)

  14. Applying Item Response Theory methods to design a learning progression-based science assessment

    Science.gov (United States)

    Chen, Jing

    Learning progressions are used to describe how students' understanding of a topic progresses over time and to classify the progress of students into steps or levels. This study applies Item Response Theory (IRT) based methods to investigate how to design learning progression-based science assessments. The research questions of this study are: (1) how to use items in different formats to classify students into levels on the learning progression, (2) how to design a test to give good information about students' progress through the learning progression of a particular construct and (3) what characteristics of test items support their use for assessing students' levels. Data used for this study were collected from 1500 elementary and secondary school students during 2009--2010. The written assessment was developed in several formats such as the Constructed Response (CR) items, Ordered Multiple Choice (OMC) and Multiple True or False (MTF) items. The followings are the main findings from this study. The OMC, MTF and CR items might measure different components of the construct. A single construct explained most of the variance in students' performances. However, additional dimensions in terms of item format can explain certain amount of the variance in student performance. So additional dimensions need to be considered when we want to capture the differences in students' performances on different types of items targeting the understanding of the same underlying progression. Items in each item format need to be improved in certain ways to classify students more accurately into the learning progression levels. This study establishes some general steps that can be followed to design other learning progression-based tests as well. For example, first, the boundaries between levels on the IRT scale can be defined by using the means of the item thresholds across a set of good items. Second, items in multiple formats can be selected to achieve the information criterion at all

  15. Item Response Theory Analysis of the Psychopathic Personality Inventory-Revised.

    Science.gov (United States)

    Eichenbaum, Alexander E; Marcus, David K; French, Brian F

    2017-06-01

    This study examined item and scale functioning in the Psychopathic Personality Inventory-Revised (PPI-R) using an item response theory analysis. PPI-R protocols from 1,052 college student participants (348 male, 704 female) were analyzed. Analyses were conducted on the 131 self-report items comprising the PPI-R's eight content scales, using a graded response model. Scales collected a majority of their information about respondents possessing higher than average levels of the traits being measured. Each scale contained at least some items that evidenced limited ability to differentiate between respondents with differing levels of the trait being measured. Moreover, 80 items (61.1%) yielded significantly different responses between men and women presumably possessing similar levels of the trait being measured. Item performance was also influenced by the scoring format (directly scored vs. reverse-scored) of the items. Overall, the results suggest that the PPI-R, despite identifying psychopathic personality traits in individuals possessing high levels of those traits, may not identify these traits equally well for men and women, and scores are likely influenced by the scoring format of the individual item and scale.

  16. Randomized clinical trials in dentistry: Risks of bias, risks of random errors, reporting quality, and methodologic quality over the years 1955-2013.

    Directory of Open Access Journals (Sweden)

    Humam Saltaji

    Full Text Available To examine the risks of bias, risks of random errors, reporting quality, and methodological quality of randomized clinical trials of oral health interventions and the development of these aspects over time.We included 540 randomized clinical trials from 64 selected systematic reviews. We extracted, in duplicate, details from each of the selected randomized clinical trials with respect to publication and trial characteristics, reporting and methodologic characteristics, and Cochrane risk of bias domains. We analyzed data using logistic regression and Chi-square statistics.Sequence generation was assessed to be inadequate (at unclear or high risk of bias in 68% (n = 367 of the trials, while allocation concealment was inadequate in the majority of trials (n = 464; 85.9%. Blinding of participants and blinding of the outcome assessment were judged to be inadequate in 28.5% (n = 154 and 40.5% (n = 219 of the trials, respectively. A sample size calculation before the initiation of the study was not performed/reported in 79.1% (n = 427 of the trials, while the sample size was assessed as adequate in only 17.6% (n = 95 of the trials. Two thirds of the trials were not described as double blinded (n = 358; 66.3%, while the method of blinding was appropriate in 53% (n = 286 of the trials. We identified a significant decrease over time (1955-2013 in the proportion of trials assessed as having inadequately addressed methodological quality items (P < 0.05 in 30 out of the 40 quality criteria, or as being inadequate (at high or unclear risk of bias in five domains of the Cochrane risk of bias tool: sequence generation, allocation concealment, incomplete outcome data, other sources of bias, and overall risk of bias.The risks of bias, risks of random errors, reporting quality, and methodological quality of randomized clinical trials of oral health interventions have improved over time; however, further efforts that contribute to the development of more stringent

  17. Transmission of Cognitive Bias and Fear From Parents to Children : An Experimental Study

    NARCIS (Netherlands)

    Remmerswaal, Danielle; Muris, Peter; Huijding, Jorg|info:eu-repo/dai/nl/292646976

    2016-01-01

    This study explored the role of parents in the development of a cognitive bias and subsequent fear levels in children. In Experiment 1, nonclinical children ages 8–13 (N = 122) underwent a training during which they worked together with their mothers on an information search task. Mothers received

  18. Students' gender bias in teaching evaluations

    Directory of Open Access Journals (Sweden)

    Narissra Punyanunt-Carter

    2015-09-01

    Full Text Available The goal of this study was to investigate if there is gender bias in student evaluations. Researchers administered a modified version of the teacher evaluation forms to 58 students (male=30; female=28 in a basic introductory communications class. Half the class was instructed to fill out the survey about a male professor, and the other half a female professor. Researchers broke down the evaluation results question by question in order to give a detailed account of the findings. Results revealed that there is certainly some gender bias at work when students evaluate their instructors. It was also found that gender bias does not significantly affect the evaluations. The results align with other findings in the available literature, which point to some sort of pattern regarding gender bias in evaluations, but it still seems to be inconsequential.  DOI: 10.18870/hlrc.v5i3.234

  19. A Comparison of the 27-Item and 12-Item Intolerance of Uncertainty Scales

    Science.gov (United States)

    Khawaja, Nigar G.; Yu, Lai Ngo Heidi

    2010-01-01

    The 27-item Intolerance of Uncertainty Scale (IUS) has become one of the most frequently used measures of Intolerance of Uncertainty. More recently, an abridged, 12-item version of the IUS has been developed. The current research used clinical (n = 50) and non-clinical (n = 56) samples to examine and compare the psychometric properties of both…

  20. Use of bias correction techniques to improve seasonal forecasts for reservoirs - A case-study in northwestern Mediterranean.

    Science.gov (United States)

    Marcos, Raül; Llasat, Ma Carmen; Quintana-Seguí, Pere; Turco, Marco

    2018-01-01

    In this paper, we have compared different bias correction methodologies to assess whether they could be advantageous for improving the performance of a seasonal prediction model for volume anomalies in the Boadella reservoir (northwestern Mediterranean). The bias correction adjustments have been applied on precipitation and temperature from the European Centre for Middle-range Weather Forecasting System 4 (S4). We have used three bias correction strategies: two linear (mean bias correction, BC, and linear regression, LR) and one non-linear (Model Output Statistics analogs, MOS-analog). The results have been compared with climatology and persistence. The volume-anomaly model is a previously computed Multiple Linear Regression that ingests precipitation, temperature and in-flow anomaly data to simulate monthly volume anomalies. The potential utility for end-users has been assessed using economic value curve areas. We have studied the S4 hindcast period 1981-2010 for each month of the year and up to seven months ahead considering an ensemble of 15 members. We have shown that the MOS-analog and LR bias corrections can improve the original S4. The application to volume anomalies points towards the possibility to introduce bias correction methods as a tool to improve water resource seasonal forecasts in an end-user context of climate services. Particularly, the MOS-analog approach gives generally better results than the other approaches in late autumn and early winter. Copyright © 2017 Elsevier B.V. All rights reserved.