WorldWideScience

Sample records for internal consistency item

  1. Internal consistency of a five-item form of the Francis Scale of Attitude Toward Christianity among adolescent students.

    Science.gov (United States)

    Campo-Arias, Adalberto; Oviedo, Heidi Celina; Cogollo, Zuleima

    2009-04-01

    The short form of the Francis Scale of Attitude Toward Christianity (L. J. Francis, 1992) is a 7-item Likert-type scale that shows high homogeneity among adolescents. The psychometric performance of a shorter version of this scale has not been explored. The authors aimed to determine the internal consistency of a 5-item form of the Francis Scale of Attitude Toward Christianity among 405 students from a school in Cartagena, Colombia. The authors computed the Cronbach's alpha coefficient for the 5 items with a greater corrected item-total punctuation correlation. The version without Items 2 and 7 showed internal consistency of .87. The 5-item version of the Francis Scale of Attitude Toward Christianity exhibited higher internal consistency than did the 7-item version. Future researchers should corroborate this finding.

  2. 26 CFR 301.6222(a)-1 - Consistent treatment of partnership items.

    Science.gov (United States)

    2010-04-01

    ... 26 Internal Revenue 18 2010-04-01 2010-04-01 false Consistent treatment of partnership items. 301... Consistent treatment of partnership items. (a) In general. The treatment of a partnership item on the partner's return must be consistent with the treatment of that item by the partnership on the partnership...

  3. Five-Item Francis Scale of Attitude toward Christianity: Construct and Nomological Validity and Internal Consistency among Colombian College Students

    Science.gov (United States)

    Ceballos, Guillermo A.; Suescun, Jesus D.; Oviedo, Heidi C.; Herazo, Edwin; Campo-Arias, Adalberto

    2015-01-01

    The Spanish version of the five-item Francis scale of attitude toward Christianity is a refinement of the short version of the Francis scale of attitude toward Christianity. The scale is a good measurement for intrinsic religiosity. It has been applied previously among Colombian adolescent students. The internal consistency and construct and…

  4. Sixteen-item Anxiety Sensitivity Index: Confirmatory factor analytic evidence, internal consistency, and construct validity in a young adult sample from the Netherlands

    NARCIS (Netherlands)

    Vujanovic, Anka A.; Arrindell, Willem A.; Bernstein, Amit; Norton, Peter J.; Zvolensky, Michael J.

    The present investigation examined the factor structure, internal consistency, and construct validity of the 16-item Anxiety Sensitivity Index (ASI; Reiss Peterson, Gursky, & McNally 1986) in a young adult sample (n = 420)from the Netherlands. Confirmatory factor analysis was used to comparatively

  5. Cross- cultural validation of the Brazilian Portuguese version of the Social Phobia Inventory (SPIN): study of the items and internal consistency.

    Science.gov (United States)

    Osório, Flávia de Lima; Crippa, José Alexandre S; Loureiro, Sonia Regina

    2009-03-01

    The objective of the present study was to carry out the cross- cultural validation for Brazilian Portuguese of the Social Phobia Inventory, an instrument for the evaluation of fear, avoidance and physiological symptoms associated with social anxiety disorder. The process of translation and adaptation involved four bilingual professionals, appreciation and approval of the back- translation by the authors of the original scale, a pilot study with 30 Brazilian university students, and appreciation by raters who confirmed the face validity of the Portuguese version, which was named ' Inventário de Fobia Social' . As part of the psychometric study of the Social Phobia Inventory, analysis of the items and evaluation of the internal consistency of the instrument were performed in a study conducted on 2314 university students. The results demonstrated that item 11, related to the fear of public speaking, was the most frequently scored item. The correlation of the items with the total score was quite adequate, ranging from 0.44 to 0.71, as was the internal consistency, which ranged from 0.71 to 0.90. The authors conclude that the Brazilian Portuguese version of the Social Phobia Inventory proved to be adequate regarding the psychometric properties initially studied, with qualities quite close to those of the original study. Studies that will evaluate the remaining indicators of validity of the Social Phobia Inventory in clinical and non-clinical samples are considered to be opportune and necessary.

  6. Item wording and internal consistency of a measure of cohesion: the group environment questionnaire.

    Science.gov (United States)

    Eys, Mark A; Carron, Albert V; Bray, Steven R; Brawley, Lawrence R

    2007-06-01

    A common practice for counteracting response acquiescence in psychological measures has been to employ both negatively and positively worded items. However, previous research has highlighted that the reliability of measures can be affected by this practice (Spector, 1992). The purpose of the present study was to examine the effect that the presence of negatively worded items has on the internal reliability of the Group Environment Questionnaire (GEQ). Two samples (N = 276) were utilized, and participants were asked to complete the GEQ (original and revised) on separate occasions. Results demonstrated that the revised questionnaire (containing all positively worded items) had significantly higher Cronbach alpha values for three of the four dimensions of the GEQ. Implications, alternatives, and future directions are discussed.

  7. Validity and internal consistency of a whiplash-specific disability measure.

    Science.gov (United States)

    Pinfold, Melanie; Niere, Ken R; O'Leary, Elizabeth F; Hoving, Jan Lucas; Green, Sally; Buchbinder, Rachelle

    2004-02-01

    Cross-sectional study of patients with whiplash-associated disorders investigating the internal consistency, factor structure, response rates, and presence of floor and ceiling effects of the Whiplash Disability Questionnaire (WDQ). The aim of this study was to confirm the appropriateness of the proposed WDQ items. Whiplash injuries are a common cause of pain and disability after motor vehicle accidents. Neck disability questionnaires are often used in whiplash studies to assess neck pain but lack content validity for patients with whiplash-associated disorders. The newly developed WDQ measures functional limitations associated with whiplash injury and was designed after interviews with 83 patients with whiplash in a previous study. Researchers sought expert opinion on items of the WDQ, and items were then tested on a clinical whiplash population. Data were inspected to determine floor and ceiling effects, response rates, factor structure, and internal consistency. Packages of questionnaires were distributed to 55 clinicians, whose patients with whiplash completed and returned 101 questionnaires to researchers. No substantial floor or ceiling effects were identified on inspection of data. The overall floor effect was 12%, and the overall ceiling effect was 4%. Principal component analysis identified one broad factor that accounted for 65% of the variance in responses. Internal consistency was high; Cronbach's alpha = 0.96. Results of the study supported the retention of the 13 proposed items in a whiplash-specific disability questionnaire. Dependent on the results of further psychometric testing, the WDQ is likely to be an appropriate outcome measure for patients with whiplash.

  8. Factor structure and internal consistency of the 12-item General Health Questionnaire (GHQ-12 and the Subjective Vitality Scale (VS, and the relationship between them: a study from France

    Directory of Open Access Journals (Sweden)

    Ismaïl Amany

    2009-03-01

    Full Text Available Abstract Background The objectives of this study were to test the factor structure and internal consistency of the 12-item General Health Questionnaire (GHQ-12 and the Subjective Vitality Scale (VS in elderly French people, and to test the relationship between these two questionnaires. Methods Using a standard 'forward-backward' translation procedure, the English language versions of the two instruments (i.e. the 12-item General Health Questionnaire and the Subjective Vitality Scale were translated into French. A sample of adults aged 58–72 years then completed both questionnaires. Internal consistency was assessed by Cronbach's alpha coefficient. The factor structures of the two instruments were extracted by confirmatory factor analysis (CFA. Finally, the relationship between the two instruments was assessed by correlation analysis. Results In all, 217 elderly adults participated in the study. The mean age of the respondents was 61.7 (SD = 6.2 years. The mean GHQ-12 score was 17.4 (SD = 8.0, and analysis showed satisfactory internal consistency (Cronbach's alpha coefficient = 0.78. The mean VS score was 22.4 (SD = 7.4 and its internal consistency was found to be good (Cronbach's alpha coefficient = 0.83. While CFA showed that the VS was uni-dimensional, analysis for the GHQ-12 demonstrated a good fit not only to the two-factor model (positive vs. negative items but also to a three-factor model. As expected, there was a strong and significant negative correlation between the GHQ-12 and the VS (r = -0.71, P Conclusion The results showed that the French versions of the 12-item General Health Questionnaire (GHQ-12 and the Subjective Vitality Scale (VS are reliable measures of psychological distress and vitality. They also confirm a significant negative correlation between these two instruments, lending support to their convergent validity in an elderly French population. The findings indicate that both measures have good structural

  9. Psychometrics and the neuroscience of individual differences: Internal consistency limits between-subjects effects.

    Science.gov (United States)

    Hajcak, Greg; Meyer, Alexandria; Kotov, Roman

    2017-08-01

    In the clinical neuroscience literature, between-subjects differences in neural activity are presumed to reflect reliable measures-even though the psychometric properties of neural measures are almost never reported. The current article focuses on the critical importance of assessing and reporting internal consistency reliability-the homogeneity of "items" that comprise a neural "score." We demonstrate how variability in the internal consistency of neural measures limits between-subjects (i.e., individual differences) effects. To this end, we utilize error-related brain activity (i.e., the error-related negativity or ERN) in both healthy and generalized anxiety disorder (GAD) participants to demonstrate options for psychometric analyses of neural measures; we examine between-groups differences in internal consistency, between-groups effect sizes, and between-groups discriminability (i.e., ROC analyses)-all as a function of increasing items (i.e., number of trials). Overall, internal consistency should be used to inform experimental design and the choice of neural measures in individual differences research. The internal consistency of neural measures is necessary for interpreting results and guiding progress in clinical neuroscience-and should be routinely reported in all individual differences studies. (PsycINFO Database Record (c) 2017 APA, all rights reserved).

  10. The Iranian version of 12-item Short Form Health Survey (SF-12): factor structure, internal consistency and construct validity.

    Science.gov (United States)

    Montazeri, Ali; Vahdaninia, Mariam; Mousavi, Sayed Javad; Omidvari, Speideh

    2009-09-16

    The 12-item Short Form Health Survey (SF-12) as a shorter alternative of the SF-36 is largely used in health outcomes surveys. The aim of this study was to validate the SF-12 in Iran. A random sample of the general population aged 15 years and over living in Tehran, Iran completed the SF-12. Reliability was estimated using internal consistency and validity was assessed using known groups comparison and convergent validity. In addition, the factor structure of the questionnaire was extracted by performing both exploratory factor analysis (EFA) and confirmatory factor analysis (CFA). In all, 5587 individuals were studied (2721 male and 2866 female). The mean age and formal education of the respondents were 35.1 (SD = 15.4) and 10.2 (SD = 4.4) years respectively. The results showed satisfactory internal consistency for both summary measures, that are the Physical Component Summary (PCS) and the Mental Component Summary (MCS); Cronbach's alpha for PCS-12 and MCS-12 was 0.73 and 0.72, respectively. Known-groups comparison showed that the SF-12 discriminated well between men and women and those who differed in age and educational status (P < 0.001). In addition, correlations between the SF-12 scales and single items showed that the physical functioning, role physical, bodily pain and general health subscales correlated higher with the PCS-12 score, while the vitality, social functioning, role emotional and mental health subscales more correlated with the MCS-12 score lending support to its good convergent validity. Finally the principal component analysis indicated a two-factor structure (physical and mental health) that jointly accounted for 57.8% of the variance. The confirmatory factory analysis also indicated a good fit to the data for the two-latent structure (physical and mental health). In general the findings suggest that the SF-12 is a reliable and valid measure of health related quality of life among Iranian population. However, further studies are needed to

  11. A dynamic Thurstonian item response theory of motive expression in the picture story exercise: solving the internal consistency paradox of the PSE.

    Science.gov (United States)

    Lang, Jonas W B

    2014-07-01

    The measurement of implicit or unconscious motives using the picture story exercise (PSE) has long been a target of debate in the psychological literature. Most debates have centered on the apparent paradox that PSE measures of implicit motives typically show low internal consistency reliability on common indices like Cronbach's alpha but nevertheless predict behavioral outcomes. I describe a dynamic Thurstonian item response theory (IRT) model that builds on dynamic system theories of motivation, theorizing on the PSE response process, and recent advancements in Thurstonian IRT modeling of choice data. To assess the models' capability to explain the internal consistency paradox, I first fitted the model to archival data (Gurin, Veroff, & Feld, 1957) and then simulated data based on bias-corrected model estimates from the real data. Simulation results revealed that the average squared correlation reliability for the motives in the Thurstonian IRT model was .74 and that Cronbach's alpha values were similar to the real data (value of extant evidence from motivational research using PSE motive measures. (c) 2014 APA, all rights reserved.

  12. The Iranian version of 12-item Short Form Health Survey (SF-12: factor structure, internal consistency and construct validity

    Directory of Open Access Journals (Sweden)

    Mousavi Sayed

    2009-09-01

    Full Text Available Abstract Background The 12-item Short Form Health Survey (SF-12 as a shorter alternative of the SF-36 is largely used in health outcomes surveys. The aim of this study was to validate the SF-12 in Iran. Methods A random sample of the general population aged 15 years and over living in Tehran, Iran completed the SF-12. Reliability was estimated using internal consistency and validity was assessed using known groups comparison and convergent validity. In addition, the factor structure of the questionnaire was extracted by performing both exploratory factor analysis (EFA and confirmatory factor analysis (CFA. Results: In all, 5587 individuals were studied (2721 male and 2866 female. The mean age and formal education of the respondents were 35.1 (SD = 15.4 and 10.2 (SD = 4.4 years respectively. The results showed satisfactory internal consistency for both summary measures, that are the Physical Component Summary (PCS and the Mental Component Summary (MCS; Cronbach's α for PCS-12 and MCS-12 was 0.73 and 0.72, respectively. Known-groups comparison showed that the SF-12 discriminated well between men and women and those who differed in age and educational status (P Conclusion In general the findings suggest that the SF-12 is a reliable and valid measure of health related quality of life among Iranian population. However, further studies are needed to establish stronger psychometric properties for this alternative form of the SF-36 Health Survey in Iran.

  13. WOrk-Related Questionnaire for UPper extremity disorders (WORQ-UP): Factor Analysis and Internal Consistency.

    Science.gov (United States)

    Aerts, Bas R; Kuijer, P Paul; Beumer, Annechien; Eygendaal, Denise; Frings-Dresen, Monique H

    2018-04-17

    To test a 17-item questionnaire, the WOrk-Related Questionnaire for UPper extremity disorders (WORQ-UP), for dimensionality of the items (factor analysis) and internal consistency. Cross-sectional study. Outpatient clinic. A consecutive sample of patients (N=150) consisting of all new referral patients (either from a general physician or other hospital) who visited the orthopedic outpatient clinic because of an upper extremity musculoskeletal disorder. Not applicable. Number and dimensionality of the factors in the WORQ-UP. Four factors with eigenvalues (EVs) >1.0 were found. The factors were named exertion, dexterity, tools & equipment, and mobility. The EVs of the factors were, respectively, 5.78, 2.38, 1.81, and 1.24. The factors together explained 65.9% of the variance. The Cronbach alpha values for these factors were, respectively, .88, .74, .87, and .66. The 17 items of the WORQ-UP resemble 4 factors-exertion, dexterity, tools & equipment, and mobility-with a good internal consistency. Copyright © 2018 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.

  14. A diagnostic test for apraxia in stroke patients: internal consistency and diagnostic value.

    NARCIS (Netherlands)

    Heugten, C.M. van; Dekker, J.; Deelman, B.G.; Stehmann-Saris, F.C.; Kinebanian, A.

    1999-01-01

    The internal consistency and the diagnostic value of a test for apraxia in patients having had a stroke are presented. Results indicate that the items of the test form a strong and consistent scale: Cronbach's alpha as well as the results of a Mokken scale analysis present good reliability and good

  15. International Semiotics: Item Difficulty and the Complexity of Science Item Illustrations in the PISA-2009 International Test Comparison

    Science.gov (United States)

    Solano-Flores, Guillermo; Wang, Chao; Shade, Chelsey

    2016-01-01

    We examined multimodality (the representation of information in multiple semiotic modes) in the context of international test comparisons. Using Program of International Student Assessment (PISA)-2009 data, we examined the correlation of the difficulty of science items and the complexity of their illustrations. We observed statistically…

  16. Internal consistency & validity of Indian Disability Evaluation and Assessment Scale (IDEAS in patients with schizophrenia

    Directory of Open Access Journals (Sweden)

    Sandeep Grover

    2014-01-01

    Full Text Available Background & objectives: The Indian Disability Evaluation and Assessment Scale (IDEAS has been recommended for assessment and certification of disability by the Government of India (GOI. However, the psychometric properties of IDEAS as adopted by GOI remain understudied. Our aim, thus, was to study the internal consistency and validity of IDEAS in patients with schizophrenia. Methods: A total of 103 consenting patients with residual schizophrenia were assessed for disability, quality of life (QOL and psychopathology using the IDEAS, WHO QOL-100 and Positive and Negative symptom scale (PANSS respectively. Internal consistency was calculated using Cronbach′s alpha. For construct validity, relations between IDEAS, and psychopathology and QOL were studied. Results: The inter-item correlations for IDEAS were significant with a Cronbach′s alpha of 0.721. All item scores other than score on communication and understanding; total and global IDEAS scores correlated significantly with the positive, negative and general sub-scales, and total PANSS scores. Communication and understanding was significantly related to negative sub-scale score only. Total and global disability scores correlated negatively with all the domains of WHOQOL-100 (ρ<0.01. The individual IDEAS item scores correlated negatively with various WHOQOL-100 domains (ρ0< 0.01. Interpretation & conclusions: This study findings showed that the GOI-modified IDEAS had good internal consistency and construct validity as tested in patients with residual schizophrenia. Similar studies need to be done with other groups of patients.

  17. Test of Gross Motor Development : Expert Validity, confirmatory validity and internal consistence

    Directory of Open Access Journals (Sweden)

    Nadia Cristina Valentini

    2008-12-01

    Full Text Available The Test of Gross Motor Development (TGMD-2 is an instrument used to evaluate children’s level of motordevelopment. The objective of this study was to translate and verify the clarity and pertinence of the TGMD-2 items by expertsand the confirmatory factorial validity and the internal consistence by means of test-retest of the Portuguese TGMD-2. Across-cultural translation was used to construct the Portuguese version. The participants of this study were 7 professionalsand 587 children, from 27 schools (kindergarten and elementary from 3 to 10 years old (51.1% boys and 48.9% girls.Each child was videotaped performing the test twice. The videotaped tests were then scored. The results indicated thatthe Portuguese version of the TGMD-2 contains clear and pertinent motor items; demonstrated satisfactory indices ofconfirmatory factorial validity (χ2/gl = 3.38; Goodness-of-fit Index = 0.95; Adjusted Goodness-of-fit index = 0.92 and Tuckerand Lewis’s Index of Fit = 0.83 and test-retest internal consistency (locomotion r = 0.82; control of object: r = 0.88. ThePortuguese TGMD-2 demonstrated validity and reliability for the sample investigated.

  18. Test of Gross Motor Development: expert validity, confirmatory validity and internal consistence

    Directory of Open Access Journals (Sweden)

    Nadia Cristina Valentini

    2008-01-01

    The Test of Gross Motor Development (TGMD-2 is an instrument used to evaluate children’s level of motor development. The objective of this study was to translate and verify the clarity and pertinence of the TGMD-2 items by experts and the confirmatory factorial validity and the internal consistence by means of test-retest of the Portuguese TGMD-2. A cross-cultural translation was used to construct the Portuguese version. The participants of this study were 7 professionals and 587 children, from 27 schools (kindergarten and elementary from 3 to 10 years old (51.1% boys and 48.9% girls. Each child was videotaped performing the test twice. The videotaped tests were then scored. The results indicated that the Portuguese version of the TGMD-2 contains clear and pertinent motor items; demonstrated satisfactory indices of confirmatory factorial validity (÷2/gl = 3.38; Goodness-of-fit Index = 0.95; Adjusted Goodness-of-fit index = 0.92 and Tucker and Lewis’s Index of Fit = 0.83 and test-retest internal consistency (locomotion r = 0.82; control of object: r = 0.88. The Portuguese TGMD-2 demonstrated validity and reliability for the sample investigated.

  19. Internal consistency, reliability, and temporal stability of the Oxford Happiness Questionnaire short-form: Test-retest data over two weeks

    OpenAIRE

    MCGUCKIN, CONOR

    2006-01-01

    PUBLISHED The Oxford Happiness Questionnaire short-form is a recently developed eight-item measure of happiness. This study evaluated the internal consistency reliability and test-retest reliability of the Oxford Happiness Questionnaire short-form among 55 Northern Irish undergraduate university students who completed the measure on two occasions separated by two weeks. Internal consistency of the measure on both occasions was satisfactory at both Time 1 (alpha = .62) and Time 2 (alpha = ....

  20. Online self-report questionnaire on computer work-related exposure (OSCWE): validity and internal consistency.

    Science.gov (United States)

    Mekhora, Keerin; Jalayondeja, Wattana; Jalayondeja, Chutima; Bhuanantanondh, Petcharatana; Dusadiisariyavong, Asadang; Upiriyasakul, Rujiret; Anuraktam, Khajornyod

    2014-07-01

    To develop an online, self-report questionnaire on computer work-related exposure (OSCWE) and to determine the internal consistency, face and content validity of the questionnaire. The online, self-report questionnaire was developed to determine the risk factors related to musculoskeletal disorders in computer users. It comprised five domains: personal, work-related, work environment, physical health and psychosocial factors. The questionnaire's content was validated by an occupational medical doctor and three physical therapy lecturers involved in ergonomic teaching. Twenty-five lay people examined the feasibility of computer-administered and the user-friendly language. The item correlation in each domain was analyzed by the internal consistency (Cronbach's alpha; alpha). The content of the questionnaire was considered congruent with the testing purposes. Eight hundred and thirty-five computer users at the PTT Exploration and Production Public Company Limited registered to the online self-report questionnaire. The internal consistency of the five domains was: personal (alpha = 0.58), work-related (alpha = 0.348), work environment (alpha = 0.72), physical health (alpha = 0.68) and psychosocial factor (alpha = 0.93). The findings suggested that the OSCWE had acceptable internal consistency for work environment and psychosocial factors. The OSCWE is available to use in population-based survey research among computer office workers.

  1. 17 CFR 229.308T - (Item 308T) Internal control over financial reporting.

    Science.gov (United States)

    2010-04-01

    ... over financial reporting. 229.308T Section 229.308T Commodity and Securities Exchanges SECURITIES AND... § 229.308T (Item 308T) Internal control over financial reporting. Note to Item 308T: This is a special... internal control over financial reporting. Provide a report of management on the registrant's internal...

  2. Choice, internal consistency, and rationality

    OpenAIRE

    Aditi Bhattacharyya; Prasanta K. Pattanaik; Yongsheng Xu

    2010-01-01

    The classical theory of rational choice is built on several important internal consistency conditions. In recent years, the reasonableness of those internal consistency conditions has been questioned and criticized, and several responses to accommodate such criticisms have been proposed in the literature. This paper develops a general framework to accommodate the issues raised by the criticisms of classical rational choice theory, and examines the broad impact of these criticisms from both no...

  3. The eye-complaint questionnaire in a visual display unit work environment: Internal consistency and test-retest reliability

    NARCIS (Netherlands)

    Steenstra, Ivan A.; Sluiter, Judith K.; Frings-Dresen, Monique H. W.

    2009-01-01

    The internal consistency and test-retest reliability of a 10-item eye-complaint questionnaire (ECQ) were examined within a sample of office workers. Repeated within-subjects measures were performed within a single day and over intervals of 1 and 7 d. Questionnaires were completed by 96 workers (70%

  4. Evaluating the factor structure, item analyses, and internal consistency of hospital anxiety and depression scale in Iranian infertile patients

    Directory of Open Access Journals (Sweden)

    Payam Amini

    2017-09-01

    Full Text Available Background: The hospital anxiety and depression scale (HADS is a common screening tool designed to measure the level of anxiety and depression in different factor structures and has been extensively used in non-psychiatric populations and individuals experiencing fertility problems. Objective: The aims of this study were to evaluate the factor structure, item analyses, and internal consistency of HADS in Iranian infertile patients. Materials and Methods: This cross-sectional study included 651 infertile patients (248 men and 403 women referred to a referral infertility Center in Tehran, Iran between January 2014 and January 2015. Confirmatory factor analysis was used to determine the underlying factor structure of the HADS among one, two, and threefactor models. Several goodness of fit indices were utilized such as comparative, normed and goodness of fit indices, Akaike information criterion, and the root mean squared error of approximation. In addition to HADS, the Satisfaction with Life Scale questionnaires as well as demographic and clinical information were administered to all patients. Results: The goodness of fit indices through CFAs exposed that three and onefactor model provided the best and worst fit to the total, male and female datasets compared to the other factor structure models for the infertile patients. The Cronbach’s alpha for anxiety and depression subscales were 0.866 and 0.753 respectively. The HADS subscales significantly correlated with SWLS, indicating an acceptable convergent validity. Conclusion: The HADS was found to be a three-factor structure screening instrument in the field of infertility.

  5. Construct validity and internal consistency reliability of the Malay version of the 21-item depression anxiety stress scale (Malay-DASS-21) among male outpatient clinic attendees in Johor.

    Science.gov (United States)

    Rusli, B N; Amrina, K; Trived, S; Loh, K P; Shashi, M

    2017-10-01

    The 21-item English version of the Depression Anxiety Stress Scale (DASS-21) has been proposed as a method for assessing self-perceived depression, anxiety and stress over the past week in various clinical and nonclinical populations. Several Malay versions of the DASS-21 have been validated in various populations with varying success. One particular Malay version has been validated in various occupational groups (such as nurses and automotive workers) but not among male clinic outpatient attendees in Malaysia. To validate the Malay version of the DASS-21 (Malay-DASS-21) among male outpatient clinic attendees in Johor. A validation study with a random sample of 402 male respondents attending the outpatient clinic of a major public outpatient clinic in Johor Bahru and Segamat was carried out from January to March 2016. Construct validity of the Malay-DASS-21 was examined using Exploratory Factor Analysis (KMO = 0.947; Bartlett's test of sphericity is significant, pDASS- 21 and the internal consistency reliability using Cronbach's alpha. Construct validity of the Malay-DASS-21 based on eigenvalues and factor loadings to confirm the three factor structure (depression, anxiety, and stress) was acceptable. The internal consistency reliability of the factor construct was very impressive with Cronbach's alpha values in the range of 0.837 to 0.863. The present study showed that the Malay- DASS-21 has acceptable psychometric construct and high internal consistency reliability to measure self-perceived depression, anxiety and stress over the past week in male outpatient clinic attendees in Johor. Further studies are necessary to revalidate the Malay-DASS-21 across different populations and cultures, and using confirmatory factor analyses.

  6. Psychometric analyses and internal consistency of the PHEEM questionnaire to measure the clinical learning environment in the clerkship of a Medical School in Chile.

    Science.gov (United States)

    Riquelme, Arnoldo; Herrera, Cristian; Aranis, Carolina; Oporto, Jorge; Padilla, Oslando

    2009-06-01

    The Spanish version of the Postgraduate Hospital Educational Environment Measure (PHEEM) was evaluated in this study to determine its psychometric properties, validity and internal consistency to measure the clinical learning environment in the hospital setting of Pontificia Universidad Católica de Chile Medical School's Internship. The 40-item PHEEM questionnaire was translated from English to Spanish and retranslated to English. Content validity was tested by a focus group and minor differences in meaning were adjusted. The PHEEM was administered to clerks in years 6 and 7. Construct validity was carried out using exploratory factor analysis followed by a Varimax rotation. Internal consistency was measured using Cronbach's alpha. A total of 125 out of 220 students responded to the PHEEM. The overall response rate was 56.8% and compliances with each item ranged from 99.2% to 100%. Analyses indicate that five factors instrument accounting for 58% of the variance and internal consistency of the 40-item questionnaire is 0.955 (Cronbach's alpha). The 40-item questionnaire had a mean score of 98.21 +/- 21.2 (maximum score of 160). The Spanish version of PHEEM is a multidimensional, valid and highly reliable instrument measuring the educational environment among undergraduate medical students working in hospital-based clerkships.

  7. Psychometric Properties of the International Personality Item Pool Big-Five Personality Questionnaire for the Greek population.

    Science.gov (United States)

    Ypofanti, Maria; Zisi, Vasiliki; Zourbanos, Nikolaos; Mouchtouri, Barbara; Tzanne, Pothiti; Theodorakis, Yannis; Lyrakos, Georgios

    2015-09-30

    Goldberg's International Personality Item Pool (IPIP) big-five personality factor markers currently lack validating evidence. The structure of the 50-item IPIP was examined in two different adult samples (total N=811), in each case justifying a 5-factor solution, with only minor discrepancies. Age differences were comparable to previous findings using other inventories. One sample (N=193) also completed additionally another personality measure (the TIPI Short Form). Conscientiousness, extraversion and emotional stability/neuroticism scales of the IPIP were highly correlated with those of the TIPI (r=0.62 to 0.65, P=0.01). Agreeableness and Intellect/Openness scales correlated less strongly (r=0.54 and 0.58 respectively, P=0.01). The IPIP scales have good internal consistency (a=0.88) and relate strongly to major dimensions of personality assessed by the two questionnaires.

  8. 17 CFR 229.308 - (Item 308) Internal control over financial reporting.

    Science.gov (United States)

    2010-04-01

    ... over financial reporting. 229.308 Section 229.308 Commodity and Securities Exchanges SECURITIES AND... § 229.308 (Item 308) Internal control over financial reporting. (a) Management's annual report on internal control over financial reporting. Provide a report of management on the registrant's internal...

  9. Temporal and Geographic variation in the validity and internal consistency of the Nursing Home Resident Assessment Minimum Data Set 2.0.

    Science.gov (United States)

    Mor, Vincent; Intrator, Orna; Unruh, Mark Aaron; Cai, Shubing

    2011-04-15

    The Minimum Data Set (MDS) for nursing home resident assessment has been required in all U.S. nursing homes since 1990 and has been universally computerized since 1998. Initially intended to structure clinical care planning, uses of the MDS expanded to include policy applications such as case-mix reimbursement, quality monitoring and research. The purpose of this paper is to summarize a series of analyses examining the internal consistency and predictive validity of the MDS data as used in the "real world" in all U.S. nursing homes between 1999 and 2007. We used person level linked MDS and Medicare denominator and all institutional claim files including inpatient (hospital and skilled nursing facilities) for all Medicare fee-for-service beneficiaries entering U.S. nursing homes during the period 1999 to 2007. We calculated the sensitivity and positive predictive value (PPV) of diagnoses taken from Medicare hospital claims and from the MDS among all new admissions from hospitals to nursing homes and the internal consistency (alpha reliability) of pairs of items within the MDS that logically should be related. We also tested the internal consistency of commonly used MDS based multi-item scales and examined the predictive validity of an MDS based severity measure viz. one year survival. Finally, we examined the correspondence of the MDS discharge record to hospitalizations and deaths seen in Medicare claims, and the completeness of MDS assessments upon skilled nursing facility (SNF) admission. Each year there were some 800,000 new admissions directly from hospital to US nursing homes and some 900,000 uninterrupted SNF stays. Comparing Medicare enrollment records and claims with MDS records revealed reasonably good correspondence that improved over time (by 2006 only 3% of deaths had no MDS discharge record, only 5% of SNF stays had no MDS, but over 20% of MDS discharges indicating hospitalization had no associated Medicare claim). The PPV and sensitivity levels of

  10. 38 CFR 21.219 - Supplies consisting of clothing, magazines and periodicals, and items which may be personally...

    Science.gov (United States)

    2010-07-01

    ... clothing, magazines and periodicals, and items which may be personally used by the veteran. 21.219 Section....219 Supplies consisting of clothing, magazines and periodicals, and items which may be personally used... will be supplied. (b) Furnishing magazines and periodicals. Appropriate past issues of magazines...

  11. Construct validity of the items on the Stroke Specific Quality of Life (SS-QOL) questionnaire that evaluate the participation component of the International Classification of Functioning, Disability and Health.

    Science.gov (United States)

    Silva, Soraia Micaela; Corrêa, Fernanda Ishida; Pereira, Gabriela Santos; Faria, Christina Danielli Coelho de Morais; Corrêa, João Carlos Ferrari

    2018-01-01

    Analyze the construct validity and internal consistency of the Stroke Specific Quality of Life (SS-QOL) items that address the participation component of the ICF as well as analyze the ceiling and floor effects. One hundred subjects were analyzed: 85 community-dwelling and 15 institutionalized individuals. The analysis of construct validity was performed using classic psychometrics: (1) the comparison of known groups (individuals without restriction to participation vs. those with restriction to participation) using the Mann-Whitney test and (2) convergent validity - correlation between the scores on the SS-QOL items that address participation and the subscale scores of measures used to evaluate the similar constructs and concepts [the Short-Form Health Survey (SF-36), Functional Independence Measure (FIM) and grip strength test]. Spearman's correlation coefficients were calculated for this analysis. Cronbach's α was used for the analysis of internal consistency and both the ceiling and floor effects were analyzed. The level of significance for all analyses was α = 0.05. The a priori hypotheses regarding construct validity were partially demonstrated, as only five of the eight domains exhibited positive moderate to strong correlations (r > 0.40) with measures that address constructs similar to those addressed on the SS-QOL questionnaire. The items demonstrated adequate internal consistency and are capable of differentiating individuals with and without restriction to participation. The ceiling and floor effects were considered adequate for the total SS-QOL score, but beyond acceptable standards for some domains. The 26 items of the SS-QOL questionnaire measure a multidimensional construct and therefore do not only address participation. However, the items demonstrated adequate internal consistency and are capable of differentiating individuals with and without restriction to participation. Implications for rehabilitation The 26 items of the SS

  12. Factorial Validity and Internal Consistency of Malaysian Adapted Depression Anxiety Stress Scale - 21 in an Adolescent Sample

    OpenAIRE

    Hairul Anuar Hashim; Freddy Golok; Rosmatunisah Ali

    2011-01-01

    Background: Psychometrically sound measurement instrument is a fundamental requirement across broad range of research areas. In negative affect research, Depression Anxiety Stress Scale (DASS) has been identified as a psychometrically sound instrument to measure depression, anxiety and stress, especially the 21-item version. However, its psychometric properties in adolescents have been less consistent. Objectives: Thus, the present study sought to examine the factorial validity and internal c...

  13. Memory Retention after Reading Alould and its Effects on the Internalization of New Items

    OpenAIRE

    佐藤, あずさ; Azusa, SATO; 安田女子大学大学院

    2014-01-01

    This paper reports the results of two studies focusing on internalization of newly learned items. In study 1, internalization was not confirmed, but reading and memory retention abilities of the reading-aloud subgroup (i.e., students with lower reading proficiency) improved significantly more than the reading-silently subgroup. In study 2 the same effects were confirmed in the reading-aloud subgroup, and internalization of newly learned items was finally confirmed in the reading-aloud group.

  14. The memory failures of everyday questionnaire (MFE): internal consistency and reliability.

    Science.gov (United States)

    Montejo Carrasco, Pedro; Montenegro, Peña Mercedes; Sueiro, Manuel J

    2012-07-01

    The Memory Failures of Everyday Questionnaire (MFE) is one of the most widely-used instruments to assess memory failures in daily life. The original scale has nine response options, making it difficult to apply; we created a three-point scale (0-1-2) with response choices that make it easier to administer. We examined the two versions' equivalence in a sample of 193 participants between 19 and 64 years of age. The test-retest reliability and internal consistency of the version we propose were also computed in a sample of 113 people. Several indicators attest to the two forms' equivalence: the correlation between the items' means (r = .94; p MFE 1-9. The MFE 0-2 provides a brief, simple evaluation, so we recommend it for use in clinical practice as well as research.

  15. Factorial Validity and Internal Consistency of the Motivational Climate in Physical Education Scale

    Directory of Open Access Journals (Sweden)

    Markus Soini

    2014-03-01

    Full Text Available The aim of the study was to examine the construct validity and internal consistency of the Motivational Climate in Physical Education Scale (MCPES. A key element of the development process of the scale was establishing a theoretical framework that integrated the dimensions of task- and ego involving climates in conjunction with autonomy, and social relatedness supporting climates. These constructs were adopted from the self-determination and achievement goal theories. A sample of Finnish Grade 9 students, comprising 2,594 girls and 1,803 boys, completed the 18-item MCPES during one physical education class. The results of the study demonstrated that participants had highest mean in task-involving climate and the lowest in autonomy climate and ego-involving climate. Additionally, autonomy, social relatedness, and task- involving climates were significantly and strongly correlated with each other, whereas the ego- involving climate had low or negligible correlations with the other climate dimensions.The construct validity of the MCPES was analyzed using confirmatory factor analysis. The statistical fit of the four-factor model consisting of motivational climate factors supporting perceived autonomy, social relatedness, task-involvement, and ego-involvement was satisfactory. The results of the reliability analysis showed acceptable internal consistencies for all four dimensions. The Motivational Climate in Physical Education Scale can be considered as psychometrically valid tool to measure motivational climate in Finnish Grade 9 students.

  16. Factor Structure, Internal Consistency, and Screening Sensitivity of the GARS-2 in a Developmental Disabilities Sample

    Directory of Open Access Journals (Sweden)

    Martin A. Volker

    2016-01-01

    Full Text Available The Gilliam Autism Rating Scale-Second Edition (GARS-2 is a widely used screening instrument that assists in the identification and diagnosis of autism. The purpose of this study was to examine the factor structure, internal consistency, and screening sensitivity of the GARS-2 using ratings from special education teaching staff for a sample of 240 individuals with autism or other significant developmental disabilities. Exploratory factor analysis yielded a correlated three-factor solution similar to that found in 2005 by Lecavalier for the original GARS. Though the three factors appeared to be reasonably consistent with the intended constructs of the three GARS-2 subscales, the analysis indicated that more than a third of the GARS-2 items were assigned to the wrong subscale. Internal consistency estimates met or exceeded standards for screening and were generally higher than those in previous studies. Screening sensitivity was .65 and specificity was .81 for the Autism Index using a cut score of 85. Based on these findings, recommendations are made for instrument revision.

  17. Evidence for the Psychometric Validity, Internal Consistency and Measurement Invariance of Warwick Edinburgh Mental Well-being Scale Scores in Scottish and Irish Adolescents.

    Science.gov (United States)

    McKay, Michael T; Andretta, James R

    2017-09-01

    Mental well-being is an important indicator of current, but also the future health of adolescents. The 14-item Warwick Edinburgh Mental Well-being Scale (WEMWBS) has been well validated in adults world-wide, but less work has been undertaken to examine the psychometric validity and internal consistency of WEMWBS scores in adolescents. In particular, little research has examined scores on the short 7-item version of the WEMWBS. The present study used two large samples of school children in Scotland and Northern Ireland and found that for both forms of the WEMWBS, scores were psychometrically valid, internally consistent, factor saturated, and measurement invariant by country. Using the WEMWBS full form, males reported significantly higher scores than females, and Northern Irish adolescents reported significantly higher scores than their Scottish counterparts. Last, the lowest overall levels of well-being were observed among Scottish females. Copyright © 2017. Published by Elsevier B.V.

  18. Delimiting Coefficient a from Internal Consistency and Unidimensionality

    Science.gov (United States)

    Sijtsma, Klaas

    2015-01-01

    I discuss the contribution by Davenport, Davison, Liou, & Love (2015) in which they relate reliability represented by coefficient a to formal definitions of internal consistency and unidimensionality, both proposed by Cronbach (1951). I argue that coefficient a is a lower bound to reliability and that concepts of internal consistency and…

  19. International Assessment: A Rasch Model and Teachers' Evaluation of TIMSS Science Achievement Items

    Science.gov (United States)

    Glynn, Shawn M.

    2012-01-01

    The Trends in International Mathematics and Science Study (TIMSS) is a comparative assessment of the achievement of students in many countries. In the present study, a rigorous independent evaluation was conducted of a representative sample of TIMSS science test items because item quality influences the validity of the scores used to inform…

  20. [Discomfort associated with dental extraction surgery and development of a questionnaire (QCirDental). Part I: Impacts and internal consistency].

    Science.gov (United States)

    Bortoluzzi, Marcelo Carlos; Martins, Luciana Dorochenko; Takahashi, André; Ribeiro, Bianca; Martins, Ligiane; Pinto, Marcia Helena Baldani

    2018-01-01

    The scope of this study was to develop and validate a questionnaire (QCirDental) to measure the impacts associated with dental extraction surgery. The QCirDental questionnaire was developed in two steps; (1) question and item generation and selection, and (2) pretest of the questionnaire with evaluation of the its measurement properties (internal consistency and responsiveness). The sample was composed of 123 patients. None of the patients had any difficulty in understanding the QCirDental. The instrument was found to have excellent internal consistency with Cronbach's alpha reliability coefficient of 0.83. The principal component analysis (Kaiser-Meyer-Olkin Measure of Sampling Adequacy 0,72 and Bartlett's Test of Sphericity with p < 0.001) showed six (6) dimensions explaining 67.5% of the variance. The QCirDental presented excellent internal consistency, being a questionnaire that is easy to read and understand with adequate semantic and content validity. More than 80% of the patients who underwent dental extraction reported some degree of discomfort within the perioperative period which highlights the necessity to assess the quality of care and impacts of dental extraction surgery.

  1. Cross- cultural validation of the Brazilian Portuguese version of the Social Phobia Inventory (SPIN: study of the items and internal consistency Validação transcultural da versão para o português do Brasil do Social Phobia Inventory (SPIN: estudo dos itens e da consistência interna

    Directory of Open Access Journals (Sweden)

    Flávia de Lima Osório

    2009-03-01

    Full Text Available OBJECTIVE: The objective of the present study was to carry out the cross- cultural validation for Brazilian Portuguese of the Social Phobia Inventory, an instrument for the evaluation of fear, avoidance and physiological symptoms associated with social anxiety disorder. METHOD: The process of translation and adaptation involved four bilingual professionals, appreciation and approval of the back- translation by the authors of the original scale, a pilot study with 30 Brazilian university students, and appreciation by raters who confirmed the face validity of the Portuguese version, which was named " Inventário de Fobia Social" . As part of the psychometric study of the Social Phobia Inventory, analysis of the items and evaluation of the internal consistency of the instrument were performed in a study conducted on 2314 university students. RESULTS: The results demonstrated that item 11, related to the fear of public speaking, was the most frequently scored item. The correlation of the items with the total score was quite adequate, ranging from 0.44 to 0.71, as was the internal consistency, which ranged from 0.71 to 0.90. DISCUSSION/CONCLUSION: The authors conclude that the Brazilian Portuguese version of the Social Phobia Inventory proved to be adequate regarding the psychometric properties initially studied, with qualities quite close to those of the original study. Studies that will evaluate the remaining indicators of validity of the Social Phobia Inventory in clinical and non-clinical samples are considered to be opportune and necessary.OBJETIVO: O objetivo deste estudo foi realizar a validação transcultural para o português do Brasil do Social Phobia Inventory, um instrumento para avaliação e mensuração dos sintomas de medo, evitação e sintomas fisiológicos associados ao transtorno de ansiedade social. MÉTODO: O processo de tradução e adaptação envolveu quatro profissionais bilingües, apreciação e aprovação da back

  2. The Portuguese language version of social phobia and Anxiety Inventory: analysis of items and internal consistency in a Brazilian sample of 1,014 undergraduate students Versão para o português do Inventário de Fobia Social e Ansiedade: análise de itens e consistência interna numa amostra de 1.014 estudantes universitários brasileiros

    Directory of Open Access Journals (Sweden)

    Patrícia Picon

    2006-01-01

    Full Text Available OBJECTIVE: Theoretical and empirical analysis of items and internal consistency of the Portuguese-language version of Social Phobia and Anxiety Inventory (SPAI-Portuguese. METHODS: Social phobia experts conducted a 45-item content analysis of the SPAI-Portuguese administered to a sample of 1,014 university students. Item discrimination was evaluated by Student's t test; interitem, mean and item-to-total correlations, by Pearson coefficient; reliability was estimated by Cronbach's alpha. RESULTS: There was 100% agreement among experts concerning the 45 items. On the SPAI-Portuguese 43 items were discriminative (p OBJETIVO: Análise teórica e empírica dos itens e da consistência interna da versão em português do Social Phobia and Anxiety Inventory (SPAI-Português e subescalas. MÉTODOS: Peritos em fobia social conduziram análise de conteúdo dos 45 itens do SPAI-Português, administrado a 1.014 estudantes universitários. A discriminação dos itens foi avaliada por teste t de Student; correlações interitens, médias e item/total por coeficientes de Pearson; fidedignidade pelo alfa de Cronbach. RESULTADOS: Concordância plena entre os peritos para os 45 itens. SPAI-Português com 43 itens discriminativos (p < 0,05. Alguns itens, entre as subescalas, apresentaram coeficientes abaixo de 0,2. As médias das correlações interitens foram: 0,41 na subescala fobia social; 0,32 na subescala agorafobia; e 0,32 no SPAI-Português. As correlações item/total foram maiores do que 0,3 (p < 0,001. Alfas de Cronbach foram: 0,95 no SPAI-Português; 0,96 na subescala de fobia social; 0,85 na subescala de agorafobia. CONCLUSÃO: O conteúdo dos itens foi relacionado aos constructos subjacentes (agorafobia e fobia social, com discriminabilidade de 43 itens do SPAI-Português. As correlações médias interitens e alfas revelaram consistência interna de SPAI-Português e subescalas, além de multidimensionalidade das mesmas. Nenhum item foi suprimido

  3. Harmonizing Measures of Cognitive Performance Across International Surveys of Aging Using Item Response Theory.

    Science.gov (United States)

    Chan, Kitty S; Gross, Alden L; Pezzin, Liliana E; Brandt, Jason; Kasper, Judith D

    2015-12-01

    To harmonize measures of cognitive performance using item response theory (IRT) across two international aging studies. Data for persons ≥65 years from the Health and Retirement Study (HRS, N = 9,471) and the English Longitudinal Study of Aging (ELSA, N = 5,444). Cognitive performance measures varied (HRS fielded 25, ELSA 13); 9 were in common. Measurement precision was examined for IRT scores based on (a) common items, (b) common items adjusted for differential item functioning (DIF), and (c) DIF-adjusted all items. Three common items (day of date, immediate word recall, and delayed word recall) demonstrated DIF by survey. Adding survey-specific items improved precision but mainly for HRS respondents at lower cognitive levels. IRT offers a feasible strategy for harmonizing cognitive performance measures across other surveys and for other multi-item constructs of interest in studies of aging. Practical implications depend on sample distribution and the difficulty mix of in-common and survey-specific items. © The Author(s) 2015.

  4. Standard Errors for National Trends in International Large-Scale Assessments in the Case of Cross-National Differential Item Functioning

    Science.gov (United States)

    Sachse, Karoline A.; Haag, Nicole

    2017-01-01

    Standard errors computed according to the operational practices of international large-scale assessment studies such as the Programme for International Student Assessment's (PISA) or the Trends in International Mathematics and Science Study (TIMSS) may be biased when cross-national differential item functioning (DIF) and item parameter drift are…

  5. Factorial validity and internal consistency of the motivational climate in physical education scale.

    Science.gov (United States)

    Soini, Markus; Liukkonen, Jarmo; Watt, Anthony; Yli-Piipari, Sami; Jaakkola, Timo

    2014-01-01

    The aim of the study was to examine the construct validity and internal consistency of the Motivational Climate in Physical Education Scale (MCPES). A key element of the development process of the scale was establishing a theoretical framework that integrated the dimensions of task- and ego involving climates in conjunction with autonomy, and social relatedness supporting climates. These constructs were adopted from the self-determination and achievement goal theories. A sample of Finnish Grade 9 students, comprising 2,594 girls and 1,803 boys, completed the 18-item MCPES during one physical education class. The results of the study demonstrated that participants had highest mean in task-involving climate and the lowest in autonomy climate and ego-involving climate. Additionally, autonomy, social relatedness, and task- involving climates were significantly and strongly correlated with each other, whereas the ego- involving climate had low or negligible correlations with the other climate dimensions.The construct validity of the MCPES was analyzed using confirmatory factor analysis. The statistical fit of the four-factor model consisting of motivational climate factors supporting perceived autonomy, social relatedness, task-involvement, and ego-involvement was satisfactory. The results of the reliability analysis showed acceptable internal consistencies for all four dimensions. The Motivational Climate in Physical Education Scale can be considered as psychometrically valid tool to measure motivational climate in Finnish Grade 9 students. Key PointsThis study developed Motivational Climate in School Physical Education Scale (MCPES). During the development process of the scale, the theoretical framework using dimensions of task- and ego involving as well as autonomy, and social relatedness supporting climates was constructed. These constructs were adopted from the self-determination and achievement goal theories.The statistical fit of the four-factor model of the

  6. Internal Branding and Employee Brand Consistent Behaviours

    DEFF Research Database (Denmark)

    Mazzei, Alessandra; Ravazzani, Silvia

    2017-01-01

    constitutive processes. In particular, the paper places emphasis on the role and kinds of communication practices as a central part of the nonnormative and constitutive internal branding process. The paper also discusses an empirical study based on interviews with 32 Italian and American communication managers...... and 2 focus groups with Italian communication managers. Findings show that, in order to enhance employee brand consistent behaviours, the most effective communication practices are those characterised as enablement-oriented. Such a communication creates the organizational conditions adequate to sustain......Employee behaviours conveying brand values, named brand consistent behaviours, affect the overall brand evaluation. Internal branding literature highlights a knowledge gap in terms of communication practices intended to sustain such behaviours. This study contributes to the development of a non...

  7. Item analysis of the Spanish version of the Boston Naming Test with a Spanish speaking adult population from Colombia.

    Science.gov (United States)

    Kim, Stella H; Strutt, Adriana M; Olabarrieta-Landa, Laiene; Lequerica, Anthony H; Rivera, Diego; De Los Reyes Aragon, Carlos Jose; Utria, Oscar; Arango-Lasprilla, Juan Carlos

    2018-02-23

    The Boston Naming Test (BNT) is a widely used measure of confrontation naming ability that has been criticized for its questionable construct validity for non-English speakers. This study investigated item difficulty and construct validity of the Spanish version of the BNT to assess cultural and linguistic impact on performance. Subjects were 1298 healthy Spanish speaking adults from Colombia. They were administered the 60- and 15-item Spanish version of the BNT. A Rasch analysis was computed to assess dimensionality, item hierarchy, targeting, reliability, and item fit. Both versions of the BNT satisfied requirements for unidimensionality. Although internal consistency was excellent for the 60-item BNT, order of difficulty did not increase consistently with item number and there were a number of items that did not fit the Rasch model. For the 15-item BNT, a total of 5 items changed position on the item hierarchy with 7 poor fitting items. Internal consistency was acceptable. Construct validity of the BNT remains a concern when it is administered to non-English speaking populations. Similar to previous findings, the order of item presentation did not correspond with increasing item difficulty, and both versions were inadequate at assessing high naming ability.

  8. Psychometric properties of the neck disability index amongst patients with chronic neck pain using item response theory.

    Science.gov (United States)

    Saltychev, Mikhail; Mattie, Ryan; McCormick, Zachary; Laimi, Katri

    2017-05-13

    The Neck Disability Index (NDI) is commonly used for clinical and research assessment for chronic neck pain, yet the original version of this tool has not undergone significant validity testing, and in particular, there has been minimal assessment using Item Response Theory. The goal of the present study was to investigate the psychometric properties of the original version of the NDI in a large sample of individuals with chronic neck pain by defining its internal consistency, construct structure and validity, and its ability to discriminate between different degrees of functional limitation. This is a cross-sectional cohort study of 585 consecutive patients with chronic neck pain seen in a university hospital rehabilitation clinic. Internal consistency was evaluated using Cronbach's alpha, construct structure was evaluated by exploratory factor analysis, and discrimination ability was determined by Item Response Theory. The NDI demonstrated good internal consistency assessed by Cronbach's alpha (0.87). The exploratory factor analysis identified only one factor with eigenvalue considered significant (cutoff 1.0). When analyzed by Item Response Theory, eight out of 10 items demonstrated almost ideal difficulty parameter estimates. In addition, eight out of 10 items showed high to perfect estimates of discrimination ability (overall range 0.8 to 2.9). Amongst patients with chronic neck pain, the NDI was found to have good internal consistency, have unidimensional properties, and an excellent ability to distinguish patients with different levels of perceived disability. Implications for Rehabilitation The Neck Disability Index has good internal consistency, unidimensional properties, and an excellent ability to distinguish patients with different levels of perceived disability. The Neck Disability Index is recommended for use when selecting patients for rehabilitation, setting rehabilitation goals, and measuring the outcome of intervention.

  9. 25 CFR 542.17 - What are the minimum internal control standards for complimentary services or items?

    Science.gov (United States)

    2010-04-01

    ... 25 Indians 2 2010-04-01 2010-04-01 false What are the minimum internal control standards for... THE INTERIOR HUMAN SERVICES MINIMUM INTERNAL CONTROL STANDARDS § 542.17 What are the minimum internal control standards for complimentary services or items? (a) Each Tribal gaming regulatory authority or...

  10. Using Item Response Theory to Develop a 60-Item Representation of the NEO PI-R Using the International Personality Item Pool: Development of the IPIP-NEO-60.

    Science.gov (United States)

    Maples-Keller, Jessica L; Williamson, Rachel L; Sleep, Chelsea E; Carter, Nathan T; Campbell, W Keith; Miller, Joshua D

    2017-10-31

    Given advantages of freely available and modifiable measures, an increase in the use of measures developed from the International Personality Item Pool (IPIP), including the 300-item representation of the Revised NEO Personality Inventory (NEO PI-R; Costa & McCrae, 1992a ) has occurred. The focus of this study was to use item response theory to develop a 60-item, IPIP-based measure of the Five-Factor Model (FFM) that provides equal representation of the FFM facets and to test the reliability and convergent and criterion validity of this measure compared to the NEO Five Factor Inventory (NEO-FFI). In an undergraduate sample (n = 359), scores from the NEO-FFI and IPIP-NEO-60 demonstrated good reliability and convergent validity with the NEO PI-R and IPIP-NEO-300. Additionally, across criterion variables in the undergraduate sample as well as a community-based sample (n = 757), the NEO-FFI and IPIP-NEO-60 demonstrated similar nomological networks across a wide range of external variables (r ICC = .96). Finally, as expected, in an MTurk sample the IPIP-NEO-60 demonstrated advantages over the Big Five Inventory-2 (Soto & John, 2017 ; n = 342) with regard to the Agreeableness domain content. The results suggest strong reliability and validity of the IPIP-NEO-60 scores.

  11. Determining the Feasibility, Content Validity, and Internal Consistency of a Newly Developed Care Coordination Scale for People with Brain Injury

    Directory of Open Access Journals (Sweden)

    Brian P. Johnson

    2017-07-01

    Full Text Available Background: With the increasing complexity of care, people with disabilities and supportive significant others (SSO must often coordinate key aspects of their own care, but no validated scale currently exists to comprehensively characterize the activities done to manage and coordinate their care. Method: This study aimed to improve the feasibility, acceptability, and content validity of the Care and Service Coordination and Management (CASCAM scale and to test its internal consistency. Questionnaire items were administered to 23 individuals with acquired brain injury and 17 SSO. Results: Respondents confirmed content validity and that the instrument addresses important care coordination and management issues. The internal consistency of care coordination domains for medical/ rehabilitative and independent living needs for people with brain injury and their SSO ranged from α = .774 to .945. Conclusion: Care coordination activities by persons with disabilities, including brain injury, and their SSO are multifaceted but feasibly measurable and should be assessed to improve care.

  12. Adaptação transcultural e consistência interna do Early Trauma Inventory (ETI Early Trauma Inventory (ETI: cross-cultural adaptation and internal consistency

    Directory of Open Access Journals (Sweden)

    Marcelo Feijó de Mello

    2010-04-01

    Full Text Available As experiências traumáticas precoces são um fator de risco preditivo de problemas psicopatológicos futuros. O Early Trauma Inventory (ETI é um instrumento que avalia em indivíduos adultos experiências traumáticas ocorridas antes dos 18 anos de idade. Tal instrumento foi traduzido, transculturalmente adaptado e sua consistência interna foi avaliada. Vítimas de violência que preencheram os critérios de inclusão e exclusão foram submetidas a uma entrevista diagnóstica (SCID-I e ao ETI. Foram incluídos 91 pacientes com o transtorno do estresse pós-traumático (TEPT. O alfa de Cronbach nos diferentes domínios variou de 0,595-0,793, e o escore total foi de 0,878. A maior parte dos itens nos vários domínios, com exceção do abuso emocional, apresentou índices de correlação interitem entre 0,51-0,99. A versão adaptada foi útil tanto na clínica quanto na pesquisa. Apresentou boa consistência interna e na correlação interitem. O ETI é um instrumento válido, com boa consistência para se avaliar a presença de história de traumas precoces em indivíduos adultos.Early life stress is a strong predictor of future psychopathology during adulthood. The Early Trauma Inventory (ETI was developed to detect the presence and impact of traumatic experiences that occurred up to 18 years of age. The ETI was translated and cross-culturally adapted and had its consistency evaluated. Victims of violence that met the inclusion and exclusion criteria were submitted to SCID-I and ETI. Ninety-one patients with post-traumatic stress disorder (PTSD were included. Cronbach's alpha in the different domains varied from 0.595 to 0.793, and the total score was 0.878. Except for emotional abuse, most of the various domains displayed inter-item correlation rates of 0.51 to 0.99. The adapted version was useful for clinical and research purposes and showed good internal consistency and inter-item correlation. The ETI is a valid instrument with good

  13. Content validation: clarity/relevance, reliability and internal consistency of enunciative signs of language acquisition.

    Science.gov (United States)

    Crestani, Anelise Henrich; Moraes, Anaelena Bragança de; Souza, Ana Paula Ramos de

    2017-08-10

    To analyze the results of the validation of building enunciative signs of language acquisition for children aged 3 to 12 months. The signs were built based on mechanisms of language acquisition in an enunciative perspective and on clinical experience with language disorders. The signs were submitted to judgment of clarity and relevance by a sample of six experts, doctors in linguistic in with knowledge of psycholinguistics and language clinic. In the validation of reliability, two judges/evaluators helped to implement the instruments in videos of 20% of the total sample of mother-infant dyads using the inter-evaluator method. The method known as internal consistency was applied to the total sample, which consisted of 94 mother-infant dyads to the contents of the Phase 1 (3-6 months) and 61 mother-infant dyads to the contents of Phase 2 (7 to 12 months). The data were collected through the analysis of mother-infant interaction based on filming of dyads and application of the parameters to be validated according to the child's age. Data were organized in a spreadsheet and then converted to computer applications for statistical analysis. The judgments of clarity/relevance indicated no modifications to be made in the instruments. The reliability test showed an almost perfect agreement between judges (0.8 ≤ Kappa ≥ 1.0); only the item 2 of Phase 1 showed substantial agreement (0.6 ≤ Kappa ≥ 0.79). The internal consistency for Phase 1 had alpha = 0.84, and Phase 2, alpha = 0.74. This demonstrates the reliability of the instruments. The results suggest adequacy as to content validity of the instruments created for both age groups, demonstrating the relevance of the content of enunciative signs of language acquisition.

  14. Comment on the internal consistency of thermodynamic databases supporting repository safety assessments

    International Nuclear Information System (INIS)

    Arthur, R.C.

    2001-11-01

    This report addresses the concept of internal consistency and its relevance to the reliability of thermodynamic databases used in repository safety assessments. In addition to being internally consistent, a reliable database should be accurate over a range of relevant temperatures and pressures, complete in the sense that all important aqueous species, gases and solid phases are represented, and traceable to original experimental results. No single definition of internal consistency need to be universally accepted as the most appropriate under all conditions, however. As a result, two databases that are each internally consistent may be inconsistent with respect to each other, and a database derived from two or more such databases must itself be internally inconsistent. The consequences of alternative definitions that are reasonably attributable to the concept of internal consistency can be illustrated with reference to the thermodynamic database supporting SKB's recent SR 97 safety assessment. This database is internally inconsistent because it includes equilibrium constants calculated over a range of temperatures: using conflicting reference values for some solids, gases and aqueous species that are common to two internally consistent databases (the OECD/NEA database for radioelements and SUPCRT databases for non-radioactive elements) that serve as source databases for the SR 97 TDB, using different definitions in these source databases of standard states for condensed phases and aqueous species, based on different mathematical expressions used in these source databases representing the temperature dependence of the heat capacity, and based on different chemical models adopted in these source databases for the aqueous phase. The importance of such inconsistencies must be considered in relation to the other database reliability criteria noted above, however. Thus, accepting a certain level of internal inconsistency in a database it is probably preferable to use a

  15. Comment on the internal consistency of thermodynamic databases supporting repository safety assessments

    Energy Technology Data Exchange (ETDEWEB)

    Arthur, R.C. [Monitor Scientific, LLC, Denver, CO (United States)

    2001-11-01

    This report addresses the concept of internal consistency and its relevance to the reliability of thermodynamic databases used in repository safety assessments. In addition to being internally consistent, a reliable database should be accurate over a range of relevant temperatures and pressures, complete in the sense that all important aqueous species, gases and solid phases are represented, and traceable to original experimental results. No single definition of internal consistency need to be universally accepted as the most appropriate under all conditions, however. As a result, two databases that are each internally consistent may be inconsistent with respect to each other, and a database derived from two or more such databases must itself be internally inconsistent. The consequences of alternative definitions that are reasonably attributable to the concept of internal consistency can be illustrated with reference to the thermodynamic database supporting SKB's recent SR 97 safety assessment. This database is internally inconsistent because it includes equilibrium constants calculated over a range of temperatures: using conflicting reference values for some solids, gases and aqueous species that are common to two internally consistent databases (the OECD/NEA database for radioelements and SUPCRT databases for non-radioactive elements) that serve as source databases for the SR 97 TDB, using different definitions in these source databases of standard states for condensed phases and aqueous species, based on different mathematical expressions used in these source databases representing the temperature dependence of the heat capacity, and based on different chemical models adopted in these source databases for the aqueous phase. The importance of such inconsistencies must be considered in relation to the other database reliability criteria noted above, however. Thus, accepting a certain level of internal inconsistency in a database it is probably preferable to

  16. An Adapted Measure of Sibling Attachment: Factor Structure and Internal Consistency of the Sibling Attachment Inventory in Youth.

    Science.gov (United States)

    Noel, Valerie A; Francis, Sarah E; Tilley, Micah A

    2018-04-01

    Parent-youth and peer relationship inventories based on attachment theory measure communication, trust, and alienation, yet sibling relationships have been overlooked. We developed the Sibling Attachment Inventory and evaluated its psychometric properties in a sample of 172 youth ages 10-14 years. We adapted the 25-item Sibling Attachment Inventory from the Inventory of Parent and Peer Attachment-Revised peer measure. Items loaded onto three factors, identified as communication, trust, and alienation, α = 0.93, 0.90, and 0.76, respectively. Sibling trust and alienation correlated with depression (r s  = -0.33, r s  = 0.48) and self-worth (r s  = 0.23; r s  = -0.32); sibling trust and alienation correlated with depression after controlling for parent trust and parent alienation (r s  = -0.23, r s  = 0.22). Preliminary analyses showed good internal consistency, construct validity, and incremental predictive validity. Following replication of these properties, this measure can facilitate large cohort assessments of sibling attachment.

  17. Internal consistency and construct validity of the Quality of Life in Alzheimer's Disease (QoL-AD) proxy – a secondary data analysis

    Science.gov (United States)

    Hylla, Jonas; Schwab, Christian G G; Isfort, Michael; Halek, Margareta; Dichter, Martin N

    2016-07-01

    Background: The maintenance and promotion of Quality of Life (QoL) of people with dementia is a major outcome in intervention studies and health care. The Quality of Life Alzheimer's Disease (QoL-AD) is an internationally recommended QoL measurement also available in German language. Until now, only a few results on the psychometric properties of the German QoL-AD were available. Objective: Evaluation of internal consistency and construct validity of the QoL-AD proxy. Method: A principal component analysis (secondary data analysis) of the 13 QoL-AD items was carried out based on the total sample of 234 people with dementia from nine nursing homes in Germany. Subsequently, the internal consistency of the identified factors was examined using Cronbach's alpha. Results: Two factors physical and mental health and social network were determined. Both factors explain 53 % of the total variance. The stability of both factors was validated in two sensitivity analyses. The internal consistency is good for both factors with a Cronbach's alpha of 0.88 (physical and mental health) and 0.75 (social network). Conclusion: The QoL-AD proxy allows the assessment of two relevant health-related QoL domains of people with dementia. However, in future studies especially the inter-rater reliability of the QoL-AD proxy has to be examined.

  18. Internationally Standardized Cost Item Definitions for Decommissioning of Nuclear Installations

    International Nuclear Information System (INIS)

    Lucien Teunckens; Kurt Pflugrad; Candace Chan-Sands; Ted Lazo

    2000-01-01

    The European Commission (EC), the International Atomic Energy Agency (IAEA), and the Organization for Economic Cooperation and Development/Nuclear Energy Agency (OECD/NEA) have agreed to jointly prepare and publish a standardized list of cost items and related definitions for decommissioning projects. Such a standardized list would facilitate communication, promote uniformity, and avoid inconsistency or contradiction of results or conclusions of cost evaluations for decommissioning projects carried out for specific purposes by different groups. Additionally, a standardized structure would also be a useful tool for more effective cost management. This paper describes actual work and result thus far

  19. Delimiting coefficient alpha from internal consistency and unidimensionality

    NARCIS (Netherlands)

    Sijtsma, K.

    2015-01-01

    I discuss the contribution by Davenport, Davison, Liou, & Love (2015) in which they relate reliability represented by coefficient α to formal definitions of internal consistency and unidimensionality, both proposed by Cronbach (1951). I argue that coefficient α is a lower bound to reliability and

  20. The Body Appreciation Scale-2: item refinement and psychometric evaluation.

    Science.gov (United States)

    Tylka, Tracy L; Wood-Barcalow, Nichole L

    2015-01-01

    Considered a positive body image measure, the 13-item Body Appreciation Scale (BAS; Avalos, Tylka, & Wood-Barcalow, 2005) assesses individuals' acceptance of, favorable opinions toward, and respect for their bodies. While the BAS has accrued psychometric support, we improved it by rewording certain BAS items (to eliminate sex-specific versions and body dissatisfaction-based language) and developing additional items based on positive body image research. In three studies, we examined the reworded, newly developed, and retained items to determine their psychometric properties among college and online community (Amazon Mechanical Turk) samples of 820 women and 767 men. After exploratory factor analysis, we retained 10 items (five original BAS items). Confirmatory factor analysis upheld the BAS-2's unidimensionality and invariance across sex and sample type. Its internal consistency, test-retest reliability, and construct (convergent, incremental, and discriminant) validity were supported. The BAS-2 is a psychometrically sound positive body image measure applicable for research and clinical settings. Copyright © 2014 Elsevier Ltd. All rights reserved.

  1. Assessing item fit for unidimensional item response theory models using residuals from estimated item response functions.

    Science.gov (United States)

    Haberman, Shelby J; Sinharay, Sandip; Chon, Kyong Hee

    2013-07-01

    Residual analysis (e.g. Hambleton & Swaminathan, Item response theory: principles and applications, Kluwer Academic, Boston, 1985; Hambleton, Swaminathan, & Rogers, Fundamentals of item response theory, Sage, Newbury Park, 1991) is a popular method to assess fit of item response theory (IRT) models. We suggest a form of residual analysis that may be applied to assess item fit for unidimensional IRT models. The residual analysis consists of a comparison of the maximum-likelihood estimate of the item characteristic curve with an alternative ratio estimate of the item characteristic curve. The large sample distribution of the residual is proved to be standardized normal when the IRT model fits the data. We compare the performance of our suggested residual to the standardized residual of Hambleton et al. (Fundamentals of item response theory, Sage, Newbury Park, 1991) in a detailed simulation study. We then calculate our suggested residuals using data from an operational test. The residuals appear to be useful in assessing the item fit for unidimensional IRT models.

  2. Item Response Data Analysis Using Stata Item Response Theory Package

    Science.gov (United States)

    Yang, Ji Seung; Zheng, Xiaying

    2018-01-01

    The purpose of this article is to introduce and review the capability and performance of the Stata item response theory (IRT) package that is available from Stata v.14, 2015. Using a simulated data set and a publicly available item response data set extracted from Programme of International Student Assessment, we review the IRT package from…

  3. Cross-Cultural Adaptation of the Profile Fitness Mapping Neck Questionnaire to Brazilian Portuguese: Internal Consistency, Reliability, and Construct and Structural Validity.

    Science.gov (United States)

    Ferreira, Mariana Cândido; Björklund, Martin; Dach, Fabiola; Chaves, Thais Cristina

    The purpose of this study was to adapt and evaluate the psychometric properties of the ProFitMap-neck to Brazilian Portuguese. The cross-cultural adaptation consisted of 5 stages, and 180 female patients with chronic neck pain participated in the study. A subsample (n = 30) answered the pretest, and another subsample (n = 100) answered the questionnaire a second time. Internal consistency, test-retest reliability, and construct validity (hypothesis testing and structural validity) were estimated. For construct validity, the scores of the questionnaire were correlated with the Neck Disability Index (NDI), and the Hospital Anxiety and Depression Scale (HADS), the Tampa Scale of Kinesiophobia (TSK), and the 36-item Short-Form Health Survey (SF-36). Internal consistency was determined by adequate Cronbach's α values (α > 0.70). Strong reliability was identified by high intraclass correlation coefficients (ICC > 0.75). Construct validity was identified by moderate and strong correlations of the Br-ProFitMap-neck with total NDI score (-0.56 50%, Kaiser-Meyer-Olkin index > 0.50, eigenvalue > 1, and factor loadings > 0.2. Br-ProFitMap-neck had adequate psychometric properties and can be used in clinical settings, as well as research, in patients with chronic neck pain. Copyright © 2017. Published by Elsevier Inc.

  4. An emotional functioning item bank of 24 items for computerized adaptive testing (CAT) was established

    DEFF Research Database (Denmark)

    Petersen, Morten Aa.; Gamper, Eva-Maria; Costantini, Anna

    2016-01-01

    of the widely used EORTC Quality of Life questionnaire (QLQ-C30). STUDY DESIGN AND SETTING: On the basis of literature search and evaluations by international samples of experts and cancer patients, 38 candidate items were developed. The psychometric properties of the items were evaluated in a large...... international sample of cancer patients. This included evaluations of dimensionality, item response theory (IRT) model fit, differential item functioning (DIF), and of measurement precision/statistical power. RESULTS: Responses were obtained from 1,023 cancer patients from four countries. The evaluations showed...... that 24 items could be included in a unidimensional IRT model. DIF did not seem to have any significant impact on the estimation of EF. Evaluations indicated that the CAT measure may reduce sample size requirements by up to 50% compared to the QLQ-C30 EF scale without reducing power. CONCLUSION...

  5. A psychometric comparison of three scales and a single-item measure to assess sexual satisfaction.

    Science.gov (United States)

    Mark, Kristen P; Herbenick, Debby; Fortenberry, J Dennis; Sanders, Stephanie; Reece, Michael

    2014-01-01

    This study was designed to systematically compare and contrast the psychometric properties of three scales developed to measure sexual satisfaction and a single-item measure of sexual satisfaction. The Index of Sexual Satisfaction (ISS), Global Measure of Sexual Satisfaction (GMSEX), and the New Sexual Satisfaction Scale-Short (NSSS-S) were compared to one another and to a single-item measure of sexual satisfaction. Conceptualization of the constructs, distribution of scores, internal consistency, convergent validity, test-retest reliability, and factor structure were compared between the measures. A total of 211 men and 214 women completed the scales and a measure of relationship satisfaction, with 33% (n = 139) of the sample reassessed two months later. All scales demonstrated appropriate distribution of scores and adequate internal consistency. The GMSEX, NSSS-S, and the single-item measure demonstrated convergent validity. Test-retest reliability was demonstrated by the ISS, GMSEX, and NSSS-S, but not the single-item measure. Taken together, the GMSEX received the strongest psychometric support in this sample for a unidimensional measure of sexual satisfaction and the NSSS-S received the strongest psychometric support in this sample for a bidimensional measure of sexual satisfaction.

  6. Internal consistency and content validity of a questionnaire aimed to assess the stages of behavioral lifestyle changes in Colombian schoolchildren: The Fuprecol study

    Directory of Open Access Journals (Sweden)

    Yasmira CARRILLO-BERNATE

    Full Text Available ABSTRACT Objective To assess internal consistency and content validity of a questionnaire aimed to assess the stages of Behavioural Lifestyle Changes in a sample of school-aged children and adolescents aged 9 to 17 years-old. Methods This validation study involved 675 schoolchildren from three official school in the city of Bogota, Colombia. A self-administered questionnaire called Behavioural Lifestyle Changes has been designed to explore stages of change regarding to physical activity/exercise, fruit and vegetable consumption, alcohol abuse, tobacco use, and drug abuse. Cronbach-α, Kappa index and exploratory factor analysis were used for evaluating the internal consistency and validity of content, respectively. Results The study population consisted of 51.1% males and the participants’ average age was 12.7±2.4 years-old. Behavioural Lifestyle Changes scored 0.720 (range 0.691 to 0.730 on the Cronbach α and intra-observer reproducibility was good (Kappa=0.71. Exploratory factor analysis determined two factors (factor 1: physical activity/exercise, fruit and vegetable consumption, and factor 2: alcohol abuse tobacco use and drug abuse, explaining 67.78% of variance by the items and six interactions χ2/gL=11649.833; p<0.001. Conclusion Behavioural Lifestyle Changes Questionnaire was seen to have suitable internal consistency and validity. This instrument can be recommended, mainly within the context of primary attention for studying the stages involved in the lifestyle behavioural changes model on a school-based population.

  7. Refinement of the Brazilian Household Food Insecurity Measurement Scale: Recommendation for a 14-item EBIA

    Directory of Open Access Journals (Sweden)

    Ana Maria Segall-Corrêa

    2014-04-01

    Full Text Available OBJECTIVE: To review and refine Brazilian Household Food Insecurity Measurement Scale structure. METHODS: The study analyzed the impact of removing the item "adult lost weight" and one of two possibly redundant items on Brazilian Household Food Insecurity Measurement Scale psychometric behavior using the one-parameter logistic (Rasch model. Brazilian Household Food Insecurity Measurement Scale psychometric behavior was analyzed with respect to acceptable adjustment values ranging from 0.7 to 1.3, and to severity scores of the items with theoretically expected gradients. The socioeconomic and food security indicators came from the 2004 National Household Sample Survey, which obtained complete answers to Brazilian Household Food Insecurity Measurement Scale items from 112,665 households. RESULTS: Removing the items "adult reduced amount..." followed by "adult ate less..." did not change the infit of the remaining items, except for "adult lost weight", whose infit increased from 1.21 to 1.56. The internal consistency and item severity scores did not change when "adult ate less" and one of the two redundant items were removed. CONCLUSION: Brazilian Household Food Insecurity Measurement Scale reanalysis reduced the number of scale items from 16 to 14 without changing its internal validity. Its use as a nationwide household food security measure is strongly recommended.

  8. Validation of the MOS Social Support Survey 6-item (MOS-SSS-6) measure with two large population-based samples of Australian women.

    Science.gov (United States)

    Holden, Libby; Lee, Christina; Hockey, Richard; Ware, Robert S; Dobson, Annette J

    2014-12-01

    This study aimed to validate a 6-item 1-factor global measure of social support developed from the Medical Outcomes Study Social Support Survey (MOS-SSS) for use in large epidemiological studies. Data were obtained from two large population-based samples of participants in the Australian Longitudinal Study on Women's Health. The two cohorts were aged 53-58 and 28-33 years at data collection (N = 10,616 and 8,977, respectively). Items selected for the 6-item 1-factor measure were derived from the factor structure obtained from unpublished work using an earlier wave of data from one of these cohorts. Descriptive statistics, including polychoric correlations, were used to describe the abbreviated scale. Cronbach's alpha was used to assess internal consistency and confirmatory factor analysis to assess scale validity. Concurrent validity was assessed using correlations between the new 6-item version and established 19-item version, and other concurrent variables. In both cohorts, the new 6-item 1-factor measure showed strong internal consistency and scale reliability. It had excellent goodness-of-fit indices, similar to those of the established 19-item measure. Both versions correlated similarly with concurrent measures. The 6-item 1-factor MOS-SSS measures global functional social support with fewer items than the established 19-item measure.

  9. IDENTIFICATION OF MEASUREMENT ITEMS OF DESIGN REQUIREMENTS FOR LEAN AND AGILE SUPPLY CHAIN-CONFIRMATORY FACTOR ANALYSIS

    Directory of Open Access Journals (Sweden)

    D.Venkata Ramana

    2013-06-01

    Full Text Available This study examines the consistency approaches by confirmatory factor analysis that determines the construct validity, convergent validity, construct reliability and internal consistency of the items of strategic design requirements. The design requirements includes use of information technology, sourcing procedures, new product development, flexible manufacturing functions and demand management supply chain net work design, management, commitment and inventory management policies among manufacturers of volatile and unforeseeable products in Andhraadesh, India. This study suggested that the seven factor model with 20 items of the leagile supply chain design requirements had a good fit. Further, the study showed a val id and reliable measurement to identify critical items among the design requirements of leagile supply chains.

  10. On the internal consistency of the term structure of forecasts of housing starts

    DEFF Research Database (Denmark)

    Pierdzioch, C.; Rulke, J. C.; Stadtmann, G.

    2013-01-01

    We use the term structure of forecasts of housing starts to test for rationality of forecasts. Our test is based on the idea that short-term and long-term forecasts should be internally consistent. We test the internal consistency of forecasts using data for Australia, Canada, Japan and the United...

  11. Nurses' knowledge and attitudes towards aged sexuality: validity and internal consistency of the Dutch version of the Aging Sexual Knowledge and Attitudes Scale.

    Science.gov (United States)

    Mahieu, Lieslot; de Casterlé, Bernadette Dierckx; Van Elssen, Kim; Gastmans, Chris

    2013-11-01

    This paper reports a study testing the content and face validity and internal consistency of the Dutch version of the Aging Sexual Knowledge and Attitudes Scale. The ability of older residents to sexually express themselves is known to be influenced by the knowledge and attitudes of nursing home staff towards later-life sexuality. Although the Aging Sexual Knowledge and Attitudes Scale is a widely used instrument to measure this, there is no validated, Dutch translation available. Instrument development. Following a standard forward/backward translation into Dutch, the scale was further adapted for use in Flemish nursing home settings. Content and face validity and user-friendliness were assessed. The psychometric properties were determined by means of an exploratory study. Data were collected from March-April 2011 at eight Flemish nursing homes. Reliability was assessed using internal consistency and item-total correlations. Both subscales of the Flemish adaptation showed acceptable content validity. The face validity and user-friendliness were deemed favourable with hardly any remarks given by the expert panel. The Cronbach's α was 0.80 and 0.88 for the knowledge and attitude subscales, respectively. The item-total correlations ranged from 0.21-0.48 for the knowledge section and from 0.09-0.68 for the attitude subscale. We conclude from our study that the Dutch version of the scale has acceptable to good psychometric properties. The Flemish adaptation therefore seems to be a valuable instrument for studying nursing staff's knowledge and attitudes towards aged sexuality in Flanders. © 2013 Blackwell Publishing Ltd.

  12. Secondary Psychometric Examination of the Dimensional Obsessive-Compulsive Scale: Classical Testing, Item Response Theory, and Differential Item Functioning.

    Science.gov (United States)

    Thibodeau, Michel A; Leonard, Rachel C; Abramowitz, Jonathan S; Riemann, Bradley C

    2015-12-01

    The Dimensional Obsessive-Compulsive Scale (DOCS) is a promising measure of obsessive-compulsive disorder (OCD) symptoms but has received minimal psychometric attention. We evaluated the utility and reliability of DOCS scores. The study included 832 students and 300 patients with OCD. Confirmatory factor analysis supported the originally proposed four-factor structure. DOCS total and subscale scores exhibited good to excellent internal consistency in both samples (α = .82 to α = .96). Patient DOCS total scores reduced substantially during treatment (t = 16.01, d = 1.02). DOCS total scores discriminated between students and patients (sensitivity = 0.76, 1 - specificity = 0.23). The measure did not exhibit gender-based differential item functioning as tested by Mantel-Haenszel chi-square tests. Expected response options for each item were plotted as a function of item response theory and demonstrated that DOCS scores incrementally discriminate OCD symptoms ranging from low to extremely high severity. Incremental differences in DOCS scores appear to represent unbiased and reliable differences in true OCD symptom severity. © The Author(s) 2014.

  13. Generalizability theory and item response theory

    NARCIS (Netherlands)

    Glas, Cornelis A.W.; Eggen, T.J.H.M.; Veldkamp, B.P.

    2012-01-01

    Item response theory is usually applied to items with a selected-response format, such as multiple choice items, whereas generalizability theory is usually applied to constructed-response tasks assessed by raters. However, in many situations, raters may use rating scales consisting of items with a

  14. Threats to Validity When Using Open-Ended Items in International Achievement Studies: Coding Responses to the PISA 2012 Problem-Solving Test in Finland

    Science.gov (United States)

    Arffman, Inga

    2016-01-01

    Open-ended (OE) items are widely used to gather data on student performance in international achievement studies. However, several factors may threaten validity when using such items. This study examined Finnish coders' opinions about threats to validity when coding responses to OE items in the PISA 2012 problem-solving test. A total of 6…

  15. Cross-cultural differences for adapting translated five-item version of International Index of Erectile Function: results of a Korean study.

    Science.gov (United States)

    Ku, Ja Hyeon; Park, Dal Woo; Kim, Soo Woong; Paick, Jae-Seung

    2005-06-01

    To assess whether the translated Korean version of the International Index of Erectile Function (IIEF-5) developed by Rosen et al. (RIIEF-5) may be adapted for a Korean population to have cross-cultural equivalency to the original version. A total of 151 patients with erectile dysfunction (ED) and 156 controls were prospectively studied. All the patients and controls had had sexual activity or attempted sexual intercourse within the 4-week period before completing the questionnaire. The Classification and Regression Trees program was used to select an optimal set of five items from the IIEF-15 (KIIEF-5) to discriminate between men with and without ED. Then, the optimal cutoff score for the diagnosis of ED was determined using the receiver operating characteristic curve. The optimal cutoff score, sensitivity, and specificity were also calculated using the RIIEF-5. The KIIEF-5 consisted, in order of importance, of items 15, 5, 13, 4, and 2 from the IIEF-15. Item 7 in the original RIIEF-5 was replaced with item 13 in the new KIIEF-5. The optimal cutoff score proved to be 21, with a corresponding sensitivity and specificity of 0.97 and 0.91, respectively. For the original RIIEF-5, the optimal cutoff score was 21 and the corresponding sensitivity and specificity was 0.94 and 0.90, respectively. Although the RIIEF-5 may be adapted for a Korean population, the KIIEF-5 can aid in decreasing the incidence of an incorrect diagnosis of ED and decreasing the number of undiagnosed cases of ED in this population. In addition, our findings suggest that the equivalence of psychometric properties does not imply cross-cultural equivalence.

  16. Reliability and validity of the International Spinal Cord Injury Basic Pain Data Set items as self-report measures

    DEFF Research Database (Denmark)

    Jensen, M P; Widerström-Noga, E; Richards, J S

    2010-01-01

    To evaluate the psychometric properties of a subset of International Spinal Cord Injury Basic Pain Data Set (ISCIBPDS) items that could be used as self-report measures in surveys, longitudinal studies and clinical trials....

  17. An empirical comparison of Item Response Theory and Classical Test Theory

    Directory of Open Access Journals (Sweden)

    Špela Progar

    2008-11-01

    Full Text Available Based on nonlinear models between the measured latent variable and the item response, item response theory (IRT enables independent estimation of item and person parameters and local estimation of measurement error. These properties of IRT are also the main theoretical advantages of IRT over classical test theory (CTT. Empirical evidence, however, often failed to discover consistent differences between IRT and CTT parameters and between invariance measures of CTT and IRT parameter estimates. In this empirical study a real data set from the Third International Mathematics and Science Study (TIMSS 1995 was used to address the following questions: (1 How comparable are CTT and IRT based item and person parameters? (2 How invariant are CTT and IRT based item parameters across different participant groups? (3 How invariant are CTT and IRT based item and person parameters across different item sets? The findings indicate that the CTT and the IRT item/person parameters are very comparable, that the CTT and the IRT item parameters show similar invariance property when estimated across different groups of participants, that the IRT person parameters are more invariant across different item sets, and that the CTT item parameters are at least as much invariant in different item sets as the IRT item parameters. The results furthermore demonstrate that, with regards to the invariance property, IRT item/person parameters are in general empirically superior to CTT parameters, but only if the appropriate IRT model is used for modelling the data.

  18. Validity and internal consistency of a whiplash-specific disability measure

    NARCIS (Netherlands)

    Pinfold, Melanie; Niere, Ken R.; O'Leary, Elizabeth F.; Hoving, Jan Lucas; Green, Sally; Buchbinder, Rachelle

    2004-01-01

    STUDY DESIGN: Cross-sectional study of patients with whiplash-associated disorders investigating the internal consistency, factor structure, response rates, and presence of floor and ceiling effects of the Whiplash Disability Questionnaire (WDQ). OBJECTIVES: The aim of this study was to confirm the

  19. Reliability and Concurrent Validity of the International Personality ...

    African Journals Online (AJOL)

    Reliability and Concurrent Validity of the International Personality item Pool (IPIP) Big-five Factor Markers in Nigeria. ... Nigerian Journal of Psychiatry ... Aims: The aim of this study was to assess the internal consistency and concurrent validity ...

  20. 26 CFR 301.6231(a)(3)-1 - Partnership items.

    Science.gov (United States)

    2010-04-01

    ... 26 Internal Revenue 18 2010-04-01 2010-04-01 false Partnership items. 301.6231(a)(3)-1 Section 301... Partnership items. (a) In general. For purposes of subtitle F of the Internal Revenue Code of 1954, the following items which are required to be taken into account for the taxable year of a partnership under...

  1. The Karen instruments for measuring quality of nursing care: construct validity and internal consistency.

    Science.gov (United States)

    Lindgren, Margareta; Andersson, Inger S

    2011-06-01

    Valid and reliable instruments for measuring the quality of care are needed for evaluation and improvement of nursing care. Previously developed and evaluated instruments, the Karen-patient and the Karen-personnel based on Donabedian's Structure-Process-Outcome triad (S-P-O triad) had promising content validity, discriminative power and internal consistency. The objective of this study was to further develop the instruments with regard to construct validity and internal consistency. This prospective study was carried out in medical and surgical wards at a hospital in Sweden. A total of 95 patients and 120 personnel were included. The instruments were tested for construct validity by performing factor analyses in two steps and for internal consistency using Cronbach's alpha coefficient. The first confirmatory factor analyses, with a pre-determined three-factor solution did not load well according to the S-P-O triad, but the second exploratory factor analysis with a six-factor solution appeared to be more coherent and the distribution of variables seemed to be logical. The reliability, i.e. internal consistency, was good in both factor analyses. The Karen-patient and the Karen-personnel instruments have achieved acceptable levels of construct validity. The internal consistency of the instruments is good. This indicates that the instruments may be suitable to use in clinical practice for measuring the quality of nursing care.

  2. Internal consistency of a Spanish translation of the Francis Scale of Attitude Toward Christianity Short Form.

    Science.gov (United States)

    Campo-Arias, Adalberto; Oviedo, Heidi Celina; Díaz, Carmen Elena; Cogollo, Zuleima

    2006-12-01

    This study evaluated the internal consistency of a Spanish version of the short form of the Francis Scale of Attitude Toward Christianity based on responses of 405 Colombian adolescent students ages 13 to 17 years. This translated short-form version of the scale had an internal consistency of .80. This estimate indicates suitable internal consistency reliability for research use in this population.

  3. [The Computer Book of the Internal Medicine resident: validity and reliability of a questionnaire for self-assessment of competences in internal medicine residents].

    Science.gov (United States)

    Oristrell, J; Casanovas, A; Jordana, R; Comet, R; Gil, M; Oliva, J C

    2012-12-01

    There are no simple and validated instruments for evaluating the training of specialists. To analyze the reliability and validity of a computerized self-assessment method to quantify the acquisition of medical competences during the Internal Medicine residency program. All residents of our department participated in the study during a period of 28 months. Twenty-two questionnaires specific for each rotation (the Computer-Book of the Internal Medicine Resident) were constructed with items (questions) corresponding to three competence domains: clinical skills competence, communication skills and teamwork. Reliability was analyzed by measuring the internal consistency of items in each competence domain using Cronbach's alpha index. Validation was performed by comparing mean scores in each competence domain between senior and junior residents. Cut-off levels of competence scores were established in order to identify the strengths and weaknesses of our training program. Finally, self-assessment values were correlated with the evaluations of the medical staff. There was a high internal consistency of the items of clinical skills competences, communication skills and teamwork. Higher scores of clinical skills competence and communication skills, but not in those of teamwork were observed in senior residents than in junior residents. The Computer-Book of the Internal Medicine Resident identified the strengths and weaknesses of our training program. We did not observe any correlation between the results of the self- evaluations and the evaluations made by staff physicians. The items of Computer-Book of the Internal Medicine Resident showed high internal consistency and made it possible to measure the acquisition of medical competences in a team of Internal Medicine residents. This self-assessment method should be complemented with other evaluation methods in order to assess the acquisition of medical competences by an individual resident. Copyright © 2012 Elsevier Espa

  4. [Factor analysis and internal consistency of pedagogical practices questionnaire among health care teachers].

    Science.gov (United States)

    Pérez V, Cristhian; Vaccarezza G, Giulietta; Aguilar A, César; Coloma N, Katherine; Salgado F, Horacio; Baquedano R, Marjorie; Chavarría R, Carla; Bastías V, Nancy

    2016-06-01

    Teaching practice is one of the most complex topics of the training process in medicine and other health care careers. The Teaching Practices Questionnaire (TPQ) evaluates teaching skills. To assess the factor structure and internal consistency of the Spanish version of the TPP among health care teachers. The TPQ was answered by 315 university teachers from 13 of the 15 administrative Chilean regions, who were selected through a non-probabilistic volunteer sampling. The internal consistency of TPP factors was calculated and the correlation between them was analyzed. Six factors were identified: Student-centered teaching, Teaching planning, Assessment process, Dialogue relationship, Teacher-centered teaching and Use of technological resources. They had Cronbach alphas ranging from 0.60 to 0.85. The factorial structure of TPQ differentiates the most important functions of teaching. It also shows a theoretical consistency and a practical relevance to perform a diagnosis and continuous evaluation of teaching practices. Additionally, it has an adequate internal consistency. Thus, TPQ is valid and reliable to evaluate pedagogical practices in health care careers.

  5. Generalizability theory and item response theory

    OpenAIRE

    Glas, Cornelis A.W.; Eggen, T.J.H.M.; Veldkamp, B.P.

    2012-01-01

    Item response theory is usually applied to items with a selected-response format, such as multiple choice items, whereas generalizability theory is usually applied to constructed-response tasks assessed by raters. However, in many situations, raters may use rating scales consisting of items with a selected-response format. This chapter presents a short overview of how item response theory and generalizability theory were integrated to model such assessments. Further, the precision of the esti...

  6. Item-level psychometrics of the ADL instrument of the Korean National Survey on persons with physical disabilities.

    Science.gov (United States)

    Hong, Ickpyo; Lee, Mi Jung; Kim, Moon Young; Park, Hae Yean

    2017-10-01

    The aim of this study is to investigate the psychometrics of the 12 items of an instrument assessing activities of daily living (ADL) using an item response theory model. A total of 648 adults with physical disabilities and having difficulties in ADLs were retrieved from the 2014 Korean National Survey on People with Disabilities. The psychometric testing included factor analysis, internal consistency, precision, and differential item functioning (DIF) across categories including sex, older age, marital status, and physical impairment area. The sample had a mean age of 69.7 years old (SD = 13.7). The majority of the sample had lower extremity impairments (62.0%) and had at least 2.1 chronic conditions. The instrument demonstrated unidimensional construct and good internal consistency (Cronbach's alpha = 0.95). The instrument precisely estimated person measures within a wide range of theta values (-2.22 logits  5.0%). Our findings indicate that the dressing item would need to be modified to improve its psychometrics. Overall, the ADL instrument demonstrates good psychometrics, and thus, it may be used as a standardized instrument for measuring disability in rehabilitation contexts. However, the findings are limited to adults with physical disabilities. Future studies should replicate psychometric testing for survey respondents with other disorders and for children.

  7. Differential item functioning analysis of the Vanderbilt Expertise Test for cars.

    Science.gov (United States)

    Lee, Woo-Yeol; Cho, Sun-Joo; McGugin, Rankin W; Van Gulick, Ana Beth; Gauthier, Isabel

    2015-01-01

    The Vanderbilt Expertise Test for cars (VETcar) is a test of visual learning for contemporary car models. We used item response theory to assess the VETcar and in particular used differential item functioning (DIF) analysis to ask if the test functions the same way in laboratory versus online settings and for different groups based on age and gender. An exploratory factor analysis found evidence of multidimensionality in the VETcar, although a single dimension was deemed sufficient to capture the recognition ability measured by the test. We selected a unidimensional three-parameter logistic item response model to examine item characteristics and subject abilities. The VETcar had satisfactory internal consistency. A substantial number of items showed DIF at a medium effect size for test setting and for age group, whereas gender DIF was negligible. Because online subjects were on average older than those tested in the lab, we focused on the age groups to conduct a multigroup item response theory analysis. This revealed that most items on the test favored the younger group. DIF could be more the rule than the exception when measuring performance with familiar object categories, therefore posing a challenge for the measurement of either domain-general visual abilities or category-specific knowledge.

  8. [Validity and internal consistency of the Maslach Burnout Inventory in Dental Students from Cartagena, Colombia].

    Science.gov (United States)

    Simancas-Pallares, Miguel Angel; Fortich Mesa, Natalia; González Martínez, Farith Damián

    To determine the internal consistency and content validity of the Maslach Burnout Inventory-Student Survey (MBI-SS) in dental students from Cartagena, Colombia. Scale validation study in 886 dental students from Cartagena, Colombia. Factor structure was determined through exploratory factor analysis (EFA) and confirmatory factor analysis (CFA). Internal consistency was measured using the Cronbach's alpha coefficient. Analyses were performed using the Stata v.13.2 for Windows (Statacorp., USA) and Mplus v.7.31 for Windows (Muthén & Muthén, USA) software. Internal consistency was α=.806. The factor structure showed three that accounted for the 56.6% of the variance. CFA revealed: χ 2 =926.036; df=85; RMSEA=.106 (90%CI, .100-.112); CFI=.947; TLI=.934. The MBI showed an adequate internal consistency and a factor structure being consistent with the original proposed structure with a poor fit, which does not reflect adequate content validity in this sample. Copyright © 2016 Asociación Colombiana de Psiquiatría. Publicado por Elsevier España. All rights reserved.

  9. Internal consistency and factor structure of the Portuguese version of the Liebowitz Social Anxiety Scale among alcoholic patients Consistência interna e estrutura fatorial da versão em português da Escala de Ansiedade Social de Liebowitz entre pacientes alcoolistas

    Directory of Open Access Journals (Sweden)

    Mauro B Terra

    2006-12-01

    Full Text Available OBJECTIVE: Liebowitz Social Anxiety Scale is an instrument used to evaluate the severity of social phobia. It has been widely used in different contexts and cultures, presenting variable psychometric properties. The objective of this article is to investigate the internal consistency and the factor structure of this scale. METHOD: In a sample of 300 alcoholic patients hospitalized in 3 mental clinics in Southern Brazil, 74 of them were social phobics (24.6%. The Structured Clinical Interview for DSM-IV-Axis I Disorders - Patient Edition, a semi-structured clinical interview based on DSM-IV, was used to check for the diagnosis of social phobia. The internal consistency was measured by Cronbach's alpha. Data were subjected to a factor analysis with the principal component method of parameter estimation. Questionnaire items loading at 0.35 or above were considered in the final factor solution. RESULTS: The coefficient of internal consistency was 0.95. All items showed corrected item-total correlation coefficient above 0.15, considered the minimum requested index. The factor analysis resulted in 5 dimensions which corresponded to 52.9% of the total variance. The five factors extracted were: factor I - speaking in a group, factor II - activity in public, factor III - social interaction with unknown person, factor IV - attitude of disagreement or disapproval and factor V - social interaction in leisure activity. CONCLUSIONS: The scale proved to be reliable and structurally valid instrument for use in a population of alcoholic patients. The possibility of screening for social phobia through the use of the instrument may be helpful in identifying probable cases of the disorder among alcoholics.OBJETIVO: A Escala de Ansiedade Social de Liebowitz é um instrumento utilizado na avaliação da gravidade da fobia social. Tem sido amplamente usada em diferentes contextos e culturas, apresentando propriedades psicométricas variadas. O objetivo do artigo

  10. Construct validity and internal consistency in the Leisure Practices Scale (EPL) for adults.

    Science.gov (United States)

    Andrade, Rubian Diego; Schwartz, Gisele Maria; Tavares, Giselle Helena; Pelegrini, Andreia; Teixeira, Clarissa Stefani; Felden, Érico Pereira Gomes

    2018-02-01

    This study proposes and analyzes the construct validity and internal consistency of the Leisure Practices Scale (EPL). This survey seeks to identify the preferences and involvement in in different leisure practices in adults. The instrument was formed based on the cultural leisure content (artistic, manual, physical, sports, intellectual, social, tourist, virtual and contemplation/leisure). The validation process was conducted with: a) content analysis by leisure experts, who evaluated the instrument for clarity of language and practical relevance, which allowed the calculation of the content validity coefficient (CVC); b) reproducibility test-retest with 51 subjects to calculate the temporal variation coefficient; c) internal consistency analysis with 885 participants. The evaluation presented appropriate coefficients, both with respect to language clarity (CVCt = 0.883) and practical relevance (CVCt = 0.879). The reproducibility coefficients were moderate to excellent. The scale showed adequate internal consistency (0.72). The EPL has psychometric quality and acceptable values in its structure, and can be used to investigate adult involvement in leisure activities.

  11. Validation of a mobility item bank for older patients in primary care.

    Science.gov (United States)

    Cabrero-García, Julio; Ramos-Pichardo, Juan Diego; Muñoz-Mendoza, Carmen Luz; Cabañero-Martínez, María José; González-Llopis, Lorena; Reig-Ferrer, Abilio

    2012-12-05

    To develop and validate an item bank to measure mobility in older people in primary care and to analyse differential item functioning (DIF) and differential bundle functioning (DBF) by sex. A pool of 48 mobility items was administered by interview to 593 older people attending primary health care practices. The pool contained four domains based on the International Classification of Functioning: changing and maintaining body position, carrying, lifting and pushing, walking and going up and down stairs. The Late Life Mobility item bank consisted of 35 items, and measured with a reliability of 0.90 or more across the full spectrum of mobility, except at the higher end of better functioning. No evidence was found of non-uniform DIF but uniform DIF was observed, mainly for items in the changing and maintaining body position and carrying, lifting and pushing domains. The walking domain did not display DBF, but the other three domains did, principally the carrying, lifting and pushing items. During the design and validation of an item bank to measure mobility in older people, we found that strength (carrying, lifting and pushing) items formed a secondary dimension that produced DBF. More research is needed to determine how best to include strength items in a mobility measure, or whether it would be more appropriate to design separate measures for each construct.

  12. Clinical Validation of the Nursing Outcome "Swallowing Status" in People with Stroke: Analysis According to the Classical and Item Response Theories.

    Science.gov (United States)

    Oliveira-Kumakura, Ana Railka de Souza; de Araujo, Thelma Leite; Costa, Alice Gabrielle de Sousa; Cavalcante, Tahissa Frota; Lopes, Marcos Venícios de Oliveira; Carvalho, Emilia Campos

    2017-09-19

    To validate clinically the nursing outcome "Swallowing status". The adjustment of the nursing outcome was investigated according to the Classical and Item Response Theories. The models were compared regarding information loss, goodness-of-fit, and differential item functioning. Stability and internal consistency were examined. The nursing outcome has the best fit in the generalized partial credit model with different discrimination parameters. Strong correlations among the scores of each indicator were observed. There was no differential item functioning of the outcome indicators. The scale presented high internal consistency (Cronbach's α = .954) and stability (and > .800). This study presents a valid nursing outcome. Most accurate monitoring of sensitivity to an intervention. Validar clinicamente o resultado de enefermagem "Estado da Deglutição". MÉTODOS: O ajustamento do resultado foi investigado de acordo com as teorias Clássica e de Resposta ao Item. Os modelos foram comparados assumindo parâmetros de itens cruzados de igual discriminação. Investigaram-se as propriedades de bondade do ajuste, funcionamento diferencial dos itens, estabilidade e consistência interna. O resultado se ajustou melhor a partir do Modelo de crédito parcial generalizado, o qual demonstrou unidimensionalidade do resultado e forte correlação entre os escores de cada indicador. Não houve funcionamento diferencial dos indicadores. A consistência interna para a escala global (Cronbach's α = .954) e a estabilidade (>.800) mantiveram-se elevadas. CONCLUSÃO: O estudo apresenta um resultado de enfermagem válido. RELEVÂNCIA PARA A PRÁTICA CLÍNICA: Maior acurácia para monitorar a sensibilidade da intervenção. © 2017 NANDA International, Inc.

  13. 26 CFR 301.6501(o)-3 - Partnership items.

    Science.gov (United States)

    2010-04-01

    ... 26 Internal Revenue 18 2010-04-01 2010-04-01 false Partnership items. 301.6501(o)-3 Section 301... § 301.6501(o)-3 Partnership items. (a) Partnership item defined. For purposes of section 6501(o) (as it..., and § 301.6511(g)-1, the term “partnership item” means— (1) Any item required to be taken into account...

  14. Internal Consistency and Convergent Validity of the Klontz Money Behavior Inventory (KMBI

    Directory of Open Access Journals (Sweden)

    Colby D. Taylor

    2015-12-01

    Full Text Available The Klontz Money Behavior Inventory (KMBI is a standalone, multi-scale measure than can screen for the presence of eight distinct money disorders. Given the well-established relationship between mental health and financial behaviors, results from the KMBI can be used to inform both mental health care professionals and financial planners. The present study examined the internal consistency and convergent validity of the KMBI, through comparison with similar measures, among a sample of college students (n = 232. Results indicate that the KMBI demonstrates acceptable internal consistency reliability and some convergence for most subscales when compared to other analogous measures. These findings highlight a need for literature and assessments to identify and describe disordered money behaviors.

  15. The Multitheoretical List of Therapeutic Interventions - 30 items (MULTI-30).

    Science.gov (United States)

    Solomonov, Nili; McCarthy, Kevin S; Gorman, Bernard S; Barber, Jacques P

    2018-01-16

    To develop a brief version of the Multitheoretical List of Therapeutic Interventions (MULTI-60) in order to decrease completion time burden by approximately half, while maintaining content coverage. Study 1 aimed to select 30 items. Study 2 aimed to examine the reliability and internal consistency of the MULTI-30. Study 3 aimed to validate the MULTI-30 and ensure content coverage. In Study 1, the sample included 186 therapist and 255 patient MULTI ratings, and 164 ratings of sessions coded by trained observers. Internal consistency (Chronbach's alpha and McDonald's omega) was calculated and confirmatory factor analysis was conducted. Psychotherapy experts rated content relevance. Study 2 included a sample of 644 patient and 522 therapist ratings, and 793 codings of psychotherapy sessions. In Study 3, the sample included 33 codings of sessions. A series of regression analyses was conducted to examine replication of previously published findings using the MULTI-30. The MULTI-30 was found valid, reliable, and internally consistent across 2564 ratings examined across the three studies presented. The MULTI-30 a brief and reliable process measure. Future studies are required for further validation.

  16. Linking Existing Instruments to Develop an Activity of Daily Living Item Bank.

    Science.gov (United States)

    Li, Chih-Ying; Romero, Sergio; Bonilha, Heather S; Simpson, Kit N; Simpson, Annie N; Hong, Ickpyo; Velozo, Craig A

    2018-03-01

    This study examined dimensionality and item-level psychometric properties of an item bank measuring activities of daily living (ADL) across inpatient rehabilitation facilities and community living centers. Common person equating method was used in the retrospective veterans data set. This study examined dimensionality, model fit, local independence, and monotonicity using factor analyses and fit statistics, principal component analysis (PCA), and differential item functioning (DIF) using Rasch analysis. Following the elimination of invalid data, 371 veterans who completed both the Functional Independence Measure (FIM) and minimum data set (MDS) within 6 days were retained. The FIM-MDS item bank demonstrated good internal consistency (Cronbach's α = .98) and met three rating scale diagnostic criteria and three of the four model fit statistics (comparative fit index/Tucker-Lewis index = 0.98, root mean square error of approximation = 0.14, and standardized root mean residual = 0.07). PCA of Rasch residuals showed the item bank explained 94.2% variance. The item bank covered the range of θ from -1.50 to 1.26 (item), -3.57 to 4.21 (person) with person strata of 6.3. The findings indicated the ADL physical function item bank constructed from FIM and MDS measured a single latent trait with overall acceptable item-level psychometric properties, suggesting that it is an appropriate source for developing efficient test forms such as short forms and computerized adaptive tests.

  17. Reliability, Dimensionality, and Internal Consistency as Defined by Cronbach: Distinct Albeit Related Concepts

    Science.gov (United States)

    Davenport, Ernest C.; Davison, Mark L.; Liou, Pey-Yan; Love, Quintin U.

    2015-01-01

    This article uses definitions provided by Cronbach in his seminal paper for coefficient a to show the concepts of reliability, dimensionality, and internal consistency are distinct but interrelated. The article begins with a critique of the definition of reliability and then explores mathematical properties of Cronbach's a. Internal consistency…

  18. Internal Consistency, Retest Reliability, and their Implications For Personality Scale Validity

    Science.gov (United States)

    McCrae, Robert R.; Kurtz, John E.; Yamagata, Shinji; Terracciano, Antonio

    2010-01-01

    We examined data (N = 34,108) on the differential reliability and validity of facet scales from the NEO Inventories. We evaluated the extent to which (a) psychometric properties of facet scales are generalizable across ages, cultures, and methods of measurement; and (b) validity criteria are associated with different forms of reliability. Composite estimates of facet scale stability, heritability, and cross-observer validity were broadly generalizable. Two estimates of retest reliability were independent predictors of the three validity criteria; none of three estimates of internal consistency was. Available evidence suggests the same pattern of results for other personality inventories. Internal consistency of scales can be useful as a check on data quality, but appears to be of limited utility for evaluating the potential validity of developed scales, and it should not be used as a substitute for retest reliability. Further research on the nature and determinants of retest reliability is needed. PMID:20435807

  19. Impact of Alzheimer's Disease on Caregiver Questionnaire: internal consistency, convergent validity, and test-retest reliability of a new measure for assessing caregiver burden.

    Science.gov (United States)

    Cole, Jason C; Ito, Diane; Chen, Yaozhu J; Cheng, Rebecca; Bolognese, Jennifer; Li-McLeod, Josephine

    2014-09-04

    unidimensional, Web-based measure of AD caregiver burden and is supported by strong model fit statistics from CFA, high degree of item-level reliability, good internal consistency, moderate test-retest reliability, and moderate convergent validity. Additional validation of the IADCQ is warranted to ensure invariance between the paper-based and Web-based administration and to determine an appropriate responder definition.

  20. Validation of the International Index of Erectile Function (IIFE) for Use in Brazil

    International Nuclear Information System (INIS)

    Gonzáles, Ana Inês; Sties, Sabrina Weiss; Wittkopf, Priscilla Geraldine; Mara, Lourenço Sampaio de; Ulbrich, Anderson Zampier; Cardoso, Fernando Luiz; Carvalho, Tales de

    2013-01-01

    The International Index of Erectile Function has been proposed as a method for assessing sexual function assisting the diagnosis and classification of erectile dysfunction. However, IIEF was not validated for the Portuguese language. Validate the International Index of Erectile Function in patients with cardiopulmonary and metabolic diseases. The sample consisted of 108 participants of to Cardiopulmonary and Metabolic program Rehabilitation (CPMR) in southern Brazil. The clarity assessment of the instrument was performed using a scale ranging from zero to 10. The construct validity was carried out by confirmatory factor analysis (KMO = 0.85; Barllet p < 0.001), internal consistency by Cronbach's alpha and reproducibility and interrater reliability via the test retest method. The items were considered very clear with averages superior to 9. The internal consistency resulted in 0.89. The majority of items related correctly with their domains, with exception of three questions from sexual satisfaction domain, and one from erectile function. All items showed excellent stability of measure and substantial to almost perfect agreement. The present study showed that the IIEF is valid and reliable for use in participants of a cardiopulmonary and metabolic rehabilitation program

  1. Validation of the International Index of Erectile Function (IIFE) for Use in Brazil

    Energy Technology Data Exchange (ETDEWEB)

    Gonzáles, Ana Inês; Sties, Sabrina Weiss; Wittkopf, Priscilla Geraldine, E-mail: sabrinasties@yahoo.com.br; Mara, Lourenço Sampaio de; Ulbrich, Anderson Zampier; Cardoso, Fernando Luiz; Carvalho, Tales de [Universidade do Estado de Santa Catarina, Florianópolis, SC (Brazil)

    2013-08-15

    The International Index of Erectile Function has been proposed as a method for assessing sexual function assisting the diagnosis and classification of erectile dysfunction. However, IIEF was not validated for the Portuguese language. Validate the International Index of Erectile Function in patients with cardiopulmonary and metabolic diseases. The sample consisted of 108 participants of to Cardiopulmonary and Metabolic program Rehabilitation (CPMR) in southern Brazil. The clarity assessment of the instrument was performed using a scale ranging from zero to 10. The construct validity was carried out by confirmatory factor analysis (KMO = 0.85; Barllet p < 0.001), internal consistency by Cronbach's alpha and reproducibility and interrater reliability via the test retest method. The items were considered very clear with averages superior to 9. The internal consistency resulted in 0.89. The majority of items related correctly with their domains, with exception of three questions from sexual satisfaction domain, and one from erectile function. All items showed excellent stability of measure and substantial to almost perfect agreement. The present study showed that the IIEF is valid and reliable for use in participants of a cardiopulmonary and metabolic rehabilitation program.

  2. Psychometric Validation of the World Health Organization Disability Assessment Schedule 2.0-Twelve-Item Version in Persons with Spinal Cord Injuries

    Science.gov (United States)

    Smedema, Susan Miller; Ruiz, Derek; Mohr, Michael J.

    2017-01-01

    Purpose: To evaluate the factorial and concurrent validity and internal consistency reliability of the World Health Organization Disability Assessment Schedule 2.0 (WHODAS 2.0) 12-item version in persons with spinal cord injuries. Method: Two hundred forty-seven adults with spinal cord injuries completed an online survey consisting of the WHODAS…

  3. The internal consistency of the standard gamble: tests after adjusting for prospect theory.

    Science.gov (United States)

    Oliver, Adam

    2003-07-01

    This article reports a study that tests whether the internal consistency of the standard gamble can be improved upon by incorporating loss weighting and probability transformation parameters in the standard gamble valuation procedure. Five alternatives to the standard EU formulation are considered: (1) probability transformation within an EU framework; and, within a prospect theory framework, (2) loss weighting and full probability transformation, (3) no loss weighting and full probability transformation, (4) loss weighting and no probability transformation, and (5) loss weighting and partial probability transformation. Of the five alternatives, only the prospect theory formulation with loss weighting and no probability transformation offers an improvement in internal consistency over the standard EU valuation procedure.

  4. Reliability and validity of the Spanish version of the 10-item Connor-Davidson Resilience Scale (10-item CD-RISC in young adults

    Directory of Open Access Journals (Sweden)

    García-Campayo Javier

    2011-08-01

    Full Text Available Abstract Background The 10-item Connor-Davidson Resilience Scale (10-item CD-RISC is an instrument for measuring resilience that has shown good psychometric properties in its original version in English. The aim of this study was to evaluate the validity and reliability of the Spanish version of the 10-item CD-RISC in young adults and to verify whether it is structured in a single dimension as in the original English version. Findings Cross-sectional observational study including 681 university students ranging in age from 18 to 30 years. The number of latent factors in the 10 items of the scale was analyzed by exploratory factor analysis. Confirmatory factor analysis was used to verify whether a single factor underlies the 10 items of the scale as in the original version in English. The convergent validity was analyzed by testing whether the mean of the scores of the mental component of SF-12 (MCS and the quality of sleep as measured with the Pittsburgh Sleep Index (PSQI were higher in subjects with better levels of resilience. The internal consistency of the 10-item CD-RISC was estimated using the Cronbach α test and test-retest reliability was estimated with the intraclass correlation coefficient. The Cronbach α coefficient was 0.85 and the test-retest intraclass correlation coefficient was 0.71. The mean MCS score and the level of quality of sleep in both men and women were significantly worse in subjects with lower resilience scores. Conclusions The Spanish version of the 10-item CD-RISC showed good psychometric properties in young adults and thus can be used as a reliable and valid instrument for measuring resilience. Our study confirmed that a single factor underlies the resilience construct, as was the case of the original scale in English.

  5. Internal validity of a household food security scale is consistent among diverse populations participating in a food supplement program in Colombia.

    Science.gov (United States)

    Hackett, Michelle; Melgar-Quinonez, Hugo; Uribe, Martha C Alvarez

    2008-05-23

    We assessed the validity of a locally adapted Colombian Household Food Security Scale (CHFSS) used as a part of the 2006 evaluation of the food supplement component of the Plan for Improving Food and Nutrition in Antioquia, Colombia (MANA - Plan Departamental de Seguridad Alimentaria y Nutricional de Antioquia). Subjects included low-income families with pre-school age children in MANA that responded affirmatively to at least one CHFSS item (n = 1,319). Rasch Modeling was used to evaluate the psychometric characteristics of the items through measure and INFIT values. Differences in CHFSS performance were assessed by area of residency, socioeconomic status and number of children enrolled in MANA. Unidimensionality of a scale by group was further assessed using Differential Item Functioning (DIF). Most CHFSS items presented good fitness with most INFIT values within the adequate range of 0.8 to 1.2. Consistency in item measure values between groups was found for all but two items in the comparison by area of residency. Only two adult items exhibited DIF between urban and rural households. The results indicate that the adapted CHFSS is a valid tool to assess the household food security of participants in food assistance programs like MANA.

  6. Assessment of the psychometrics of a PROMIS item bank: self-efficacy for managing daily activities.

    Science.gov (United States)

    Hong, Ickpyo; Velozo, Craig A; Li, Chih-Ying; Romero, Sergio; Gruber-Baldini, Ann L; Shulman, Lisa M

    2016-09-01

    The aim of this study is to investigate the psychometrics of the Patient-Reported Outcomes Measurement Information System self-efficacy for managing daily activities item bank. The item pool was field tested on a sample of 1087 participants via internet (n = 250) and in-clinic (n = 837) surveys. All participants reported having at least one chronic health condition. The 35 item pool was investigated for dimensionality (confirmatory factor analyses, CFA and exploratory factor analysis, EFA), item-total correlations, local independence, precision, and differential item functioning (DIF) across gender, race, ethnicity, age groups, data collection modes, and neurological chronic conditions (McFadden Pseudo R (2) less than 10 %). The item pool met two of the four CFA fit criteria (CFI = 0.952 and SRMR = 0.07). EFA analysis found a dominant first factor (eigenvalue = 24.34) and the ratio of first to second eigenvalue was 12.4. The item pool demonstrated good item-total correlations (0.59-0.85) and acceptable internal consistency (Cronbach's alpha = 0.97). The item pool maintained its precision (reliability over 0.90) across a wide range of theta (3.70), and there was no significant DIF. The findings indicated the item pool has sound psychometric properties and the test items are eligible for development of computerized adaptive testing and short forms.

  7. Quality of life in infants and children with atopic dermatitis: Addressing issues of differential item functioning across countries in multinational clinical trials

    Directory of Open Access Journals (Sweden)

    Tennant Alan

    2007-07-01

    Full Text Available Abstract Background A previous study had identified 45 items assessing the impact of atopic dermatitis (AD on the whole family. From these it was intended to develop two separate scales, one assessing impact on carers and the other determining the effect on the child. Methods The 45 items were included in three clinical trials designed to test the efficacy of a new topical treatment (pimecrolimus, Elidel cream 1% in the treatment of AD in infants and children and in validation studies in the UK, US, Germany, France and the Netherlands. Rasch analyses were undertaken to determine whether an internationally valid, unidimensional scale could be developed that would inform on the direct impact of AD on the child. Results Rasch analyses applied to the data from the trials indicated that the draft measure consisted of two scales, one assessing the QoL of the carer and the other (consisting of 12 items measuring the impact of AD on the child. Three of the 12 potential items failed to fit the measurement model in Europe and five in the US. In addition, four items exhibiting differential item functioning (DIF by country were identified. After removing the misfitting items and controlling for DIF it was possible to derive a scale; The Childhood Impact of Atopic Dermatitis (CIAD with good item fit for each trial analysis. Analysis of the validation data from each of the different countries confirmed that the CIAD had adequate internal consistency, reproducibility and construct validity. The CIAD demonstrated the benefits of treatment with Elidel over placebo in the European trial. A similar (non-significant trend was found for the US trials. Conclusion The study represents a novel method of dealing with the problem of DIF associated with different cultures. Such problems are likely to arise in any multinational study involving patient-reported outcome measures, as items in the scales are likely to be valued differently in different cultures. However, where

  8. Assessing motivation for work environment improvements: internal consistency, reliability and factorial structure.

    Science.gov (United States)

    Hedlund, Ann; Ateg, Mattias; Andersson, Ing-Marie; Rosén, Gunnar

    2010-04-01

    Workers' motivation to actively take part in improvements to the work environment is assumed to be important for the efficiency of investments for that purpose. That gives rise to the need for a tool to measure this motivation. A questionnaire to measure motivation for improvements to the work environment has been designed. Internal consistency and test-retest reliability of the domains of the questionnaire have been measured, and the factorial structure has been explored, from the answers of 113 employees. The internal consistency is high (0.94), as well as the correlation for the total score (0.84). Three factors are identified accounting for 61.6% of the total variance. The questionnaire can be a useful tool in improving intervention methods. The expectation is that the tool can be useful, particularly with the aim of improving efficiency of companies' investments for work environment improvements. Copyright 2010 Elsevier Ltd. All rights reserved.

  9. Studies on the consistency of internally taken contrast medium for pancreas CT

    Energy Technology Data Exchange (ETDEWEB)

    Matsushima, Kishio; Mimura, Seiichi; Tahara, Seiji; Kitayama, Takuichi; Inamura, Keiji; Mikami, Yasutaka; Hashimoto, Keiji; Hiraki, Yoshio; Aono, Kaname

    1985-02-01

    A problem of Pancreatic CT scanning is the discrimination between the pancreas and the adjacent gastrointestinal tract. Generally we administer a dilution of gastrografin internally to make the discrimination. The degree of dilution has been decided by experience at each hospital. When the consistency of the contrast medium is low in density, an enhancement effect cannot be expected, but when the consistency is high, artifacts appear. We have experimented on the degree of the dilution and CT-No to decide the optimum consistency of gastrografin for the diagnosis of pancreatic disease. Statistical analysis of the results show the optimum dilution of gastrografin to be 1.5%.

  10. Equivalência semântica e avaliação da consistência interna da versão em português do Sociocultural Attitudes Towards Appearance Questionnaire-3 (SATAQ-3 Semantic equivalence and internal consistency of the Brazilian Portuguese version of the Sociocultural Attitudes Towards Appearance Questionnaire-3 (SATAQ-3

    Directory of Open Access Journals (Sweden)

    Ana Carolina Soares Amaral

    2011-08-01

    Full Text Available O objetivo do estudo foi descrever o processo de adaptação transcultural do Sociocultural Attitudes Towards Appearance Questionnaire-3 (SATAQ-3 para a língua portuguesa. A metodologia foi baseada nas etapas de (1 tradução do questionário para o português; (2 retrotradução para o inglês; (3 comitê de peritos para construção da primeira versão; (4 avaliação da compreensão verbal por especialistas e por uma amostra da população-alvo; (5 análise da consistência interna do instrumento a partir do alfa de Cronbach. O instrumento foi traduzido para o português e a versão final contou com os 30 itens do instrumento original. Todos os itens foram interpretados como de fácil compreensão, tanto por especialistas quanto pela população-alvo. Os valores de consistência interna foram satisfatórios, sendo de 0,91 para toda a escala. O instrumento encontra-se traduzido e adaptado para o português, com evidências de boa compreensão e consistência interna, sendo ainda necessária a avaliação de sua equivalência de mensuração, validade externa e reprodutibilidade.This study aimed to describe the cross-cultural adaptation of the Sociocultural Attitudes Towards Appearance Questionnaire-3 (SATAQ-3 into Brazilian Portuguese. The methodology involved the following stages: (1 translation of the questionnaire into Portuguese; (2 back-translation into English; (3 meeting with experts to prepare a draft version; (4 assessment of verbal understanding of the draft by experts and by a sample of the target population; and (5 analysis of the tool's internal consistency, using Cronbach's alpha. The questionnaire was translated into Portuguese, and the scale's final version included 30 items, as in the original. Both the experts and target population members assessed all the items as easy to understand. Internal consistency was satisfactory, reaching 0.91 for the scale as a whole. The questionnaire has now been translated and adapted into

  11. Internal consistency of the CHAMPS physical activity questionnaire for Spanish speaking older adults.

    Science.gov (United States)

    Rosario, Martín G; Vázquez, Jenniffer M; Cruz, Wanda I; Ortiz, Alexis

    2008-09-01

    The Community Healthy Activities Model Program for Seniors (CHAMPS) is a physical activity monitoring questionnaire for people between 65 to 90 years old. This questionnaire has been previously translated to Spanish to be used in the Latin American population. To adapt the Spanish version of the CHAMPS questionnaire to Puerto Rico and assess its internal consistency. An external review committee adapted the existent Spanish version of the CHAMPS to be used in the Puerto Rican population. Three older adults participated in a second phase with the purpose of training the research team. After the second phase, 35 older adults participated in a third content adaptation phase. During the third phase, the preliminary Spanish version for Puerto Rico of the CHAMPS was given to the 35 participants to assess for clarity, vocabulary and understandability. Interviews to each participant in the third phase were carried out to obtain feedback and create a final Spanish version of the CHAMPS for Puerto Rico. After analyses of this phase, the external review committee prepared a final Spanish version of the CHAMPS for Puerto Rico. The final version was administered to 15 older adults (76 +/- 6.5 years) to assess the internal consistency by using Cronbach's Alpha analysis. The questionnaire showed a strong internal consistency of 0.76. The total time to answer the questionnaire was 17.4 minutes. The Spanish version of the CHAMPS questionnaire for Puerto Rico suggested being an easy to administer and consistent measurement tool to assess physical activity in older adults.

  12. Investigating the Impact of Item Parameter Drift for Item Response Theory Models with Mixture Distributions

    Directory of Open Access Journals (Sweden)

    Yoon Soo ePark

    2016-02-01

    Full Text Available This study investigates the impact of item parameter drift (IPD on parameter and ability estimation when the underlying measurement model fits a mixture distribution, thereby violating the item invariance property of unidimensional item response theory (IRT models. An empirical study was conducted to demonstrate the occurrence of both IPD and an underlying mixture distribution using real-world data. Twenty-one trended anchor items from the 1999, 2003, and 2007 administrations of Trends in International Mathematics and Science Study (TIMSS were analyzed using unidimensional and mixture IRT models. TIMSS treats trended anchor items as invariant over testing administrations and uses pre-calibrated item parameters based on unidimensional IRT. However, empirical results showed evidence of two latent subgroups with IPD. Results showed changes in the distribution of examinee ability between latent classes over the three administrations. A simulation study was conducted to examine the impact of IPD on the estimation of ability and item parameters, when data have underlying mixture distributions. Simulations used data generated from a mixture IRT model and estimated using unidimensional IRT. Results showed that data reflecting IPD using mixture IRT model led to IPD in the unidimensional IRT model. Changes in the distribution of examinee ability also affected item parameters. Moreover, drift with respect to item discrimination and distribution of examinee ability affected estimates of examinee ability. These findings demonstrate the need to caution and evaluate IPD using a mixture IRT framework to understand its effect on item parameters and examinee ability.

  13. Investigating the Impact of Item Parameter Drift for Item Response Theory Models with Mixture Distributions.

    Science.gov (United States)

    Park, Yoon Soo; Lee, Young-Sun; Xing, Kuan

    2016-01-01

    This study investigates the impact of item parameter drift (IPD) on parameter and ability estimation when the underlying measurement model fits a mixture distribution, thereby violating the item invariance property of unidimensional item response theory (IRT) models. An empirical study was conducted to demonstrate the occurrence of both IPD and an underlying mixture distribution using real-world data. Twenty-one trended anchor items from the 1999, 2003, and 2007 administrations of Trends in International Mathematics and Science Study (TIMSS) were analyzed using unidimensional and mixture IRT models. TIMSS treats trended anchor items as invariant over testing administrations and uses pre-calibrated item parameters based on unidimensional IRT. However, empirical results showed evidence of two latent subgroups with IPD. Results also showed changes in the distribution of examinee ability between latent classes over the three administrations. A simulation study was conducted to examine the impact of IPD on the estimation of ability and item parameters, when data have underlying mixture distributions. Simulations used data generated from a mixture IRT model and estimated using unidimensional IRT. Results showed that data reflecting IPD using mixture IRT model led to IPD in the unidimensional IRT model. Changes in the distribution of examinee ability also affected item parameters. Moreover, drift with respect to item discrimination and distribution of examinee ability affected estimates of examinee ability. These findings demonstrate the need to caution and evaluate IPD using a mixture IRT framework to understand its effects on item parameters and examinee ability.

  14. Recommended core items to assess e-cigarette use in population-based surveys.

    Science.gov (United States)

    Pearson, Jennifer L; Hitchman, Sara C; Brose, Leonie S; Bauld, Linda; Glasser, Allison M; Villanti, Andrea C; McNeill, Ann; Abrams, David B; Cohen, Joanna E

    2018-05-01

    A consistent approach using standardised items to assess e-cigarette use in both youth and adult populations will aid cross-survey and cross-national comparisons of the effect of e-cigarette (and tobacco) policies and improve our understanding of the population health impact of e-cigarette use. Focusing on adult behaviour, we propose a set of e-cigarette use items, discuss their utility and potential adaptation, and highlight e-cigarette constructs that researchers should avoid without further item development. Reliable and valid items will strengthen the emerging science and inform knowledge synthesis for policy-making. Building on informal discussions at a series of international meetings of 65 experts from 15 countries, the authors provide recommendations for assessing e-cigarette use behaviour, relative perceived harm, device type, presence of nicotine, flavours and reasons for use. We recommend items assessing eight core constructs: e-cigarette ever use, frequency of use and former daily use; relative perceived harm; device type; primary flavour preference; presence of nicotine; and primary reason for use. These items should be standardised or minimally adapted for the policy context and target population. Researchers should be prepared to update items as e-cigarette device characteristics change. A minimum set of e-cigarette items is proposed to encourage consensus around items to allow for cross-survey and cross-jurisdictional comparisons of e-cigarette use behaviour. These proposed items are a starting point. We recognise room for continued improvement, and welcome input from e-cigarette users and scientific colleagues. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  15. Internal validity of a household food security scale is consistent among diverse populations participating in a food supplement program in Colombia

    Directory of Open Access Journals (Sweden)

    Melgar-Quinonez Hugo

    2008-05-01

    Full Text Available Abstract Objective We assessed the validity of a locally adapted Colombian Household Food Security Scale (CHFSS used as a part of the 2006 evaluation of the food supplement component of the Plan for Improving Food and Nutrition in Antioquia, Colombia (MANA – Plan Departamental de Seguridad Alimentaria y Nutricional de Antioquia. Methods Subjects included low-income families with pre-school age children in MANA that responded affirmatively to at least one CHFSS item (n = 1,319. Rasch Modeling was used to evaluate the psychometric characteristics of the items through measure and INFIT values. Differences in CHFSS performance were assessed by area of residency, socioeconomic status and number of children enrolled in MANA. Unidimensionality of a scale by group was further assessed using Differential Item Functioning (DIF. Results Most CHFSS items presented good fitness with most INFIT values within the adequate range of 0.8 to 1.2. Consistency in item measure values between groups was found for all but two items in the comparison by area of residency. Only two adult items exhibited DIF between urban and rural households. Conclusion The results indicate that the adapted CHFSS is a valid tool to assess the household food security of participants in food assistance programs like MANA.

  16. Differential Item Functioning in While-Listening Performance Tests: The Case of the International English Language Testing System (IELTS) Listening Module

    Science.gov (United States)

    Aryadoust, Vahid

    2012-01-01

    This article investigates a version of the International English Language Testing System (IELTS) listening test for evidence of differential item functioning (DIF) based on gender, nationality, age, and degree of previous exposure to the test. Overall, the listening construct was found to be underrepresented, which is probably an important cause…

  17. Psychometric evaluation of an item bank for computerized adaptive testing of the EORTC QLQ-C30 cognitive functioning dimension in cancer patients

    DEFF Research Database (Denmark)

    Dirven, Linda; Groenvold, Mogens; Taphoorn, Martin J. B.

    2017-01-01

    on the field-testing and psychometric evaluation of the item bank for cognitive functioning (CF). METHODS: In previous phases (I-III), 44 candidate items were developed measuring CF in cancer patients. In phase IV, these items were psychometrically evaluated in a large sample of international cancer patients...... model, showing an acceptable fit. Although several items showed DIF, these had a negligible impact on CF estimation. Measurement precision of the item bank was much higher than the two original QLQ-C30 CF items alone, across the whole continuum. Moreover, CAT measurement may on average reduce study...... sample sizes with about 35-40% compared to the original QLQ-C30 CF scale, without loss of power. CONCLUSION: A CF item bank for CAT measurement consisting of 34 items was established, applicable to various cancer patients across countries. This CAT measurement system will facilitate precise and efficient...

  18. Memory for Items and Relationships among Items Embedded in Realistic Scenes: Disproportionate Relational Memory Impairments in Amnesia

    Science.gov (United States)

    Hannula, Deborah E.; Tranel, Daniel; Allen, John S.; Kirchhoff, Brenda A.; Nickel, Allison E.; Cohen, Neal J.

    2014-01-01

    Objective The objective of this study was to examine the dependence of item memory and relational memory on medial temporal lobe (MTL) structures. Patients with amnesia, who either had extensive MTL damage or damage that was relatively restricted to the hippocampus, were tested, as was a matched comparison group. Disproportionate relational memory impairments were predicted for both patient groups, and those with extensive MTL damage were also expected to have impaired item memory. Method Participants studied scenes, and were tested with interleaved two-alternative forced-choice probe trials. Probe trials were either presented immediately after the corresponding study trial (lag 1), five trials later (lag 5), or nine trials later (lag 9) and consisted of the studied scene along with a manipulated version of that scene in which one item was replaced with a different exemplar (item memory test) or was moved to a new location (relational memory test). Participants were to identify the exact match of the studied scene. Results As predicted, patients were disproportionately impaired on the test of relational memory. Item memory performance was marginally poorer among patients with extensive MTL damage, but both groups were impaired relative to matched comparison participants. Impaired performance was evident at all lags, including the shortest possible lag (lag 1). Conclusions The results are consistent with the proposed role of the hippocampus in relational memory binding and representation, even at short delays, and suggest that the hippocampus may also contribute to successful item memory when items are embedded in complex scenes. PMID:25068665

  19. Development and evaluation of CAHPS survey items assessing how well healthcare providers address health literacy.

    Science.gov (United States)

    Weidmer, Beverly A; Brach, Cindy; Hays, Ron D

    2012-09-01

    The complexity of health information often exceeds patients' skills to understand and use it. To develop survey items assessing how well healthcare providers communicate health information. Domains and items for the Consumer Assessment of Healthcare Providers and Systems (CAHPS) Item Set for Addressing Health Literacy were identified through an environmental scan and input from stakeholders. The draft item set was translated into Spanish and pretested in both English and Spanish. The revised item set was field tested with a randomly selected sample of adult patients from 2 sites using mail and telephonic data collection. Item-scale correlations, confirmatory factor analysis, and internal consistency reliability estimates were estimated to assess how well the survey items performed and identify composite measures. Finally, we regressed the CAHPS global rating of the provider item on the CAHPS core communication composite and the new health literacy composites. A total of 601 completed surveys were obtained (52% response rate). Two composite measures were identified: (1) Communication to Improve Health Literacy (16 items); and (2) How Well Providers Communicate About Medicines (6 items). These 2 composites were significantly uniquely associated with the global rating of the provider (communication to improve health literacy: PLiteracy composite accounted for 90% of the variance of the original 16-item composite. This study provides support for reliability and validity of the CAHPS Item Set for Addressing Health Literacy. These items can serve to assess whether healthcare providers have communicated effectively with their patients and as a tool for quality improvement.

  20. International Comparisons of Behavioral and Emotional Problems in Preschool Children: Parents’ Reports From 24 Societies

    Science.gov (United States)

    Rescorla, Leslie A.; Achenbach, Thomas M.; Ivanova, Masha Y.; Harder, Valerie S.; Otten, Laura; Bilenberg, Niels; Bjarnadottir, Gudrun; Capron, Christiane; De Pauw, Sarah S. W.; Dias, Pedro; Dobrean, Anca; Döpfner, Manfred; Duyme, Michel; Eapen, Valsamma; Erol, Nese; Esmaeili, Elaheh Mohammad; Ezpeleta, Lourdes; Frigerio, Alessandra; Fung, Daniel S. S.; Gonçalves, Miguel; Guđmundsson, Halldór; Jeng, Suh-Fang; Jusiené, Roma; Kim, Young Ah; Kristensen, Solvejg; Liu, Jianghong; Lecannelier, Felipe; Leung, Patrick W. L.; Machado, Bárbara César; Montirosso, Rosario; Oh, Kyung Ja; Ooi, Yoon Phaik; Plück, Julia; Pomalima, Rolando; Pranvera, Jetishi; Schmeck, Klaus; Shahini, Mimoza; Silva, Jaime R.; Simsek, Zeynep; Sourander, Andre; Valverde, José; van der Ende, Jan; Van Leeuwen, Karla G.; Wu, Yen-Tzu; Yurdusen, Sema; Zubrick, Stephen R.; Verhulst, Frank C.

    2014-01-01

    International comparisons were conducted of preschool children’s behavioral and emotional problems as reported on the Child Behavior Checklist for Ages 1½–5 by parents in 24 societies (N =19,850). Item ratings were aggregated into scores on syndromes; Diagnostic and Statistical Manual of Mental Disorders–oriented scales; a Stress Problems scale; and Internalizing, Externalizing, and Total Problems scales. Effect sizes for scale score differences among the 24 societies ranged from small to medium (3–12%). Although societies differed greatly in language, culture, and other characteristics, Total Problems scores for 18 of the 24 societies were within 7.1 points of the omnicultural mean of 33.3 (on a scale of 0–198). Gender and age differences, as well as gender and age interactions with society, were all very small (effect sizes societies, correlations between mean item ratings averaged .78, and correlations between internal consistency alphas for the scales averaged .92, indicating that the rank orders of mean item ratings and internal consistencies of scales were very similar across diverse societies. PMID:21534056

  1. Construction of a memory battery for computerized administration, using item response theory.

    Science.gov (United States)

    Ferreira, Aristides I; Almeida, Leandro S; Prieto, Gerardo

    2012-10-01

    In accordance with Item Response Theory, a computer memory battery with six tests was constructed for use in the Portuguese adult population. A factor analysis was conducted to assess the internal structure of the tests (N = 547 undergraduate students). According to the literature, several confirmatory factor models were evaluated. Results showed better fit of a model with two independent latent variables corresponding to verbal and non-verbal factors, reproducing the initial battery organization. Internal consistency reliability for the six tests were alpha = .72 to .89. IRT analyses (Rasch and partial credit models) yielded good Infit and Outfit measures and high precision for parameter estimation. The potential utility of these memory tasks for psychological research and practice willbe discussed.

  2. Using item response theory to address vulnerabilities in FFQ.

    Science.gov (United States)

    Kazman, Josh B; Scott, Jonathan M; Deuster, Patricia A

    2017-09-01

    The limitations for self-reporting of dietary patterns are widely recognised as a major vulnerability of FFQ and the dietary screeners/scales derived from FFQ. Such instruments can yield inconsistent results to produce questionable interpretations. The present article discusses the value of psychometric approaches and standards in addressing these drawbacks for instruments used to estimate dietary habits and nutrient intake. We argue that a FFQ or screener that treats diet as a 'latent construct' can be optimised for both internal consistency and the value of the research results. Latent constructs, a foundation for item response theory (IRT)-based scales (e.g. Patient Reported Outcomes Measurement Information System) are typically introduced in the design stage of an instrument to elicit critical factors that cannot be observed or measured directly. We propose an iterative approach that uses such modelling to refine FFQ and similar instruments. To that end, we illustrate the benefits of psychometric modelling by using items and data from a sample of 12 370 Soldiers who completed the 2012 US Army Global Assessment Tool (GAT). We used factor analysis to build the scale incorporating five out of eleven survey items. An IRT-driven assessment of response category properties indicates likely problems in the ordering or wording of several response categories. Group comparisons, examined with differential item functioning (DIF), provided evidence of scale validity across each Army sub-population (sex, service component and officer status). Such an approach holds promise for future FFQ.

  3. Development of an assessment tool to measure students′ perceptions of respiratory care education programs: Item generation, item reduction, and preliminary validation

    Directory of Open Access Journals (Sweden)

    Ghazi Alotaibi

    2013-01-01

    Full Text Available Objectives: Students who perceived their learning environment positively are more likely to develop effective learning strategies, and adopt a deep learning approach. Currently, there is no validated instrument for measuring the educational environment of educational programs on respiratory care (RC. The aim of this study was to develop an instrument to measure students′ perception of the RC educational environment. Materials and Methods: Based on the literature review and an assessment of content validity by multiple focus groups of RC educationalists, potential items of the instrument relevant to RC educational environment construct were generated by the research group. The initial 71 item questionnaire was then field-tested on all students from the 3 RC programs in Saudi Arabia and was subjected to multi-trait scaling analysis. Cronbach′s alpha was used to assess internal consistency reliabilities. Results: Two hundred and twelve students (100% completed the survey. The initial instrument of 71 items was reduced to 65 across 5 scales. Convergent and discriminant validity assessment demonstrated that the majority of items correlated more highly with their intended scale than a competing one. Cronbach′s alpha exceeded the standard criterion of >0.70 in all scales except one. There was no floor or ceiling effect for scale or overall score. Conclusions: This instrument is the first assessment tool developed to measure the RC educational environment. There was evidence of its good feasibility, validity, and reliability. This first validation of the instrument supports its use by RC students to evaluate educational environment.

  4. Expanding the Reach of Participatory Risk Management: Testing an Online Decision-Aiding Framework for Informing Internally Consistent Choices.

    Science.gov (United States)

    Bessette, Douglas L; Campbell-Arvai, Victoria; Arvai, Joseph

    2016-05-01

    This article presents research aimed at developing and testing an online, multistakeholder decision-aiding framework for informing multiattribute risk management choices associated with energy development and climate change. The framework was designed to provide necessary background information and facilitate internally consistent choices, or choices that are in line with users' prioritized objectives. In order to test different components of the decision-aiding framework, a six-part, 2 × 2 × 2 factorial experiment was conducted, yielding eight treatment scenarios. The three factors included: (1) whether or not users could construct their own alternatives; (2) the level of detail regarding the composition of alternatives users would evaluate; and (3) the way in which a final choice between users' own constructed (or highest-ranked) portfolio and an internally consistent portfolio was presented. Participants' self-reports revealed the framework was easy to use and providing an opportunity to develop one's own risk-management alternatives (Factor 1) led to the highest knowledge gains. Empirical measures showed the internal consistency of users' decisions across all treatments to be lower than expected and confirmed that providing information about alternatives' composition (Factor 2) resulted in the least internally consistent choices. At the same time, those users who did not develop their own alternatives and were not shown detailed information about the composition of alternatives believed their choices to be the most internally consistent. These results raise concerns about how the amount of information provided and the ability to construct alternatives may inversely affect users' real and perceived internal consistency. © 2015 Society for Risk Analysis.

  5. P2-19: The Effect of item Repetition on Item-Context Association Depends on the Prior Exposure of Items

    Directory of Open Access Journals (Sweden)

    Hongmi Lee

    2012-10-01

    Full Text Available Previous studies have reported conflicting findings on whether item repetition has beneficial or detrimental effects on source memory. To reconcile such contradictions, we investigated whether the degree of pre-exposure of items can be a potential modulating factor. The experimental procedures spanned two consecutive days. On Day 1, participants were exposed to a set of unfamiliar faces. On Day 2, the same faces presented on the previous day were used again in half of the participants, whereas novel faces were used for the other half. Day 2 procedures consisted of three successive phases: item repetition, source association, and source memory test. In the item repetition phase, half of the face stimuli were repeatedly presented while participants were making male/female judgments. During the source association phase, both the repeated and the unrepeated faces appeared in one of the four locations on the screen. Finally, participants were tested on the location in which a given face was presented during the previous phase and reported the confidence of their memory. Source memory accuracy was measured as the percentage of correct non-guess trials. As results, we found a significant interaction between prior exposure and repetition. Repetition impaired source memory when the items had been pre-exposed on Day 1, while it led to greater accuracy in novel ones. These results show that pre-experimental exposure can modulate the effects of repetition on associative binding between an item and its contextual information, suggesting that pre-existing representation and novelty signal interact to form new episodic memory.

  6. Validation of Portuguese version of Quality of Erection Questionnaire (QEQ) and comparison to International Index of Erectile Function (IIEF) and RAND 36-Item Health Survey.

    Science.gov (United States)

    Reis, Ana Luiza; Reis, Leonardo Oliveira; Saade, Ricardo Destro; Santos, Carlos Alberto; Lima, Marcelo Lopes de; Fregonesi, Adriano

    2015-01-01

    To validate the Quality of Erection Questionnaire (QEQ) considering Brazilian social-cultural aspects. To determine equivalence between the Portuguese and the English QEQ versions, the Portuguese version was back-translated by two professors who are native English speakers. After language equivalence had been determined, urologists considered the QEQ Portuguese version suitable. Men with self-reported erectile dysfunction (ED) and infertile men who had a stable sexual relationship for at least 6 months were invited to answer the QEQ, the International Index of Erectile Function (IIEF) and the RAND 36-Item Health Survey (RAND-36). The questionnaires were presented together and answered without help in a private room. Internal consistency (Cronbach's α), test-retest reliability (Spearman), convergent validity (Spearman correlation) coefficients and known-groups validity (the ability of the QEQ Portuguese version to differentiate erectile dysfunction severity groups) were assessed. We recruited 197 men (167 ED patients and 30 non-ED patients), mean age of 53.3 and median of 55.5 years (23-82 years). The Portuguese version of the QEQ had high internal consistency (Cronbach α=0.93), high stability between test and retest (ICC 0.83, with IC 95%: 0.76-0.88, pPortuguese version presented good psychometric properties and high convergent validity in relation to IIEF. The low correlations between the QEQ and the RAND-36, as well as between the IIEF and the RAND-36 indicated IIEF and QEQ specificity, which may have resulted from the patients' psychological adaptations that minimized the impact of ED on Quality of Life (QoL) and reestablished the well-being feeling.

  7. Item-level and subscale-level factoring of Biggs' Learning Process Questionnaire (LPQ) in a mainland Chinese sample.

    Science.gov (United States)

    Sachs, J; Gao, L

    2000-09-01

    The learning process questionnaire (LPQ) has been the source of intensive cross-cultural study. However, an item-level factor analysis of all the LPQ items simultaneously has never been reported. Rather, items within each subscale have been factor analysed to establish subscale unidimensionality and justify the use of composite subscale scores. It was of major interest to see if the six logically constructed items groups of the LPQ would be supported by empirical evidence. Additionally, it was of interest to compare the consistency of the reliability and correlational structure of the LPQ subscales in our study with those of previous cross-cultural studies. Confirmatory factor analysis was used to fit the six-factor item level model and to fit five representative subscale level factor models. A total of 1070 students between the ages of 15 to 18 years was drawn from a representative selection of 29 classes from within 15 secondary schools in Guangzhou, China. Males and females were almost equally represented. The six-factor item level model of the LPQ seemed to fit reasonably well, thus supporting the six dimensional structure of the LPQ and justifying the use of composite subscale scores for each LPQ dimension. However, the reliability of many of these subscales was low. Furthermore, only two subscale-level factor models showed marginally acceptable fit. Substantive considerations supported an oblique three-factor model. Because the LPQ subscales often show low internal consistency reliability, experimental and correlational studies that have used these subscales as dependent measures have been disappointing. It is suggested that some LPQ items should be revised and other items added to improve the inventory's overall psychometric properties.

  8. Evaluation properties of the French version of the OUT-PATSAT35 satisfaction with care questionnaire according to classical and item response theory analyses.

    Science.gov (United States)

    Panouillères, M; Anota, A; Nguyen, T V; Brédart, A; Bosset, J F; Monnier, A; Mercier, M; Hardouin, J B

    2014-09-01

    The present study investigates the properties of the French version of the OUT-PATSAT35 questionnaire, which evaluates the outpatients' satisfaction with care in oncology using classical analysis (CTT) and item response theory (IRT). This cross-sectional multicenter study includes 692 patients who completed the questionnaire at the end of their ambulatory treatment. CTT analyses tested the main psychometric properties (convergent and divergent validity, and internal consistency). IRT analyses were conducted separately for each OUT-PATSAT35 domain (the doctors, the nurses or the radiation therapists and the services/organization) by models from the Rasch family. We examined the fit of the data to the model expectations and tested whether the model assumptions of unidimensionality, monotonicity and local independence were respected. A total of 605 (87.4%) respondents were analyzed with a mean age of 64 years (range 29-88). Internal consistency for all scales separately and for the three main domains was good (Cronbach's α 0.74-0.98). IRT analyses were performed with the partial credit model. No disordered thresholds of polytomous items were found. Each domain showed high reliability but fitted poorly to the Rasch models. Three items in particular, the item about "promptness" in the doctors' domain and the items about "accessibility" and "environment" in the services/organization domain, presented the highest default of fit. A correct fit of the Rasch model can be obtained by dropping these items. Most of the local dependence concerned items about "information provided" in each domain. A major deviation of unidimensionality was found in the nurses' domain. CTT showed good psychometric properties of the OUT-PATSAT35. However, the Rasch analysis revealed some misfitting and redundant items. Taking the above problems into consideration, it could be interesting to refine the questionnaire in a future study.

  9. Australian Chemistry Test Item Bank: Years 11 & 12. Volume 1.

    Science.gov (United States)

    Commons, C., Ed.; Martin, P., Ed.

    Volume 1 of the Australian Chemistry Test Item Bank, consisting of two volumes, contains nearly 2000 multiple-choice items related to the chemistry taught in Year 11 and Year 12 courses in Australia. Items which were written during 1979 and 1980 were initially published in the "ACER Chemistry Test Item Collection" and in the "ACER…

  10. Visual distraction during word-list retrieval does not consistently disrupt memory: no evidence for a finite cognitive resource theory

    Directory of Open Access Journals (Sweden)

    Pamela Jayne Louise Rae

    2014-04-01

    Full Text Available Glenberg, Schroeder and Robertson (1998 reported that episodic memory is impaired by visual distraction and argued that this effect is consistent with a trade-off between internal and external attentional focus. However, their demonstration that visual distraction impairs memory for lists used 15 consecutive word lists, with analysis only of mid-list items, and has never been replicated. Experiment 1 (N=37 replicated their study, and found no overall effect of distraction on recall for the entire lists. However it did replicate the impairment for mid-list recall. Experiment 2 (N=64 explored whether this pattern arises because the mid-list items are poorly encoded (by manipulating presentation rate or because of interference. Experiment 3 (N=36 also looked at the role of interference whilst controlling for potential item effects. Neither study replicated the pattern seen in Experiment 1, despite reliable effects of presentation rate (Experiment 2 and interference (Experiments 2 and 3. Experiment 2 found no effect of distraction for mid-list items, but distraction did increase both correct and incorrect recall of all items suggestive of a shift in willingness to report. Experiment 3 found no effects of distraction whatsoever. Thus, there is no clear evidence that distraction consistently impairs retrieval of items from lists, contrary to the embodied cognition account used to explain the original finding.

  11. Results of assembly test of HTTR reactor internals

    International Nuclear Information System (INIS)

    Maruyama, S.; Saikusa, A.; Shiozawa, S.; Tsuji, N.; Miki, T.

    1996-01-01

    The assembly test of the HTTR actual reactor internals had been carried out at the works, prior to their installation in the actual reactor pressure vessel(RPV) at the construction site. The assembly test consists of several items such as examining fabricating precision of each component and alignment of piled-up structures, measuring circumferential coolant velocity profile in the passage between the simulated RPV and the reactor internals as well as under the support plates, measuring by-pass flow rate through gaps between the reactor internals, and measuring the binding force of the core restraint mechanism. Results of the test showed good performance of the HTTR reactor internals. Installation of the reactor internals in the actual RPV was started at the construction site of HTTR in April, 1995. In the installation process, main items of the assembly test at the works were repeated to investigate the reproducibility of installation. (author). 5 refs, 11 figs

  12. Identifying Country-Specific Cultures of Physics Education: A differential item functioning approach

    Science.gov (United States)

    Mesic, Vanes

    2012-11-01

    In international large-scale assessments of educational outcomes, student achievement is often represented by unidimensional constructs. This approach allows for drawing general conclusions about country rankings with respect to the given achievement measure, but it typically does not provide specific diagnostic information which is necessary for systematic comparisons and improvements of educational systems. Useful information could be obtained by exploring the differences in national profiles of student achievement between low-achieving and high-achieving countries. In this study, we aimed to identify the relative weaknesses and strengths of eighth graders' physics achievement in Bosnia and Herzegovina in comparison to the achievement of their peers from Slovenia. For this purpose, we ran a secondary analysis of Trends in International Mathematics and Science Study (TIMSS) 2007 data. The student sample consisted of 4,220 students from Bosnia and Herzegovina and 4,043 students from Slovenia. After analysing the cognitive demands of TIMSS 2007 physics items, the correspondent differential item functioning (DIF)/differential group functioning contrasts were estimated. Approximately 40% of items exhibited large DIF contrasts, indicating significant differences between cultures of physics education in Bosnia and Herzegovina and Slovenia. The relative strength of students from Bosnia and Herzegovina showed to be mainly associated with the topic area 'Electricity and magnetism'. Classes of items which required the knowledge of experimental method, counterintuitive thinking, proportional reasoning and/or the use of complex knowledge structures proved to be differentially easier for students from Slovenia. In the light of the presented results, the common practice of ranking countries with respect to universally established cognitive categories seems to be potentially misleading.

  13. SAS and SPSS macros to calculate standardized Cronbach's alpha using the upper bound of the phi coefficient for dichotomous items.

    Science.gov (United States)

    Sun, Wei; Chou, Chih-Ping; Stacy, Alan W; Ma, Huiyan; Unger, Jennifer; Gallaher, Peggy

    2007-02-01

    Cronbach's a is widely used in social science research to estimate the internal consistency of reliability of a measurement scale. However, when items are not strictly parallel, the Cronbach's a coefficient provides a lower-bound estimate of true reliability, and this estimate may be further biased downward when items are dichotomous. The estimation of standardized Cronbach's a for a scale with dichotomous items can be improved by using the upper bound of coefficient phi. SAS and SPSS macros have been developed in this article to obtain standardized Cronbach's a via this method. The simulation analysis showed that Cronbach's a from upper-bound phi might be appropriate for estimating the real reliability when standardized Cronbach's a is problematic.

  14. Consistent adoption of the International System of Units (SI) in nuclear science and technology

    Energy Technology Data Exchange (ETDEWEB)

    Klumpar, J; Kovar, Z [Ceskoslovenska Akademie Ved, Prague. Laborator Radiologicke Dozimetrie; Sacha, J [Slovenska Akademia Vied, Bratislava (Czechoslovakia). Fyzikalny Ustav

    1975-11-01

    The principles are stressed behind a consistent introduction of the International System of Units (SI) in Czechoslovakia complying with the latest edition of the Czechoslovak Standard CSN 01 1300 on the prescribed system of national and international units. The use of special and auxiliary units in nuclear physics and technology is discussed, particular attention being devoted to the units of activity and to the time units applied in radiology. Conversion graph and tables are annexed.

  15. Structural validity of a 16-item abridged version of the Cervantes Health-Related Quality of Life scale for menopause: the Cervantes Short-Form Scale.

    Science.gov (United States)

    Coronado, Pluvio J; Borrego, Rafael Sánchez; Palacios, Santiago; Ruiz, Miguel A; Rejas, Javier

    2015-03-01

    The Cervantes Scale is a specific health-related quality of life questionnaire that was originally developed in Spanish to be used in Spain for women through and beyond menopause. It contains 31 items and is time-consuming. The aim of this study was to produce an abridged version with the same dimensional structure and with similar psychometric properties. A representative sample of 516 postmenopausal women (mean [SD] age, 57 [4.31] y) seen in outpatient gynecology clinics and extracted from an observational cross-sectional study was used. Item analysis, internal consistency reliability, item-total and item-dimension correlations, and item correlation with the 12-item Medical Outcomes Study Short Form Health Survey Version 2.0 were studied. Dimensional and full-model confirmatory factor analyses were used to check structure stability. A threefold cross-validation method was used to obtain stable estimates by means of multigroup analysis. The scale was reduced to a 16-item version, the Cervantes Short-Form Scale, containing four main dimensions (Menopause and Health, Psychological, Sexuality, and Couple Relations), with the first dimension composed of three subdimensions (Vasomotor Symptoms, Health, and Aging). Goodness-of-fit statistics were better than those of the extended version (χ(2)/df = 2.493; adjusted goodness-of-fit index, 0.802; parsimony comparative fit index, 0.749; root mean standard error of approximation, 0.054). Internal consistency was good (Cronbach's α = 0.880). Correlations between the extended and the reduced dimensions were high and significant in all cases (P < 0.001; r values ranged from 0.90 for Sexuality to 0.969 for Vasomotor Symptoms). The Cervantes Scale can be reduced to a 16-item abridged version (Cervantes Short-Form Scale) that maintains the original dimensional structure and psychometric properties. At 51% of the original length, this version can be administered faster, making it especially suitable for routine medical practice.

  16. Using personality item characteristics to predict single-item reliability, retest reliability, and self-other agreement

    NARCIS (Netherlands)

    de Vries, Reinout Everhard; Realo, Anu; Allik, Jüri

    2016-01-01

    The use of reliability estimates is increasingly scrutinized as scholars become more aware that test–retest stability and self–other agreement provide a better approximation of the theoretical and practical usefulness of an instrument than its internal reliability. In this study, we investigate item

  17. Mathematical-programming approaches to test item pool design

    NARCIS (Netherlands)

    Veldkamp, Bernard P.; van der Linden, Willem J.; Ariel, A.

    2002-01-01

    This paper presents an approach to item pool design that has the potential to improve on the quality of current item pools in educational and psychological testing andhence to increase both measurement precision and validity. The approach consists of the application of mathematical programming

  18. Internal Consistency and Concurrent Validity of the Questionnaire for Limitations and Restrictions Assessment in Children with ADHD

    Directory of Open Access Journals (Sweden)

    Luisa Matilde Salamanca-Duque

    2014-09-01

    Full Text Available Introduction: ADHD is one of the most common diagnoses in child psychiatry, its early diagnosis is of great importance for intervention at family, school and social environment. Based on the International Classification of Functioning, Disability and Health (ICF, a questionnaire was designed to assess activity limitations and participation restrictions in children with ADHD. The questionnaire was called “CLARP-ADHD Parent and Teacher Version”. Objective: To determine the degree of internal consistency of the CLARP-ADHD questionnaire, and its concurrent validity with the “Strengths and Difficulties Questionnaire SDQ parent and teacher version”. Material and Methods: A sample of 203 children aged 6 to 12 with ADHD, currently attending school in five Colombian cities. The questionnaires were applied to parents and teachers. The internal consistency analysis was performed through Cronbach coefficient and concurrent validity using the Spearman correlation coefficient utilizing multiple and unique predictors through multiple linear regression as well as simple regression models. Results: A high internal consistency was found for global questionnaires for each of its domains. The CLARP-ADHD for parents gave as result an internal consistency of 0.83, and the CLARP-ADHD for teachers one of 0.93. Concurrent validity was found between the CLARP-ADHD and the SDQ Parent and Teacher version; also, concurrence between the CLARPADHD for Teachers and the SDQ Teachers was found, as well as between CLARP ADHD for Parents and CLARP ADHD Teachers, given by p values of p < 0.001.

  19. Item-focussed Trees for the Identification of Items in Differential Item Functioning.

    Science.gov (United States)

    Tutz, Gerhard; Berger, Moritz

    2016-09-01

    A novel method for the identification of differential item functioning (DIF) by means of recursive partitioning techniques is proposed. We assume an extension of the Rasch model that allows for DIF being induced by an arbitrary number of covariates for each item. Recursive partitioning on the item level results in one tree for each item and leads to simultaneous selection of items and variables that induce DIF. For each item, it is possible to detect groups of subjects with different item difficulties, defined by combinations of characteristics that are not pre-specified. The way a DIF item is determined by covariates is visualized in a small tree and therefore easily accessible. An algorithm is proposed that is based on permutation tests. Various simulation studies, including the comparison with traditional approaches to identify items with DIF, show the applicability and the competitive performance of the method. Two applications illustrate the usefulness and the advantages of the new method.

  20. Assessing nicotine dependence in adolescent E-cigarette users: The 4-item Patient-Reported Outcomes Measurement Information System (PROMIS) Nicotine Dependence Item Bank for electronic cigarettes.

    Science.gov (United States)

    Morean, Meghan E; Krishnan-Sarin, Suchitra; S O'Malley, Stephanie

    2018-04-26

    Adolescent e-cigarette use (i.e., "vaping") likely confers risk for developing nicotine dependence. However, there have been no studies assessing e-cigarette nicotine dependence in youth. We evaluated the psychometric properties of the 4-item Patient-Reported Outcomes Measurement Information System Nicotine Dependence Item Bank for E-cigarettes (PROMIS-E) for assessing youth e-cigarette nicotine dependence and examined risk factors for experiencing stronger dependence symptoms. In 2017, 520 adolescent past-month e-cigarette users completed the PROMIS-E during a school-based survey (50.5% female, 84.8% White, 16.22[1.19] years old). Adolescents also reported on sex, grade, race, age at e-cigarette use onset, vaping frequency, nicotine e-liquid use, and past-month cigarette smoking. Analyses included conducting confirmatory factor analysis and examining the internal consistency of the PROMIS-E. Bivariate correlations and independent-samples t-tests were used to examine unadjusted relationships between e-cigarette nicotine dependence and the proposed risk factors. Regression models were run in which all potential risk factors were entered as simultaneous predictors of PROMIS-E scores. The single-factor structure of the PROMIS-E was confirmed and evidenced good internal consistency. Across models, larger PROMIS-E scores were associated with being in a higher grade, initiating e-cigarette use at an earlier age, vaping more frequently, using nicotine e-liquid (and higher nicotine concentrations), and smoking cigarettes. Adolescent e-cigarette users reported experiencing nicotine dependence, which was assessed using the psychometrically sound PROMIS-E. Experiencing stronger nicotine dependence symptoms was associated with characteristics that previously have been shown to confer risk for frequent vaping and tobacco cigarette dependence. Copyright © 2018 Elsevier B.V. All rights reserved.

  1. Comparing response options for the International Outcome Inventory for Hearing Aids (IOI-HA) and for Alternative Interventions (IOI-AI) daily-use items.

    Science.gov (United States)

    Laplante-Lévesque, Ariane; Hickson, Louise; Worrall, Linda

    2012-10-01

    This study investigated how clients quantify use of hearing rehabilitation. Comparisons focused on the daily-use item of the International Outcome Inventory for Hearing Aids (IOI-HA), and for Alternative Interventions (IOI-AI). Adults with hearing impairment completed the original versions of the IOI-HA and the IOI-AI daily-use item which has five numerical response options (e.g. 1-4 hours/day) and a modified version with five word response options (e.g. 'Sometimes'). Respondents completed both IOI versions immediately after intervention completion and three months later. In total, 64 people who had obtained hearing aids completed both IOI-HA versions and 27 people who had participated in communication programs completed both IOI-AI versions. Participants reported higher scores on the modified (word) daily-use item than on the original (number) daily-use item. Participants who completed the IOI-AI did so significantly more than participants who completed the IOI-HA. This was true both after intervention completion and three months later. This study showed that comparisons between IOI-HA and IOI-AI daily-use item scores should be made with caution. Word daily-use response options are recommended for the IOI-AI (i.e. Never; Rarely; Sometimes; Often; and Almost always).

  2. The medial temporal lobes distinguish between within-item and item-context relations during autobiographical memory retrieval.

    Science.gov (United States)

    Sheldon, Signy; Levine, Brian

    2015-12-01

    During autobiographical memory retrieval, the medial temporal lobes (MTL) relate together multiple event elements, including object (within-item relations) and context (item-context relations) information, to create a cohesive memory. There is consistent support for a functional specialization within the MTL according to these relational processes, much of which comes from recognition memory experiments. In this study, we compared brain activation patterns associated with retrieving within-item relations (i.e., associating conceptual and sensory-perceptual object features) and item-context relations (i.e., spatial relations among objects) with respect to naturalistic autobiographical retrieval. We developed a novel paradigm that cued participants to retrieve information about past autobiographical events, non-episodic within-item relations, and non-episodic item-context relations with the perceptuomotor aspects of retrieval equated across these conditions. We used multivariate analysis techniques to extract common and distinct patterns of activity among these conditions within the MTL and across the whole brain, both in terms of spatial and temporal patterns of activity. The anterior MTL (perirhinal cortex and anterior hippocampus) was preferentially recruited for generating within-item relations later in retrieval whereas the posterior MTL (posterior parahippocampal cortex and posterior hippocampus) was preferentially recruited for generating item-context relations across the retrieval phase. These findings provide novel evidence for functional specialization within the MTL with respect to naturalistic memory retrieval. © 2015 Wiley Periodicals, Inc.

  3. Consistency Check for the Bin Packing Constraint Revisited

    Science.gov (United States)

    Dupuis, Julien; Schaus, Pierre; Deville, Yves

    The bin packing problem (BP) consists in finding the minimum number of bins necessary to pack a set of items so that the total size of the items in each bin does not exceed the bin capacity C. The bin capacity is common for all the bins.

  4. The Internal Consistency and Validity of the Vaccination Attitudes Examination Scale: A Replication Study.

    Science.gov (United States)

    Wood, Louise; Smith, Michael; Miller, Christopher B; O'Carroll, Ronan E

    2018-06-19

    Vaccinations are important preventative health behaviors. The recently developed Vaccination Attitudes Examination (VAX) Scale aims to measure the reasons behind refusal/hesitancy regarding vaccinations. The aim of this replication study is to conduct an independent test of the newly developed VAX Scale in the UK. We tested (a) internal consistency (Cronbach's α); (b) convergent validity by assessing its relationships with beliefs about medication, medical mistrust, and perceived sensitivity to medicines; and (c) construct validity by testing how well the VAX Scale discriminated between vaccinators and nonvaccinators. A sample of 243 UK adults completed the VAX Scale, the Beliefs About Medicines Questionnaire, the Perceived Sensitivity to Medicines Scale, and the Medical Mistrust Index, in addition to demographics of age, gender, education levels, and social deprivation. Participants were asked (a) whether they received an influenza vaccination in the past year and (b) if they had a young child, whether they had vaccinated the young child against influenza in the past year. The VAX (a) demonstrated high internal consistency (α = .92); (b) was positively correlated with medical mistrust and beliefs about medicines, and less strongly correlated with perceived sensitivity to medicines; and (c) successfully differentiated parental influenza vaccinators from nonvaccinators. The VAX demonstrated good internal consistency, convergent validity, and construct validity in an independent UK sample. It appears to be a useful measure to help us understand the health beliefs that promote or deter vaccination behavior.

  5. Dissociation between source and item memory in Parkinson's disease

    Institute of Scientific and Technical Information of China (English)

    Hu Panpan; Li Youhai; Ma Huijuan; Xi Chunhua; Chen Xianwen; Wang Kai

    2014-01-01

    Background Episodic memory includes information about item memory and source memory.Many researches support the hypothesis that these two memory systems are implemented by different brain structures.The aim of this study was to investigate the characteristics of item memory and source memory processing in patients with Parkinson's disease (PD),and to further verify the hypothesis of dual-process model of source and item memory.Methods We established a neuropsychological battery to measure the performance of item memory and source memory.Totally 35 PD individuals and 35 matched healthy controls (HC) were administrated with the battery.Item memory task consists of the learning and recognition of high-frequency national Chinese characters; source memory task consists of the learning and recognition of three modes (character,picture,and image) of objects.Results Compared with the controls,the idiopathic PD patients have been impaired source memory (PD vs.HC:0.65±0.06 vs.0.72±0.09,P=0.001),but not impaired in item memory (PD vs.HC:0.65±0.07 vs.0.67±0.08,P=0.240).Conclusions The present experiment provides evidence for dissociation between item and source memory in PD patients,thereby strengthening the claim that the item or source memory rely on different brain structures.PD patients show poor source memory,in which dopamine plays a critical role.

  6. Characterization of internal dosimetry practices

    International Nuclear Information System (INIS)

    Traub, R.J.; Heid, K.R.; Mann, J.C.

    1983-01-01

    Current practices in internal dosimetry at DOE facilities were evaluated with respect to consistency among DOE Contractors. All aspects of an internal dosimetry program were addressed. Items considered include, but are not necessarily limited to, record systems and ease of information retrieval; ease of integrating internal dose and external dose; modeling systems employed, including ability to modify models depending on excretion data, and verification of computer codes utilized; bioassay procedures, including quality control; and ability to relate air concentration data to individual workers and bioassay data. Feasibility of uranium analysis in solution by laser fluorescence excitation at uranium concentrations of one part per billion was demonstrated

  7. Validation of the Spanish versions of the long (26 items) and short (12 items) forms of the Self-Compassion Scale (SCS).

    Science.gov (United States)

    Garcia-Campayo, Javier; Navarro-Gil, Mayte; Andrés, Eva; Montero-Marin, Jesús; López-Artal, Lorena; Demarzo, Marcelo Marcos Piva

    2014-01-10

    Self-compassion is a key psychological construct for assessing clinical outcomes in mindfulness-based interventions. The aim of this study was to validate the Spanish versions of the long (26 item) and short (12 item) forms of the Self-Compassion Scale (SCS). The translated Spanish versions of both subscales were administered to two independent samples: Sample 1 was comprised of university students (n = 268) who were recruited to validate the long form, and Sample 2 was comprised of Aragon Health Service workers (n = 271) who were recruited to validate the short form. In addition to SCS, the Mindful Attention Awareness Scale (MAAS), the State-Trait Anxiety Inventory-Trait (STAI-T), the Beck Depression Inventory (BDI) and the Perceived Stress Questionnaire (PSQ) were administered. Construct validity, internal consistency, test-retest reliability and convergent validity were tested. The Confirmatory Factor Analysis (CFA) of the long and short forms of the SCS confirmed the original six-factor model in both scales, showing goodness of fit. Cronbach's α for the 26 item SCS was 0.87 (95% CI = 0.85-0.90) and ranged between 0.72 and 0.79 for the 6 subscales. Cronbach's α for the 12-item SCS was 0.85 (95% CI = 0.81-0.88) and ranged between 0.71 and 0.77 for the 6 subscales. The long (26-item) form of the SCS showed a test-retest coefficient of 0.92 (95% CI = 0.89-0.94). The Intraclass Correlation (ICC) for the 6 subscales ranged from 0.84 to 0.93. The short (12-item) form of the SCS showed a test-retest coefficient of 0.89 (95% CI: 0.87-0.93). The ICC for the 6 subscales ranged from 0.79 to 0.91. The long and short forms of the SCS exhibited a significant negative correlation with the BDI, the STAI and the PSQ, and a significant positive correlation with the MAAS. The correlation between the total score of the long and short SCS form was r = 0.92. The Spanish versions of the long (26-item) and short (12-item) forms of the SCS are valid and

  8. Australian Chemistry Test Item Bank: Years 11 and 12. Volume 2.

    Science.gov (United States)

    Commons, C., Ed.; Martin, P., Ed.

    The second volume of the Australian Chemistry Test Item Bank, consisting of two volumes, contains nearly 2000 multiple-choice items related to the chemistry taught in Year 11 and Year 12 courses in Australia. Items which were written during 1979 and 1980 were initially published in the "ACER Chemistry Test Item Collection" and in the…

  9. Negative effects of item repetition on source memory.

    Science.gov (United States)

    Kim, Kyungmi; Yi, Do-Joon; Raye, Carol L; Johnson, Marcia K

    2012-08-01

    In the present study, we explored how item repetition affects source memory for new item-feature associations (picture-location or picture-color). We presented line drawings varying numbers of times in Phase 1. In Phase 2, each drawing was presented once with a critical new feature. In Phase 3, we tested memory for the new source feature of each item from Phase 2. Experiments 1 and 2 demonstrated and replicated the negative effects of item repetition on incidental source memory. Prior item repetition also had a negative effect on source memory when different source dimensions were used in Phases 1 and 2 (Experiment 3) and when participants were explicitly instructed to learn source information in Phase 2 (Experiments 4 and 5). Importantly, when the order between Phases 1 and 2 was reversed, such that item repetition occurred after the encoding of critical item-source combinations, item repetition no longer affected source memory (Experiment 6). Overall, our findings did not support predictions based on item predifferentiation, within-dimension source interference, or general interference from multiple traces of an item. Rather, the findings were consistent with the idea that prior item repetition reduces attention to subsequent presentations of the item, decreasing the likelihood that critical item-source associations will be encoded.

  10. 77 FR 50932 - Electronic Transmission of Customs Data-Outbound International Letter-Post Items

    Science.gov (United States)

    2012-08-23

    ...[supreg] items bearing a permit imprint at a business mail entry unit (BMEU) since the information... Canada [Revise the intro and items a and b of 292.47 to read as follows (note that we have used bold text...

  11. Summarizing activity limitations in children with chronic illnesses living in the community: a measurement study of scales using supplemented interRAI items

    Directory of Open Access Journals (Sweden)

    Phillips Charles D

    2012-01-01

    Full Text Available Abstract Background To test the validity and reliability of scales intended to measure activity limitations faced by children with chronic illnesses living in the community. The scales were based on information provided by caregivers to service program personnel almost exclusively trained as social workers. The items used to measure activity limitations were interRAI items supplemented so that they were more applicable to activity limitations in children with chronic illnesses. In addition, these analyses may shed light on the possibility of gathering functional information that can span the life course as well as spanning different care settings. Methods Analyses included testing the internal consistency, predictive, concurrent, discriminant and construct validity of two activity limitation scales. The scales were developed using assessment data gathered in the United States of America (USA from over 2,700 assessments of children aged 4 to 20 receiving Medicaid Early and Periodic Screening, Diagnostic and Treatment (EPSDT services, specifically Personal Care Services to assist children in overcoming activity limitations. The Medicaid program in the USA pays for health care services provided to children in low-income households. Data were collected in a single, large state in the southwestern USA in late 2008 and early 2009. A similar sample of children was assessed in 2010, and the analyses were replicated using this sample. Results The two scales exhibited excellent internal consistency. Evidence on the concurrent, predictive, discriminant, and construct validity of the proposed scales was strong. Quite importantly, scale scores were not correlated with (confounded with a child's developmental stage or age. The results for these scales and items were consistent across the two independent samples. Conclusions Unpaid caregivers, usually parents, can provide assessors lacking either medical or nursing training with reliable and valid information

  12. Summarizing activity limitations in children with chronic illnesses living in the community: a measurement study of scales using supplemented interRAI items.

    Science.gov (United States)

    Phillips, Charles D; Patnaik, Ashweeta; Moudouni, Darcy K; Naiser, Emily; Dyer, James A; Hawes, Catherine; Fournier, Constance J; Miller, Thomas R; Elliott, Timothy R

    2012-01-23

    To test the validity and reliability of scales intended to measure activity limitations faced by children with chronic illnesses living in the community. The scales were based on information provided by caregivers to service program personnel almost exclusively trained as social workers. The items used to measure activity limitations were interRAI items supplemented so that they were more applicable to activity limitations in children with chronic illnesses. In addition, these analyses may shed light on the possibility of gathering functional information that can span the life course as well as spanning different care settings. Analyses included testing the internal consistency, predictive, concurrent, discriminant and construct validity of two activity limitation scales. The scales were developed using assessment data gathered in the United States of America (USA) from over 2,700 assessments of children aged 4 to 20 receiving Medicaid Early and Periodic Screening, Diagnostic and Treatment (EPSDT) services, specifically Personal Care Services to assist children in overcoming activity limitations. The Medicaid program in the USA pays for health care services provided to children in low-income households. Data were collected in a single, large state in the southwestern USA in late 2008 and early 2009. A similar sample of children was assessed in 2010, and the analyses were replicated using this sample. The two scales exhibited excellent internal consistency. Evidence on the concurrent, predictive, discriminant, and construct validity of the proposed scales was strong. Quite importantly, scale scores were not correlated with (confounded with) a child's developmental stage or age. The results for these scales and items were consistent across the two independent samples. Unpaid caregivers, usually parents, can provide assessors lacking either medical or nursing training with reliable and valid information on the activity limitations of children. One can summarize these

  13. Item-level factor analysis of the Self-Efficacy Scale.

    Science.gov (United States)

    Bunketorp Käll, Lina

    2014-03-01

    This study explores the internal structure of the Self-Efficacy Scale (SES) using item response analysis. The SES was previously translated into Swedish and modified to encompass all types of pain, not exclusively back pain. Data on perceived self-efficacy in 47 patients with subacute whiplash-associated disorders were derived from a previously conducted randomized-controlled trial. The item-level factor analysis was carried out using a six-step procedure. To further study the item inter-relationships and to determine the underlying structure empirically, the 20 items of the SES were also subjected to principal component analysis with varimax rotation. The analyses showed two underlying factors, named 'social activities' and 'physical activities', with seven items loading on each factor. The remaining six items of the SES appeared to measure somewhat different constructs and need to be analysed further.

  14. Assessment of disabilities in stroke patients with apraxia : Internal consistency and inter-observer reliability

    NARCIS (Netherlands)

    van Heugten, CM; Dekker, J; Deelman, BG; Stehmann-Saris, JC; Kinebanian, A

    1999-01-01

    In this paper the internal consistency and inter-observer reliability of the assessment of disabilities in stroke patients with apraxia is presented. Disabilities were assessed by means of observation of activities of daily living (ADL). The study was conducted at occupational therapy departments in

  15. Assessment of disabilities in stroke patients with apraxia: internal consistency and inter-observer reliability.

    NARCIS (Netherlands)

    Heugten, C.M. van; Dekker, J.; Deelman, B.G.; Stehmann-Saris, J.C.; Kinebanian, A.

    1999-01-01

    In this paper the internal consistency and inter-observer reliability of the assessment of disabilities in stroke patients with apraxia is presented. Disabilities were assessed by means of observation of activities of daily living (ADL). The study was conducted at occupational therapy departments in

  16. Validating a shortened depression scale (10 item CES-D among HIV-positive people in British Columbia, Canada.

    Directory of Open Access Journals (Sweden)

    Wendy Zhang

    Full Text Available OBJECTIVE: To establish the reliability and validity of a shortened (10-item depression scale used among HIV-positive patients enrolled in the Drug Treatment Program in British Columbia, Canada. METHODS: The 10-item CES-D (Center for Epidemiologic Studies Depression Scale was examined among 563 participants who initiated antiretroviral therapy (ART between August 1, 1996 and June 30, 2002. Internal consistency of the scale was measured by Cronbach's alpha. Using the original CES-D 20 as primary criteria, comparisons were made using the Kappa statistic. Predictive accuracy of CES-D 10 was assessed by calculating sensitivity, specificity, positive predictive values and negative predictive values. Factor analysis was also performed to determine if the CES-D 10 contained the same factors of positive and negative affect found in the original development of the CES-D. RESULTS: The correlation between the original and the shortened scale is very high (Spearman correlation coefficient  =0.97 (P<0.001. Internal consistency reliability coefficients of the CES-D 10 were satisfactory (Cronbach α=0.88. The CES-D 10 showed comparable accuracy to the original CES-D 20 in classifying participants with depressive symptoms (Kappa=0.82, P<0.001. Sensitivity of CES-D 10 was 91%; specificity was 92%; and positive predictive value was 92%. Factor analysis demonstrates that CES-D 10 contains the same underlying factors of positive and negative affect found in the original development of the CES-D 20. CONCLUSION: The 10-item CES-D is a comparable tool to measure depressive symptoms among HIV-positive research participants.

  17. Cultural competence in mental health nursing: validity and internal consistency of the Portuguese version of the multicultural mental health awareness scale-MMHAS.

    Science.gov (United States)

    de Almeida Vieira Monteiro, Ana Paula Teixeira; Fernandes, Alexandre Bastos

    2016-05-17

    Cultural competence is an essential component in rendering effective and culturally responsive services to culturally and ethnically diverse clients. Still, great difficulty exists in assessing the cultural competence of mental health nurses. There are no Portuguese validated measurement instruments to assess cultural competence in mental health nurses. This paper reports a study testing the reliability and validity of the Portuguese version of the Multicultural Mental Health Awareness Scale-MMHAS in a sample of Portuguese nurses. Following a standard forward/backward translation into Portuguese, the adapted version of MMHAS, along with a sociodemographic questionnaire, were applied to a sample of 306 Portuguese nurses (299 males, 77 females; ages 21-68 years, M = 35.43, SD = 9.85 years). A psychometric research design was used with content and construct validity and reliability. Reliability was assessed using internal consistency and item-total correlations. Construct validity was determined using factor analysis. The factor analysis confirmed that the Portuguese version of MMHAS has a three-factor structure of multicultural competencies (Awareness, Knowledge, and Skills) explaining 59.51% of the total variance. Strong content validity and reliability correlations were demonstrated. The Portuguese version of MMHAS has a strong internal consistency, with a Cronbach's alpha of 0.958 for the total scale. The results supported the construct validity and reliability of the Portuguese version of MMHAS, proving that is a reliable and valid measure of multicultural counselling competencies in mental health nursing. The MMHAS Portuguese version can be used to evaluate the effectiveness of multicultural competency training programs in Portuguese-speaking mental health nurses. The scale can also be a useful in future studies of multicultural competencies in Portuguese-speaking nurses.

  18. Sociodemographic and lifestyle factors affecting the self-perception period of lower urinary tract symptoms of international prostate symptom score items.

    Science.gov (United States)

    Kim, J H; Shim, S R; Lee, W J; Kim, H J; Kwon, S-S; Bae, J H

    2012-12-01

    This study investigated the influence of sociodemographic and lifestyle factors on the lower urinary tract symptom (LUTS) self-perception period and International Prostate Symptom Score. This cross-sectional study examined 209 men aged ≥ 40 years with non-treated LUTS who participated in a prostate examination survey. Questions included International Prostate Symptom Score (IPSS) items with self-perception periods for each item. Sociodemographic and lifestyle factors were also assessed. Participants were divided by mild LUTS (IPSS less than 8) and moderate-to-severe LUTS (IPSS 8 or higher). Self-perception period of the moderate-to-severe LUTS (n = 110) was affected by BMI; the self-perception period of the mild LUTS (n = 90) was affected by age, income, occupation and concomitant disease. Moderate-to-severe LUTS were affected by self-perception period (p = 0.03). Self-perception period was affected by concern for health (p = 0.005) by multivariate analysis, and self-perception period of mild LUTS was affected by BMI (p = 0.012). Moderate-to-severe LUTS were affected by age, number of family members, concern for health and drinking (p self-perception period. In moderate-to-severe LUTS, age, concern for health and drinking were affecting factors of self-perception period. © 2012 Blackwell Publishing Ltd.

  19. Item validity vs. item discrimination index: a redundancy?

    Science.gov (United States)

    Panjaitan, R. L.; Irawati, R.; Sujana, A.; Hanifah, N.; Djuanda, D.

    2018-03-01

    In several literatures about evaluation and test analysis, it is common to find that there are calculations of item validity as well as item discrimination index (D) with different formula for each. Meanwhile, other resources said that item discrimination index could be obtained by calculating the correlation between the testee’s score in a particular item and the testee’s score on the overall test, which is actually the same concept as item validity. Some research reports, especially undergraduate theses tend to include both item validity and item discrimination index in the instrument analysis. It seems that these concepts might overlap for both reflect the test quality on measuring the examinees’ ability. In this paper, examples of some results of data processing on item validity and item discrimination index were compared. It would be discussed whether item validity and item discrimination index can be represented by one of them only or it should be better to present both calculations for simple test analysis, especially in undergraduate theses where test analyses were included.

  20. 26 CFR 301.6682-1 - False information with respect to withholding allowances based on itemized deductions.

    Science.gov (United States)

    2010-04-01

    ... 26 Internal Revenue 18 2010-04-01 2010-04-01 false False information with respect to withholding allowances based on itemized deductions. 301.6682-1 Section 301.6682-1 Internal Revenue INTERNAL REVENUE... Amounts § 301.6682-1 False information with respect to withholding allowances based on itemized deductions...

  1. International cooperation for the development of consistent and stable transportation regulations to promote and enhance safety and security

    International Nuclear Information System (INIS)

    Strosnider, J.

    2004-01-01

    International commerce of radioactive materials crosses national boundaries, linking separate regulatory institutions with a common purpose and making it necessary for these institutions to work together in order to achieve common safety goals in a manner that does not place an undue burden on industry and commerce. Widespread and increasing use of radioactive materials across the world has led to increases in the transport of radioactive materials. The demand for consistency in the oversight of international transport has also increased to prevent unnecessary delays and costs associated with incongruent or redundant regulatory requirements by the various countries through which radioactive material is transported. The International Atomic Energy Agency (IAEA) is the authority for international regulation of transportation of radioactive materials responsible for promulgation of regulations and guidance for the establishment of acceptable methods of transportation for the international community. As such, the IAEA is seen as the focal point for consensus building between its Member States to develop consistency in transportation regulations and reviews and to ensure the safe and secure transport of radioactive material. International cooperation is also needed to ensure stability in our regulatory processes. Changes to transportation regulations should be based on an anticipated safety benefit supported by risk information and insights gained from continuing experience, evaluation, and research studies. If we keep safety as the principle basis for regulatory changes, regulatory stability will be enhanced. Finally, as we endeavour to maintain consistency and stability in our international regulations, we must be mindful of the new security challenges that lay before the international community as a result of a changing terrorist environment. Terrorism is a problem of global concern that also requires international cooperation and support, as we look for ways to

  2. Three controversies over item disclosure in medical licensure examinations

    Directory of Open Access Journals (Sweden)

    Yoon Soo Park

    2015-09-01

    Full Text Available In response to views on public's right to know, there is growing attention to item disclosure – release of items, answer keys, and performance data to the public – in medical licensure examinations and their potential impact on the test's ability to measure competence and select qualified candidates. Recent debates on this issue have sparked legislative action internationally, including South Korea, with prior discussions among North American countries dating over three decades. The purpose of this study is to identify and analyze three issues associated with item disclosure in medical licensure examinations – 1 fairness and validity, 2 impact on passing levels, and 3 utility of item disclosure – by synthesizing existing literature in relation to standards in testing. Historically, the controversy over item disclosure has centered on fairness and validity. Proponents of item disclosure stress test takers’ right to know, while opponents argue from a validity perspective. Item disclosure may bias item characteristics, such as difficulty and discrimination, and has consequences on setting passing levels. To date, there has been limited research on the utility of item disclosure for large scale testing. These issues requires ongoing and careful consideration.

  3. Internal Consistency of Performance Evaluations as a Function of Music Expertise and Excerpt Familiarity

    Science.gov (United States)

    Kinney, Daryl W.

    2009-01-01

    The purpose of this study was to examine the effects of music experience and excerpt familiarity on the internal consistency of performance evaluations. Participants included nonmusic majors who had not participated in high school music ensembles, nonmusic majors who had participated in high school music ensembles, music majors, and experts…

  4. Consistência interna da versão em português do Mini-Inventário de Fobia Social (Mini-SPIN Internal consistency of the Portuguese version of the Mini-Social Phobia Inventory (Mini-SPIN

    Directory of Open Access Journals (Sweden)

    Gustavo J. Fonseca D'El Rey

    2007-01-01

    Full Text Available CONTEXTO: A fobia social é um grave transtorno de ansiedade que traz incapacitação e sofrimento. OBJETIVOS: Investigar a consistência interna da versão em português do Mini-Inventário de Fobia Social (Mini-SPIN. MÉTODOS: Foi realizado um estudo da consistência interna do Mini-SPIN em uma amostra de 206 estudantes universitários da cidade de São Paulo, SP. RESULTADOS: A consistência interna do instrumento, analisada pelo coeficiente alfa de Cronbach, foi de 0,81. CONCLUSÕES: Esses achados permitiram concluir que a versão em português do Mini-SPIN exibiu resultados de boa consistência interna, semelhantes aos da versão original em inglês.BACKGROUND: Social phobia is a severe anxiety disorder that brings disability and distress. OBJECTIVES: To investigate the internal consistency of the Portuguese version of the Mini-Social Phobia Inventory (Mini-SPIN. METHODS: We conducted a study of internal consistency of the Mini-SPIN in a sample of 206 college students of the city of São Paulo, SP. RESULTS: The internal consistency of the instrument, analyzed by Cronbach's alpha coefficient, was 0.81. CONCLUSIONS: These findings suggest that the Portuguese version of the Mini-SPIN has a good internal consistency, similar to those obtained with the original English version.

  5. Missouri Assessment Program (MAP), Spring 2000: Secondary Science, Released Items, Grade 10.

    Science.gov (United States)

    Missouri State Dept. of Elementary and Secondary Education, Jefferson City.

    This assessment sample provides information on the Missouri Assessment Program (MAP) for grade 10 science. The sample consists of six items taken from the test booklet and scoring guides for the six items. The items assess ecosystems, mechanics, and data analysis. (MM)

  6. Development of a brief version of the Social Phobia Inventory using item response theory: the Mini-SPIN-R.

    Science.gov (United States)

    Aderka, Idan M; Pollack, Mark H; Simon, Naomi M; Smits, Jasper A J; Van Ameringen, Michael; Stein, Murray B; Hofmann, Stefan G

    2013-12-01

    The Social Phobia Inventory (SPIN) is a widely used measure in mental health settings and a 3-item version (mini-SPIN) has been developed as a screening instrument for social anxiety disorder. In the present study, we examined the psychometric properties of the SPIN and developed a brief version (mini-SPIN-R) designed to assess social anxiety severity using item response theory. Our sample included 569 individuals with social anxiety disorder who participated in 2 clinical trials and filled out a battery of self-report measures. Using a nonparametric kernel smoothing method we identified the most sensitive items of the SPIN. These 3 items comprised the mini-SPIN-R, which was found to have greater internal consistency, and to capture a greater range of symptoms compared to the mini-SPIN. The mini-SPIN-R evidenced superior convergent validity compared to the mini-SPIN and both measures had similar divergent validity. Thus, the mini-SPIN-R is a promising brief measure of social anxiety severity. Copyright © 2013. Published by Elsevier Ltd.

  7. Exploring differential item functioning (DIF) with the Rasch model: A comparison of gender differences on eighth-grade science items in the United States and Spain

    Science.gov (United States)

    Calvert, Tasha

    Despite the attention that has been given to gender and science, boys continue to outperform girls in science achievement, particularly by the end of secondary school. Because it is unclear whether gender differences have narrowed over time (Leder, 1992; Willingham & Cole, 1997), it is important to continue a line of inquiry into the nature of gender differences, specifically at the international level. The purpose of this study was to investigate gender differences in science achievement across two countries: United States and Spain. A secondary purpose was to demonstrate an alternative method for exploring gender differences based on the many-faceted Rasch model (1980). A secondary analysis of the data from the Third International Mathematics and Science Study (TIMSS) was used to examine the relationship between gender DIF (differential item functioning) and item characteristics (item type, content, and performance expectation) across both countries. Nationally representative samples of eighth grade students in the United States and Spain who participated in TIMSS were analyzed to answer the research questions in this study. In both countries, girls showed an advantage over boys on life science items and most extended response items, whereas boys, by and large, had an advantage on earth science, physics, and chemistry items. However, even within areas that favored boys, such as physics, there were items that were differentially easier for girls. In general, patterns in gender differences were similar across both countries although there were a few differences between the countries on individual items. It was concluded that simply looking at mean differences does not provide an adequate understanding of the nature of gender differences in science achievement.

  8. A Comprehensive List of Items to be Included on a Pediatric Drug Monograph.

    Science.gov (United States)

    Kelly, Lauren E; Ito, Shinya; Woods, David; Nunn, Anthony J; Taketomo, Carol; de Hoog, Matthijs; Offringa, Martin

    2017-01-01

    Children require special considerations for drug prescribing. Drug information summarized in a formulary containing drug monographs is essential for safe and effective prescribing. Currently, little is known about the information needs of those who prescribe and administer medicines to children. Our primary objective was to identify a list of important and relevant items to be included in a pediatric drug monograph. Following the establishment of an expert steering committee and an environmental scan of adult and pediatric formulary monograph items, 46 participants from 25 countries were invited to complete a 2-round Delphi survey. Questions regarding source of prescribing information and importance of items were recorded. An international consensus meeting to vote on and finalize the items list with the steering committee followed. Pediatric formularies are most commonly the first resource consulted for information on medication used in children by 31 Delphi participants. After the Delphi rounds, 116 items were identified to be included in a comprehensive pediatric drug monograph, including general information, adverse drug reactions, dosages, precautions, drug-drug interactions, formulation, and drug properties. Health care providers identified 116 monograph items as important for prescribing medicines for children by an international consensus-based process. This information will assist in setting standards for the creation of new pediatric drug monographs for international application and for those involved in pediatric formulary development.

  9. ‘Forget me (not?’ – Remembering forget-items versus un-cued items in directed forgetting

    Directory of Open Access Journals (Sweden)

    Bastian eZwissler

    2015-11-01

    Full Text Available Humans need to be able to selectively control their memories. Here, we investigate the underlying processes in item-method directed forgetting and compare the classic active memory cues in this paradigm with a passive instruction. Typically, individual items are presented and each is followed by either a forget- or remember-instruction. On a surprise test of all items, memory is then worse for to-be-forgotten items (TBF compared to to-be-remembered items (TBR. This is thought to result from selective rehearsal of TBR, or from active inhibition of TBF, or from both. However, evidence suggests that if a forget instruction initiates active processing, paradoxical effects may also arise. To investigate the underlying mechanisms, four experiments were conducted where un-cued items (UI were introduced and recognition performance was compared between TBR, TBF and UI stimuli. Accuracy was encouraged via a performance-dependent monetary bonus. Across all experiments, including perceptually fully matched variants, memory accuracy for TBF was reduced compared to TBR, but better than for UI. Moreover, participants used a more conservative response criterion when responding to TBF stimuli. Thus, ironically, the F cue results in active processing, but this does not have inhibitory effects that would impair recognition memory beyond a un-cued baseline condition. This casts doubts on inhibitory accounts of item-method directed forgetting and is also difficult to reconcile with pure selective rehearsal of TBR. While the F-cue does induce active processing, this does not result in particularly successful forgetting. The pattern seems most consistent with the notion of ironic processing.

  10. Using Procedure Based on Item Response Theory to Evaluate Classification Consistency Indices in the Practice of Large-Scale Assessment

    Directory of Open Access Journals (Sweden)

    Shanshan Zhang

    2017-09-01

    Full Text Available In spite of the growing interest in the methods of evaluating the classification consistency (CC indices, only few researches are available in the field of applying these methods in the practice of large-scale educational assessment. In addition, only few studies considered the influence of practical factors, for example, the examinee ability distribution, the cut score location and the score scale, on the performance of CC indices. Using the newly developed Lee's procedure based on the item response theory (IRT, the main purpose of this study is to investigate the performance of CC indices when practical factors are taken into consideration. A simulation study and an empirical study were conducted under comprehensive conditions. Results suggested that with negatively skewed distribution, the CC indices were larger than with other distributions. Interactions occurred among ability distribution, cut score location, and score scale. Consequently, Lee's IRT procedure is reliable to be used in the field of large-scale educational assessment, and when reporting the indices, it should be treated with caution as testing conditions may vary a lot.

  11. Development of a simple 12-item theory-based instrument to assess the impact of continuing professional development on clinical behavioral intentions.

    Directory of Open Access Journals (Sweden)

    France Légaré

    Full Text Available Decision-makers in organizations providing continuing professional development (CPD have identified the need for routine assessment of its impact on practice. We sought to develop a theory-based instrument for evaluating the impact of CPD activities on health professionals' clinical behavioral intentions.Our multipronged study had four phases. 1 We systematically reviewed the literature for instruments that used socio-cognitive theories to assess healthcare professionals' clinically-oriented behavioral intentions and/or behaviors; we extracted items relating to the theoretical constructs of an integrated model of healthcare professionals' behaviors and removed duplicates. 2 A committee of researchers and CPD decision-makers selected a pool of items relevant to CPD. 3 An international group of experts (n = 70 reached consensus on the most relevant items using electronic Delphi surveys. 4 We created a preliminary instrument with the items found most relevant and assessed its factorial validity, internal consistency and reliability (weighted kappa over a two-week period among 138 physicians attending a CPD activity. Out of 72 potentially relevant instruments, 47 were analyzed. Of the 1218 items extracted from these, 16% were discarded as improperly phrased and 70% discarded as duplicates. Mapping the remaining items onto the constructs of the integrated model of healthcare professionals' behaviors yielded a minimum of 18 and a maximum of 275 items per construct. The partnership committee retained 61 items covering all seven constructs. Two iterations of the Delphi process produced consensus on a provisional 40-item questionnaire. Exploratory factorial analysis following test-retest resulted in a 12-item questionnaire. Cronbach's coefficients for the constructs varied from 0.77 to 0.85.A 12-item theory-based instrument for assessing the impact of CPD activities on health professionals' clinical behavioral intentions showed adequate validity and

  12. Repair systems with exchangeable items and the longest queue mechanism

    NARCIS (Netherlands)

    Ravid, R.; Boxma, O.J.; Perry, D.

    2013-01-01

    We consider a repair facility consisting of one repairman and two arrival streams of failed items, from bases 1 and 2. The arrival processes are independent Poisson processes, and the repair times are independent and identically exponentially distributed. The item types are exchangeable, and a

  13. Repair systems with exchangeable items and the longest queue mechanism

    NARCIS (Netherlands)

    Ravid, R.; Boxma, O.J.; Perry, D.

    2011-01-01

    We consider a repair facility consisting of one repairman and two arrival streams of failed items, from bases 1 and 2. The arrival processes are independent Poisson processes, and the repair times are independent and identically exponentially distributed. The item types are exchangeable, and a

  14. Using automatic item generation to create multiple-choice test items.

    Science.gov (United States)

    Gierl, Mark J; Lai, Hollis; Turner, Simon R

    2012-08-01

    Many tests of medical knowledge, from the undergraduate level to the level of certification and licensure, contain multiple-choice items. Although these are efficient in measuring examinees' knowledge and skills across diverse content areas, multiple-choice items are time-consuming and expensive to create. Changes in student assessment brought about by new forms of computer-based testing have created the demand for large numbers of multiple-choice items. Our current approaches to item development cannot meet this demand. We present a methodology for developing multiple-choice items based on automatic item generation (AIG) concepts and procedures. We describe a three-stage approach to AIG and we illustrate this approach by generating multiple-choice items for a medical licensure test in the content area of surgery. To generate multiple-choice items, our method requires a three-stage process. Firstly, a cognitive model is created by content specialists. Secondly, item models are developed using the content from the cognitive model. Thirdly, items are generated from the item models using computer software. Using this methodology, we generated 1248 multiple-choice items from one item model. Automatic item generation is a process that involves using models to generate items using computer technology. With our method, content specialists identify and structure the content for the test items, and computer technology systematically combines the content to generate new test items. By combining these outcomes, items can be generated automatically. © Blackwell Publishing Ltd 2012.

  15. Using Likert-type and ipsative/forced choice items in sequence to generate a preference.

    Science.gov (United States)

    Ried, L Douglas

    2014-01-01

    Collaboration and implementation of a minimum, standardized set of core global educational and professional competencies seems appropriate given the expanding international evolution of pharmacy practice. However, winnowing down hundreds of competencies from a plethora of local, national and international competency frameworks to select the most highly preferred to be included in the core set is a daunting task. The objective of this paper is to describe a combination of strategies used to ascertain the most highly preferred items among a large number of disparate items. In this case, the items were >100 educational and professional competencies that might be incorporated as the core components of new and existing competency frameworks. Panelists (n = 30) from the European Union (EU) and United States (USA) were chosen to reflect a variety of practice settings. Each panelist completed two electronic surveys. The first survey presented competencies in a Likert-type format and the second survey presented many of the same competencies in an ipsative/forced choice format. Item mean scores were calculated for each competency, the competencies were ranked, and non-parametric statistical tests were used to ascertain the consistency in the rankings achieved by the two strategies. This exploratory study presented over 100 competencies to the panelists in the beginning. The two methods provided similar results, as indicated by the significant correlation between the rankings (Spearman's rho = 0.30, P < 0.09). A two-step strategy using Likert-type and ipsative/forced choice formats in sequence, appears to be useful in a situation where a clear preference is required from among a large number of choices. The ipsative/forced choice format resulted in some differences in the competency preferences because the panelists could not rate them equally by design. While this strategy was used for the selection of professional educational competencies in this exploratory study, it is

  16. Screening for depression in advanced disease: psychometric properties, sensitivity, and specificity of two items of the Palliative Care Outcome Scale (POS).

    Science.gov (United States)

    Antunes, Bárbara; Murtagh, Fliss; Bausewein, Claudia; Harding, Richard; Higginson, Irene J

    2015-02-01

    Depression is common among patients with advanced disease but often difficult to detect. To assess the Palliative care Outcome Scale (POS) (10 items) against the Geriatric Depression Scale (GDS)-10 total score and the Hospital Anxiety and Depression Scale (HADS)-Depression subscale total score and determine if the POS has appropriate items to screen for depression among people with advanced disease. This was a secondary analysis performed on five studies. Four psychometric properties were assessed: data quality, scaling assumptions, acceptability, and internal consistency (reliability). Receiver operating characteristic (ROC) curves were used to determine the area under the curve. Sensitivity, specificity, positive and negative predictive values, false positive and negative rates, and positive and negative likelihood ratios were computed. The overall sample had 416 patients from Germany and England: 144 had cancer and 267 had nonmalignant conditions. Prevalence of depression across the sample was 17.5%. Floor and ceiling effects were rare. Cronbach's alpha coefficients for POS items 7 and 8 summed, GDS-10 and HADS-Depression items varied: 0.61 (heart failure) and 0.80 (cancer). Two items combined (Item 7-feeling depressed and Item 8-feeling good about yourself) consistently presented the highest area under the ROC curve, ranging from 0.76 (95% CI 0.60, 0.93) (Germany, lung cancer) to 0.97 (95% CI 0.91, 1.0) (heart failure), highest negative predictive value, and lowest false negative rate. For the overall sample, the cutoff 2/3 presented a negative predictive value of 89.4% (95% CI 84.7, 92.8) and false negative rate of 10.6 (95% CI 7.2, 15.3). POS items 7 and 8 summed are potentially useful to screen for depression in advanced disease populations. Copyright © 2015 American Academy of Hospice and Palliative Medicine. Published by Elsevier Inc. All rights reserved.

  17. Discussion on monitoring items of radionuclides in influents from nuclear power plants

    International Nuclear Information System (INIS)

    Zhang Yanxia; Li Jin; Liu Jiacheng; Han Shanbiao; Yu Zhengwei

    2014-01-01

    For the radionuclide monitoring items of effluents from nuclear power plant, this paper makes some comparisons and analysis from three aspects of the international atomic energy general requirements, the routine radionuclide measurement items of China's nuclear power plant and effluents low level radionuclide experimental research results. Finally, it summarizes the necessary items and recommended items of the radionuclide monitoring of effluents from nuclear power plant, which can provide references for the radioactivity monitoring activities of nuclear power plant effluent and the supervisions of regulatory departments. (authors)

  18. Psychometric evaluation of an item bank for computerized adaptive testing of the EORTC QLQ-C30 cognitive functioning dimension in cancer patients.

    Science.gov (United States)

    Dirven, Linda; Groenvold, Mogens; Taphoorn, Martin J B; Conroy, Thierry; Tomaszewski, Krzysztof A; Young, Teresa; Petersen, Morten Aa

    2017-11-01

    The European Organisation of Research and Treatment of Cancer (EORTC) Quality of Life Group is developing computerized adaptive testing (CAT) versions of all EORTC Quality of Life Questionnaire (QLQ-C30) scales with the aim to enhance measurement precision. Here we present the results on the field-testing and psychometric evaluation of the item bank for cognitive functioning (CF). In previous phases (I-III), 44 candidate items were developed measuring CF in cancer patients. In phase IV, these items were psychometrically evaluated in a large sample of international cancer patients. This evaluation included an assessment of dimensionality, fit to the item response theory (IRT) model, differential item functioning (DIF), and measurement properties. A total of 1030 cancer patients completed the 44 candidate items on CF. Of these, 34 items could be included in a unidimensional IRT model, showing an acceptable fit. Although several items showed DIF, these had a negligible impact on CF estimation. Measurement precision of the item bank was much higher than the two original QLQ-C30 CF items alone, across the whole continuum. Moreover, CAT measurement may on average reduce study sample sizes with about 35-40% compared to the original QLQ-C30 CF scale, without loss of power. A CF item bank for CAT measurement consisting of 34 items was established, applicable to various cancer patients across countries. This CAT measurement system will facilitate precise and efficient assessment of HRQOL of cancer patients, without loss of comparability of results.

  19. An abbreviated Faecal Incontinence Quality of Life Scale for Chinese-speaking population with colorectal cancer after surgery: cultural adaptation and item reduction.

    Science.gov (United States)

    Hsu, L-F; Hung, C-L; Kuo, L-J; Tsai, P-S

    2017-09-01

    No instrument is available to assess the impact of faecal incontinence (FI) of quality of life for Chinese-speaking population. The purpose of the study was to adapt the Faecal Incontinence Quality of Life Scale (FIQL) for patients with colorectal cancer, assess the factor structure and reduce the items for brevity. A sample of 120 participants were enrolled. Internal consistency, test-retest reliability, and convergent and contrasted-groups validity were assessed. Construct validity was analysed using an exploratory and confirmatory factor analyses (CFA). The internal consistency (Cronbach's α of the total scale and four subscales = 0.98 and 0.97, 0.96, 0.92, 0.82 respectively), test-retest reliability (intraclass correlation coefficients ≥.98 for all scales with p < .001) and significant correlations of all scales with selected subscales of the Medical Outcomes Study 36-Item Short-Form Health Survey and the Wexner scale suggested satisfactory reliability and validity. The severe FI group (with a Wexner score ≥9) scored significantly lower on the scale than the less severe FI group (with a Wexner score <9) did (p < .001). The CFA supported a two-factor structure and demonstrated an excellent model fit of the 15-item abbreviated version of the FIQL-Chinese. The FIQL-Chinese has satisfactory validity and reliability and the abbreviated version may be more practical and applicable. © 2016 John Wiley & Sons Ltd.

  20. Item response modeling: a psychometric assessment of the children's fruit, vegetable, water, and physical activity self-efficacy scales among Chinese children.

    Science.gov (United States)

    Wang, Jing-Jing; Chen, Tzu-An; Baranowski, Tom; Lau, Patrick W C

    2017-09-16

    This study aimed to evaluate the psychometric properties of four self-efficacy scales (i.e., self-efficacy for fruit (FSE), vegetable (VSE), and water (WSE) intakes, and physical activity (PASE)) and to investigate their differences in item functioning across sex, age, and body weight status groups using item response modeling (IRM) and differential item functioning (DIF). Four self-efficacy scales were administrated to 763 Hong Kong Chinese children (55.2% boys) aged 8-13 years. Classical test theory (CTT) was used to examine the reliability and factorial validity of scales. IRM was conducted and DIF analyses were performed to assess the characteristics of item parameter estimates on the basis of children's sex, age and body weight status. All self-efficacy scales demonstrated adequate to excellent internal consistency reliability (Cronbach's α: 0.79-0.91). One FSE misfit item and one PASE misfit item were detected. Small DIF were found for all the scale items across children's age groups. Items with medium to large DIF were detected in different sex and body weight status groups, which will require modification. A Wright map revealed that items covered the range of the distribution of participants' self-efficacy for each scale except VSE. Several self-efficacy scales' items functioned differently by children's sex and body weight status. Additional research is required to modify the four self-efficacy scales to minimize these moderating influences for application.

  1. The Role of Item Models in Automatic Item Generation

    Science.gov (United States)

    Gierl, Mark J.; Lai, Hollis

    2012-01-01

    Automatic item generation represents a relatively new but rapidly evolving research area where cognitive and psychometric theories are used to produce tests that include items generated using computer technology. Automatic item generation requires two steps. First, test development specialists create item models, which are comparable to templates…

  2. Análisis de consistencia interna mediante Alfa de Cronbach: un programa basado en gráficos dinámicos Internal consistency analysis by means of Cronbach’s Alpha: a computer program based on dynamic graphics

    Directory of Open Access Journals (Sweden)

    Rubén Ledesma

    2002-12-01

    Full Text Available En este trabajo se presenta una herramienta informática original que permite realizar análisis de consistencia interna (modelo Alfa de Cronbach utilizando métodos gráficos dinámicos. Se trata de un módulo basado en la filosofía del Análisis Exploratorio de Datos y en métodos de visualización estadística, diseñado para asistir al analista en el proceso de construcción de pruebas psicológicas. La herramienta permite analizar la consistencia interna de la prueba, las propiedades de los ítems que la componen, los patrones de respuesta de los sujetos a los ítems, y el efecto de la eliminación de los ítems y del incremento en la longitud de la prueba sobre su fiabilidad. En comparación con otros programas existentes, el beneficio del módulo es la incorporación de gráficos estadísticos dinámicos como complemento a la presentación de resultados convencionales en formato texto.This paper describes a computer software that provides dynamic graphics to perform internal consistence analysis by means of Cronbach’s Alpha. This software, based on Exploratory Data Analysis philosophy and statistical visualization methods, is designed to assist the process of psychological test and scale construction. It allows carry out internal consistency analysis, as well as exploring statistical properties of items, subject responses patterns, and the effect of item deletion and test length increase on reliability coefficient. Comparing with other statistical software, the benefit of this program is to use dynamic graphics complementing statistical output.

  3. Statistical power as a function of Cronbach alpha of instrument questionnaire items.

    Science.gov (United States)

    Heo, Moonseong; Kim, Namhee; Faith, Myles S

    2015-10-14

    In countless number of clinical trials, measurements of outcomes rely on instrument questionnaire items which however often suffer measurement error problems which in turn affect statistical power of study designs. The Cronbach alpha or coefficient alpha, here denoted by C(α), can be used as a measure of internal consistency of parallel instrument items that are developed to measure a target unidimensional outcome construct. Scale score for the target construct is often represented by the sum of the item scores. However, power functions based on C(α) have been lacking for various study designs. We formulate a statistical model for parallel items to derive power functions as a function of C(α) under several study designs. To this end, we assume fixed true score variance assumption as opposed to usual fixed total variance assumption. That assumption is critical and practically relevant to show that smaller measurement errors are inversely associated with higher inter-item correlations, and thus that greater C(α) is associated with greater statistical power. We compare the derived theoretical statistical power with empirical power obtained through Monte Carlo simulations for the following comparisons: one-sample comparison of pre- and post-treatment mean differences, two-sample comparison of pre-post mean differences between groups, and two-sample comparison of mean differences between groups. It is shown that C(α) is the same as a test-retest correlation of the scale scores of parallel items, which enables testing significance of C(α). Closed-form power functions and samples size determination formulas are derived in terms of C(α), for all of the aforementioned comparisons. Power functions are shown to be an increasing function of C(α), regardless of comparison of interest. The derived power functions are well validated by simulation studies that show that the magnitudes of theoretical power are virtually identical to those of the empirical power. Regardless

  4. Problems with the factor analysis of items: Solutions based on item response theory and item parcelling

    Directory of Open Access Journals (Sweden)

    Gideon P. De Bruin

    2004-10-01

    Full Text Available The factor analysis of items often produces spurious results in the sense that unidimensional scales appear multidimensional. This may be ascribed to failure in meeting the assumptions of linearity and normality on which factor analysis is based. Item response theory is explicitly designed for the modelling of the non-linear relations between ordinal variables and provides a strong alternative to the factor analysis of items. Items may also be combined in parcels that are more likely to satisfy the assumptions of factor analysis than do the items. The use of the Rasch rating scale model and the factor analysis of parcels is illustrated with data obtained with the Locus of Control Inventory. The results of these analyses are compared with the results obtained through the factor analysis of items. It is shown that the Rasch rating scale model and the factoring of parcels produce superior results to the factor analysis of items. Recommendations for the analysis of scales are made. Opsomming Die faktorontleding van items lewer dikwels misleidende resultate op, veral in die opsig dat eendimensionele skale as meerdimensioneel voorkom. Hierdie resultate kan dikwels daaraan toegeskryf word dat daar nie aan die aannames van lineariteit en normaliteit waarop faktorontleding berus, voldoen word nie. Itemresponsteorie, wat eksplisiet vir die modellering van die nie-liniêre verbande tussen ordinale items ontwerp is, bied ’n aantreklike alternatief vir die faktorontleding van items. Items kan ook in pakkies gegroepeer word wat meer waarskynlik aan die aannames van faktorontleding voldoen as individuele items. Die gebruik van die Rasch beoordelingskaalmodel en die faktorontleding van pakkies word aan die hand van data wat met die Lokus van Beheervraelys verkry is, gedemonstreer. Die resultate van hierdie ontledings word vergelyk met die resultate wat deur ‘n faktorontleding van die individuele items verkry is. Die resultate dui daarop dat die Rasch

  5. Identifying the most efficient items from the Mini-Mental State Examination for cognitive function assessment in older Taiwanese patients.

    Science.gov (United States)

    Lou, Meei-Fang; Dai, Yu-Tzu; Huang, Guey-Shiun; Yu, Po-Jui

    2007-03-01

    The purpose of the study was to identify the most efficient items from the Mini-Mental State Examination for assessment of cognitive function. The Mini-Mental State Examination is the most frequently used cognitive screening instrument. However, the Mini-Mental State Examination has been criticized for insensitivity to mild cognitive dysfunction, limited memory assessment and variability in level of difficulty of the individual items. This study used secondary data analysis. Item response theory two-parameter model was used to analyse the data from the admission assessment of mental status by the Mini-Mental State Examination for 801 patients. By using item response analysis, 16 items were selected from the original 30-item Mini-Mental State Examination. The 16 items included mainly the measures of orientation, recall and attention and calculation. The internal consistency of the 16-item Mini-Mental State Examination was 0.84. The proposed new cut-off point for the 16-item Mini-Mental State Examination was 11. The correct classification rate was 0.94, the sensitivity was 100% and the specificity was 97.4%, when compared with the original 30-item Mini-Mental State Examination from the cut-off point of 24. This new cut-off point was determined for the purpose of over-identifying patients at risk so as to ensure early detection of and prevention from the onset of cognitive disturbance. Only a few items are needed to describe the subject's cognitive status. Using item response theory analysis, the study found that the Mini-Mental State Examination could be simplified. Deleting the items with less variation makes this assessment tool not only shorter, easier to administer and less strenuous for respondents, but also enables one to maintain validity as a cognitive function test for clinical setting.

  6. 47 CFR 36.224 - Extraordinary items-Account 7600.

    Science.gov (United States)

    2010-10-01

    ..., REVENUES, EXPENSES, TAXES AND RESERVES FOR TELECOMMUNICATIONS COMPANIES 1 Operating Revenues and Certain... account of an operating nature are apportioned on a basis consistent with the nature of these items. ...

  7. An overview of coefficient alpha and a reliability matrix for estimating adequacy of internal consistency coefficients with psychological research measures.

    Science.gov (United States)

    Ponterotto, Joseph G; Ruckdeschel, Daniel E

    2007-12-01

    The present article addresses issues in reliability assessment that are often neglected in psychological research such as acceptable levels of internal consistency for research purposes, factors affecting the magnitude of coefficient alpha (alpha), and considerations for interpreting alpha within the research context. A new reliability matrix anchored in classical test theory is introduced to help researchers judge adequacy of internal consistency coefficients with research measures. Guidelines and cautions in applying the matrix are provided.

  8. Item information and discrimination functions for trinary PCM items

    NARCIS (Netherlands)

    Akkermans, Wies; Muraki, Eiji

    1997-01-01

    For trinary partial credit items the shape of the item information and the item discrimination function is examined in relation to the item parameters. In particular, it is shown that these functions are unimodal if δ2 – δ1 < 4 ln 2 and bimodal otherwise. The locations and values of the maxima are

  9. International Classification of Functioning, Disability and Health categories explored for self-rated participation in Swedish adolescents and adults with a mild intellectual disability.

    Science.gov (United States)

    Arvidsson, Patrik; Granlund, Mats; Thyberg, Ingrid; Thyberg, Mikael

    2012-06-01

    To explore internal consistency and correlations between perceived ability, performance and perceived importance in a preliminary selection of self-reported items representing the activity/participation component of the International Classification of Functioning, Disability and Health (ICF). Structured interview study. Fifty-five Swedish adolescents and adults with a mild intellectual disability. Questions about perceived ability, performance and perceived importance were asked on the basis of a 3-grade Likert-scale regarding each of 68 items representing the 9 ICF domains of activity/participation. Internal consistency for perceived ability (Cronbach's alpha for all 68 items): 0.95 (values for each domain varied between 0.57 and 0.85), for performance: 0.86 (between 0.27 and 0.66), for perceived importance: 0.84 (between 0.27 and 0.68). Seventy-two percent of the items showed correlations >0.5 (mean=0.59) for performance vs perceived importance, 41% >0.5 (mean=0.47) for perceived ability vs performance and 12% >0.5 (mean=0.28) for perceived ability vs perceived importance. Measures of performance and perceived importance may have to be based primarily on their estimated clinical relevance for describing aspects of the ICF participation concept. With a clinimetric approach, parts of the studied items and domains may be used to investigate factors related to different patterns and levels of participation, and outcomes of rehabilitation.

  10. Evaluating an Automated Number Series Item Generator Using Linear Logistic Test Models

    Directory of Open Access Journals (Sweden)

    Bao Sheng Loe

    2018-04-01

    Full Text Available This study investigates the item properties of a newly developed Automatic Number Series Item Generator (ANSIG. The foundation of the ANSIG is based on five hypothesised cognitive operators. Thirteen item models were developed using the numGen R package and eleven were evaluated in this study. The 16-item ICAR (International Cognitive Ability Resource1 short form ability test was used to evaluate construct validity. The Rasch Model and two Linear Logistic Test Model(s (LLTM were employed to estimate and predict the item parameters. Results indicate that a single factor determines the performance on tests composed of items generated by the ANSIG. Under the LLTM approach, all the cognitive operators were significant predictors of item difficulty. Moderate to high correlations were evident between the number series items and the ICAR test scores, with high correlation found for the ICAR Letter-Numeric-Series type items, suggesting adequate nomothetic span. Extended cognitive research is, nevertheless, essential for the automatic generation of an item pool with predictable psychometric properties.

  11. Internal Consistency of the easyCBM© CCSS Reading Measures: Grades 3-8. Technical Report #1407

    Science.gov (United States)

    Guerreiro, Meg; Alonzo, Julie; Tindal, Gerald

    2014-01-01

    This technical report documents findings from a study of the internal consistency and split-half reliability of the easyCBM© CCSS Reading measures, grades 3-8. Data, drawn from an extant data set gathered in school year 2013-2014, include scores from over 150,000 students' fall and winter benchmark assessments. Findings suggest that the easyCBM©…

  12. How Well Does the Sum Score Summarize the Test? Summability as a Measure of Internal Consistency

    NARCIS (Netherlands)

    Goeman, J.J.; De, Jong N.H.

    2018-01-01

    Many researchers use Cronbach's alpha to demonstrate internal consistency, even though it has been shown numerous times that Cronbach's alpha is not suitable for this. Because the intention of questionnaire and test constructers is to summarize the test by its overall sum score, we advocate

  13. Scenes for Social Information Processing in Adolescence: Item and factor analytic procedures for psychometric appraisal.

    Science.gov (United States)

    Vagos, Paula; Rijo, Daniel; Santos, Isabel M

    2016-04-01

    Relatively little is known about measures used to investigate the validity and applications of social information processing theory. The Scenes for Social Information Processing in Adolescence includes items built using a participatory approach to evaluate the attribution of intent, emotion intensity, response evaluation, and response decision steps of social information processing. We evaluated a sample of 802 Portuguese adolescents (61.5% female; mean age = 16.44 years old) using this instrument. Item analysis and exploratory and confirmatory factor analytic procedures were used for psychometric examination. Two measures for attribution of intent were produced, including hostile and neutral; along with 3 emotion measures, focused on negative emotional states; 8 response evaluation measures; and 4 response decision measures, including prosocial and impaired social behavior. All of these measures achieved good internal consistency values and fit indicators. Boys seemed to favor and choose overt and relational aggression behaviors more often; girls conveyed higher levels of neutral attribution, sadness, and assertiveness and passiveness. The Scenes for Social Information Processing in Adolescence achieved adequate psychometric results and seems a valuable alternative for evaluating social information processing, even if it is essential to continue investigation into its internal and external validity. (c) 2016 APA, all rights reserved.

  14. Link between self-consistent pressure profiles and electron internal transport barriers in tokamaks

    Energy Technology Data Exchange (ETDEWEB)

    Razumova, K A [Nuclear Fusion Institute, RRC ' Kurchatov Institute' , 123182 Moscow (Russian Federation); Andreev, V F [Nuclear Fusion Institute, RRC ' Kurchatov Institute' , 123182 Moscow (Russian Federation); Donne, A J H [FOM-Institute for Plasma Physics Rijnhuizen, Association EURATOM-FOM, partner in the Trilateral Euregio Cluster, PO Box 1207, 3430 BE Nieuwegein (Netherlands); Hogeweij, G M D [FOM-Institute for Plasma Physics Rijnhuizen, Association EURATOM-FOM, partner in the Trilateral Euregio Cluster, PO Box 1207, 3430 BE Nieuwegein (Netherlands); Lysenko, S E [Nuclear Fusion Institute, RRC ' Kurchatov Institute' , 123182 Moscow (Russian Federation); Shelukhin, D A [Nuclear Fusion Institute, RRC ' Kurchatov Institute' , 123182 Moscow (Russian Federation); Spakman, G W [FOM-Institute for Plasma Physics Rijnhuizen, Association EURATOM-FOM, partner in the Trilateral Euregio Cluster, PO Box 1207, 3430 BE Nieuwegein (Netherlands); Vershkov, V A [Nuclear Fusion Institute, RRC ' Kurchatov Institute' , 123182 Moscow (Russian Federation); Zhuravlev, V A [Nuclear Fusion Institute, RRC ' Kurchatov Institute' , 123182 Moscow (Russian Federation)

    2006-09-15

    Tokamak plasmas have a tendency to self-organization: the plasma pressure profiles obtained in different operational regimes and even in various tokamaks may be represented by a single typical curve, called the self-consistent pressure profile. About a decade ago local zones with enhanced confinement were discovered in tokamak plasmas. These zones are referred to as internal transport barriers (ITBs) and they can act on the electron and/or ion fluid. Here the pressure gradients can largely exceed the gradients dictated by profile consistency. So the existence of ITBs seems to be in contradiction with the self-consistent pressure profiles (this is also often referred to as profile resilience or profile stiffness). In this paper we will discuss the interplay between profile consistency and ITBs. A summary of the cumulative information obtained from T-10, RTP and TEXTOR is given, and a coherent explanation of the main features of the observed phenomena is suggested. Both phenomena, the self-consistent profile and ITB, are connected with the density of rational magnetic surfaces, where the turbulent cells are situated. The distance between these cells determines the level of their interaction, and therefore the level of the turbulent transport. This process regulates the plasma pressure profile. If the distance is wide, the turbulent flux may be diminished and the ITB may be formed. In regions with rarefied surfaces the steeper pressure gradients are possible without instantaneously inducing pressure driven instabilities, which force the profiles back to their self-consistent shapes. Also it can be expected that the ITB region is wider for lower dq/d{rho} (more rarefied surfaces)

  15. Item level diagnostics and model - data fit in item response theory ...

    African Journals Online (AJOL)

    Item response theory (IRT) is a framework for modeling and analyzing item response data. Item-level modeling gives IRT advantages over classical test theory. The fit of an item score pattern to an item response theory (IRT) models is a necessary condition that must be assessed for further use of item and models that best fit ...

  16. The Effects of Item Format and Cognitive Domain on Students' Science Performance in TIMSS 2011

    Science.gov (United States)

    Liou, Pey-Yan; Bulut, Okan

    2017-12-01

    The purpose of this study was to examine eighth-grade students' science performance in terms of two test design components, item format, and cognitive domain. The portion of Taiwanese data came from the 2011 administration of the Trends in International Mathematics and Science Study (TIMSS), one of the major international large-scale assessments in science. The item difficulty analysis was initially applied to show the proportion of correct items. A regression-based cumulative link mixed modeling (CLMM) approach was further utilized to estimate the impact of item format, cognitive domain, and their interaction on the students' science scores. The results of the proportion-correct statistics showed that constructed-response items were more difficult than multiple-choice items, and that the reasoning cognitive domain items were more difficult compared to the items in the applying and knowing domains. In terms of the CLMM results, students tended to obtain higher scores when answering constructed-response items as well as items in the applying cognitive domain. When the two predictors and the interaction term were included together, the directions and magnitudes of the predictors on student science performance changed substantially. Plausible explanations for the complex nature of the effects of the two test-design predictors on student science performance are discussed. The results provide practical, empirical-based evidence for test developers, teachers, and stakeholders to be aware of the differential function of item format, cognitive domain, and their interaction in students' science performance.

  17. Redefining diagnostic symptoms of depression using Rasch analysis: testing an item bank suitable for DSM-V and computer adaptive testing.

    Science.gov (United States)

    Mitchell, Alex J; Smith, Adam B; Al-salihy, Zerak; Rahim, Twana A; Mahmud, Mahmud Q; Muhyaldin, Asma S

    2011-10-01

    We aimed to redefine the optimal self-report symptoms of depression suitable for creation of an item bank that could be used in computer adaptive testing or to develop a simplified screening tool for DSM-V. Four hundred subjects (200 patients with primary depression and 200 non-depressed subjects), living in Iraqi Kurdistan were interviewed. The Mini International Neuropsychiatric Interview (MINI) was used to define the presence of major depression (DSM-IV criteria). We examined symptoms of depression using four well-known scales delivered in Kurdish. The Partial Credit Model was applied to each instrument. Common-item equating was subsequently used to create an item bank and differential item functioning (DIF) explored for known subgroups. A symptom level Rasch analysis reduced the original 45 items to 24 items of the original after the exclusion of 21 misfitting items. A further six items (CESD13 and CESD17, HADS-D4, HADS-D5 and HADS-D7, and CDSS3 and CDSS4) were removed due to misfit as the items were added together to form the item bank, and two items were subsequently removed following the DIF analysis by diagnosis (CESD20 and CDSS9, both of which were harder to endorse for women). Therefore the remaining optimal item bank consisted of 17 items and produced an area under the curve (AUC) of 0.987. Using a bank restricted to the optimal nine items revealed only minor loss of accuracy (AUC = 0.989, sensitivity 96%, specificity 95%). Finally, when restricted to only four items accuracy was still high (AUC was still 0.976; sensitivity 93%, specificity 96%). An item bank of 17 items may be useful in computer adaptive testing and nine or even four items may be used to develop a simplified screening tool for DSM-V major depressive disorder (MDD). Further examination of this item bank should be conducted in different cultural settings.

  18. A Systematic Approach to Identify Promising New Items for Small to Medium Enterprises: A Case Study

    Directory of Open Access Journals (Sweden)

    Sukjae Jeong

    2016-11-01

    Full Text Available Despite the growing importance of identifying new business items for small and medium enterprises (SMEs, most previous studies focus on conglomerates. The paucity of empirical studies has also led to limited real-life applications. Hence, this study proposes a systematic approach to find new business items (NBIs that help the prospective SMEs develop, evaluate, and select viable business items to survive the competitive environment. The proposed approach comprises two stages: (1 the classification of diversification of SMEs; and (2 the searching and screening of business items. In the first stage, SMEs are allocated to five groups, based on their internal technological competency and external market conditions. In the second stage, based on the types of SMEs identified in the first stage, a set of alternative business items is derived by combining the results of portfolio analysis and benchmarking analysis. After deriving new business items, a market and technology-driven matrix analysis is utilized to screen suitable business items, and the Bruce Merrifield-Ohe (BMO method is used to categorize and identify prospective items based on market attractiveness and internal capability. To illustrate the applicability of the proposed approach, a case study is presented.

  19. Effects of Misbehaving Common Items on Aggregate Scores and an Application of the Mantel-Haenszel Statistic in Test Equating. CSE Report 688

    Science.gov (United States)

    Michaelides, Michalis P.

    2006-01-01

    Consistent behavior is a desirable characteristic that common items are expected to have when administered to different groups. Findings from the literature have established that items do not always behave in consistent ways; item indices and IRT item parameter estimates of the same items differ when obtained from different administrations.…

  20. The influence of item order on intentional response distortion in the assessment of high potentials: assessing pilot applicants.

    Science.gov (United States)

    Khorramdel, Lale; Kubinger, Klaus D; Uitz, Alexander

    2014-04-01

    An experiment was conducted to investigate the effects of item order and questionnaire content on faking good or intentional response distortion. It was hypothesized that intentional response distortion would either increase towards the end of a long questionnaire, as learning effects might make it easier to adjust responses to a faking good schema, or decrease because applicants' will to distort responses is reduced if the questionnaire lasts long enough. Furthermore, it was hypothesized that certain types of questionnaire content are especially vulnerable to response distortion. Eighty-four pre-selected pilot applicants filled out a questionnaire consisting of 516 items including items from the NEO five factor inventory (NEO FFI), NEO personality inventory revised (NEO PI-R) and business-focused inventory of personality (BIP). The positions of the items were varied within the applicant sample to test if responses are affected by item order, and applicants' response behaviour was additionally compared to that of volunteers. Applicants reported significantly higher mean scores than volunteers, and results provide some evidence of decreased faking tendencies towards the end of the questionnaire. Furthermore, it could be demonstrated that lower variances or standard deviations in combination with appropriate (often higher) mean scores can serve as an indicator for faking tendencies in group comparisons, even if effects are not significant. © 2013 International Union of Psychological Science.

  1. Communicating Quantitative Literacy: An Examination of Open-Ended Assessment Items in TIMSS, NALS, IALS, and PISA

    Directory of Open Access Journals (Sweden)

    Karl W. Kosko

    2011-07-01

    Full Text Available Quantitative Literacy (QL has been described as the skill set an individual uses when interacting with the world in a quantitative manner. A necessary component of this interaction is communication. To this end, assessments of QL have included open-ended items as a means of including communicative aspects of QL. The present study sought to examine whether such open-ended items typically measured aspects of quantitative communication, as compared to mathematical communication, or mathematical skills. We focused on public-released items and rubrics from four of the most widely referenced assessments: the Third International Mathematics and Science Study (TIMSS-95: the National Adult Literacy Survey (NALS; now the National Assessment of Adult Literacy, NAAL in 1985 and 1992, the International Adult Literacy Skills (IALS beginning in 1994; and the Program for International Student Assessment (PISA beginning in 2000. We found that open-ended item rubrics in these QL assessments showed a strong tendency to assess answer-only responses. Therefore, while some open-ended items may have required certain levels of quantitative reasoning to find a solution, it is the solution rather than the reasoning that was often assessed.

  2. An analysis on the export license criteria for NSG control items in the US and Japan

    International Nuclear Information System (INIS)

    Choi, Young Rok

    1995-06-01

    Korea has taken steps to join the Nuclear Suppliers Group (NSG) which is a major part of the international nuclear export control regime. In this connection, it is an urgent task to build a new Korean nuclear export control system that includes NSG guidelines and control items. In addition, it is necessary to review the developed supplier countries' experience in the field of export control. The main purpose of this study is to analyze how the US and Japan have controlled the items listed in NSG part 1 and 2 guidelines. To this end, various relevant regulations of the US and Japan were studied. Among those regulations, the US Export Administration Regulation, US 10 CFR 110 and 810, and the Japan's Export Administration Order are included. Through the review process, this study identified NSG items which are controlled in the export control systems of the US and Japan. Furthermore, this study summarized and compared the export license criteria that must be satisfied before exporting each NSG item in the two countries. The export license criteria consist of permitted destinations, document requirements, and types of license. The results of this study are expected to contribute to establishing an appropriate Korean nuclear export control system and could be used as references to the practical export licensing policies of the US and Japan. 6 tabs., 13 refs., (Author) .new

  3. An analysis on the export license criteria for NSG control items in the US and Japan

    Energy Technology Data Exchange (ETDEWEB)

    Choi, Young Rok [Korea Atomic Energy Research Institute, Taejon (Korea, Republic of)

    1995-06-01

    Korea has taken steps to join the Nuclear Suppliers Group (NSG) which is a major part of the international nuclear export control regime. In this connection, it is an urgent task to build a new Korean nuclear export control system that includes NSG guidelines and control items. In addition, it is necessary to review the developed supplier countries` experience in the field of export control. The main purpose of this study is to analyze how the US and Japan have controlled the items listed in NSG part 1 and 2 guidelines. To this end, various relevant regulations of the US and Japan were studied. Among those regulations, the US Export Administration Regulation, US 10 CFR 110 and 810, and the Japan`s Export Administration Order are included. Through the review process, this study identified NSG items which are controlled in the export control systems of the US and Japan. Furthermore, this study summarized and compared the export license criteria that must be satisfied before exporting each NSG item in the two countries. The export license criteria consist of permitted destinations, document requirements, and types of license. The results of this study are expected to contribute to establishing an appropriate Korean nuclear export control system and could be used as references to the practical export licensing policies of the US and Japan. 6 tabs., 13 refs., (Author) .new.

  4. The Internal Consistency Reliability of the Katz-Francis Scale of Attitude toward Judaism among Australian Jews

    Directory of Open Access Journals (Sweden)

    Patrick Lumbroso

    2016-09-01

    Full Text Available The Katz-Francis Scale of Attitude toward Judaism was developed initially to extend among the Hebrew-speaking Jewish community in Israel a growing body of international research concerned to map the correlates, antecedents and consequences of individual differences in attitude toward religion as assessed by the Francis Scale of Attitude toward Christianity. The present paper explored the internal consistency reliability and construct validity of the English translation of the Katz-Francis Scale of Attitude toward Judaism among 101 Australian Jews. On the basis of these data, this instrument is commended for application in further research.

  5. Brief Sensation Seeking Scale: Latent structure of 8-item and 4-item versions in Peruvian adolescents.

    Science.gov (United States)

    Merino-Soto, Cesar; Salas Blas, Edwin

    2018-01-01

    This research intended to validate two brief scales of sensations seeking with Peruvian adolescents: the eight item scale (BSSS8; Hoyle, Stephenson, Palmgreen, Lorch, y Donohew, 2002) and the four item scale (BSSS4; Stephenson, Hoyle, Slater, y Palmgreen, 2003). Questionnaires were administered to 618 voluntary participants, with an average age of 13.6 years, from different levels of high school, state and private school in a district in the south of Lima. It analyzed the internal structure of both short versions using three models: a) unidimensional (M1), b) oblique or related dimensions (M2), and c) the bifactor model (M3). Results show that both instruments have a single dimension which best represents the variability of the items; a fact that can be explained both by the complexity of the concept and by the small number of items representing each factor, which is more noticeable in the BSSS4. Reliability is within levels found by previous studies: alpha: .745 = BSSS8 and BSSS4 =. 643; omega coefficient: .747 in BSSS8 and .651 in BSSS4. These are considered suitable for the type of instruments studied. Based on the correlation between the two instruments, it was found that there are satisfactory levels of equivalence between the BSSS8 and BSSS4. However, it is recommended that the BSSS4 is mainly used for research and for the purpose of describing populations.

  6. A Polytomous Item Response Theory Analysis of Social Physique Anxiety Scale

    Science.gov (United States)

    Fletcher, Richard B.; Crocker, Peter

    2014-01-01

    The present study investigated the social physique anxiety scale's factor structure and item properties using confirmatory factor analysis and item response theory. An additional aim was to identify differences in response patterns between groups (gender). A large sample of high school students aged 11-15 years (N = 1,529) consisting of n =…

  7. Exploring the consistency, transparency and portability of dental technology education: benchmarking across Norway, Ireland and Australia.

    Science.gov (United States)

    Myhrer, T; Evans, J L; Haugen, H K; Gorman, C; Kavanagh, Y; Cameron, A B

    2016-08-01

    Dental technology programmes of study must prepare students to practice in a broad range of contemporary workplaces. Currently, there is limited evidence to benchmark dental technology education - locally, nationally or internationally. This research aims to improve consistency, transparency and portability of dental technology qualifications across three countries. Data were accessed from open-source curriculum documents and five calibrated assessment items. Three institutions collaborated with Oslo and Akershus University College, Norway; Trinity College Dublin, Ireland; and Griffith University, Australia. From these, 29-44 students completed 174 assessments. The curricula reflect the community needs of each country and display common themes that underpin professional dental technology practice. Assessment results differed between institutions but no more than a normal distribution. Face-to-face assessment moderation was critical to achieve consistency. This collaborative research has led to the development of a set of guidelines for other dental technology education providers interested in developing or aligning courses internationally to enhance the portability of qualifications. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  8. The Nurses Self-Concept Instrument (NSCI): assessment of psychometric properties for Australian domestic and international student nurses.

    Science.gov (United States)

    Angel, Elizabeth; Craven, Rhonda; Denson, Nida

    2012-07-01

    Professional self-concept is a critical driver of job satisfaction. In Australia, as international nursing enrolments rise, nursing is increasingly characterised by a professional body of international nurses who may differ from domestic Australian nurses in their nursing self-concept. At present, no psychometrically sound instrument for assessing nursing self-concept for Australian domestic and international nursing students is available. The purpose of this study was to: (1) develop an instrument (the Nurses' Self-Concept Instrument (NSCI)) to measure the professional self-concept of domestic and international nursing students in Australia, and (2) test the psychometric properties of this newly developed instrument. A literature review was conducted to generate the initial dimension and item pools to measure nurses' professional self-concept (NSCI). Two stakeholders examined the content and face validity of dimensions and items. Analysis was performed on data collected from 253 undergraduate nursing students in a large public university in Sydney, Australia, and consisted of domestic (n=218) and international (n=35) nursing students. Internal reliability was assessed using Cronbach's Alpha. Confirmatory factor analysis (CFA) was used to assess the construct validity of the NSCI. The resulting NSCI consisted of 14 items across four self-concept domains: care, leadership, staff relations, and knowledge. The CFA supported the hypothesised factor structure of the self-concept model. All reliabilities were acceptable for both domestic and international students (ranging from r=.78 to .93). The NSCI was shown to be a valid and reliable tool for assessing Australian domestic and international student nurses' professional self-concept. This instrument may also enable those responsible for recruitment of students into nursing courses to assess students' professional self-concept and implement appropriate strategies to foster the growth of lifelong career development

  9. Item response theory analysis applied to the Spanish version of the Personal Outcomes Scale.

    Science.gov (United States)

    Guàrdia-Olmos, J; Carbó-Carreté, M; Peró-Cebollero, M; Giné, C

    2017-11-01

    The study of measurements of quality of life (QoL) is one of the great challenges of modern psychology and psychometric approaches. This issue has greater importance when examining QoL in populations that were historically treated on the basis of their deficiency, and recently, the focus has shifted to what each person values and desires in their life, as in cases of people with intellectual disability (ID). Many studies of QoL scales applied in this area have attempted to improve the validity and reliability of their components by incorporating various sources of information to achieve consistency in the data obtained. The adaptation of the Personal Outcomes Scale (POS) in Spanish has shown excellent psychometric attributes, and its administration has three sources of information: self-assessment, practitioner and family. The study of possible congruence or incongruence of observed distributions of each item between sources is therefore essential to ensure a correct interpretation of the measure. The aim of this paper was to analyse the observed distribution of items and dimensions from the three Spanish POS information sources cited earlier, using the item response theory. We studied a sample of 529 people with ID and their respective practitioners and family member, and in each case, we analysed items and factors using Samejima's model of polytomic ordinal scales. The results indicated an important number of items with differential effects regarding sources, and in some cases, they indicated significant differences in the distribution of items, factors and sources of information. As a result of this analysis, we must affirm that the administration of the POS, considering three sources of information, was adequate overall, but a correct interpretation of the results requires that it obtain much more information to consider, as well as some specific items in specific dimensions. The overall ratings, if these comments are considered, could result in bias. © 2017

  10. Item Response Theory Modeling and Categorical Regression Analyses of the Five-Factor Model Rating Form: A Study on Italian Community-Dwelling Adolescent Participants and Adult Participants.

    Science.gov (United States)

    Fossati, Andrea; Widiger, Thomas A; Borroni, Serena; Maffei, Cesare; Somma, Antonella

    2017-06-01

    To extend the evidence on the reliability and construct validity of the Five-Factor Model Rating Form (FFMRF) in its self-report version, two independent samples of Italian participants, which were composed of 510 adolescent high school students and 457 community-dwelling adults, respectively, were administered the FFMRF in its Italian translation. Adolescent participants were also administered the Italian translation of the Borderline Personality Features Scale for Children-11 (BPFSC-11), whereas adult participants were administered the Italian translation of the Triarchic Psychopathy Measure (TriPM). Cronbach α values were consistent with previous findings; in both samples, average interitem r values indicated acceptable internal consistency for all FFMRF scales. A multidimensional graded item response theory model indicated that the majority of FFMRF items had adequate discrimination parameters; information indices supported the reliability of the FFMRF scales. Both categorical (i.e., item-level) and scale-level regression analyses suggested that the FFMRF scores may predict a nonnegligible amount of variance in the BPFSC-11 total score in adolescent participants, and in the TriPM scale scores in adult participants.

  11. Development and evaluation of the Internalized Racism in Asian Americans Scale (IRAAS).

    Science.gov (United States)

    Choi, Andrew Young; Israel, Tania; Maeda, Hotaka

    2017-01-01

    This article presents the development and psychometric evaluation of the Internalized Racism in Asian Americans Scale (IRAAS), which was designed to measure the degree to which Asian Americans internalized hostile attitudes and negative messages targeted toward their racial identity. Items were developed on basis of prior literature, vetted through expert feedback and cognitive interviews, and administered to 655 Asian American participants through Amazon Mechanical Turk. Exploratory factor analysis with a random subsample (n = 324) yielded a psychometrically robust preliminary measurement model consisting of 3 factors: Self-Negativity, Weakness Stereotypes, and Appearance Bias. Confirmatory factor analysis with a separate subsample (n = 331) indicated that the proposed correlated factors model was strongly consistent with the observed data. Factor determinacies were high and demonstrated that the specified items adequately measured their intended factors. Bifactor modeling further indicated that this multidimensionality could be univocally represented for the purpose of measurement, including the use of a mean total score representing a single continuum of internalized racism on which individuals vary. The IRAAS statistically predicted depressive symptoms, and demonstrated statistically significant correlations in theoretically expected directions with four dimensions of collective self-esteem. These results provide initial validity evidence supporting the use of the IRAAS to measure aspects of internalized racism in this population. Limitations and research implications are discussed. (PsycINFO Database Record (c) 2017 APA, all rights reserved).

  12. Inconsistency effects in source memory and compensatory schema-consistent guessing.

    Science.gov (United States)

    Küppers, Viviane; Bayen, Ute J

    2014-10-01

    The attention-elaboration hypothesis of memory for schematically unexpected information predicts better source memory for unexpected than expected sources. In three source-monitoring experiments, the authors tested the occurrence of an inconsistency effect in source memory. Participants were presented with items that were schematically either very expected or very unexpected for their source. Multinomial processing tree models were used to separate source memory, item memory, and guessing bias. Results show an inconsistency effect in source memory accompanied by a compensatory schema-consistent guessing bias when expectancy strength is high, that is, when items are very expected or very unexpected for their source.

  13. Development of a Quality and Safety Competency Curriculum for Radiation Oncology Residency: An International Delphi Study

    International Nuclear Information System (INIS)

    Adleman, Jenna; Gillan, Caitlin; Caissie, Amanda; Davis, Carol-Anne; Liszewski, Brian; McNiven, Andrea; Giuliani, Meredith

    2017-01-01

    Purpose: To develop an entry-to-practice quality and safety competency profile for radiation oncology residency. Methods and Materials: A comprehensive list of potential quality and safety competency items was generated from public and professional resources and interprofessional focus groups. Redundant or out-of-scope items were eliminated through investigator consensus. Remaining items were subjected to an international 2-round modified Delphi process involving experts in radiation oncology, radiation therapy, and medical physics. During Round 1, each item was scored independently on a 9-point Likert scale indicating appropriateness for inclusion in the competency profile. Items indistinctly ranked for inclusion or exclusion were re-evaluated through web conference discussion and reranked in Round 2. Results: An initial 1211 items were compiled from 32 international sources and distilled to 105 unique potential quality and safety competency items. Fifteen of the 50 invited experts participated in round 1: 10 radiation oncologists, 4 radiation therapists, and 1 medical physicist from 13 centers in 5 countries. Round 1 rankings resulted in 80 items included, 1 item excluded, and 24 items indeterminate. Two areas emerged more prominently within the latter group: change management and human factors. Web conference with 5 participants resulted in 9 of these 24 items edited for content or clarity. In Round 2, 12 participants rescored all indeterminate items resulting in 10 items ranked for inclusion. The final 90 enabling competency items were organized into thematic groups consisting of 18 key competencies under headings adapted from Deming's System of Profound Knowledge. Conclusions: This quality and safety competency profile may inform minimum training standards for radiation oncology residency programs.

  14. Development of a Quality and Safety Competency Curriculum for Radiation Oncology Residency: An International Delphi Study

    Energy Technology Data Exchange (ETDEWEB)

    Adleman, Jenna [Department of Radiation Oncology, University of Toronto, Toronto, Ontario (Canada); Gillan, Caitlin [Department of Radiation Oncology, University of Toronto, Toronto, Ontario (Canada); Radiation Medicine Program, Princess Margaret Cancer Centre, Toronto, Ontario (Canada); Caissie, Amanda [Department of Radiation Oncology, Dalhousie University, Halifax, Nova Scotia (Canada); Saint John Regional Hospital, Saint John, New Brunswick (Canada); Davis, Carol-Anne [Department of Radiation Oncology, Dalhousie University, Halifax, Nova Scotia (Canada); Nova Scotia Cancer Centre, Halifax, Nova Scotia (Canada); Liszewski, Brian [Odette Cancer Centre, Sunnybrook Health Sciences Centre, Toronto, Ontario (Canada); McNiven, Andrea [Department of Radiation Oncology, University of Toronto, Toronto, Ontario (Canada); Radiation Medicine Program, Princess Margaret Cancer Centre, Toronto, Ontario (Canada); Giuliani, Meredith, E-mail: Meredith.Giuliani@rmp.uhn.ca [Department of Radiation Oncology, University of Toronto, Toronto, Ontario (Canada); Radiation Medicine Program, Princess Margaret Cancer Centre, Toronto, Ontario (Canada)

    2017-06-01

    Purpose: To develop an entry-to-practice quality and safety competency profile for radiation oncology residency. Methods and Materials: A comprehensive list of potential quality and safety competency items was generated from public and professional resources and interprofessional focus groups. Redundant or out-of-scope items were eliminated through investigator consensus. Remaining items were subjected to an international 2-round modified Delphi process involving experts in radiation oncology, radiation therapy, and medical physics. During Round 1, each item was scored independently on a 9-point Likert scale indicating appropriateness for inclusion in the competency profile. Items indistinctly ranked for inclusion or exclusion were re-evaluated through web conference discussion and reranked in Round 2. Results: An initial 1211 items were compiled from 32 international sources and distilled to 105 unique potential quality and safety competency items. Fifteen of the 50 invited experts participated in round 1: 10 radiation oncologists, 4 radiation therapists, and 1 medical physicist from 13 centers in 5 countries. Round 1 rankings resulted in 80 items included, 1 item excluded, and 24 items indeterminate. Two areas emerged more prominently within the latter group: change management and human factors. Web conference with 5 participants resulted in 9 of these 24 items edited for content or clarity. In Round 2, 12 participants rescored all indeterminate items resulting in 10 items ranked for inclusion. The final 90 enabling competency items were organized into thematic groups consisting of 18 key competencies under headings adapted from Deming's System of Profound Knowledge. Conclusions: This quality and safety competency profile may inform minimum training standards for radiation oncology residency programs.

  15. A Bifactor Multidimensional Item Response Theory Model for Differential Item Functioning Analysis on Testlet-Based Items

    Science.gov (United States)

    Fukuhara, Hirotaka; Kamata, Akihito

    2011-01-01

    A differential item functioning (DIF) detection method for testlet-based data was proposed and evaluated in this study. The proposed DIF model is an extension of a bifactor multidimensional item response theory (MIRT) model for testlets. Unlike traditional item response theory (IRT) DIF models, the proposed model takes testlet effects into…

  16. Gender-Based Differential Item Performance in Mathematics Achievement Items.

    Science.gov (United States)

    Doolittle, Allen E.; Cleary, T. Anne

    1987-01-01

    Eight randomly equivalent samples of high school seniors were each given a unique form of the ACT Assessment Mathematics Usage Test (ACTM). Signed measures of differential item performance (DIP) were obtained for each item in the eight ACTM forms. DIP estimates were analyzed and a significant item category effect was found. (Author/LMO)

  17. Evaluation of psychometric properties and differential item functioning of 8-item Child Perceptions Questionnaires using item response theory.

    Science.gov (United States)

    Yau, David T W; Wong, May C M; Lam, K F; McGrath, Colman

    2015-08-19

    Four-factor structure of the two 8-item short forms of Child Perceptions Questionnaire CPQ11-14 (RSF:8 and ISF:8) has been confirmed. However, the sum scores are typically reported in practice as a proxy of Oral health-related Quality of Life (OHRQoL), which implied a unidimensional structure. This study first assessed the unidimensionality of 8-item short forms of CPQ11-14. Item response theory (IRT) was employed to offer an alternative and complementary approach of validation and to overcome the limitations of classical test theory assumptions. A random sample of 649 12-year-old school children in Hong Kong was analyzed. Unidimensionality of the scale was tested by confirmatory factor analysis (CFA), principle component analysis (PCA) and local dependency (LD) statistic. Graded response model was fitted to the data. Contribution of each item to the scale was assessed by item information function (IIF). Reliability of the scale was assessed by test information function (TIF). Differential item functioning (DIF) across gender was identified by Wald test and expected score functions. Both CPQ11-14 RSF:8 and ISF:8 did not deviate much from the unidimensionality assumption. Results from CFA indicated acceptable fit of the one-factor model. PCA indicated that the first principle component explained >30 % of the total variation with high factor loadings for both RSF:8 and ISF:8. Almost all LD statistic items suggesting little contribution of information to the scale and item removal caused little practical impact. Comparing the TIFs, RSF:8 showed slightly better information than ISF:8. In addition to oral symptoms items, the item "Concerned with what other people think" demonstrated a uniform DIF (p Items related to oral symptoms were not informative to OHRQoL and deletion of these items is suggested. The impact of DIF across gender on the overall score was minimal. CPQ11-14 RSF:8 performed slightly better than ISF:8 in measurement precision. The 6-item short forms

  18. Weight bias internalization across weight categories among school-aged children. Validation of the Weight Bias Internalization Scale for Children.

    Science.gov (United States)

    Zuba, Anna; Warschburger, Petra

    2018-06-01

    Anti-fat bias is widespread and is linked to the internalization of weight bias and psychosocial problems. The purpose of this study was to examine the internalization of weight bias among children across weight categories and to evaluate the psychometric properties of the Weight Bias Internalization Scale for Children (WBIS-C). Data were collected from 1484 primary school children and their parents. WBIS-C demonstrated good internal consistency (α = .86) after exclusion of Item 1. The unitary factor structure was supported using exploratory and confirmatory factor analyses (factorial validity). Girls and overweight children reported higher WBIS-C scores in comparison to boys and non-overweight peers (known-groups validity). Convergent validity was shown by significant correlations with psychosocial problems. Internalization of weight bias explained additional variance in different indicators of psychosocial well-being. The results suggest that the WBIS-C is a psychometrically sound and informative tool to assess weight bias internalization among children. Copyright © 2018 Elsevier Ltd. All rights reserved.

  19. Internal consistency and validity of an observational method for assessing disability in mobility in patients with osteoarthritis.

    NARCIS (Netherlands)

    Steultjens, M.P.M.; Dekker, J.; Baar, M.E. van; Oostendorp, R.A.B.; Bijlsma, J.W.J.

    1999-01-01

    Objective: To establish the internal consistency of validity of an observational method for assessing diasbility in mobility in patients with osteoarthritis (OA), Methods: Data were obtained from 198 patients with OA of the hip or knee. Results of the observational method were compared with results

  20. National soft science research task item-organization and implementation

    International Nuclear Information System (INIS)

    Zhang Yiming

    2014-01-01

    International Thermonuclear Experimental Reactor (ITER) project, as the most large-scale science project and research cooperation plan in the human history, has brought together major world-wide scientific and technological achievements in current controlled magnetic confinement fusion research. The project is aiming at validating the scientific and technological feasibility of the peaceful use of fusion energy, laying a science and technology foundation for the realization of the fusion energy commercialization. Promoted by the ITER project, the nuclear fusion frontier science researches and experiments in China have made a deep development, and have made remarkable achievements. Based on this situation, the Fusion Information Division of the Southwestern Institute of Physics (SWIP) has undertaken the soft science research task item -Prediction of Nuclear Fusion Energy Research and Development Technology in China,issued by the Ministry of Science and Technology of China. The research team has gone through these processes such as documentation collection and investigation, documentation reading and refining, outline determination, the first draft writing, content analysis and optimization for the draft, and the internal trial within the research team, review and revise from the experts at SWIP and out of SWIP, evaluation from China International Nuclear Fusion Energy Program Execution Center (ITER China DA), as well as evaluation from the famous experts in domestic fusion community by means of letters and mail. Finally, the research team has completed the research report successfully. In this report, the fusion development strategies of the world's leading fusion research countries and organizations participating in ITER project have been described. Moreover, some comparisons and analysis in this report have been made in order to provide scientific and technological research, analysis base, as well as strategic decision references for exploring medium and long term

  1. Force Concept Inventory-based multiple-choice test for investigating students’ representational consistency

    Directory of Open Access Journals (Sweden)

    Pasi Nieminen

    2010-08-01

    Full Text Available This study investigates students’ ability to interpret multiple representations consistently (i.e., representational consistency in the context of the force concept. For this purpose we developed the Representational Variant of the Force Concept Inventory (R-FCI, which makes use of nine items from the 1995 version of the Force Concept Inventory (FCI. These original FCI items were redesigned using various representations (such as motion map, vectorial and graphical, yielding 27 multiple-choice items concerning four central concepts underpinning the force concept: Newton’s first, second, and third laws, and gravitation. We provide some evidence for the validity and reliability of the R-FCI; this analysis is limited to the student population of one Finnish high school. The students took the R-FCI at the beginning and at the end of their first high school physics course. We found that students’ (n=168 representational consistency (whether scientifically correct or not varied considerably depending on the concept. On average, representational consistency and scientifically correct understanding increased during the instruction, although in the post-test only a few students performed consistently both in terms of representations and scientifically correct understanding. We also compared students’ (n=87 results of the R-FCI and the FCI, and found that they correlated quite well.

  2. Development and analysis of the factor structure of parents' internalized stigma of neurodevelopmental disorder in child scale

    Directory of Open Access Journals (Sweden)

    Ananya Mahapatra

    2017-01-01

    Full Text Available Background: Parents of children suffering from neurodevelopmental disorders, frequently face public stigma which is often internalized and leads to psychological burden. However, there is a lack of data on the perceptions of internalized stigma among parents of children with neurodevelopmental disorders, especially from lower-middle-income countries like India. Aims: This study aims to develop an adapted version of the Internalized Stigma of Mental Illness (ISMI scale for use in parents of children suffering from neurodevelopmental disorders and to explore the factor structure of this instrument through exploratory factor analysis (EFA. Settings and Design: A cross-sectional study was conducted in an outpatient setting in a tertiary care hospital in India. Materials and Methods: A total of 105 parents of children suffering from neurodevelopmental disorders (according to the Diagnostic and Statistical Manual of Mental Disorders Fifth Edition were recruited for the study after screening for psychiatric disorder using Mini International Neuropsychiatric Interview version 6.0. A modified 16-item scale was constructed Parents' Internalized Stigma of Neurodevelopmental Disorder in Child (PISNC scale and applied on 105 parents of children suffering from neurodevelopmental disorders, after translation to Hindi and back-translation, in keeping with the World Health Organization's translation-back-translation methodology. Statistical Analysis: EFA was carried out using principal component analysis with orthogonal (varimax rotation. Internal consistency of the Hindi version of the scale was estimated in the form of Cronbach's alpha. Spearman–Brown coefficient and Guttman split-half coefficient were calculated to evaluate the split-half reliability. Results: The initial factor analysis yielded three-factor models with an eigenvalue of >1 and the total variance explained by these factors was 62.017%. The internal consistency of the 16-item scale was 0

  3. 77 FR 75187 - Certain Food Containers, Cups, Plates, Cutlery, and Related Items and Packaging Thereof...

    Science.gov (United States)

    2012-12-19

    ... INTERNATIONAL TRADE COMMISSION [Investigation No. 337-TA-835] Certain Food Containers, Cups, Plates, Cutlery, and Related Items and Packaging Thereof; Commission Determination Not To Review an... containers, cups, plates, cutlery, and related items and packaging thereof by reason of infringement of U.S...

  4. The emotion regulation questionnaire in women with cancer: A psychometric evaluation and an item response theory analysis.

    Science.gov (United States)

    Brandão, Tânia; Schulz, Marc S; Gross, James J; Matos, Paula Mena

    2017-10-01

    Emotion regulation is thought to play an important role in adaptation to cancer. However, the emotion regulation questionnaire (ERQ), a widely used instrument to assess emotion regulation, has not yet been validated in this context. This study addresses this gap by examining the psychometric properties of the ERQ in a sample of Portuguese women with cancer. The ERQ was administered to 204 women with cancer (mean age = 48.89 years, SD = 7.55). Confirmatory factor analysis and item response theory analysis were used to examine psychometric properties of the ERQ. Confirmatory factor analysis confirmed the 2-factor solution proposed by the original authors (expressive suppression and cognitive reappraisal). This solution was invariant across age and type of cancer. Item response theory analyses showed that all items were moderately to highly discriminant and that items are better suited for identifying moderate levels of expressive suppression and cognitive reappraisal. Support was found for the internal consistency and test-retest reliability of the ERQ. The pattern of relationships with emotional control, alexithymia, emotional self-efficacy, attachment, and quality of life provided evidence of the convergent and concurrent validity for both dimensions of the ERQ. Overall, the ERQ is a psychometrically sound approach for assessing emotion regulation strategies in the oncological context. Clinical implications are discussed. Copyright © 2016 John Wiley & Sons, Ltd.

  5. Reproducibility of the items on the Stroke Specific Quality of Life questionnaire that evaluate the participation component of the International Classification of Functioning, Disability and Health.

    Science.gov (United States)

    Silva, Soraia Micaela; Corrêa, Fernanda Ishida; Faria, Christina Danielli Coelho de Morais; Pereira, Gabriela Santos; Attié, Edna Alves Dos Anjos; Corrêa, João Carlos Ferrari

    2016-12-01

    To evaluate the reproducibility of the Stroke Specific Quality of Life (SS-QOL) items that address the participation component of the International Classification of Functioning, Disability and Health (ICF) and analyse the correlation between the subscore of these 26 items and the total SS-QOL score. Seventy-five stroke survivors participated in this study. Reproducibility was evaluated using the intraclass correlation coefficient (ICC2,1), standard error of measurement (SEM), minimum detectable change (MDC) and the Bland-Altman plot. The correlation between the subscore of the 26 items and the total SS-QOL score was analysed using Spearman's correlation coefficients (rho) and simple linear regression. An alpha risk ≤ 0.05 was considered for all analyses. The SS-QOL items that address the participation component of the ICF demonstrated excellent reliability (intra-rater ICC2,1 = 0.96; inter-rater ICC2,1 = 0.95). The SEM and MDC were adequate. The Bland-Altman plot demonstrated satisfactory agreement. A significant and strong correlation (rho = 0.83) was found between the 26 SS-QOL items that address participation and the total SS-QOL score. Moreover, the evaluation of participation was found to explain 73% of the evaluation of health-related quality of life. The 26 SS-QOL items that address the participation component of the ICF demonstrated adequate reproducibility. Thus, participation, which represents the social aspects of functionality, can be adequately evaluated with these items. Implications for Rehabilitation The 26 Stroke Specific Quality of Life items that address participation proved to be reproducible for the analysis of social participation following a stroke. The findings can lead to a better understanding of the social participation of individuals with chronic hemiparesis and assist in the establishment of adequate treatment for such individuals. The rehabilitation process can be directed towards more specific goals focused on the

  6. Internal consistency and validity of an observational method for assessing disability in mobility in patients with osteoarthritis

    NARCIS (Netherlands)

    Steultjens, M. P.; Dekker, J.; van Baar, M. E.; Oostendorp, R. A.; Bijlsma, J. W.

    1999-01-01

    To establish the internal consistency and validity of an observational method for assessing disability in mobility in patients with osteoarthritis (OA). Data were obtained from 198 patients with OA of the hip or knee. Results of the observational method were compared with results of self-report

  7. Inter-rater and test-retest reliability, internal consistency, and factorial structure of the instrument for forensic treatment evaluation

    NARCIS (Netherlands)

    Schuringa, E.; Spreen, M.; Bogaerts, S.

    2014-01-01

    In this study, the Instrument for Forensic Treatment Evaluation (IFTE) is introduced. The IFTE includes 14 dynamic items of the risk assessment scheme HKT-R and eight items specifically related to the treatment of forensic psychiatric patients. The items are divided over three factors: protective

  8. Gene-Environment Interplay in Internalizing Disorders: Consistent Findings across Six Environmental Risk Factors

    Science.gov (United States)

    Hicks, Brian M.; DiRago, Ana C.; Iacono, William G.; McGue, Matt

    2009-01-01

    Background Newer behavior genetic methods can better elucidate gene-environment (G-E) interplay in the development of internalizing (INT) disorders (i.e., major depression and anxiety disorders). However, no study to date has conducted a comprehensive analysis examining multiple environmental risks with the purpose of delineating how general G-E mechanisms influence the development of INT disorders. Methods The sample consisted of 1315 male and female twin pairs participating in the age 17 assessment of the Minnesota Twin Family Study. Quantitative G-E interplay models were used to examine how genetic and environmental risk for INT disorders changes as a function of environmental context. Multiple measures and informants were employed to construct composite measures of INT disorders and 6 environmental risk factors including: stressful life events, mother-child and father-child relationship problems, antisocial and prosocial peer affiliation, and academic achievement and engagement. Results Significant moderation effects were detected between each environmental risk factor and INT such that in the context of greater environmental adversity, nonshared environmental factors became more important in the etiology of INT symptoms. Conclusion Our results are consistent with the interpretation that environmental stressors have a causative effect on the emergence of INT disorders. The consistency of our results suggests a general mechanism of environmental influence on INT disorders regardless of the specific form of environmental risk. PMID:19594836

  9. The Effect of the Existence of Defective Items in Assembly Operations

    OpenAIRE

    Eben-Chaime , Moshe

    2014-01-01

    Part 1: Knowledge-Based Performance Improvement; International audience; Quality is a principle issue in production management (PM). No process is perfect and the production of defective items is unavoidable. Very few studies regard the effect of the existence of defective items (EEDI) in production processes. Further, quality has been studied in isolation to high extent, of other PM domains. In this study, defect rates together with the assembly ratios of the bill of material are embedded in...

  10. An Application of Cognitive Diagnostic Assessment on TIMMS-2007 8th Grade Mathematics Items

    Science.gov (United States)

    Toker, Turker; Green, Kathy

    2012-01-01

    The least squares distance method (LSDM) was used in a cognitive diagnostic analysis of TIMSS (Trends in International Mathematics and Science Study) items administered to 4,498 8th-grade students from seven geographical regions of Turkey, extending analysis of attributes from content to process and skill attributes. Logit item positions were…

  11. Dissociative effects of orthographic distinctiveness in pure and mixed lists: an item-order account.

    Science.gov (United States)

    McDaniel, Mark A; Cahill, Michael; Bugg, Julie M; Meadow, Nathaniel G

    2011-10-01

    We apply the item-order theory of list composition effects in free recall to the orthographic distinctiveness effect. The item-order account assumes that orthographically distinct items advantage item-specific encoding in both mixed and pure lists, but at the expense of exploiting relational information present in the list. Experiment 1 replicated the typical free recall advantage of orthographically distinct items in mixed lists and the elimination of that advantage in pure lists. Supporting the item-order account, recognition performances indicated that orthographically distinct items received greater item-specific encoding than did orthographically common items in mixed and pure lists (Experiments 1 and 2). Furthermore, order memory (input-output correspondence and sequential contiguity effects) was evident in recall of pure unstructured common lists, but not in recall of unstructured distinct lists (Experiment 1). These combined patterns, although not anticipated by prevailing views, are consistent with an item-order account.

  12. Students' approaches to learning in a clinical practicum: A psychometric evaluation based on item response theory.

    Science.gov (United States)

    Zhao, Yue; Kuan, Hoi Kei; Chung, Joyce O K; Chan, Cecilia K Y; Li, William H C

    2018-07-01

    The investigation of learning approaches in the clinical workplace context has remained an under-researched area. Despite the validation of learning approach instruments and their applications in various clinical contexts, little is known about the extent to which an individual item, that reflects a specific learning strategy and motive, effectively contributes to characterizing students' learning approaches. This study aimed to measure nursing students' approaches to learning in a clinical practicum using the Approaches to Learning at Work Questionnaire (ALWQ). Survey research design was used in the study. A sample of year 3 nursing students (n = 208) who undertook a 6-week clinical practicum course participated in the study. Factor analyses were conducted, followed by an item response theory analysis, including model assumption evaluation (unidimensionality and local independence), item calibration and goodness-of-fit assessment. Two subscales, deep and surface, were derived. Findings suggested that: (a) items measuring the deep motive from intrinsic interest and deep strategies of relating new ideas to similar situations, and that of concept mapping served as the strongest discriminating indicators; (b) the surface strategy of memorizing facts and details without an overall picture exhibited the highest discriminating power among all surface items; and, (c) both subscales appeared to be informative in assessing a broad range of the corresponding latent trait. The 21-item ALWQ derived from this study presented an efficient, internally consistent and precise measure. Findings provided a useful psychometric evaluation of the ALWQ in the clinical practicum context, added evidence to the utility of the ALWQ for nursing education practice and research, and echoed the discussions from previous studies on the role of the contextual factors in influencing student choices of different learning strategies. They provided insights for clinical educators to measure

  13. Hunger enhances consistent economic choices in non-human primates.

    Science.gov (United States)

    Yamada, Hiroshi

    2017-05-24

    Hunger and thirst are fundamental biological processes that drive consumption behavior in humans and non-human animals. While the existing literature in neuroscience suggests that these satiety states change how consumable rewards are represented in the brain, it remains unclear as to how they change animal choice behavior and the underlying economic preferences. Here, I used combined techniques from experimental economics, psychology, and neuroscience to measure food preferences of marmoset monkeys (Callithrix jacchus), a recently developed primate model for neuroscience. Hunger states of animals were manipulated by scheduling feeding intervals, resulting in three different conditions: sated, non-sated, and hungry. During these hunger states, animals performed pairwise choices of food items, which included all possible pairwise combinations of five different food items except for same-food pairs. Results showed that hunger enhanced economic rationality, evident as a decrease of transitivity violations (item A was preferred to item B, and B to C, but C was preferred to A). Further analysis demonstrated that hungry monkeys chose more-preferred items over less-preferred items in a more deterministic manner, while the individual food preferences appeared to remain stable across hunger states. These results suggest that hunger enhances consistent choice behavior and shifts animals towards efficient outcome maximization.

  14. Are reflective models appropriate for very short scales? Proofs of concept of formative models using the Ten-Item Personality Inventory.

    Science.gov (United States)

    Myszkowski, Nils; Storme, Martin; Tavani, Jean-Louis

    2018-04-27

    Because of their length and objective of broad content coverage, very short scales can show limited internal consistency and structural validity. We argue that it is because their objectives may be better aligned with formative investigations than with reflective measurement methods that capitalize on content overlap. As proofs of concept of formative investigations of short scales, we investigate the Ten Item Personality Inventory (TIPI). In Study 1, we administered the TIPI and the Big Five Inventory (BFI) to 938 adults, and fitted a formative Multiple Indicator Multiple Causes model, which consisted of the TIPI items forming 5 latent variables, which in turn predicted the 5 BFI scores. These results were replicated in Study 2, on a sample of 759 adults, with, this time, the Revised NEO Personality Inventory (NEO-PI-R) as the external criterion. The models fit the data adequately, and moderate to strong significant effects (.37<|β|<.69, all p<.001) of all 5 latent formative variables on their corresponding BFI and NEOPI-R scores were observed. This study presents a formative approach that we propose to be more consistent with the aims of scales with broad content and short length like the TIPI. This article is protected by copyright. All rights reserved. © 2018 Wiley Periodicals, Inc.

  15. Soviet Cybernetics: Recent News Items, Number Thirteen.

    Science.gov (United States)

    Holland, Wade B.

    An issue of "Soviet Cybernetics: Recent News Items" consists of English translations of the leading recent Soviet contributions to the study of cybernetics. Articles deal with cybernetics in the 21st Century; the Soviet State Committee on Science and Technology; economic reforms in Rudnev's ministry; an interview with Rudnev; Dnepr-2; Dnepr-2…

  16. Examination of the Brief Fear of Negative Evaluation Scale-Version 2 and the Brief Fear of Negative Evaluation Scale-Straightforward Items Factor Structure in a Sample of U.S. College Students

    Science.gov (United States)

    Liu, Liu; Lowe, Patricia A.

    2016-01-01

    The current study examined the factor structure of the Brief Fear of Negative Evaluation-Straightforward Items (BFNE-S) and the Brief Fear of Negative Evaluation-Version 2 (BFNE-II) among 151 college students from the United States. Results indicated that the BFNE-S and the BFNE-II scores demonstrated excellent internal consistency reliability.…

  17. Working memory for sequences of temporal durations reveals a volatile single-item store

    Directory of Open Access Journals (Sweden)

    Sanjay G Manohar

    2016-10-01

    Full Text Available When a sequence is held in working memory, different items are retained with differing fidelity. Here we ask whether a sequence of brief time intervals that must be remembered show recency effects, similar to those observed in verbal and visuospatial working memory. It has been suggested that prioritising some items over others can be accounted for by a focus of attention, maintaining some items in a privileged state. We therefore also investigated whether such benefits are vulnerable to disruption by attention or expectation. Participants listened to sequences of one to five tones, of varying durations (200ms to 2s. Subsequently, the length of one of the tones in the sequence had to be reproduced by holding a key. The discrepancy between the reproduced and actual durations quantified the fidelity of memory for auditory durations. Recall precision decreased with the number of items that had to be remembered, and was better for the first and last items of sequences, in line with set-size and serial position effects seen in other modalities. To test whether attentional filtering demands might impair performance, an irrelevant variation in pitch was introduced in some blocks of trials. In those blocks, memory precision was worse for sequences that consisted of only one item, i.e. the smallest memory set size. Thus, when irrelevant information was present, the benefit of having only one item in memory is attenuated. Finally we examined whether expectation could interfere with memory. On half the trials, the number of items in the upcoming sequence was cued. When the number of items was known in advance, performance was paradoxically worse when the sequence consisted of only one item. Thus the benefit of having only one item to remember is stronger when it is unexpectedly the only item. Our results suggest that similar mechanisms are used to hold auditory time durations in working memory, as for visual or verbal stimuli. Further, solitary items were

  18. Suspect/Counterfeit Items Information Guide for Subcontractors/Suppliers

    Energy Technology Data Exchange (ETDEWEB)

    Tessmar, Nancy D. [Los Alamos National Laboratory; Salazar, Michael J. [Los Alamos National Laboratory

    2012-09-18

    Counterfeiting of industrial and commercial grade items is an international problem that places worker safety, program objectives, expensive equipment, and security at risk. In order to prevent the introduction of Suspect/Counterfeit Items (S/CI), this information sheet is being made available as a guide to assist in the implementation of S/CI awareness and controls, in conjunction with subcontractor's/supplier's quality assurance programs. When it comes to counterfeit goods, including industrial materials, items, and equipment, no market is immune. Some manufactures have been known to misrepresent their products and intentionally use inferior materials and processes to manufacture substandard items, whose properties can significantly cart from established standards and specifications. These substandard items termed by the Department of Energy (DOE) as S/CI, pose immediate and potential threats to the safety of DOE and contractor workers, the public, and the environment. Failure of certain systems and processes caused by an S/CI could also have national security implications at Los Alamos National Laboratory (LANL). Nuclear Safety Rules (federal Laws), DOE Orders, and other regulations set forth requirements for DOE contractors to implement effective controls to assure that items and services meet specified requirements. This includes techniques to implement and thereby minimizing the potential threat of entry of S/CI to LANL. As a qualified supplier of goods or services to the LANL, your company will be required to establish and maintain effective controls to prevent the introduction of S/CI to LANL. This will require that your company warrant that all items (including their subassemblies, components, and parts) sold to LANL are genuine (i.e. not counterfeit), new, and unused, and conform to the requirements of the LANL purchase orders/contracts unless otherwise approved in writing to the Los Alamos National Security (LANS) contract administrator

  19. Universal Authenticated Item Monitoring System (AIMS) second generation equipment

    International Nuclear Information System (INIS)

    Schoeneman, J.L.; Baumann, M.J.; Fox, L.J.; Jenkins, C.D.; Perlinsk, A.W.

    1992-01-01

    Sandia National Laboratories (SNL) is in the final stages of developing a Universal Authenticated Item Monitoring System (AIMS). When completed, AIMS will provide applicable agencies in the US government, and those in the International arena, with a secure and convenient method of monitoring the physical status of selected items. The benefit derived from this development activity will be the commercial availability of an item monitoring system with the capability for ''quick set-up'' monitoring, as well as long-term unattended monitoring. The AIMS includes a variety of sensors, a robust and authenticated radio frequency (RF) communication link, a Receiver Processing Unit (RPU), and an inspector-friendly personal computer (PC) interface for collecting, sorting, viewing and archiving pertinent event histories. The system will provide the capability to monitor selected items in a real-time mode, a remotely interrogated mode, and a stand-alone, unattended data collection mode. The sensor suite under development includes advanced motion sensors, interior volumetric intrusion sensors, Re-usable, In-situ Verifiable Authenticated (RIVA) fiber-optic seal sensors, generic utility sensors (to accommodate contact closure inputs), and radiation and environmental sensors. A new generation authentication algorithm recently has been developed that provides a high degree of system security 121. The AIMS has potential safeguards applications in the areas of arms control and treaty verification military asset control, International Atomic Energy Agency (IAEA) and Euratom safeguards verification activities, as well as domestic nuclear safeguard activities. Commercial applications could include high-value inventory control and security systems. This paper describes the second-generation AIMS along with its recently expanded sensor suite and enhanced data collection capabilities

  20. 77 FR 14423 - Certain Food Containers, Cups, Plates, Cutlery, and Related Items, and Packaging Thereof; Notice...

    Science.gov (United States)

    2012-03-09

    ... INTERNATIONAL TRADE COMMISSION [DN 2883] Certain Food Containers, Cups, Plates, Cutlery, and... Containers, Cups, Plates, Cutlery, and Related Items, and Packaging Thereof, DN 2883; the Commission is... importation of certain food containers, cups, plates, cutlery, and related items, and packaging thereof. The...

  1. Internal consistency, concurrent validity, and discriminant validity of a measure of public support for policies for active living in transportation (PAL-T) in a population-based sample of adults.

    Science.gov (United States)

    Fuller, Daniel; Gauvin, Lise; Fournier, Michel; Kestens, Yan; Daniel, Mark; Morency, Patrick; Drouin, Louis

    2012-04-01

    Active living is a broad conceptualization of physical activity that incorporates domains of exercise; recreational, household, and occupational activities; and active transportation. Policy makers develop and implement a variety of transportation policies that can influence choices about how to travel from one location to another. In making such decisions, policy makers act in part in response to public opinion or support for proposed policies. Measures of the public's support for policies aimed at promoting active transportation can inform researchers and policy makers. This study examined the internal consistency, and concurrent and discriminant validity of a newly developed measure of the public's support for policies for active living in transportation (PAL-T). A series of 17 items representing potential policies for promoting active transportation was generated. Two samples of participants (n = 2,001 and n = 2,502) from Montreal, Canada, were recruited via random digit dialling. Analyses were conducted on the combined data set (n = 4,503). Participants were aged 18 through 94 years (58% female). The concurrent and discriminant validity of the PAL-T was assessed by examining relationships with physical activity and smoking. To explore the usability of the PAL-T, predicted scale scores were compared to the summed values of responses. Results showed that the internal consistency of the PAL-T was 0.70. Multilevel regression demonstrated no relationship between the PAL-T and smoking status (p > 0.05) but significant relationships with utilitarian walking (p public opinion can inform policy makers and support advocacy efforts aimed at making built environments more suitable for active transportation while allowing researchers to examine the antecedents and consequences of public support for policies.

  2. A Feedback Control Strategy for Enhancing Item Selection Efficiency in Computerized Adaptive Testing

    Science.gov (United States)

    Weissman, Alexander

    2006-01-01

    A computerized adaptive test (CAT) may be modeled as a closed-loop system, where item selection is influenced by trait level ([theta]) estimation and vice versa. When discrepancies exist between an examinee's estimated and true [theta] levels, nonoptimal item selection is a likely result. Nevertheless, examinee response behavior consistent with…

  3. Australian Biology Test Item Bank, Years 11 and 12. Volume II: Year 12.

    Science.gov (United States)

    Brown, David W., Ed.; Sewell, Jeffrey J., Ed.

    This document consists of test items which are applicable to biology courses throughout Australia (irrespective of course materials used); assess key concepts within course statement (for both core and optional studies); assess a wide range of cognitive processes; and are relevant to current biological concepts. These items are arranged under…

  4. Australian Biology Test Item Bank, Years 11 and 12. Volume I: Year 11.

    Science.gov (United States)

    Brown, David W., Ed.; Sewell, Jeffrey J., Ed.

    This document consists of test items which are applicable to biology courses throughout Australia (irrespective of course materials used); assess key concepts within course statement (for both core and optional studies); assess a wide range of cognitive processes; and are relevant to current biological concepts. These items are arranged under…

  5. Fashionista: A Fashion-aware Graphical System for Exploring Visually Similar Items

    OpenAIRE

    He, Ruining; Lin, Chunbin; McAuley, Julian

    2016-01-01

    To build a fashion recommendation system, we need to help users retrieve fashionable items that are visually similar to a particular query, for reasons ranging from searching alternatives (i.e., substitutes), to generating stylish outfits that are visually consistent, among other applications. In domains like clothing and accessories, such considerations are particularly paramount as the visual appearance of items is a critical feature that guides users' decisions. However, existing systems l...

  6. Developing an African youth psychosocial assessment: an application of item response theory.

    Science.gov (United States)

    Betancourt, Theresa S; Yang, Frances; Bolton, Paul; Normand, Sharon-Lise

    2014-06-01

    This study aimed to refine a dimensional scale for measuring psychosocial adjustment in African youth using item response theory (IRT). A 60-item scale derived from qualitative data was administered to 667 war-affected adolescents (55% female). Exploratory factor analysis (EFA) determined the dimensionality of items based on goodness-of-fit indices. Items with loadings less than 0.4 were dropped. Confirmatory factor analysis (CFA) was used to confirm the scale's dimensionality found under the EFA. Item discrimination and difficulty were estimated using a graded response model for each subscale using weighted least squares means and variances. Predictive validity was examined through correlations between IRT scores (θ) for each subscale and ratings of functional impairment. All models were assessed using goodness-of-fit and comparative fit indices. Fisher's Information curves examined item precision at different underlying ranges of each trait. Original scale items were optimized and reconfigured into an empirically-robust 41-item scale, the African Youth Psychosocial Assessment (AYPA). Refined subscales assess internalizing and externalizing problems, prosocial attitudes/behaviors and somatic complaints without medical cause. The AYPA is a refined dimensional assessment of emotional and behavioral problems in African youth with good psychometric properties. Validation studies in other cultures are recommended. Copyright © 2014 John Wiley & Sons, Ltd.

  7. Development history of crimp shear tools. Item D11 Decommissioning Group

    International Nuclear Information System (INIS)

    Farmer, A.K.

    1988-02-01

    This report gives information on the continuing development of a range of crimp/shear tools aimed at separating items of radioactive plant, eg, gloveboxes, from a variety of service pipelines which could also be internally contaminated. (author)

  8. 26 CFR 48.4216(a)-3 - Other items relating to tax on sale price.

    Science.gov (United States)

    2010-04-01

    ... 26 Internal Revenue 16 2010-04-01 2010-04-01 true Other items relating to tax on sale price. 48.4216(a)-3 Section 48.4216(a)-3 Internal Revenue INTERNAL REVENUE SERVICE, DEPARTMENT OF THE TREASURY... reason of the failure of the article under a warranty as to its quality or service, and a new article is...

  9. A preliminary psychometric evaluation of the eight-item cognitive load scale.

    Science.gov (United States)

    Pignatiello, Grant A; Tsivitse, Emily; Hickman, Ronald L

    2018-04-01

    The aim of this article is to report the psychometric properties of the eight-item cognitive load scale. According to cognitive load theory, the formatting and delivery of healthcare education influences the degree to which patients and/or family members can engage their working memory systems for learning. However, despite its relevance, cognitive load has not yet been evaluated among surrogate decision makers exposed to electronic decision support for healthcare decisions. To date, no psychometric analyses of instruments evaluating cognitive load have been reported within healthcare settings. A convenience sample of 62 surrogate decision makers for critically ill patients were exposed to one of two healthcare decision support interventions were recruited from four intensive care units at a tertiary medical center in Northeast Ohio. Participants were administered a battery of psychosocial instruments and the eight-item cognitive load scale (CLS). The CLS demonstrated a bidimensional factor structure with acceptable discriminant validity and internal consistency reliability (Cronbach's α = 0.75 and 0.89). The CLS is a psychometrically sound instrument that may be used in the evaluation of decision support among surrogate decision makers of the critically ill. The authors recommend application of the cognitive load scale in the evaluation and development of healthcare education and interventions. Copyright © 2018 Elsevier Inc. All rights reserved.

  10. Subjective caregiver burden: validity of the 10-item short version of the Burden Scale for Family Caregivers BSFC-s.

    Science.gov (United States)

    Graessel, Elmar; Berth, Hendrik; Lichte, Thomas; Grau, Hannes

    2014-02-20

    Subjective burden is a central variable describing the situation encountered by family caregivers. The 10-item short version of the Burden Scale for Family Caregivers (BSFC-short/BSFC-s) was developed to provide an economical measure of this variable. The present study examined the reliability and validity of the BSFC-s. Comprehensive data from "the IDA project" were the basis of the calculations, which included 351 dyads and examined medical data on people with dementia, interview data from their family caregivers, and health insurance data. A factor analysis was performed to explore the structure of the BSFC-s; Cronbach's alpha was used to evaluate the internal consistency of the scale. The items were analyzed to determine the item difficulty and the discriminatory power. Construct validity was tested with five hypotheses. To establish the predictive validity of the BSFC-s, predictors of institutionalization at a follow-up time of 2.5 years were analyzed (binary logistic regression). The BSFC-s score adhered to a one-factor structure. Cronbach's alpha for the complete scale was .92. A significant increase in the BSFC-s score was observed when dementia progressed, disturbing behavior occurred more frequently, care requirements increased, and when caregivers were diagnosed with depression. Caregiver burden was the second strongest predictor of institutionalization out of a total of four significant predictors. All hypotheses that referred to the construct validity were supported. The BSFC-short with its ten items is a very economical instrument for assessing the caregiver's total subjective burden in a short time frame. The BSFC-s score has predictive validity for the institutionalization of people with dementia. Therefore it is an appropriate outcome measure to evaluate caregiver interventions. The scale is available for free in 20 languages (http://www.caregiver-burden.eu). This availability facilitates the comparison of international research findings.

  11. Preferred Reporting Items for Systematic Review and Meta-Analyses of individual participant data: the PRISMA-IPD Statement.

    Science.gov (United States)

    Stewart, Lesley A; Clarke, Mike; Rovers, Maroeska; Riley, Richard D; Simmonds, Mark; Stewart, Gavin; Tierney, Jayne F

    2015-04-28

    Systematic reviews and meta-analyses of individual participant data (IPD) aim to collect, check, and reanalyze individual-level data from all studies addressing a particular research question and are therefore considered a gold standard approach to evidence synthesis. They are likely to be used with increasing frequency as current initiatives to share clinical trial data gain momentum and may be particularly important in reviewing controversial therapeutic areas. To develop PRISMA-IPD as a stand-alone extension to the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) Statement, tailored to the specific requirements of reporting systematic reviews and meta-analyses of IPD. Although developed primarily for reviews of randomized trials, many items will apply in other contexts, including reviews of diagnosis and prognosis. Development of PRISMA-IPD followed the EQUATOR Network framework guidance and used the existing standard PRISMA Statement as a starting point to draft additional relevant material. A web-based survey informed discussion at an international workshop that included researchers, clinicians, methodologists experienced in conducting systematic reviews and meta-analyses of IPD, and journal editors. The statement was drafted and iterative refinements were made by the project, advisory, and development groups. The PRISMA-IPD Development Group reached agreement on the PRISMA-IPD checklist and flow diagram by consensus. Compared with standard PRISMA, the PRISMA-IPD checklist includes 3 new items that address (1) methods of checking the integrity of the IPD (such as pattern of randomization, data consistency, baseline imbalance, and missing data), (2) reporting any important issues that emerge, and (3) exploring variation (such as whether certain types of individual benefit more from the intervention than others). A further additional item was created by reorganization of standard PRISMA items relating to interpreting results. Wording

  12. Evaluation of item candidates for a diabetic retinopathy quality of life item bank.

    Science.gov (United States)

    Fenwick, Eva K; Pesudovs, Konrad; Khadka, Jyoti; Rees, Gwyn; Wong, Tien Y; Lamoureux, Ecosse L

    2013-09-01

    We are developing an item bank assessing the impact of diabetic retinopathy (DR) on quality of life (QoL) using a rigorous multi-staged process combining qualitative and quantitative methods. We describe here the first two qualitative phases: content development and item evaluation. After a comprehensive literature review, items were generated from four sources: (1) 34 previously validated patient-reported outcome measures; (2) five published qualitative articles; (3) eight focus groups and 18 semi-structured interviews with 57 DR patients; and (4) seven semi-structured interviews with diabetes or ophthalmic experts. Items were then evaluated during 3 stages, namely binning (grouping) and winnowing (reduction) based on key criteria and panel consensus; development of item stems and response options; and pre-testing of items via cognitive interviews with patients. The content development phase yielded 1,165 unique items across 7 QoL domains. After 3 sessions of binning and winnowing, items were reduced to a minimally representative set (n = 312) across 9 domains of QoL: visual symptoms; ocular surface symptoms; activity limitation; mobility; emotional; health concerns; social; convenience; and economic. After 8 cognitive interviews, 42 items were amended resulting in a final set of 314 items. We have employed a systematic approach to develop items for a DR-specific QoL item bank. The psychometric properties of the nine QoL subscales will be assessed using Rasch analysis. The resulting validated item bank will allow clinicians and researchers to better understand the QoL impact of DR and DR therapies from the patient's perspective.

  13. International standards for monoclonal antibodies to support pre- and post-marketing product consistency: Evaluation of a candidate international standard for the bioactivities of rituximab.

    Science.gov (United States)

    Prior, Sandra; Hufton, Simon E; Fox, Bernard; Dougall, Thomas; Rigsby, Peter; Bristow, Adrian

    2018-01-01

    The intrinsic complexity and heterogeneity of therapeutic monoclonal antibodies is built into the biosimilarity paradigm where critical quality attributes are controlled in exhaustive comparability studies with the reference medicinal product. The long-term success of biosimilars will depend on reassuring healthcare professionals and patients of consistent product quality, safety and efficacy. With this aim, the World Health Organization has endorsed the need for public bioactivity standards for therapeutic monoclonal antibodies in support of current controls. We have developed a candidate international potency standard for rituximab that was evaluated in a multi-center collaborative study using participants' own qualified Fc-effector function and cell-based binding bioassays. Dose-response curve model parameters were shown to reflect similar behavior amongst rituximab preparations, albeit with some differences in potency. In the absence of a common reference standard, potency estimates were in poor agreement amongst laboratories, but the use of the candidate preparation significantly reduced this variability. Our results suggest that the candidate rituximab standard can support bioassay performance and improve data harmonization, which when implemented will promote consistency of rituximab products over their life-cycles. This data provides the first scientific evidence that a classical standardization exercise allowing traceability of bioassay data to an international standard is also applicable to rituximab. However, we submit that this new type of international standard needs to be used appropriately and its role not to be mistaken with that of the reference medicinal product.

  14. The role of attention in item-item binding in visual working memory.

    Science.gov (United States)

    Peterson, Dwight J; Naveh-Benjamin, Moshe

    2017-09-01

    An important yet unresolved question regarding visual working memory (VWM) relates to whether or not binding processes within VWM require additional attentional resources compared with processing solely the individual components comprising these bindings. Previous findings indicate that binding of surface features (e.g., colored shapes) within VWM is not demanding of resources beyond what is required for single features. However, it is possible that other types of binding, such as the binding of complex, distinct items (e.g., faces and scenes), in VWM may require additional resources. In 3 experiments, we examined VWM item-item binding performance under no load, articulatory suppression, and backward counting using a modified change detection task. Binding performance declined to a greater extent than single-item performance under higher compared with lower levels of concurrent load. The findings from each of these experiments indicate that processing item-item bindings within VWM requires a greater amount of attentional resources compared with single items. These findings also highlight an important distinction between the role of attention in item-item binding within VWM and previous studies of long-term memory (LTM) where declines in single-item and binding test performance are similar under divided attention. The current findings provide novel evidence that the specific type of binding is an important determining factor regarding whether or not VWM binding processes require attention. (PsycINFO Database Record (c) 2017 APA, all rights reserved).

  15. Introduction: U.S. Homophile Internationalism.

    Science.gov (United States)

    Stein, Marc

    2017-01-01

    This article introduces "U.S. Homophile Internationalism," a special issue of the Journal of Homosexuality. The introduction provides a broad overview of the "U.S. Homophile Internationalism" archive and exhibit, which was published on the Outhistory Web site in 2015. The archive and exhibit consists of more than 800 U.S. homophile magazine articles, letters, and other items that referenced non-U.S. regions of the world from 1953 to 1964. The essays in the special issue focus on (1) Africa; (2) Asia and the Pacific; (3) Canada; (4) Latin America and the Caribbean; (5) the Middle East; and (6) Russia, the Soviet Union, and Eastern Europe. There is also an article that addresses the public history and digital humanities dimensions of the project. The introduction concludes by discussing the essays' common goals, themes, and concerns.

  16. Towards consistent and reliable Dutch and international energy statistics for the chemical industry

    International Nuclear Information System (INIS)

    Neelis, M.L.; Pouwelse, J.W.

    2008-01-01

    Consistent and reliable energy statistics are of vital importance for proper monitoring of energy-efficiency policies. In recent studies, irregularities have been reported in the Dutch energy statistics for the chemical industry. We studied in depth the company data that form the basis of the energy statistics in the Netherlands between 1995 and 2004 to find causes for these irregularities. We discovered that chemical products have occasionally been included, resulting in statistics with an inconsistent system boundary. Lack of guidance in the survey for the complex energy conversions in the chemical industry in the survey also resulted in large fluctuations for certain energy commodities. The findings of our analysis have been the basis for a new survey that has been used since 2007. We demonstrate that the annual questionnaire used for the international energy statistics can result in comparable problems as observed in the Netherlands. We suggest to include chemical residual gas as energy commodity in the questionnaire and to include the energy conversions in the chemical industry in the international energy statistics. In addition, we think the questionnaire should be explicit about the treatment of basic chemical products produced at refineries and in the petrochemical industry to avoid system boundary problems

  17. Factor Structure, Internal Consistency, and Screening Sensitivity of the GARS-2 in a Developmental Disabilities Sample

    OpenAIRE

    Martin A. Volker; Elissa H. Dua; Christopher Lopata; Marcus L. Thomeer; Jennifer A. Toomey; Audrey M. Smerbeck; Jonathan D. Rodgers; Joshua R. Popkin; Andrew T. Nelson; Gloria K. Lee

    2016-01-01

    The Gilliam Autism Rating Scale-Second Edition (GARS-2) is a widely used screening instrument that assists in the identification and diagnosis of autism. The purpose of this study was to examine the factor structure, internal consistency, and screening sensitivity of the GARS-2 using ratings from special education teaching staff for a sample of 240 individuals with autism or other significant developmental disabilities. Exploratory factor analysis yielded a correlated three-factor solution si...

  18. The Internalized Homophobia Scale for Vietnamese Sexual Minority Women: Conceptualization, Factor Structure, Reliability, and Associations With Hypothesized Correlates.

    Science.gov (United States)

    Nguyen, Trang Quynh; Poteat, Tonia; Bandeen-Roche, Karen; German, Danielle; Nguyen, Yen Hai; Vu, Loan Kieu-Chau; Nguyen, Nam Thi-Thu; Knowlton, Amy R

    2016-08-01

    We developed the first Vietnamese Internalized Homophobia (IH) scale for use with Vietnamese sexual minority women (SMW). Drawing from existing IH scales in the international literature and based on prior qualitative research about SMW in the Viet Nam context, the scale covers two domains: self-stigma (negative attitudes toward oneself as a sexual minority person) and sexual prejudice (negative attitudes toward homosexuality/same-sex relations in general). Scale items, including items borrowed from existing scales and items based on local expressions, were reviewed and confirmed by members of the target population. Quantitative evaluation used data from an anonymous web-based survey of Vietnamese SMW, including those who identified as lesbian (n = 1187), or as bisexual (n = 641) and those who were unsure about their sexual identity (n = 353). The scale was found to consist of two highly correlated factors reflecting self-stigma (not normal/wholesome and self-reproach and wishing away same-sex sexuality) and one factor reflecting sexual prejudice, and to have excellent internal consistency. Construct validity was evidenced by subscale associations with a wide range of hypothesized correlates, including perceived sexual stigma, outness, social support, connection to other SMW, relationship quality, psychological well-being, anticipation of heterosexual marriage, and endorsement of same-sex marriage legalization. Self-stigma was more strongly associated with psychosocial correlates, and sexual prejudice was more associated with endorsement of legal same-sex marriage. The variations in these associations across the hypothesized correlates and across sexual identity groups were consistent with the minority stress model and the IH literature, and exhibited context-specific features, which are discussed.

  19. Evaluating the quality of medical multiple-choice items created with automated processes.

    Science.gov (United States)

    Gierl, Mark J; Lai, Hollis

    2013-07-01

    distractors than the traditional method, medical experts cannot consistently distinguish AIG items from traditionally developed items in a blind review. © 2013 John Wiley & Sons Ltd.

  20. Exploring differential item functioning (DIF) with the Rasch model: a comparison of gender differences on eighth grade science items in the United States and Spain.

    Science.gov (United States)

    Babiar, Tasha Calvert

    2011-01-01

    Traditionally, women and minorities have not been fully represented in science and engineering. Numerous studies have attributed these differences to gaps in science achievement as measured by various standardized tests. Rather than describe mean group differences in science achievement across multiple cultures, this study focused on an in-depth item-level analysis across two countries: Spain and the United States. This study investigated eighth-grade gender differences on science items across the two countries. A secondary purpose of the study was to explore the nature of gender differences using the many-faceted Rasch Model as a way to estimate gender DIF. A secondary analysis of data from the Third International Mathematics and Science Study (TIMSS) was used to address three questions: 1) Does gender DIF in science achievement exist? 2) Is there a relationship between gender DIF and characteristics of the science items? 3) Do the relationships between item characteristics and gender DIF in science items replicate across countries. Participants included 7,087 eight grade students from the United States and 3,855 students from Spain who participated in TIMSS. The Facets program (Linacre and Wright, 1992) was used to estimate gender DIF. The results of the analysis indicate that the content of the item seemed to be related to gender DIF. The analysis also suggests that there is a relationship between gender DIF and item format. No pattern of gender DIF related to cognitive demand was found. The general pattern of gender DIF was similar across the two countries used in the analysis. The strength of item-level analysis as opposed to group mean difference analysis is that gender differences can be detected at the item level, even when no mean differences can be detected at the group level.

  1. Psychometric evaluation of the Patient-Reported Outcomes Measurement Information System (PROMIS) Nicotine Dependence Item Bank for use with electronic cigarettes.

    Science.gov (United States)

    Morean, Meghan; Krishnan-Sarin, Suchitra; Sussman, Steve; Foulds, Jonathan; Fishbein, Howard; Grana, Rachel; O'Malley, Stephanie S

    2018-01-02

    Psychometrically sound measures of e-cigarette dependence are lacking. We modified the PROMIS Nicotine Dependence Item Banks for use with e-cigarettes and evaluated the psychometrics of the 22-, 8- and 4-item adapted versions. 1009 adults who reported using e-cigarettes at least weekly completed an anonymous survey in Summer 2016 (50.2% male, 77.1% White, mean age 35.81 [10.71], 66.4% daily e-cigarette users, 72.6% current cigarette smokers). Psychometric analyses included confirmatory factor analysis, internal consistency, measurement invariance, examination of mean-level differences, convergent validity, and test-criterion relationships with e-cigarette use outcomes. All PROMIS-E versions had confirmable, internally consistent latent structures that were scalar invariant by sex, race, e-cigarette use (non-daily/daily), e-liquid nicotine content (no/yes), and current cigarette smoking status (no/yes). Daily e-cigarette users, nicotine e-liquid users, and cigarette smokers reported being more dependent on e-cigarettes than their counterparts. All PROMIS-E versions correlated strongly with one another, evidenced convergent validity with the Penn State E-cigarette Dependence Index and time to first e-cigarette use in the morning, and evidenced test-criterion relationships with vaping frequency, e-liquid nicotine concentration, and e-cigarette quit attempts. Similar results were observed when analyses were conducted within subsamples of exclusive e-cigarette users and duals-users of cigarettes and e-cigarettes. Each PROMIS-E version evidenced strong psychometric properties for assessing e-cigarette dependence in adults who either use e-cigarette exclusively or who are dual-users of cigarettes and e-cigarettes. However, results indicated little benefit of the longer versions over the 4-item PROMIS-E, which provides an efficient assessment of e-cigarette dependence. The availability of the novel, psychometrically sound PROMIS-E can further research on a wide range of

  2. Personalized recommendation based on unbiased consistence

    Science.gov (United States)

    Zhu, Xuzhen; Tian, Hui; Zhang, Ping; Hu, Zheng; Zhou, Tao

    2015-08-01

    Recently, in physical dynamics, mass-diffusion-based recommendation algorithms on bipartite network provide an efficient solution by automatically pushing possible relevant items to users according to their past preferences. However, traditional mass-diffusion-based algorithms just focus on unidirectional mass diffusion from objects having been collected to those which should be recommended, resulting in a biased causal similarity estimation and not-so-good performance. In this letter, we argue that in many cases, a user's interests are stable, and thus bidirectional mass diffusion abilities, no matter originated from objects having been collected or from those which should be recommended, should be consistently powerful, showing unbiased consistence. We further propose a consistence-based mass diffusion algorithm via bidirectional diffusion against biased causality, outperforming the state-of-the-art recommendation algorithms in disparate real data sets, including Netflix, MovieLens, Amazon and Rate Your Music.

  3. International migration to and from the United Kingdom, 1975-1999: consistency, change and implications for the labour market.

    Science.gov (United States)

    Dobson, J; McLaughlan, G

    2001-01-01

    This article presents some findings of a recent study carried out for the Home Office by the Migration Research Unit (MRU) in the Department of Geography at UCL. The study was concerned with patterns and trends in international migration to and from the United Kingdom since 1975, with a particular focus on those in employment, and drew on many sources. The statistics analysed here derive from the International Passenger Survey, including hitherto unpublished tables provided by the Office for National Statistics on migration of the employed by citizenship. They indicate remarkable consistency in some aspects of migration flows and major change in others.

  4. Review on 18th Revision of 'Notice on Export and Import of Strategic Items'

    International Nuclear Information System (INIS)

    Jeon, Jihye; Lee, Chansuh

    2014-01-01

    Nuclear Suppliers Group (NSG) has established a guideline and continued to revise it in accordance with ever-changing international situation and developing technology. The Part 1 of guideline, 'Guidelines of Nuclear Transfers' covers the Trigger List items which triggers safeguards as a condition of supply. Currently NSG has published the 12 th revised guideline (INFCIRC/254/Rev.12/Part1) in November 2013. Korean government fully reflected the guideline to its national legislation to implement in accordance with internationally agreed standard. The export control of nuclear strategic items in Korea is responsibility of Nuclear Safety and Security Commission (NSSC), which entrusted the technical review of the work to Korea Institute of Nonproliferation and Control (KINAC). The specific guidelines for the technical review are stipulated in Notice on Export and Import of Strategic Items with other strategic items usable to other Weapons of Mass Destruction. The Ministry of Trade, Industry and Energy approved the 18 th revision of Notice on Export and Import of Strategic Items on 31 January 2014 as Notice no. 2014-15, which strictly follows the NSG guideline. The 18 th revision of the notice reflects the final proposals agreed from the last Dedicated Meeting of Technical Experts (DMTE) of NSG's Consultative Group (CG) in April 2013. The 3-year-DMTE offered the 'fundamental, holistic approach to the technical review' within the international framework of NSG, rather than sporadic endeavors by individual states in the past. The 18 th version itself has meaning in that the final products of the international technical review were reflected in the Korean national legislation of nuclear export control. It addressed various changes in control text in technical, contextual, and editorial aspects. The revision is analyzed herein concentrating only on technical and semantic changes in control text

  5. Item response theory analysis of the Pain Self-Efficacy Questionnaire.

    Science.gov (United States)

    Costa, Daniel S J; Asghari, Ali; Nicholas, Michael K

    2017-01-01

    The Pain Self-Efficacy Questionnaire (PSEQ) is a 10-item instrument designed to assess the extent to which a person in pain believes s/he is able to accomplish various activities despite their pain. There is strong evidence for the validity and reliability of both the full-length PSEQ and a 2-item version. The purpose of this study is to further examine the properties of the PSEQ using an item response theory (IRT) approach. We used the two-parameter graded response model to examine the category probability curves, and location and discrimination parameters of the 10 PSEQ items. In item response theory, responses to a set of items are assumed to be probabilistically determined by a latent (unobserved) variable. In the graded-response model specifically, item response threshold (the value of the latent variable for which adjacent response categories are equally likely) and discrimination parameters are estimated for each item. Participants were 1511 mixed, chronic pain patients attending for initial assessment at a tertiary pain management centre. All items except item 7 ('I can cope with my pain without medication') performed well in IRT analysis, and the category probability curves suggested that participants used the 7-point response scale consistently. Items 6 ('I can still do many of the things I enjoy doing, such as hobbies or leisure activity, despite pain'), 8 ('I can still accomplish most of my goals in life, despite the pain') and 9 ('I can live a normal lifestyle, despite the pain') captured higher levels of the latent variable with greater precision. The results from this IRT analysis add to the body of evidence based on classical test theory illustrating the strong psychometric properties of the PSEQ. Despite the relatively poor performance of Item 7, its clinical utility warrants its retention in the questionnaire. The strong psychometric properties of the PSEQ support its use as an effective tool for assessing self-efficacy in people with pain

  6. Felder-Soloman's Index of Learning Styles: internal consistency, temporal stability, and factor structure.

    Science.gov (United States)

    Hosford, Charles C; Siders, William A

    2010-10-01

    Strategies to facilitate learning include using knowledge of students' learning style preferences to inform students and their teachers. Aims of this study were to evaluate the factor structure, internal consistency, and temporal stability of medical student responses to the Index of Learning Styles (ILS) and determine its appropriateness as an instrument for medical education. The ILS assesses preferences on four dimensions: sensing/intuitive information perceiving, visual/verbal information receiving, active/reflective information processing, and sequential/global information understanding. Students entering the 2002-2007 classes completed the ILS; some completed the ILS again after 2 and 4 years. Analyses of responses supported the ILS's intended structure and moderate reliability. Students had moderate preferences for sensing and visual learning. This study provides evidence supporting the appropriateness of the ILS for assessing learning style preferences in medical students.

  7. The role of interactive control systems in obtaining internal consistency in the management control system package

    DEFF Research Database (Denmark)

    Toldbod, Thomas; Israelsen, Poul

    2014-01-01

    Companies rely on multiple Management Control Systems to obtain their short and long term objectives. When applying a multifaceted perspective on Management Control System the concept of internal consistency has been found to be important in obtaining goal congruency in the company. However, to d...... management is aware of this shortcoming they use the cybernetic controls more interactively to overcome this shortcoming, whereby the cybernetic controls are also used as a learning platform and not just for performance control....

  8. Validation and psychometric properties of the Somatic and Psychological HEalth REport (SPHERE) in a young Australian-based population sample using non-parametric item response theory.

    Science.gov (United States)

    Couvy-Duchesne, Baptiste; Davenport, Tracey A; Martin, Nicholas G; Wright, Margaret J; Hickie, Ian B

    2017-08-01

    The Somatic and Psychological HEalth REport (SPHERE) is a 34-item self-report questionnaire that assesses symptoms of mental distress and persistent fatigue. As it was developed as a screening instrument for use mainly in primary care-based clinical settings, its validity and psychometric properties have not been studied extensively in population-based samples. We used non-parametric Item Response Theory to assess scale validity and item properties of the SPHERE-34 scales, collected through four waves of the Brisbane Longitudinal Twin Study (N = 1707, mean age = 12, 51% females; N = 1273, mean age = 14, 50% females; N = 1513, mean age = 16, 54% females, N = 1263, mean age = 18, 56% females). We estimated the heritability of the new scores, their genetic correlation, and their predictive ability in a sub-sample (N = 1993) who completed the Composite International Diagnostic Interview. After excluding items most responsible for noise, sex or wave bias, the SPHERE-34 questionnaire was reduced to 21 items (SPHERE-21), comprising a 14-item scale for anxiety-depression and a 10-item scale for chronic fatigue (3 items overlapping). These new scores showed high internal consistency (alpha > 0.78), moderate three months reliability (ICC = 0.47-0.58) and item scalability (Hi > 0.23), and were positively correlated (phenotypic correlations r = 0.57-0.70; rG = 0.77-1.00). Heritability estimates ranged from 0.27 to 0.51. In addition, both scores were associated with later DSM-IV diagnoses of MDD, social anxiety and alcohol dependence (OR in 1.23-1.47). Finally, a post-hoc comparison showed that several psychometric properties of the SPHERE-21 were similar to those of the Beck Depression Inventory. The scales of SPHERE-21 measure valid and comparable constructs across sex and age groups (from 9 to 28 years). SPHERE-21 scores are heritable, genetically correlated and show good predictive ability of mental health in an Australian-based population

  9. A multi-level differential item functioning analysis of trends in international mathematics and science study: Potential sources of gender and minority difference among U.S. eighth graders' science achievement

    Science.gov (United States)

    Qian, Xiaoyu

    Science is an area where a large achievement gap has been observed between White and minority, and between male and female students. The science minority gap has continued as indicated by the National Assessment of Educational Progress and the Trends in International Mathematics and Science Studies (TIMSS). TIMSS also shows a gender gap favoring males emerging at the eighth grade. Both gaps continue to be wider in the number of doctoral degrees and full professorships awarded (NSF, 2008). The current study investigated both minority and gender achievement gaps in science utilizing a multi-level differential item functioning (DIF) methodology (Kamata, 2001) within fully Bayesian framework. All dichotomously coded items from TIMSS 2007 science assessment at eighth grade were analyzed. Both gender DIF and minority DIF were studied. Multi-level models were employed to identify DIF items and sources of DIF at both student and teacher levels. The study found that several student variables were potential sources of achievement gaps. It was also found that gender DIF favoring male students was more noticeable in the content areas of physics and earth science than biology and chemistry. In terms of item type, the majority of these gender DIF items were multiple choice than constructed response items. Female students also performed less well on items requiring visual-spatial ability. Minority students performed significantly worse on physics and earth science items as well. A higher percentage of minority DIF items in earth science and biology were constructed response than multiple choice items, indicating that literacy may be the cause of minority DIF. Three-level model results suggested that some teacher variables may be the cause of DIF variations from teacher to teacher. It is essential for both middle school science teachers and science educators to find instructional methods that work more effectively to improve science achievement of both female and minority students

  10. Reliability, factor analysis and internal consistency calculation of the Insomnia Severity Index (ISI) in French and in English among Lebanese adolescents.

    Science.gov (United States)

    Chahoud, M; Chahine, R; Salameh, P; Sauleau, E A

    2017-06-01

    Our goal is to validate and to verify the reliability of the French and English versions of the Insomnia Severity Index (ISI) in Lebanese adolescents. A cross-sectional study was implemented. 104 Lebanese students aged between 14 and 19 years participated in the study. The English version of the questionnaire was distributed to English-speaking students and the French version was administered to French-speaking students. A scale (1 to 7 with 1 = very well understood and 7 = not at all) was used to identify the level of the students' understanding of each instruction, question and answer of the ISI. The scale's structural validity was assessed. The factor structure of ISI was evaluated by principal component analysis. The internal consistency of this scale was evaluated by Cronbach's alpha. To assess test-retest reliability the intraclass correlation coefficient (ICC) was used. The principal component analysis confirmed the presence of a two-component factor structure in the English version and a three-component factor structure in the French version with eigenvalues > 1. The English version of the ISI had an excellent internal consistency (α = 0.90), while the French version had a good internal consistency (α = 0.70). The ICC presented an excellent agreement in the French version (ICC = 0.914, CI = 0.856-0.949) and a good agreement in the English one (ICC = 0.762, CI = 0.481-890). The Bland-Altman plots of the two versions of the ISI showed that the responses over two weeks' were comparable and very few outliers were detected. The results of our analyses reveal that both English and French versions of the ISI scale have good internal consistency and are reproducible and reliable. Therefore, it can be used to assess the prevalence of insomnia in Lebanese adolescents.

  11. Assessment of mastication in healthy children and children with cerebral palsy: a validity and consistency study.

    Science.gov (United States)

    Remijn, L; Speyer, R; Groen, B E; Holtus, P C M; van Limbeek, J; Nijhuis-van der Sanden, M W G

    2013-05-01

    The aim of this study was to develop the Mastication Observation and Evaluation instrument for observing and assessing the chewing ability of children eating solid and lumpy foods. This study describes the process of item definition and item selection and reports the content validity, reproducibility and consistency of the instrument. In the developmental phase, 15 experienced speech therapists assessed item relevance and descriptions over three Delphi rounds. Potential items were selected based on the results from a literature review. At the initial Delphi round, 17 potential items were included. After three Delphi rounds, 14 items that regarded as providing distinctive value in assessment of mastication (consensus >75%) were included in the Mastication Observation and Evaluation instrument. To test item reproducibility and consistency, two experts and five students evaluated video recordings of 20 children (10 children with cerebral palsy aged 29-65 months and 10 healthy children aged 11-42 months) eating bread and a biscuit. Reproducibility was estimated by means of the intraclass correlation coefficient (ICC). With the exception of one item concerning chewing duration, all items showed good to excellent intra-observer agreement (ICC students: 0.73-1.0). With the exception of chewing duration and number of swallows, inter-observer agreement was fair to excellent for all items (ICC experts: 0.68-1.0 and ICC students: 0.42-1.0). Results indicate that this tool is a feasible instrument and could be used in clinical practice after further research is completed on the reliability of the tool. © 2013 Blackwell Publishing Ltd.

  12. Differential item functioning magnitude and impact measures from item response theory models.

    Science.gov (United States)

    Kleinman, Marjorie; Teresi, Jeanne A

    2016-01-01

    Measures of magnitude and impact of differential item functioning (DIF) at the item and scale level, respectively are presented and reviewed in this paper. Most measures are based on item response theory models. Magnitude refers to item level effect sizes, whereas impact refers to differences between groups at the scale score level. Reviewed are magnitude measures based on group differences in the expected item scores and impact measures based on differences in the expected scale scores. The similarities among these indices are demonstrated. Various software packages are described that provide magnitude and impact measures, and new software presented that computes all of the available statistics conveniently in one program with explanations of their relationships to one another.

  13. ITEM LEVEL DIAGNOSTICS AND MODEL - DATA FIT IN ITEM ...

    African Journals Online (AJOL)

    Global Journal

    Item response theory (IRT) is a framework for modeling and analyzing item response ... data. Though, there is an argument that the evaluation of fit in IRT modeling has been ... National Council on Measurement in Education ... model data fit should be based on three types of ... prediction should be assessed through the.

  14. Cross-cultural validity of the Spanish version of PHQ-9 among pregnant Peruvian women: a Rasch item response theory analysis.

    Science.gov (United States)

    Zhong, Qiuyue; Gelaye, Bizu; Fann, Jesse R; Sanchez, Sixto E; Williams, Michelle A

    2014-04-01

    We sought to evaluate the validity of the Spanish language version of the patient health questionnaire-9 (PHQ-9) depression scale in a large sample of pregnant Peruvian women using Rasch item response theory (IRT) approaches. We further sought to examine the appropriateness of the response formats, reliability and potential differential item functioning (DIF) by maternal age, educational attainment and employment status. This cross-sectional study was conducted among 1520 pregnant women in Lima, Peru. A structured interview was used to collect information on demographic characteristics and PHQ-9 items. Data from the PHQ-9 were fitted to the Rasch IRT model and tested for appropriate category ordering, the assumptions of unidimensionality and local independence, item fit, reliability and presence of DIF. The Spanish language version of PHQ-9 demonstrated unidimensionality, local independence, and acceptable fit for the Rasch IRT model. However, we detected disordered response categories for the original four response categories. After collapsing "more than half the days" and "nearly every day", the response categories ordered properly and the PHQ-9 fit the Rasch IRT model. The PHQ-9 had moderate internal consistency (person separation index, PSI=0.72). Additionally, the items of PHQ-9 were free of DIF with regard to age, educational attainment, and employment status. The Spanish language version of the PHQ-9 was shown to have item properties of an effective screening instrument. Collapsing rating scale categories and reconstructing three-point Likert scale for all items improved the fit of the instrument. Future studies are warranted to establish new cutoff scores and criterion validity of the three-point Likert scale response options for the Spanish language version of the PHQ-9. Copyright © 2014 Elsevier B.V. All rights reserved.

  15. Psychometric Properties of a 36-Item Version of the “Stress Management Competency Indicator Tool”

    Directory of Open Access Journals (Sweden)

    Stefano Toderi

    2016-11-01

    Full Text Available The development of supervisors’ behaviours has been proposed as an innovative approach for the reduction of employees’ work stress. The UK Health and Safety Executive (HSE developed the “Stress Management Competency Indicator Tool” (SMCIT, designed to be used within a learning and development intervention. However, its psychometric properties have never been evaluated, and the length of the questionnaire (66 items limits its practical applicability. We developed a brief 36-item version of the questionnaire, assessed its psychometric properties and studied the relationship with the employees’ psychosocial work environment. 353 employees filled in the brief SMCIT and the “Stress Management Indicator Tool”. The latter is a self-report questionnaire developed by the UK HSE, measuring workers’ perceptions of seven dimensions of the psychosocial work environment that if not properly managed can lead to harm. Data were analysed with structural equation modelling and multiple regressions. The results confirmed the factorial structure of the brief SMCIT questionnaire and mainly supported the convergent validity and internal consistency of the scales. Furthermore, with few exceptions, the relations hypothesized between supervisors’ competencies and the psychosocial work environment were confirmed, supporting the criterion validity of the revised questionnaire and the UK HSE framework. We conclude that the brief 36-item version of the SMCIT represents an important step toward the development of interventions directed at supervisors and we discuss the practical implications for work stress prevention.

  16. Force Concept Inventory-Based Multiple-Choice Test for Investigating Students' Representational Consistency

    Science.gov (United States)

    Nieminen, Pasi; Savinainen, Antti; Viiri, Jouni

    2010-01-01

    This study investigates students' ability to interpret multiple representations consistently (i.e., representational consistency) in the context of the force concept. For this purpose we developed the Representational Variant of the Force Concept Inventory (R-FCI), which makes use of nine items from the 1995 version of the Force Concept Inventory…

  17. MMPI-2 Item Endorsements in Dissociative Identity Disorder vs. Simulators.

    Science.gov (United States)

    Brand, Bethany L; Chasson, Gregory S; Palermo, Cori A; Donato, Frank M; Rhodes, Kyle P; Voorhees, Emily F

    2016-03-01

    Elevated scores on some MMPI-2 (Minnesota Multiphasic Inventory-2) validity scales are common among patients with dissociative identity disorder (DID), which raises questions about the validity of their responses. Such patients show elevated scores on atypical answers (F), F-psychopathology (Fp), atypical answers in the second half of the test (FB), schizophrenia (Sc), and depression (D) scales, with Fp showing the greatest utility in distinguishing them from coached and uncoached DID simulators. In the current study, we investigated the items on the MMPI-2 F, Fp, FB, Sc, and D scales that were most and least commonly endorsed by participants with DID in our 2014 study and compared these responses with those of coached and uncoached DID simulators. The comparisons revealed that patients with DID most frequently endorsed items related to dissociation, trauma, depression, fearfulness, conflict within family, and self-destructiveness. The coached group more successfully imitated item endorsements of the DID group than did the uncoached group. However, both simulating groups, especially the uncoached group, frequently endorsed items that were uncommonly endorsed by the DID group. The uncoached group endorsed items consistent with popular media portrayals of people with DID being violent, delusional, and unlawful. These results suggest that item endorsement patterns can provide useful information to clinicians making determinations about whether an individual is presenting with DID or feigning. © 2016 American Academy of Psychiatry and the Law.

  18. A Non-Parametric Item Response Theory Evaluation of the CAGE Instrument Among Older Adults.

    Science.gov (United States)

    Abdin, Edimansyah; Sagayadevan, Vathsala; Vaingankar, Janhavi Ajit; Picco, Louisa; Chong, Siow Ann; Subramaniam, Mythily

    2018-02-23

    The validity of the CAGE using item response theory (IRT) has not yet been examined in older adult population. This study aims to investigate the psychometric properties of the CAGE using both non-parametric and parametric IRT models, assess whether there is any differential item functioning (DIF) by age, gender and ethnicity and examine the measurement precision at the cut-off scores. We used data from the Well-being of the Singapore Elderly study to conduct Mokken scaling analysis (MSA), dichotomous Rasch and 2-parameter logistic IRT models. The measurement precision at the cut-off scores were evaluated using classification accuracy (CA) and classification consistency (CC). The MSA showed the overall scalability H index was 0.459, indicating a medium performing instrument. All items were found to be homogenous, measuring the same construct and able to discriminate well between respondents with high levels of the construct and the ones with lower levels. The item discrimination ranged from 1.07 to 6.73 while the item difficulty ranged from 0.33 to 2.80. Significant DIF was found for 2-item across ethnic group. More than 90% (CC and CA ranged from 92.5% to 94.3%) of the respondents were consistently and accurately classified by the CAGE cut-off scores of 2 and 3. The current study provides new evidence on the validity of the CAGE from the IRT perspective. This study provides valuable information of each item in the assessment of the overall severity of alcohol problem and the precision of the cut-off scores in older adult population.

  19. The utility of single-item readiness screeners in middle school.

    Science.gov (United States)

    Lewis, Crystal G; Herman, Keith C; Huang, Francis L; Stormont, Melissa; Grossman, Caroline; Eddy, Colleen; Reinke, Wendy M

    2017-10-01

    This study examined the benefit of utilizing one-item academic and one-item behavior readiness teacher-rated screeners at the beginning of the school year to predict end-of-school year outcomes for middle school students. The Middle School Academic and Behavior Readiness (M-ABR) screeners were developed to provide an efficient and effective way to assess readiness in students. Participants included 889 students in 62 middle school classrooms in an urban Missouri school district. Concurrent validity with the M-ABR items and other indicators of readiness in the fall were evaluated using Pearson product-moment correlation coefficients, with the academic readiness item having medium to strong correlations with other baseline academic indicators (r=±0.56 to 0.91) and the behavior readiness item having low to strong correlations with baseline behavior items (r=±0.20 to 0.79). Next, the predictive validity of the M-ABR items was analyzed with hierarchical linear regressions using end-of-year outcomes as the dependent variable. The academic and behavior readiness items demonstrated adequate validity for all outcomes with moderate effects (β=±0.31 to 0.73 for academic outcomes and β=±0.24 to 0.59 for behavioral outcomes) after controlling for baseline demographics. Even after controlling for baseline scores, the M-ABR items predicted unique variance in almost all outcome variables. Four conditional probability indices were calculated to obtain an optimal cut score, to determine ready vs. not ready, for both single-item M-ABR scales. The cut point of "fair" yielded the most acceptable values for the indices. The odd ratios (OR) of experiencing negative outcomes given a "fair" or lower readiness rating (2 or below on the M-ABR screeners) at the beginning of the year were significant and strong for all outcomes (OR=2.29 to OR=14.46), except for internalizing problems. These findings suggest promise for using single readiness items to screen for varying negative end

  20. Decree of the State Office for Nuclear Safety No. 147/1997 of 17 June 1997 specifying lists of selected nuclear-related items and dual-use nuclear-related items

    International Nuclear Information System (INIS)

    1997-01-01

    The core of the Decree consists of 2 lists, viz (a) the List of Selected Items (selected nuclear-related materials, equipment and technologies) which are subject to control regimes during imports, exports and transit; and (b) the List of Dual-Use Items (dual-use nuclear-related materials, equipment and technologies) which are subject to control regimes during imports and exports. Both Lists are based on the IAEA document INFCIRC/254/Rev.2/Part 2/Mod.1. (P.A.)

  1. The Internalized Homophobia Scale for Vietnamese Sexual Minority Women (IHVN-W): Conceptualization, factor structure, reliability, and associations with hypothesized correlates

    Science.gov (United States)

    Nguyen, Trang Quynh; Poteat, Tonia; Bandeen-Roche, Karen; German, Danielle; Nguyen, Yen Hai; Vu, Loan Kieu-Chau; Nguyen, Nam Thi-Thu; Knowlton, Amy R.

    2016-01-01

    We developed the first Vietnamese internalized homophobia (IH) scale, for use with Vietnamese sexual minority women (SMW). Drawing from existing IH scales in the international literature and based on prior qualitative research about SMW in the Viet Nam context, the scale covers two domains: self-stigma (negative attitudes toward oneself as a sexual minority person) and sexual prejudice (negative attitudes toward homosexuality/same-sex relations in general). Scale items, including items borrowed from existing scales and items based on local expressions, were reviewed and confirmed by members of the target population. Quantitative evaluation used data from an anonymous web-based survey of Vietnamese SMW, including those who identified as lesbian (n=1187), or as bisexual (n=641) and those who were unsure about their sexual identity (n=353). The scale was found to consist of two highly correlated factors reflecting self-stigma (not normal/wholesome and self-reproach and wishing away same-sex sexuality) and one factor reflecting sexual prejudice, and to have excellent internal consistency. Construct validity was evidenced by subscales’ associations with a wide range of hypothesized correlates including perceived sexual stigma, outness, social support, connection to other SMW, relationship quality, psychological well-being, anticipation of heterosexual marriage and endorsement of same-sex marriage legalization. Self-stigma was more strongly associated with psychosocial correlates and sexual prejudice was more associated with endorsement of legal same-sex marriage. The variations in these associations across the hypothesized correlates and across sexual identity groups were consistent with the Minority Stress Model and the IH literature, and exhibited context-specific features, which are discussed. PMID:27007469

  2. Screening for major and minor depression in a multiethnic sample of Asian primary care patients: a comparison of the nine-item Patient Health Questionnaire (PHQ-9) and the 16-item Quick Inventory of Depressive Symptomatology - Self-Report (QIDS-SR16 ).

    Science.gov (United States)

    Sung, Sharon Cohan; Low, Charity Cheng Hong; Fung, Daniel Shuen Sheng; Chan, Yiong Huak

    2013-12-01

    Depression is common, disabling, and the single most important factor leading to suicide, yet it is underdiagnosed in busy primary care settings. A key challenge facing primary care clinicians in Asia is the selection of instruments to facilitate depression screening. Although the nine-item Patient Health Questionnaire (PHQ-9) and 16-item Quick Inventory of Depressive Symptomatology - Self-Report (QIDS-SR16 ) are used internationally, they have not been directly compared or widely validated in Asian primary care populations. This study aimed to validate the PHQ-9 and QIDS-SR16 against a structured interview diagnosis of Diagnostic and Statistical Manual, 4th Edition, depression based on the Mini-International Neuropsychiatric Interview in a multiethnic Asian sample. From April through August 2011, we enrolled 400 English-speaking Singaporean primary care patients. Participants completed a demographic data form, the PHQ-9, and the QIDS-SR16 . They were assessed independently for major and minor depression using the Mini-International Neuropsychiatric Interview. Sensitivity and specificity for diagnosing major depression were 91.7% and 72.2%, respectively, for the PHQ-9 (optimal cutoff score of 6), and 83.3% and 84.7%, respectively, for the QIDS-SR16 (optimal cutoff score of 9). The QIDS-SR16 also detected minor depression at an optimal cutoff score of 7, with a sensitivity of 94.4% and specificity of 77.9%. The PHQ-9 and QIDS-SR16 showed good internal consistency (Cronbach's α: 0.87 and 0.79, respectively) and good convergent validity (correlation coefficient: r = 0.73, P depressive disorders was 9%. The PHQ-9 and QIDS-SR16 appear to be valid and reliable for depression screening in Asian primary care settings. Copyright © 2013 Wiley Publishing Asia Pty Ltd.

  3. Validation of a 4-item Negative Symptom Assessment (NSA-4): a short, practical clinical tool for the assessment of negative symptoms in schizophrenia.

    Science.gov (United States)

    Alphs, Larry; Morlock, Robert; Coon, Cheryl; Cazorla, Pilar; Szegedi, Armin; Panagides, John

    2011-06-01

    The 16-item Negative Symptom Assessment (NSA-16) scale is a validated tool for evaluating negative symptoms of schizophrenia. The psychometric properties and predictive power of a four-item version (NSA-4) were compared with the NSA-16. Baseline data from 561 patients with predominant negative symptoms of schizophrenia who participated in two identically designed clinical trials were evaluated. Ordered logistic regression analysis of ratings using NSA-4 and NSA-16 were compared with ratings using several other standard tools to determine predictive validity and construct validity. Internal consistency and test--retest reliability were also analyzed. NSA-16 and NSA-4 scores were both predictive of scores on the NSA global rating (odds ratio = 0.83-0.86) and the Clinical Global Impressions--Severity scale (odds ratio = 0.91-0.93). NSA-16 and NSA-4 showed high correlation with each other (Pearson r = 0.85), similar high correlation with other measures of negative symptoms (demonstrating convergent validity), and lesser correlations with measures of other forms of psychopathology (demonstrating divergent validity). NSA-16 and NSA-4 both showed acceptable internal consistency (Cronbach α, 0.85 and 0.64, respectively) and test--retest reliability (intraclass correlation coefficient, 0.87 and 0.82). This study demonstrates that NSA-4 offers accuracy comparable to the NSA-16 in rating negative symptoms in patients with schizophrenia. Copyright © 2011 John Wiley & Sons, Ltd.

  4. A note on "An economic order quantity (EOQ for items with imperfect quality and inspection errors"

    Directory of Open Access Journals (Sweden)

    Lie-Fern Hsu

    2012-08-01

    Full Text Available In a previously published paper by Khan et al. (2011 [Khan, M., Jaber, M.Y., & Bonney M. (2011. An economic order quantity (EOQ for items with imperfect quality and inspection errors, International Journal of Production Economics, 133, 113-118], we found that there is a contradiction between the cycle length and the holding cost per cycle. To obtain the cycle length, the authors assumed that the returned items from the market were replaced with good quality items. However, for the holding cost per cycle, the authors implicitly assumed that the returned items were not replaced by good quality items. In this note, we first point out the contradiction. Then we fix this flaw and develop a corrected EOQ.

  5. The Effects of Goal Relevance and Perceptual Features on Emotional Items and Associative Memory.

    Science.gov (United States)

    Mao, Wei B; An, Shu; Yang, Xiao F

    2017-01-01

    Showing an emotional item in a neutral background scene often leads to enhanced memory for the emotional item and impaired associative memory for background details. Meanwhile, both top-down goal relevance and bottom-up perceptual features played important roles in memory binding. We conducted two experiments and aimed to further examine the effects of goal relevance and perceptual features on emotional items and associative memory. By manipulating goal relevance (asking participants to categorize only each item image as living or non-living or to categorize each whole composite picture consisted of item image and background scene as natural scene or manufactured scene) and perceptual features (controlling visual contrast and visual familiarity) in two experiments, we found that both high goal relevance and salient perceptual features (high salience of items vs. high familiarity of items) could promote emotional item memory, but they had different effects on associative memory for emotional items and neutral backgrounds. Specifically, high goal relevance and high perceptual-salience of items could jointly impair the associative memory for emotional items and neutral backgrounds, while the effect of item familiarity on associative memory for emotional items would be modulated by goal relevance. High familiarity of items could increase associative memory for negative items and neutral backgrounds only in the low goal relevance condition. These findings suggest the effect of emotion on associative memory is not only related to attentional capture elicited by emotion, but also can be affected by goal relevance and perceptual features of stimulus.

  6. Science Literacy: How do High School Students Solve PISA Test Items?

    Science.gov (United States)

    Wati, F.; Sinaga, P.; Priyandoko, D.

    2017-09-01

    The Programme for International Students Assessment (PISA) does assess students’ science literacy in a real-life contexts and wide variety of situation. Therefore, the results do not provide adequate information for the teacher to excavate students’ science literacy because the range of materials taught at schools depends on the curriculum used. This study aims to investigate the way how junior high school students in Indonesia solve PISA test items. Data was collected by using PISA test items in greenhouse unit employed to 36 students of 9th grade. Students’ answer was analyzed qualitatively for each item based on competence tested in the problem. The way how students answer the problem exhibits their ability in particular competence which is influenced by a number of factors. Those are students’ unfamiliarity with test construction, low performance on reading, low in connecting available information and question, and limitation on expressing their ideas effectively and easy-read. As the effort, selected PISA test items can be used in accordance teaching topic taught to familiarize students with science literacy.

  7. Evolution of a Test Item

    Science.gov (United States)

    Spaan, Mary

    2007-01-01

    This article follows the development of test items (see "Language Assessment Quarterly", Volume 3 Issue 1, pp. 71-79 for the article "Test and Item Specifications Development"), beginning with a review of test and item specifications, then proceeding to writing and editing of items, pretesting and analysis, and finally selection of an item for a…

  8. Effect of individual thinking styles on item selection during study time allocation.

    Science.gov (United States)

    Jia, Xiaoyu; Li, Weijian; Cao, Liren; Li, Ping; Shi, Meiling; Wang, Jingjing; Cao, Wei; Li, Xinyu

    2018-04-01

    The influence of individual differences on learners' study time allocation has been emphasised in recent studies; however, little is known about the role of individual thinking styles (analytical versus intuitive). In the present study, we explored the influence of individual thinking styles on learners' application of agenda-based and habitual processes when selecting the first item during a study-time allocation task. A 3-item cognitive reflection test (CRT) was used to determine individuals' degree of cognitive reliance on intuitive versus analytical cognitive processing. Significant correlations between CRT scores and the choices of first item selection were observed in both Experiment 1a (study time was 5 seconds per triplet) and Experiment 1b (study time was 20 seconds per triplet). Furthermore, analytical decision makers constructed a value-based agenda (prioritised high-reward items), whereas intuitive decision makers relied more upon habitual responding (selected items from the leftmost of the array). The findings of Experiment 1a were replicated in Experiment 2 notwithstanding ruling out the possible effects from individual intelligence and working memory capacity. Overall, the individual thinking style plays an important role on learners' study time allocation and the predictive ability of CRT is reliable in learners' item selection strategy. © 2016 International Union of Psychological Science.

  9. Internal Consistency of General Outcome Measures in Grades 1-8. Technical Report # 0915

    Science.gov (United States)

    Anderson, Daniel; Tindal, Gerald; Alonzo, Julie

    2009-01-01

    We developed alternate forms of a math test for use in both screening students at risk of failure and monitoring their progress over time. In this technical report, we present results of the screener, used in the fall of 2009. The 48-item test was aligned to the National Council of Teachers of Mathematics (NCTM) Curriculum Focal Point Standards…

  10. Review on 18{sup th} Revision of 'Notice on Export and Import of Strategic Items'

    Energy Technology Data Exchange (ETDEWEB)

    Jeon, Jihye; Lee, Chansuh [Korea Institute of Nuclear Nonproliferation and Control, Daejeon (Korea, Republic of)

    2014-05-15

    Nuclear Suppliers Group (NSG) has established a guideline and continued to revise it in accordance with ever-changing international situation and developing technology. The Part 1 of guideline, 'Guidelines of Nuclear Transfers' covers the Trigger List items which triggers safeguards as a condition of supply. Currently NSG has published the 12{sup th} revised guideline (INFCIRC/254/Rev.12/Part1) in November 2013. Korean government fully reflected the guideline to its national legislation to implement in accordance with internationally agreed standard. The export control of nuclear strategic items in Korea is responsibility of Nuclear Safety and Security Commission (NSSC), which entrusted the technical review of the work to Korea Institute of Nonproliferation and Control (KINAC). The specific guidelines for the technical review are stipulated in Notice on Export and Import of Strategic Items with other strategic items usable to other Weapons of Mass Destruction. The Ministry of Trade, Industry and Energy approved the 18{sup th} revision of Notice on Export and Import of Strategic Items on 31 January 2014 as Notice no. 2014-15, which strictly follows the NSG guideline. The 18{sup th} revision of the notice reflects the final proposals agreed from the last Dedicated Meeting of Technical Experts (DMTE) of NSG's Consultative Group (CG) in April 2013. The 3-year-DMTE offered the 'fundamental, holistic approach to the technical review' within the international framework of NSG, rather than sporadic endeavors by individual states in the past. The 18{sup th} version itself has meaning in that the final products of the international technical review were reflected in the Korean national legislation of nuclear export control. It addressed various changes in control text in technical, contextual, and editorial aspects. The revision is analyzed herein concentrating only on technical and semantic changes in control text.

  11. Piecewise Polynomial Fitting with Trend Item Removal and Its Application in a Cab Vibration Test

    Directory of Open Access Journals (Sweden)

    Wu Ren

    2018-01-01

    Full Text Available The trend item of a long-term vibration signal is difficult to remove. This paper proposes a piecewise integration method to remove trend items. Examples of direct integration without trend item removal, global integration after piecewise polynomial fitting with trend item removal, and direct integration after piecewise polynomial fitting with trend item removal were simulated. The results showed that direct integration of the fitted piecewise polynomial provided greater acceleration and displacement precision than the other two integration methods. A vibration test was then performed on a special equipment cab. The results indicated that direct integration by piecewise polynomial fitting with trend item removal was highly consistent with the measured signal data. However, the direct integration method without trend item removal resulted in signal distortion. The proposed method can help with frequency domain analysis of vibration signals and modal parameter identification for such equipment.

  12. Recommended core items to assess e-cigarette use in population-based surveys

    OpenAIRE

    Pearson, Jennifer L; Hitchman, Sara C; Brose, Leonie S; Bauld, Linda; Glasser, Allison M; Villanti, Andrea C; McNeill, Ann; Abrams, David B; Cohen, Joanna E

    2017-01-01

    Background: A consistent approach using standardized items to assess e-cigarette use in both youth and adult populations will aid cross-survey and cross-national comparisons of the effect of e-cigarette (and tobacco) policies and improve our understanding of the population health impact of e-cigarette use. Focusing on adult behavior, we propose a set of e-cigarette use items, discuss their utility and potential adaptation, and highlight e-cigarette constructs that researchers should avoid wit...

  13. 26 CFR 301.6224(c)-3 - Consistent settlements.

    Science.gov (United States)

    2010-04-01

    ... 26 Internal Revenue 18 2010-04-01 2010-04-01 false Consistent settlements. 301.6224(c)-3 Section... settlements. (a) In general. If the Internal Revenue Service enters into a settlement agreement with any..., settlement terms consistent with those contained in the settlement agreement entered into. (b) Requirements...

  14. Methodology for the development and calibration of the SCI-QOL item banks.

    Science.gov (United States)

    Tulsky, David S; Kisala, Pamela A; Victorson, David; Choi, Seung W; Gershon, Richard; Heinemann, Allen W; Cella, David

    2015-05-01

    To develop a comprehensive, psychometrically sound, and conceptually grounded patient reported outcomes (PRO) measurement system for individuals with spinal cord injury (SCI). Individual interviews (n=44) and focus groups (n=65 individuals with SCI and n=42 SCI clinicians) were used to select key domains for inclusion and to develop PRO items. Verbatim items from other cutting-edge measurement systems (i.e. PROMIS, Neuro-QOL) were included to facilitate linkage and cross-population comparison. Items were field tested in a large sample of individuals with traumatic SCI (n=877). Dimensionality was assessed with confirmatory factor analysis. Local item dependence and differential item functioning were assessed, and items were calibrated using the item response theory (IRT) graded response model. Finally, computer adaptive tests (CATs) and short forms were administered in a new sample (n=245) to assess test-retest reliability and stability. A calibration sample of 877 individuals with traumatic SCI across five SCI Model Systems sites and one Department of Veterans Affairs medical center completed SCI-QOL items in interview format. We developed 14 unidimensional calibrated item banks and 3 calibrated scales across physical, emotional, and social health domains. When combined with the five Spinal Cord Injury--Functional Index physical function banks, the final SCI-QOL system consists of 22 IRT-calibrated item banks/scales. Item banks may be administered as CATs or short forms. Scales may be administered in a fixed-length format only. The SCI-QOL measurement system provides SCI researchers and clinicians with a comprehensive, relevant and psychometrically robust system for measurement of physical-medical, physical-functional, emotional, and social outcomes. All SCI-QOL instruments are freely available on Assessment CenterSM.

  15. 77 FR 25737 - Notice of Intent To Repatriate Cultural Items: Arizona State Museum, University of Arizona...

    Science.gov (United States)

    2012-05-01

    ... the appropriate Indian tribes, has determined that the cultural items meet the definition of..., Tucson, AZ, that meets the definition of unassociated funerary objects under 25 U.S.C. 3001. This notice... mortuary program, ceramic types, and other items of material culture are consistent with the Hohokam...

  16. On-line item control at a high enriched nuclear fuel fabrication facility

    International Nuclear Information System (INIS)

    Lewis, T.W.; Lewis, H.M.

    1984-01-01

    The on-line item control system at Nuclear Fuel Services, Inc., is a near-real time method capable of tracking uniquely identified items from creation through disposition. The system provides for improved control, timeliness, accuracy and usability of company information and the necessary data required to support the regulatory program for the protection against diversion of Special Nuclear Materials. The system consists of software applications (approximately 150 programs) with man/machine interface controls which provide facilities for correct data entry and for the protection of data integrity. This system went into stand-alone operation in September, 1983 after a twenty month parallel test run with the previous keybatched (manual forms) item control system

  17. The Effects of Goal Relevance and Perceptual Features on Emotional Items and Associative Memory

    Directory of Open Access Journals (Sweden)

    Wei B. Mao

    2017-07-01

    Full Text Available Showing an emotional item in a neutral background scene often leads to enhanced memory for the emotional item and impaired associative memory for background details. Meanwhile, both top–down goal relevance and bottom–up perceptual features played important roles in memory binding. We conducted two experiments and aimed to further examine the effects of goal relevance and perceptual features on emotional items and associative memory. By manipulating goal relevance (asking participants to categorize only each item image as living or non-living or to categorize each whole composite picture consisted of item image and background scene as natural scene or manufactured scene and perceptual features (controlling visual contrast and visual familiarity in two experiments, we found that both high goal relevance and salient perceptual features (high salience of items vs. high familiarity of items could promote emotional item memory, but they had different effects on associative memory for emotional items and neutral backgrounds. Specifically, high goal relevance and high perceptual-salience of items could jointly impair the associative memory for emotional items and neutral backgrounds, while the effect of item familiarity on associative memory for emotional items would be modulated by goal relevance. High familiarity of items could increase associative memory for negative items and neutral backgrounds only in the low goal relevance condition. These findings suggest the effect of emotion on associative memory is not only related to attentional capture elicited by emotion, but also can be affected by goal relevance and perceptual features of stimulus.

  18. Instructional Topics in Educational Measurement (ITEMS) Module: Using Automated Processes to Generate Test Items

    Science.gov (United States)

    Gierl, Mark J.; Lai, Hollis

    2013-01-01

    Changes to the design and development of our educational assessments are resulting in the unprecedented demand for a large and continuous supply of content-specific test items. One way to address this growing demand is with automatic item generation (AIG). AIG is the process of using item models to generate test items with the aid of computer…

  19. A 67-Item Stress Resilience item bank showing high content validity was developed in a psychosomatic sample.

    Science.gov (United States)

    Obbarius, Nina; Fischer, Felix; Obbarius, Alexander; Nolte, Sandra; Liegl, Gregor; Rose, Matthias

    2018-04-10

    To develop the first item bank to measure Stress Resilience (SR) in clinical populations. Qualitative item development resulted in an initial pool of 131 items covering a broad theoretical SR concept. These items were tested in n=521 patients at a psychosomatic outpatient clinic. Exploratory and Confirmatory Factor Analysis (CFA), as well as other state-of-the-art item analyses and IRT were used for item evaluation and calibration of the final item bank. Out of the initial item pool of 131 items, we excluded 64 items (54 factor loading .3, 2 non-discriminative Item Response Curves, 4 Differential Item Functioning). The final set of 67 items indicated sufficient model fit in CFA and IRT analyses. Additionally, a 10-item short form with high measurement precision (SE≤.32 in a theta range between -1.8 and +1.5) was derived. Both the SR item bank and the SR short form were highly correlated with an existing static legacy tool (Connor-Davidson Resilience Scale). The final SR item bank and 10-item short form showed good psychometric properties. When further validated, they will be ready to be used within a framework of Computer-Adaptive Tests for a comprehensive assessment of the Stress-Construct. Copyright © 2018. Published by Elsevier Inc.

  20. Teoria da Resposta ao Item Teoria de la respuesta al item Item response theory

    Directory of Open Access Journals (Sweden)

    Eutalia Aparecida Candido de Araujo

    2009-12-01

    Full Text Available A preocupação com medidas de traços psicológicos é antiga, sendo que muitos estudos e propostas de métodos foram desenvolvidos no sentido de alcançar este objetivo. Entre os trabalhos propostos, destaca-se a Teoria da Resposta ao Item (TRI que, a princípio, veio completar limitações da Teoria Clássica de Medidas, empregada em larga escala até hoje na medida de traços psicológicos. O ponto principal da TRI é que ela leva em consideração o item particularmente, sem relevar os escores totais; portanto, as conclusões não dependem apenas do teste ou questionário, mas de cada item que o compõe. Este artigo propõe-se a apresentar esta Teoria que revolucionou a teoria de medidas.La preocupación con las medidas de los rasgos psicológicos es antigua y muchos estudios y propuestas de métodos fueron desarrollados para lograr este objetivo. Entre estas propuestas de trabajo se incluye la Teoría de la Respuesta al Ítem (TRI que, en principio, vino a completar las limitaciones de la Teoría Clásica de los Tests, ampliamente utilizada hasta hoy en la medida de los rasgos psicológicos. El punto principal de la TRI es que se tiene en cuenta el punto concreto, sin relevar las puntuaciones totales; por lo tanto, los resultados no sólo dependen de la prueba o cuestionario, sino que de cada ítem que lo compone. En este artículo se propone presentar la Teoría que revolucionó la teoría de medidas.The concern with measures of psychological traits is old and many studies and proposals of methods were developed to achieve this goal. Among these proposed methods highlights the Item Response Theory (IRT that, in principle, came to complete limitations of the Classical Test Theory, which is widely used until nowadays in the measurement of psychological traits. The main point of IRT is that it takes into account the item in particular, not relieving the total scores; therefore, the findings do not only depend on the test or questionnaire

  1. Development of the PROMIS positive emotional and sensory expectancies of smoking item banks.

    Science.gov (United States)

    Tucker, Joan S; Shadel, William G; Edelen, Maria Orlando; Stucky, Brian D; Li, Zhen; Hansen, Mark; Cai, Li

    2014-09-01

    The positive emotional and sensory expectancies of cigarette smoking include improved cognitive abilities, positive affective states, and pleasurable sensorimotor sensations. This paper describes development of Positive Emotional and Sensory Expectancies of Smoking item banks that will serve to standardize the assessment of this construct among daily and nondaily cigarette smokers. Data came from daily (N = 4,201) and nondaily (N =1,183) smokers who completed an online survey. To identify a unidimensional set of items, we conducted item factor analyses, item response theory analyses, and differential item functioning analyses. Additionally, we evaluated the performance of fixed-item short forms (SFs) and computer adaptive tests (CATs) to efficiently assess the construct. Eighteen items were included in the item banks (15 common across daily and nondaily smokers, 1 unique to daily, 2 unique to nondaily). The item banks are strongly unidimensional, highly reliable (reliability = 0.95 for both), and perform similarly across gender, age, and race/ethnicity groups. A SF common to daily and nondaily smokers consists of 6 items (reliability = 0.86). Results from simulated CATs indicated that, on average, less than 8 items are needed to assess the construct with adequate precision using the item banks. These analyses identified a new set of items that can assess the positive emotional and sensory expectancies of smoking in a reliable and standardized manner. Considerable efficiency in assessing this construct can be achieved by using the item bank SF, employing computer adaptive tests, or selecting subsets of items tailored to specific research or clinical purposes. © The Author 2014. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  2. Validating the 11-Item Revised University of California Los Angeles Scale to Assess Loneliness Among Older Adults: An Evaluation of Factor Structure and Other Measurement Properties.

    Science.gov (United States)

    Lee, Joonyup; Cagle, John G

    2017-11-01

    To examine the measurement properties and factor structure of the short version of the Revised University of California Los Angeles (R-UCLA) loneliness scale from the Health and Retirement Study (HRS). Based on data from 3,706 HRS participants aged 65 + who completed the 2012 wave of the HRS and its Psychosocial Supplement, the measurement properties and factorability of the R-UCLA were examined by conducting an exploratory factor analysis (EFA) and the confirmatory factor analysis (CFA) on randomly split halves. The average score for the 11-item loneliness scale was 16.4 (standard deviation: 4.5). An evaluation of the internal consistency produced a Cronbach's α of 0.87. Results from the EFA showed that two- and three-factor models were appropriate. However, based on the results of the CFA, only a two-factor model was determined to be suitable because there was a very high correlation between two factors identified in the three-factor model, available social connections and sense of belonging. This study provides important data on the properties of the 11-item R-UCLA scale by identifying a two-factor model of loneliness: feeling isolated and available social connections. Our findings suggest the 11-item R-UCLA has good factorability and internal reliability. Copyright © 2017 American Association for Geriatric Psychiatry. Published by Elsevier Inc. All rights reserved.

  3. Operating Experience Report: Counterfeit, Suspect and Fraudulent Items. Working Group on Operating Experience. Proceedings and Analysis on an Item of Generic Interest

    International Nuclear Information System (INIS)

    2011-01-01

    The NEA Committee on Nuclear Regulatory Activities (CNRA) believes that sharing operating experience from the national operating experience feedback programmes are a major element in the industry's and regulatory body's efforts to ensure the continued safe operation of nuclear facilities. Considering the importance of these issues, the Committee on the Safety of Nuclear Installations (CSNI) established a working group, PWG no.1 (Principle Working Group Number 1) to assess operating experience in the late 1970's, which was later renamed the Working Group on Operating Experience (WGOE). In 1978, the CSNI approved the establishment of a system to collect international operating experience data. The accident at Three Mile Island shortly after added impetus to this and led to the start of the Incident Reporting System (IRS). In 1983, the IRS database was moved to the International Agency for Atomic Energy (IAEA) to be operated as a joint database by IAEA and NEA for the benefit of all of the member countries of both organisations. In 2006, the WGOE was moved to be under the umbrella of the Committee on Nuclear Regulatory Activities (CNRA) in NEA. In 2009, the scope of the Incident Reporting System was expanded and re-named the International Reporting System for Operating Experience (although, the acronym remains the same). The purpose of WGOE is to facilitate the exchange of information, experience, and lessons learnt related to operating experience between member countries. The working group continues its mission to identify trending and issues that should be addressed in specialty areas of CNRA and CSNI working groups. The CSFI (Counterfeit, Suspect, and Fraudulent Items) issue was determined to be the Issue of Generic Interest at the April 2010 WGOE meeting. The Issue of Generic Interest is determined by the working group members for an in-depth discussion. They are often emerging issues in operating experience that a country or several countries would to the share

  4. Internalized Homophobia Scale for Gay Chinese Men: Conceptualization, Factor Structure, Reliability, and Associations With Hypothesized Correlates.

    Science.gov (United States)

    Ren, Zhengjia; Hood, Ralph W

    2018-04-01

    This study reports the development of an inventory to assess the perceived internalized homophobia of gay men in a collectivistic Chinese cultural context. The results of exploratory and confirmatory factor analyses using two samples suggested the viability and stability of a three-factor model: internalized heteronormativity (IHN), family-oriented identity (FOI), and socially oriented identity (SOI). The 11-item internalized homophobia inventory demonstrated good internal consistency and construct validity. Internalized homophobia was related positively to the extent of a sense of loneliness and negatively to self-evaluation and the discrepancy in self-identification as a gay man. In addition, the participants' internalized SOI consistently predicted their coming out choices in their social surroundings, while their FOI predicted their decisions to enter into heterosexual marriages. The findings suggest that sexual self-prejudice was correlated with IHN, family values, and social norms. The present research demonstrates that a culturally sensitive scale is necessary to understand the cultural and family-oriented values that influence gay Chinese men's everyday lives, self-constructs, and behavioral choices.

  5. Fuzzy prototype classifier based on items and its application in recommender system

    Directory of Open Access Journals (Sweden)

    Mei Cai

    2017-01-01

    Full Text Available Currently, recommender systems (RS are incorporating implicit information from social circle of the Internet. The implicit social information in human mind is not easy to reflect in appropriate decision making techniques. This paper consists of 2 contributions. First, we develop an item-based prototype classifier (IPC in which a prototype represents a social circlers preferences as a pattern classification technique. We assume the social circle which distinguishes with others by the items their members like. The prototype structure of the classifier is defined by two2-dimensional matrices. We use information gain and OWA aggregator to construct a feature space. The item-based classifier assigns a new item to some prototypes with different prototypicalities. We reform a typical data setmIris data set in UCI Machine Learning Repository to verify our fuzzy prototype classifier. The second proposition of this paper is to give the application of IPC in recommender system to solve new item cold-start problems. We modify the dataset of MovieLens to perform experimental demonstrations of the proposed ideas.

  6. Computerized Adaptive Test (CAT) Applications and Item Response Theory Models for Polytomous Items

    Science.gov (United States)

    Aybek, Eren Can; Demirtasli, R. Nukhet

    2017-01-01

    This article aims to provide a theoretical framework for computerized adaptive tests (CAT) and item response theory models for polytomous items. Besides that, it aims to introduce the simulation and live CAT software to the related researchers. Computerized adaptive test algorithm, assumptions of item response theory models, nominal response…

  7. Validation of the Malayalam version of the Internalized Stigma of Mental Illness (ISMI) scale.

    Science.gov (United States)

    James, Tintu; Kutty, V Raman; Boyd, Jennifer; Brzoska, Patrick

    2016-04-01

    Little is known about internalized stigma of mental illness in India. A reason for this could be the lack of valid assessment instruments adapted for the diverse cultures and languages of the country. One of the most widely used and accepted questionnaires to assess internalized stigma is the 29-item Internalized Stigma of Mental Illness (ISMI) scale. The aim of the present study was to translate and adapt the ISMI to the Malayalam-speaking population of Kerala, India and to assess its content and factorial validity. The content validity of the Malayalam-language ISMI was studied through interviews with 7 experts on stigma in India. Factorial validity was examined by means of a confirmatory factor analysis (CFA) based on a cross-sectional survey among 290 patients with mental illness attending follow-up outpatient and primary care clinics in Kerala, India. The expert panel concluded that the items of the translated questionnaire adequately represent internalized stigma in the Malayalam-speaking population of Kerala. The theorized factor structure of the ISMI consisting of five factors showed a suboptimal model fit (WRMR=0.940; TLI=0.971, CFI=0.948; RMSEA=0.059) which improved considerably after removal of the stigma resistance factor and three items with poor factor loadings (WRMR=0.819; TLI=0.982, CFI=0.966; RMSEA=0.051). Although our study identifies some sources of model ill-fit, it shows that a reduced version of the Malayalam-language ISMI can be a valuable tool for the study of internalized stigma in this cultural setting. Copyright © 2016 Elsevier B.V. All rights reserved.

  8. Language-related differential item functioning between English and German PROMIS Depression items is negligible.

    Science.gov (United States)

    Fischer, H Felix; Wahl, Inka; Nolte, Sandra; Liegl, Gregor; Brähler, Elmar; Löwe, Bernd; Rose, Matthias

    2017-12-01

    To investigate differential item functioning (DIF) of PROMIS Depression items between US and German samples we compared data from the US PROMIS calibration sample (n = 780), a German general population survey (n = 2,500) and a German clinical sample (n = 621). DIF was assessed in an ordinal logistic regression framework, with 0.02 as criterion for R 2 -change and 0.096 for Raju's non-compensatory DIF. Item parameters were initially fixed to the PROMIS Depression metric; we used plausible values to account for uncertainty in depression estimates. Only four items showed DIF. Accounting for DIF led to negligible effects for the full item bank as well as a post hoc simulated computer-adaptive test (German general population sample was considerably lower compared to the US reference value of 50. Overall, we found little evidence for language DIF between US and German samples, which could be addressed by either replacing the DIF items by items not showing DIF or by scoring the short form in German samples with the corrected item parameters reported. Copyright © 2016 John Wiley & Sons, Ltd.

  9. Students' Preference for Science Careers: International comparisons based on PISA 2006

    Science.gov (United States)

    Kjærnsli, Marit; Lie, Svein

    2011-01-01

    This article deals with 15-year-old students' tendencies to consider a future science-related career. Two aspects have been the focus of our investigation. The first is based on the construct called 'future science orientation', an affective construct consisting of four Likert scale items that measure students' consideration of being involved in future education and careers in science-related areas. Due to the well-known evidence for Likert scales providing culturally biased estimates, the aim has been to go beyond the comparison of simple country averages. In a series of regression and correlation analyses, we have investigated how well the variance of this construct in each of the participating countries can be accounted for by other Programme for International Student Assessment (PISA) student data. The second aspect is based on a question about students' future jobs. By separating science-related jobs into what we have called 'soft' and 'hard' science-related types of jobs, we have calculated and compared country percentages within each category. In particular, gender differences are discussed, and interesting international patterns have been identified. The results in this article have been reported not only for individual countries, but also for groups of countries. These cluster analyses of countries are based on item-by-item patterns of (residual values of) national average values for the combination of cognitive and affective items. The emerging cluster structure of countries has turned out to contribute to the literature of similarities and differences between countries and the factors behind the country clustering both in science education and more generally.

  10. Citizens' perceptions of political processes. A critical evaluation of preference consistency and survey items

    Directory of Open Access Journals (Sweden)

    Bengtsson, Åsa

    2012-12-01

    Full Text Available The current state of research does not tell us much about citizens’ expectations of political decision making. Most surveys allow respondents to evaluate how the current system is working, but do not inquire about alternative political decision-making procedures. The lack of established survey items can be explained by the fact that radical changes in decision-making procedures have been hard to envisage, but also by a general scepticism regarding people’s ability to form opinions on these matters. Political processes are, without doubt, complex matters that do not lend themselves very well to simplistic survey questions. Moreover, previous research has convincingly shown that most people in general have difficulties forming single, coherent and stable attitudes even towards far more straightforward political issues. In order to determine if trying to grasp attitudes towards political decision-making in future empirical studies can be considered a fruitful endeavour, this study sets out to critically assess the extent to which people express coherent preferences on these matters, and if preferences are in line with expectations in previous, rather scattered research. The study is based on the Finnish National Election Study 2011; a study which, contrary to most other election studies, includes a rich variety of survey items on the topic, and utilises a combination of strategies in order to explore patterns in the opinions held by citizens.

    El estado actual de las investigaciones no nos dice mucho sobre las expectativas de los ciudadanos con respecto a la toma de decisiones políticas. La mayoría de las encuestas permiten que quienes las responden evalúen cómo funciona el sistema actual, pero no preguntan por procedimientos alternativos de decisión política. La falta de preguntas de encuesta contrastadas se puede explicar tanto por el hecho de que los cambios en los procedimientos de toma de decisiones han resultado difíciles de

  11. Development and validation of an international appraisal instrument for assessing the quality of clinical practice guidelines: the AGREE project.

    Science.gov (United States)

    2003-02-01

    International interest in clinical practice guidelines has never been greater but many published guidelines do not meet the basic quality requirements. There have been renewed calls for validated criteria to assess the quality of guidelines. To develop and validate an international instrument for assessing the quality of the process and reporting of clinical practice guideline development. The instrument was developed through a multi-staged process of item generation, selection and scaling, field testing, and refinement procedures. 100 guidelines selected from 11 participating countries were evaluated independently by 194 appraisers with the instrument. Following refinement the instrument was further field tested on three guidelines per country by a new set of 70 appraisers. The final version of the instrument contained 23 items grouped into six quality domains with a 4 point Likert scale to score each item (scope and purpose, stakeholder involvement, rigour of development, clarity and presentation, applicability, editorial independence). 95% of appraisers found the instrument useful for assessing guidelines. Reliability was acceptable for most domains (Cronbach's alpha 0.64-0.88). Guidelines produced as part of an established guideline programme had significantly higher scores on editorial independence and, after the publication of a national policy, had significantly higher quality scores on rigour of development (pinternationally. The instrument is sensitive to differences in important aspects of guidelines and can be used consistently and easily by a wide range of professionals from different backgrounds. The adoption of common standards should improve the consistency and quality of the reporting of guideline development worldwide and provide a framework to encourage international comparison of clinical practice guidelines.

  12. Why Students Answer TIMSS Science Test Items the Way They Do

    Science.gov (United States)

    Harlow, Ann; Jones, Alister

    2004-04-01

    The purpose of this study was to explore how Year 8 students answered Third International Mathematics and Science Study (TIMSS) questions and whether the test questions represented the scientific understanding of these students. One hundred and seventy-seven students were tested using written test questions taken from the science test used in the Third International Mathematics and Science Study. The degree to which a sample of 38 children represented their understanding of the topics in a written test compared to the level of understanding that could be elicited by an interview is presented in this paper. In exploring student responses in the interview situation this study hoped to gain some insight into the science knowledge that students held and whether or not the test items had been able to elicit this knowledge successfully. We question the usefulness and quality of data from large-scale summative assessments on their own to represent student scientific understanding and conclude that large scale written test items, such as TIMSS, on their own are not a valid way of exploring students'' understanding of scientific concepts. Considerable caution is therefore needed in exploiting the outcomes of international achievement testing when considering educational policy changes or using TIMSS data on their own to represent student understanding.

  13. Selecting Items for Criterion-Referenced Tests.

    Science.gov (United States)

    Mellenbergh, Gideon J.; van der Linden, Wim J.

    1982-01-01

    Three item selection methods for criterion-referenced tests are examined: the classical theory of item difficulty and item-test correlation; the latent trait theory of item characteristic curves; and a decision-theoretic approach for optimal item selection. Item contribution to the standardized expected utility of mastery testing is discussed. (CM)

  14. Geriatric Anxiety Scale: item response theory analysis, differential item functioning, and creation of a ten-item short form (GAS-10).

    Science.gov (United States)

    Mueller, Anne E; Segal, Daniel L; Gavett, Brandon; Marty, Meghan A; Yochim, Brian; June, Andrea; Coolidge, Frederick L

    2015-07-01

    The Geriatric Anxiety Scale (GAS; Segal et al. (Segal, D. L., June, A., Payne, M., Coolidge, F. L. and Yochim, B. (2010). Journal of Anxiety Disorders, 24, 709-714. doi:10.1016/j.janxdis.2010.05.002) is a self-report measure of anxiety that was designed to address unique issues associated with anxiety assessment in older adults. This study is the first to use item response theory (IRT) to examine the psychometric properties of a measure of anxiety in older adults. A large sample of older adults (n = 581; mean age = 72.32 years, SD = 7.64 years, range = 60 to 96 years; 64% women; 88% European American) completed the GAS. IRT properties were examined. The presence of differential item functioning (DIF) or measurement bias by age and sex was assessed, and a ten-item short form of the GAS (called the GAS-10) was created. All GAS items had discrimination parameters of 1.07 or greater. Items from the somatic subscale tended to have lower discrimination parameters than items on the cognitive or affective subscales. Two items were flagged for DIF, but the impact of the DIF was negligible. Women scored significantly higher than men on the GAS and its subscales. Participants in the young-old group (60 to 79 years old) scored significantly higher on the cognitive subscale than participants in the old-old group (80 years old and older). Results from the IRT analyses indicated that the GAS and GAS-10 have strong psychometric properties among older adults. We conclude by discussing implications and future research directions.

  15. Asymptotic Standard Errors for Item Response Theory True Score Equating of Polytomous Items

    Science.gov (United States)

    Cher Wong, Cheow

    2015-01-01

    Building on previous works by Lord and Ogasawara for dichotomous items, this article proposes an approach to derive the asymptotic standard errors of item response theory true score equating involving polytomous items, for equivalent and nonequivalent groups of examinees. This analytical approach could be used in place of empirical methods like…

  16. MIMIC Methods for Assessing Differential Item Functioning in Polytomous Items

    Science.gov (United States)

    Wang, Wen-Chung; Shih, Ching-Lin

    2010-01-01

    Three multiple indicators-multiple causes (MIMIC) methods, namely, the standard MIMIC method (M-ST), the MIMIC method with scale purification (M-SP), and the MIMIC method with a pure anchor (M-PA), were developed to assess differential item functioning (DIF) in polytomous items. In a series of simulations, it appeared that all three methods…

  17. Evaluation of Northwest University, Kano Post-UTME Test Items Using Item Response Theory

    Science.gov (United States)

    Bichi, Ado Abdu; Hafiz, Hadiza; Bello, Samira Abdullahi

    2016-01-01

    High-stakes testing is used for the purposes of providing results that have important consequences. Validity is the cornerstone upon which all measurement systems are built. This study applied the Item Response Theory principles to analyse Northwest University Kano Post-UTME Economics test items. The developed fifty (50) economics test items was…

  18. The test of variables of attention (TOVA): Internal consistency (Q1 vs. Q2 and Q3 vs. Q4) in children with Attention Deficit/Hyperactivity Disorder (ADHD)

    Science.gov (United States)

    The internal consistency of the Test of Variables of Attention (TOVA) was examined in a cohort of 6- to 12-year-old children (N = 63) strictly diagnosed with ADHD. The internal consistency of errors of omission (OMM), errors of commission (COM), response time (RT), and response time variability (RTV...

  19. Using Differential Item Functioning Procedures to Explore Sources of Item Difficulty and Group Performance Characteristics.

    Science.gov (United States)

    Scheuneman, Janice Dowd; Gerritz, Kalle

    1990-01-01

    Differential item functioning (DIF) methodology for revealing sources of item difficulty and performance characteristics of different groups was explored. A total of 150 Scholastic Aptitude Test items and 132 Graduate Record Examination general test items were analyzed. DIF was evaluated for males and females and Blacks and Whites. (SLD)

  20. Using Item Analysis to Assess Objectively the Quality of the Calgary-Cambridge OSCE Checklist

    Directory of Open Access Journals (Sweden)

    Tyrone Donnon

    2011-06-01

    Full Text Available Background:  The purpose of this study was to investigate the use of item analysis to assess objectively the quality of items on the Calgary-Cambridge Communications OSCE checklist. Methods:  A total of 150 first year medical students were provided with extensive teaching on the use of the Calgary-Cambridge Guidelines for interviewing patients and participated in a final year end 20 minute communication OSCE station.  Grouped into either the upper half (50% or lower half (50% communication skills performance groups, discrimination, difficulty and point biserial values were calculated for each checklist item. Results:  The mean score on the 33 item communication checklist was 24.09 (SD = 4.46 and the internal reliability coefficient was ? = 0.77. Although most of the items were found to have moderate (k = 12, 36% or excellent (k = 10, 30% discrimination values, there were 6 (18% identified as ‘fair’ and 3 (9% as ‘poor’. A post-examination review focused on item analysis findings resulted in an increase in checklist reliability (? = 0.80. Conclusions:  Item analysis has been used with MCQ exams extensively. In this study, it was also found to be an objective and practical approach to use in evaluating the quality of a standardized OSCE checklist.

  1. Development of the Oxford Participation and Activities Questionnaire: constructing an item pool

    Directory of Open Access Journals (Sweden)

    Kelly L

    2015-05-01

    Full Text Available Laura Kelly, Crispin Jenkinson, Sarah Dummett, Jill Dawson, Ray Fitzpatrick, David Morley Health Services Research Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK Purpose: The Oxford Participation and Activities Questionnaire is a patient-reported outcome measure in development that is grounded on the World Health Organization International Classification of Functioning, Disability, and Health (ICF. The study reported here aimed to inform and generate an item pool for the new measure, which is specifically designed for the assessment of participation and activity in patients experiencing a range of health conditions. Methods: Items were informed through in-depth interviews conducted with 37 participants spanning a range of conditions. Interviews aimed to identify how their condition impacted their ability to participate in meaningful activities. Conditions included arthritis, cancer, chronic back pain, diabetes, motor neuron disease, multiple sclerosis, Parkinson's disease, and spinal cord injury. Transcripts were analyzed using the framework method. Statements relating to ICF themes were recast as questionnaire items and shown for review to an expert panel. Cognitive debrief interviews (n=13 were used to assess items for face and content validity. Results: ICF themes relevant to activities and participation in everyday life were explored, and a total of 222 items formed the initial item pool. This item pool was refined by the research team and 28 generic items were mapped onto all nine chapters of the ICF construct, detailing activity and participation. Cognitive interviewing confirmed the questionnaire instructions, items, and response options were acceptable to participants. Conclusion: Using a clear conceptual basis to inform item generation, 28 items have been identified as suitable to undergo further psychometric testing. A large-scale postal survey will follow in order to refine the instrument further and

  2. Item Banking with Embedded Standards

    Science.gov (United States)

    MacCann, Robert G.; Stanley, Gordon

    2009-01-01

    An item banking method that does not use Item Response Theory (IRT) is described. This method provides a comparable grading system across schools that would be suitable for low-stakes testing. It uses the Angoff standard-setting method to obtain item ratings that are stored with each item. An example of such a grading system is given, showing how…

  3. Short term memory bowing effect is consistent with presentation rate dependent decay.

    Science.gov (United States)

    Tarnow, Eugen

    2010-12-01

    I reanalyze the free recall data of Murdock, J Exp Psychol 64(5):482-488 (1962) and Murdock and Okada, J Verbal Learn and Verbal Behav 86:263-267 (1970) which show the famous bowing effect in which initial and recent items are recalled better than intermediate items (primacy and recency effects). Recent item recall probabilities follow a logarithmic decay with time of recall consistent with the tagging/retagging theory. The slope of the decay increases with increasing presentation rate. The initial items, with an effectively low presentation rate, decay with the slowest logarithmic slope, explaining the primacy effect. The finding that presentation rate limits the duration of short term memory suggests a basis for memory loss in busy adults, for the importance of slow music practice, for long term memory deficiencies for people with attention deficits who may be artificially increasing the presentation rates of their surroundings. A well-defined, quantitative measure of the primacy effect is introduced.

  4. Reliability, factor analysis and internal consistency calculation of the Insomnia Severity Index (ISI in French and in English among Lebanese adolescents

    Directory of Open Access Journals (Sweden)

    M. Chahoud

    2017-06-01

    Conclusion: The results of our analyses reveal that both English and French versions of the ISI scale have good internal consistency and are reproducible and reliable. Therefore, it can be used to assess the prevalence of insomnia in Lebanese adolescents.

  5. Effect of Clinically Discriminating, Evidence-Based Checklist Items on the Reliability of Scores from an Internal Medicine Residency OSCE

    Science.gov (United States)

    Daniels, Vijay J.; Bordage, Georges; Gierl, Mark J.; Yudkowsky, Rachel

    2014-01-01

    Objective structured clinical examinations (OSCEs) are used worldwide for summative examinations but often lack acceptable reliability. Research has shown that reliability of scores increases if OSCE checklists for medical students include only clinically relevant items. Also, checklists are often missing evidence-based items that high-achieving…

  6. Criticality Safety Support to a Project Addressing SNM Legacy Items at LLNL

    International Nuclear Information System (INIS)

    Pearson, J S; Burch, J G; Dodson, K E; Huang, S T

    2005-01-01

    The programmatic, facility and criticality safety support staffs at the LLNL Plutonium Facility worked together to successfully develop and implement a project to process legacy (DNFSB Recommendation 94-1 and non-Environmental, Safety, and Health (ES and H) labeled) materials in storage. Over many years, material had accumulated in storage that lacked information to adequately characterize the material for current criticality safety controls used in the facility. Generally, the fissionable material mass information was well known, but other information such as form, impurities, internal packaging, and presence of internal moderating or reflecting materials were not well documented. In many cases, the material was excess to programmatic need, but such a determination was difficult with the little information given on MC and A labels and in the MC and A database. The material was not packaged as efficiently as possible, so it also occupied much more valuable storage space than was necessary. Although safe as stored, the inadequately characterized material posed a risk for criticality safety noncompliances if moved within the facility under current criticality safety controls. A Legacy Item Implementation Plan was developed and implemented to deal with this problem. Reasonable bounding conditions were determined for the material involved, and criticality safety evaluations were completed. Two appropriately designated glove boxes were identified and criticality safety controls were developed to safely inspect the material. Inspecting the material involved identifying containers of legacy material, followed by opening, evaluating, processing if necessary, characterizing and repackaging the material. Material from multiple containers was consolidated more efficiently thus decreasing the total number of stored items to about one half of the highest count. Current packaging requirements were implemented. Detailed characterization of the material was captured in databases

  7. Crosslinguistic Developmental Consistency in the Composition of Toddlers’ Internal State Vocabulary: Evidence from Four Languages

    Directory of Open Access Journals (Sweden)

    Susanne Kristen

    2014-01-01

    Full Text Available Mental state language, emerging in the second and third years of life in typically developing children, is one of the first signs of an explicit psychological understanding. While mental state vocabulary may serve a variety of conversational functions in discourse and thus might not always indicate psychological comprehension, there is evidence for genuine references to mental states (desires, knowledge, beliefs, and emotions early in development across languages. This present study presents parental questionnaire data on the composition of 297 toddler-aged (30-to 32-month-olds children’s internal state vocabulary in four languages: Italian, German, English, and French. The results demonstrated that across languages expressions for physiological states (e.g., hungry and tired were among the most varied, while children’s vocabulary for cognitive entities (e.g., know and think proved to be least varied. Further, consistent with studies on children’s comprehension of these concepts, across languages children’s mastery of volition terms (e.g., like to do and want preceded their mastery of cognition terms. These findings confirm the cross-linguistic consistency of children’s emerging expression of abstract psychological concepts.

  8. TEDS-M 2008 User Guide for the International Database. Supplement 4: TEDS-M Released Mathematics and Mathematics Pedagogy Knowledge Assessment Items

    Science.gov (United States)

    Brese, Falk, Ed.

    2012-01-01

    The goal for selecting the released set of test items was to have approximately 25% of each of the full item sets for mathematics content knowledge (MCK) and mathematics pedagogical content knowledge (MPCK) that would represent the full range of difficulty, content, and item format used in the TEDS-M study. The initial step in the selection was to…

  9. Differential item functioning analysis with ordinal logistic regression techniques. DIFdetect and difwithpar.

    Science.gov (United States)

    Crane, Paul K; Gibbons, Laura E; Jolley, Lance; van Belle, Gerald

    2006-11-01

    We present an ordinal logistic regression model for identification of items with differential item functioning (DIF) and apply this model to a Mini-Mental State Examination (MMSE) dataset. We employ item response theory ability estimation in our models. Three nested ordinal logistic regression models are applied to each item. Model testing begins with examination of the statistical significance of the interaction term between ability and the group indicator, consistent with nonuniform DIF. Then we turn our attention to the coefficient of the ability term in models with and without the group term. If including the group term has a marked effect on that coefficient, we declare that it has uniform DIF. We examined DIF related to language of test administration in addition to self-reported race, Hispanic ethnicity, age, years of education, and sex. We used PARSCALE for IRT analyses and STATA for ordinal logistic regression approaches. We used an iterative technique for adjusting IRT ability estimates on the basis of DIF findings. Five items were found to have DIF related to language. These same items also had DIF related to other covariates. The ordinal logistic regression approach to DIF detection, when combined with IRT ability estimates, provides a reasonable alternative for DIF detection. There appear to be several items with significant DIF related to language of test administration in the MMSE. More attention needs to be paid to the specific criteria used to determine whether an item has DIF, not just the technique used to identify DIF.

  10. Development and Standardization of the Diagnostic Adaptive Behavior Scale: Application of Item Response Theory to the Assessment of Adaptive Behavior

    Science.gov (United States)

    Tassé, Marc J.; Schalock, Robert L.; Thissen, David; Balboni, Giulia; Bersani, Henry, Jr.; Borthwick-Duffy, Sharon A.; Spreat, Scott; Widaman, Keith F.; Zhang, Dalun; Navas, Patricia

    2016-01-01

    The Diagnostic Adaptive Behavior Scale (DABS) was developed using item response theory (IRT) methods and was constructed to provide the most precise and valid adaptive behavior information at or near the cutoff point of making a decision regarding a diagnosis of intellectual disability. The DABS initial item pool consisted of 260 items. Using IRT…

  11. Assessing the validity of single-item life satisfaction measures: results from three large samples.

    Science.gov (United States)

    Cheung, Felix; Lucas, Richard E

    2014-12-01

    The present paper assessed the validity of single-item life satisfaction measures by comparing single-item measures to the Satisfaction with Life Scale (SWLS)-a more psychometrically established measure. Two large samples from Washington (N = 13,064) and Oregon (N = 2,277) recruited by the Behavioral Risk Factor Surveillance System and a representative German sample (N = 1,312) recruited by the Germany Socio-Economic Panel were included in the present analyses. Single-item life satisfaction measures and the SWLS were correlated with theoretically relevant variables, such as demographics, subjective health, domain satisfaction, and affect. The correlations between the two life satisfaction measures and these variables were examined to assess the construct validity of single-item life satisfaction measures. Consistent across three samples, single-item life satisfaction measures demonstrated substantial degree of criterion validity with the SWLS (zero-order r = 0.62-0.64; disattenuated r = 0.78-0.80). Patterns of statistical significance for correlations with theoretically relevant variables were the same across single-item measures and the SWLS. Single-item measures did not produce systematically different correlations compared to the SWLS (average difference = 0.001-0.005). The average absolute difference in the magnitudes of the correlations produced by single-item measures and the SWLS was very small (average absolute difference = 0.015-0.042). Single-item life satisfaction measures performed very similarly compared to the multiple-item SWLS. Social scientists would get virtually identical answer to substantive questions regardless of which measure they use.

  12. Using PISA as an International Benchmark in Standard Setting.

    Science.gov (United States)

    Phillips, Gary W; Jiang, Tao

    2015-01-01

    This study describes how the Programme for International Student Assessment (PISA) can be used to internationally benchmark state performance standards. The process is accomplished in three steps. First, PISA items are embedded in the administration of the state assessment and calibrated on the state scale. Second, the international item calibrations are then used to link the state scale to the PISA scale through common item linking. Third, the statistical linking results are used as part of the state standard setting process to help standard setting panelists determine how high their state standards need to be in order to be internationally competitive. This process was carried out in Delaware, Hawaii, and Oregon, in three subjects-science, mathematics and reading with initial results reported by Phillips and Jiang (2011). An in depth discussion of methods and results are reported in this article for one subject (mathematics) and one state (Hawaii).

  13. Cultural Interchangeability? Culture-Specific Items in Translation

    Directory of Open Access Journals (Sweden)

    Ajtony Zsuzsanna

    2016-12-01

    Full Text Available This paper summarizes the results of the translation work carried out within an international project aiming to develop the language skills of staff working in hotel and catering services. As the topics touched upon in the English source texts are related to several European cultures, these cultural differences bring about several challenges related to the translation of realia, or culture-specific items (CSIs. In the first part of the paper, a series of translation strategies for rendering source-language CSIs into the target language are enlisted, while the second part presents the main strategies employed in the prepared translations.

  14. Cs-137 concentration in food items common to the Filipino dietary

    International Nuclear Information System (INIS)

    Cruz, B. de la

    1980-01-01

    The present investigation aims to determine the level of Cs-137 in various food items common to the Filipino dietary, consisting of cereals, fish, meat, vegetables and fruits and to estimate the average dose commitment of the average Filipino adult resulting from the aforementioned radionuclide. (author)

  15. Validation of a 10-item care-related regret intensity scale (RIS-10) for health care professionals.

    Science.gov (United States)

    Courvoisier, Delphine S; Cullati, Stéphane; Haller, Chiara S; Schmidt, Ralph E; Haller, Guy; Agoritsas, Thomas; Perneger, Thomas V

    2013-03-01

    Regret after one of the many decisions and interventions that health care professionals make every day can have an impact on their own health and quality of life, and on their patient care practices. To validate a new care-related regret intensity scale (RIS) for health care professionals. Retrospective cross-sectional cohort study with a 1-month follow-up (test-retest) in a French-speaking University Hospital. A total of 469 nurses and physicians responded to the survey, and 175 answered the retest. RIS, self-report questions on the context of the regret-inducing event, its consequences for the patient, involvement of the health care professionals, and changes in patient care practices after the event. We measured the impact of regret intensity on health care professionals with the satisfaction with life scale, the SF-36 first question (self-reported health), and a question on self-esteem. On the basis of factor analysis and item response analysis, the initial 19-item scale was shortened to 10 items. The resulting scale (RIS-10) was unidimensional and had high internal consistency (α=0.87) and acceptable test-retest reliability (0.70). Higher regret intensity was associated with (a) more consequences for the patient; (b) lower life satisfaction and poorer self-reported health in health care professionals; and (c) changes in patient care practices. Nurses reported analyzing the event and apologizing, whereas physicians reported talking preferentially to colleagues, rather than to their supervisor, about changing practices. The RIS is a valid and reliable measure of care-related regret intensity for hospital-based physicians and nurses.

  16. A signal detection-item response theory model for evaluating neuropsychological measures.

    Science.gov (United States)

    Thomas, Michael L; Brown, Gregory G; Gur, Ruben C; Moore, Tyler M; Patt, Virginie M; Risbrough, Victoria B; Baker, Dewleen G

    2018-02-05

    Models from signal detection theory are commonly used to score neuropsychological test data, especially tests of recognition memory. Here we show that certain item response theory models can be formulated as signal detection theory models, thus linking two complementary but distinct methodologies. We then use the approach to evaluate the validity (construct representation) of commonly used research measures, demonstrate the impact of conditional error on neuropsychological outcomes, and evaluate measurement bias. Signal detection-item response theory (SD-IRT) models were fitted to recognition memory data for words, faces, and objects. The sample consisted of U.S. Infantry Marines and Navy Corpsmen participating in the Marine Resiliency Study. Data comprised item responses to the Penn Face Memory Test (PFMT; N = 1,338), Penn Word Memory Test (PWMT; N = 1,331), and Visual Object Learning Test (VOLT; N = 1,249), and self-report of past head injury with loss of consciousness. SD-IRT models adequately fitted recognition memory item data across all modalities. Error varied systematically with ability estimates, and distributions of residuals from the regression of memory discrimination onto self-report of past head injury were positively skewed towards regions of larger measurement error. Analyses of differential item functioning revealed little evidence of systematic bias by level of education. SD-IRT models benefit from the measurement rigor of item response theory-which permits the modeling of item difficulty and examinee ability-and from signal detection theory-which provides an interpretive framework encompassing the experimentally validated constructs of memory discrimination and response bias. We used this approach to validate the construct representation of commonly used research measures and to demonstrate how nonoptimized item parameters can lead to erroneous conclusions when interpreting neuropsychological test data. Future work might include the

  17. Reliability and known-group validity of the Arabic version of the 8-item Morisky Medication Adherence Scale among type 2 diabetes mellitus patients.

    Science.gov (United States)

    Ashur, S T; Shamsuddin, K; Shah, S A; Bosseri, S; Morisky, D E

    2015-12-13

    No validation study has previously been made for the Arabic version of the 8-item Morisky Medication Adherence Scale (MMAS-8(©)) as a measure for medication adherence in diabetes. This study in 2013 tested the reliability and validity of the Arabic MMAS-8 for type 2 diabetes mellitus patients attending a referral centre in Tripoli, Libya. A convenience sample of 103 patients self-completed the questionnaire. Reliability was tested using Cronbach alpha, average inter-item correlation and Spearman-Brown coefficient. Known-group validity was tested by comparing MMAS-8 scores of patients grouped by glycaemic control. The Arabic version showed adequate internal consistency (α = 0.70) and moderate split-half reliability (r = 0.65). Known-group validity was supported as a significant association was found between medication adherence and glycaemic control, with a moderate effect size (ϕc = 0.34). The Arabic version displayed good psychometric properties and could support diabetes research and practice in Arab countries.

  18. International field testing of the psychometric properties of an EORTC quality of life module for oral health: the EORTC QLQ-OH15.

    Science.gov (United States)

    Hjermstad, Marianne J; Bergenmar, Mia; Bjordal, Kristin; Fisher, Sheila E; Hofmeister, Dirk; Montel, Sébastien; Nicolatou-Galitis, Ourania; Pinto, Monica; Raber-Durlacher, Judith; Singer, Susanne; Tomaszewska, Iwona M; Tomaszewski, Krzysztof A; Verdonck-de Leeuw, Irma; Yarom, Noam; Winstanley, Julie B; Herlofson, Bente B

    2016-09-01

    This international EORTC validation study (phase IV) is aimed at testing the psychometric properties of a quality of life (QoL) module related to oral health problems in cancer patients. The phase III module comprised 17 items with four hypothesized multi-item scales and three single items. In phase IV, patients with mixed cancers, in different treatment phases from 10 countries completed the EORTC QLQ-C30, the QLQ-OH module, and a debriefing interview. The hypothesized structure was tested using combinations of classical test theory and item response theory, following EORTC guidelines. Test-retest assessments and responsiveness to change analysis (RCA) were performed after 2 weeks. Five hundred seventy-two patients (median age 60.3, 54 % females) were analyzed. Completion took issues were addressed. Analyses suggested a revision of the phase III hypothesized scale structure. Two items were deleted based on a high degree of item misfit, together with negative patient feedback. The remaining 15 items formed one eight-item scale named OH-QoL score, a two-item information scale, a two-item scale regarding dentures, and three single items (sticky saliva/mouth soreness/sensitivity to food/drink). Face and convergent validity and internal consistency were confirmed. Test-retest reliability (n = 60) was demonstrated as was RCA for patients undergoing chemotherapy (n = 117; p = 0.06). The resulting QLQ-OH15 discriminated between clinically distinct patient groups, e.g., low performance status vs. higher (p < 000.1), and head-and-neck cancer versus other cancers (p < 0.03). The EORTC module QLQ-OH15 is a short, well-accepted assessment tool focusing on oral problems and QoL to improve clinical management. ClinicalTrials.gov Identifier: NCT01724333.

  19. Scoring best-worst data in unbalanced many-item designs, with applications to crowdsourcing semantic judgments.

    Science.gov (United States)

    Hollis, Geoff

    2018-04-01

    Best-worst scaling is a judgment format in which participants are presented with a set of items and have to choose the superior and inferior items in the set. Best-worst scaling generates a large quantity of information per judgment because each judgment allows for inferences about the rank value of all unjudged items. This property of best-worst scaling makes it a promising judgment format for research in psychology and natural language processing concerned with estimating the semantic properties of tens of thousands of words. A variety of different scoring algorithms have been devised in the previous literature on best-worst scaling. However, due to problems of computational efficiency, these scoring algorithms cannot be applied efficiently to cases in which thousands of items need to be scored. New algorithms are presented here for converting responses from best-worst scaling into item scores for thousands of items (many-item scoring problems). These scoring algorithms are validated through simulation and empirical experiments, and considerations related to noise, the underlying distribution of true values, and trial design are identified that can affect the relative quality of the derived item scores. The newly introduced scoring algorithms consistently outperformed scoring algorithms used in the previous literature on scoring many-item best-worst data.

  20. Internal Structure and Partial Invariance across Gender in the Spanish Version of the Reasoning Test Battery.

    Science.gov (United States)

    Elosua, Paula; Mujika, Josu

    2015-10-13

    The Reasoning Test Battery (BPR) is an instrument built on theories of the hierarchical organization of cognitive abilities and therefore consists of different tasks related with abstract, numerical, verbal, practical, spatial and mechanical reasoning. It was originally created in Belgium and later adapted to Portuguese. There are three forms of the battery consisting of different items and scales which cover an age range from 9 to 22. This paper focuses on the adaptation of the BPR to Spanish, and analyzes different aspects of its internal structure: (a) exploratory item factor analysis was applied to assess the presence of a dominant factor for each partial scale; (b) the general underlined model was evaluated through confirmatory factor analysis, and (c) factorial invariance across gender was studied. The sample consisted of 2624 Spanish students. The results concluded the presence of a general factor beyond the scales, with equivalent values for men and women, and gender differences in the factorial structure which affect the numerical reasoning, abstract reasoning and mechanical reasoning scales.

  1. Measuring attitude towards Buddhism and Sikhism : internal consistency reliability for two new instruments

    OpenAIRE

    Thanissaro, Phra Nicholas

    2011-01-01

    This paper describes and discusses the development and empirical properties of two new\\ud 24-item scales – one measuring attitude toward Buddhism and the other measuring attitude\\ud toward Sikhism. The scale is designed to facilitate inter-faith comparisons within the\\ud psychology of religion alongside the well-established Francis Scale of Attitude toward\\ud Christianity. Data were obtained from a multi-religious sample of 369 school pupils aged\\ud between 13 and 15 in London. Application of...

  2. The case for an international patient-reported outcomes measurement information system (PROMIS®) initiative.

    Science.gov (United States)

    Alonso, Jordi; Bartlett, Susan J; Rose, Matthias; Aaronson, Neil K; Chaplin, John E; Efficace, Fabio; Leplège, Alain; Lu, Aiping; Tulsky, David S; Raat, Hein; Ravens-Sieberer, Ulrike; Revicki, Dennis; Terwee, Caroline B; Valderas, Jose M; Cella, David; Forrest, Christopher B

    2013-12-20

    Patient-reported outcomes (PROs) play an increasingly important role in clinical practice and research. Modern psychometric methods such as item response theory (IRT) enable the creation of item banks that support fixed-length forms as well as computerized adaptive testing (CAT), often resulting in improved measurement precision and responsiveness. Here we describe and discuss the case for developing an international core set of PROs building from the US PROMIS® network.PROMIS is a U.S.-based cooperative group of research sites and centers of excellence convened to develop and standardize PRO measures across studies and settings. If extended to a global collaboration, PROMIS has the potential to transform PRO measurement by creating a shared, unifying terminology and metric for reporting of common symptoms and functional life domains. Extending a common set of standardized PRO measures to the international community offers great potential for improving patient-centered research, clinical trials reporting, population monitoring, and health care worldwide. Benefits of such standardization include the possibility of: international syntheses (such as meta-analyses) of research findings; international population monitoring and policy development; health services administrators and planners access to relevant information on the populations they serve; better assessment and monitoring of patients by providers; and improved shared decision making.The goal of the current PROMIS International initiative is to ensure that item banks are translated and culturally adapted for use in adults and children in as many countries as possible. The process includes 3 key steps: translation/cultural adaptation, calibration, and validation. A universal translation, an approach focusing on commonalities, rather than differences across versions developed in regions or countries speaking the same language, is proposed to ensure conceptual equivalence for all items. International item

  3. Good validity of the international spinal cord injury quality of life basic data set

    DEFF Research Database (Denmark)

    Post, M W M; Adriaansen, J J E; Charlifue, S

    2016-01-01

    STUDY DESIGN: Cross-sectional validation study. OBJECTIVES: To examine the construct and concurrent validity of the International Spinal Cord Injury (SCI) Quality of Life (QoL) Basic Data Set. SETTING: Dutch community. PARTICIPANTS: People 28-65 years of age, who obtained their SCI between 18...... and 35 years of age, were at least 10 years post SCI and were wheelchair users in daily life.Measure(s):The International SCI QoL Basic Data Set consists of three single items on satisfaction with life as a whole, physical health and psychological health (0=complete dissatisfaction; 10=complete...... and psychological health (0.70). CONCLUSIONS: This first validity study of the International SCI QoL Basic Data Set shows that it appears valid for persons with SCI....

  4. Order information is used to guide recall of long lists: Further evidence for the item-order account.

    Science.gov (United States)

    Forrin, Noah D; MacLeod, Colin M

    2016-06-01

    Differences in memory for item order have been used to explain the absence of between-subjects (i.e., pure-list) effects in free recall for several encoding techniques, including the production effect, the finding that reading aloud benefits memory compared with reading silently. Notably, however, evidence in support of the item-order account (Nairne, Riegler, & Serra, 1991) has derived primarily from short-list paradigms. We provide novel evidence that the item-order account also applies when recalling long lists. In Experiment 1, participants studied and then free recalled 3 different long lists of words: pure aloud, pure silent, and mixed (half aloud, half silent). A Bayesian analysis supported a null pure-list production effect, and subsequent order analyses were largely consistent with the item-order account. These findings indicate that order information is retained in long-term memory and is useful in guiding subsequent free recall. In Experiment 2, a distractor task was inserted between the study and test phases, ensuring that only long-term memory processes were involved in recall: The pattern of results remained consistent with the item-order account. Order information can be retained in long-term memory for long lists, and is useful in guiding subsequent free recall, extending the domain of the item-order account. (PsycINFO Database Record (c) 2016 APA, all rights reserved).

  5. The Technical Quality of Test Items Generated Using a Systematic Approach to Item Writing.

    Science.gov (United States)

    Siskind, Theresa G.; Anderson, Lorin W.

    The study was designed to examine the similarity of response options generated by different item writers using a systematic approach to item writing. The similarity of response options to student responses for the same item stems presented in an open-ended format was also examined. A non-systematic (subject matter expertise) approach and a…

  6. Order information and free recall: evaluating the item-order hypothesis.

    Science.gov (United States)

    Mulligan, Neil W; Lozito, Jeffrey P

    2007-05-01

    The item-order hypothesis proposes that order information plays an important role in recall from long-term memory, and it is commonly used to account for the moderating effects of experimental design in memory research. Recent research (Engelkamp, Jahn, & Seiler, 2003; McDaniel, DeLosh, & Merritt, 2000) raises questions about the assumptions underlying the item-order hypothesis. Four experiments tested these assumptions by examining the relationship between free recall and order memory for lists of varying length (8, 16, or 24 unrelated words or pictures). Some groups were given standard free-recall instructions, other groups were explicitly instructed to use order information in free recall, and other groups were given free-recall tests intermixed with tests of order memory (order reconstruction). The results for short lists were consistent with the assumptions of the item-order account. For intermediate-length lists, explicit order instructions and intermixed order tests made recall more reliant on order information, but under standard conditions, order information played little role in recall. For long lists, there was little evidence that order information contributed to recall. In sum, the assumptions of the item-order account held for short lists, received mixed support with intermediate lists, and received no support for longer lists.

  7. Cross-cultural adaptation and validation of the Danish consensus version of the 10-item Perceived Stress Scale

    DEFF Research Database (Denmark)

    Eskildsen, Anita; Dalgaard, Vita Ligaya; Nielsen, Kent Jacob

    2015-01-01

    with work-related stress complaints. METHODS: A consensus-building process was performed involving the authors of the three previous Danish translations and the consensus version was back-translated into English and pilot-tested. Psychometric properties of the final version were examined in a sample of 64...... patients with work-related stress complaints. RESULTS: The face validity, reliability, and internal consistency of the Danish consensus version of the PSS-10 were satisfactory, and convergent construct validity was confirmed. Receiver operating characteristic (ROC) curves of the change scores showed......OBJECTIVES: The aims of the present study were to (i) cross-culturally adapt a Danish consensus version of the 10-item Perceived Stress Scale (PSS-10) and (ii) evaluate its psychometric properties in terms of agreement, reliability, validity, responsiveness, and interpretability among patients...

  8. Comparing Two Types of Diagnostic Items to Evaluate Understanding of Heat and Temperature Concepts

    Science.gov (United States)

    Chu, Hye-Eun; Chandrasegaran, A. L.; Treagust, David F.

    2018-01-01

    The purpose of this research was to investigate an efficient method to assess year 8 (age 13-14) students' conceptual understanding of heat and temperature concepts. Two different types of instruments were used in this study: Type 1, consisting of multiple-choice items with open-ended justifications; and Type 2, consisting of two-tier…

  9. The influence of international and domestic events in the evolution of forest inventory and reporting consistency in the United States

    Science.gov (United States)

    W. Brad Smith

    2009-01-01

    This article takes a brief chronological look at resource inventory and reporting and links to international influences. It explores events as drivers of more consistent data within the United States and highlights key dates and events in the evolution of inventory policy and practice. From King George to L?Ecole nationale forestiere to the Food and Agriculture...

  10. Sharing the cost of redundant items

    DEFF Research Database (Denmark)

    Hougaard, Jens Leth; Moulin, Hervé

    2014-01-01

    We ask how to share the cost of finitely many public goods (items) among users with different needs: some smaller subsets of items are enough to serve the needs of each user, yet the cost of all items must be covered, even if this entails inefficiently paying for redundant items. Typical examples...... are network connectivity problems when an existing (possibly inefficient) network must be maintained. We axiomatize a family cost ratios based on simple liability indices, one for each agent and for each item, measuring the relative worth of this item across agents, and generating cost allocation rules...... additive in costs....

  11. Analysis of Chemical Composition of Non-Ferrous Metal Items from the Ananyino Burial Ground

    Directory of Open Access Journals (Sweden)

    Saprykina Irina А.

    2016-03-01

    Full Text Available The article presents results of an analysis conducted by the authors in order to study chemical composition of items from non-ferrous metals found on the Ananyino burial ground. A number of research methods, including OES, XRF and TXRF was applied to study a selection of 387 samples of arrow- and spearheads, celts, tail-pieces, warhammers, poleaxes, knives and daggers, as well as items of attire and jewelry, some sporadic details of harness and bridle. The fi ndings are quite comparable. The results were classifi ed by the geochemical principle of 1,0% alloyage threshold. It was found out that the sample primarily consists of copper items, including “pure” copper and copper with a wide range of trace elements (particularly, Ni, As, Sb. The core (48% consists of copper items with traces of antimony and arsenic, or “pure” copper (7%, tin or triple bronze (40%; it also includes some other types of alloys based on copper or silver (5%. As the analysis has shown, complex ores seem to be the most probable source of copper. Traditionally, the Urals, the Sayan and the Altay Mountains, Kazakhstan and the Northern Caucasus were regarded as the most probable minefi elds to supply ores to the barren regions of Eastern Europe. While ore sources for products made of metallurgical “pure” copper are localized within the Ural mining and metallurgical region, metal sources for items cast from different groups of alloys (rather than imports of ready-made products require further research.

  12. Assessing the Validity of Single-item Life Satisfaction Measures: Results from Three Large Samples

    Science.gov (United States)

    Cheung, Felix; Lucas, Richard E.

    2014-01-01

    Purpose The present paper assessed the validity of single-item life satisfaction measures by comparing single-item measures to the Satisfaction with Life Scale (SWLS) - a more psychometrically established measure. Methods Two large samples from Washington (N=13,064) and Oregon (N=2,277) recruited by the Behavioral Risk Factor Surveillance System (BRFSS) and a representative German sample (N=1,312) recruited by the Germany Socio-Economic Panel (GSOEP) were included in the present analyses. Single-item life satisfaction measures and the SWLS were correlated with theoretically relevant variables, such as demographics, subjective health, domain satisfaction, and affect. The correlations between the two life satisfaction measures and these variables were examined to assess the construct validity of single-item life satisfaction measures. Results Consistent across three samples, single-item life satisfaction measures demonstrated substantial degree of criterion validity with the SWLS (zero-order r = 0.62 – 0.64; disattenuated r = 0.78 – 0.80). Patterns of statistical significance for correlations with theoretically relevant variables were the same across single-item measures and the SWLS. Single-item measures did not produce systematically different correlations compared to the SWLS (average difference = 0.001 – 0.005). The average absolute difference in the magnitudes of the correlations produced by single-item measures and the SWLS were very small (average absolute difference = 0.015 −0.042). Conclusions Single-item life satisfaction measures performed very similarly compared to the multiple-item SWLS. Social scientists would get virtually identical answer to substantive questions regardless of which measure they use. PMID:24890827

  13. The emotion dysregulation inventory: Psychometric properties and item response theory calibration in an autism spectrum disorder sample.

    Science.gov (United States)

    Mazefsky, Carla A; Yu, Lan; White, Susan W; Siegel, Matthew; Pilkonis, Paul A

    2018-04-06

    Individuals with autism spectrum disorder (ASD) often present with prominent emotion dysregulation that requires treatment but can be difficult to measure. The Emotion Dysregulation Inventory (EDI) was created using methods developed by the Patient-Reported Outcomes Measurement Information System (PROMIS ® ) to capture observable indicators of poor emotion regulation. Caregivers of 1,755 youth with ASD completed 66 candidate EDI items, and the final 30 items were selected based on classical test theory and item response theory (IRT) analyses. The analyses identified two factors: (a) Reactivity, characterized by intense, rapidly escalating, sustained, and poorly regulated negative emotional reactions, and (b) Dysphoria, characterized by anhedonia, sadness, and nervousness. The final items did not show differential item functioning (DIF) based on gender, age, intellectual ability, or verbal ability. Because the final items were calibrated using IRT, even a small number of items offers high precision, minimizing respondent burden. IRT co-calibration of the EDI with related measures demonstrated its superiority in assessing the severity of emotion dysregulation with as few as seven items. Validity of the EDI was supported by expert review, its association with related constructs (e.g., anxiety and depression symptoms, aggression), higher scores in psychiatric inpatients with ASD compared to a community ASD sample, and demonstration of test-retest stability and sensitivity to change. In sum, the EDI provides an efficient and sensitive method to measure emotion dysregulation for clinical assessment, monitoring, and research in youth with ASD of any level of cognitive or verbal ability. Autism Res 2018. © 2018 International Society for Autism Research, Wiley Periodicals, Inc. This paper describes a new measure of poor emotional control called the Emotion Dysregulation Inventory (EDI). Caregivers of 1,755 youth with ASD completed candidate items, and advanced statistical

  14. Dissociating the neural correlates of intra-item and inter-item working-memory binding.

    Directory of Open Access Journals (Sweden)

    Carinne Piekema

    Full Text Available BACKGROUND: Integration of information streams into a unitary representation is an important task of our cognitive system. Within working memory, the medial temporal lobe (MTL has been conceptually linked to the maintenance of bound representations. In a previous fMRI study, we have shown that the MTL is indeed more active during working-memory maintenance of spatial associations as compared to non-spatial associations or single items. There are two explanations for this result, the mere presence of the spatial component activates the MTL, or the MTL is recruited to bind associations between neurally non-overlapping representations. METHODOLOGY/PRINCIPAL FINDINGS: The current fMRI study investigates this issue further by directly comparing intrinsic intra-item binding (object/colour, extrinsic intra-item binding (object/location, and inter-item binding (object/object. The three binding conditions resulted in differential activation of brain regions. Specifically, we show that the MTL is important for establishing extrinsic intra-item associations and inter-item associations, in line with the notion that binding of information processed in different brain regions depends on the MTL. CONCLUSIONS/SIGNIFICANCE: Our findings indicate that different forms of working-memory binding rely on specific neural structures. In addition, these results extend previous reports indicating that the MTL is implicated in working-memory maintenance, challenging the classic distinction between short-term and long-term memory systems.

  15. Relative amplitude preservation processing utilizing surface consistent amplitude correction. Part 4; Surface consistent amplitude correction wo mochiita sotai shinpuku hozon shori. 4

    Energy Technology Data Exchange (ETDEWEB)

    Saeki, T [Japan National Oil Corp., Tokyo (Japan). Technology Research Center

    1997-10-22

    Discussions were given on seismic exploration from the ground surface using the reflection method, for surface consistent amplitude correction from among effects imposed from the ground surface and a surface layer. Amplitude distribution on the reflection wave zone is complex. Therefore, items to be considered in making an analysis are multiple, such as estimation of spherical surface divergence effect and exponential attenuation effect, not only amplitude change through the surface layer. If all of these items are taken into consideration, burden of the work becomes excessive. As a method to solve this problem, utilization of amplitude in initial movement of a diffraction wave may be conceived. Distribution of the amplitude in initial movement of the diffraction wave shows a value relatively close to distribution of the vibration transmitting and receiving points. The reason for this is thought because characteristics of the vibration transmitting and receiving points related with waveline paths in the vicinity of the ground surface have no great difference both on the diffraction waves and on the reflection waves. The lecture described in this paper introduces an attempt of improving the efficiency of the surface consistent amplitude correction by utilizing the analysis of amplitude in initial movement of the diffraction wave. 4 refs., 2 figs.

  16. Assessment of test-retest reliability and internal consistency of the Wisconsin Gait Scale in hemiparetic post-stroke patients

    Directory of Open Access Journals (Sweden)

    Guzik Agnieszka

    2016-09-01

    Full Text Available Introduction: A proper assessment of gait pattern is a significant aspect in planning the process of teaching gait in hemiparetic post-stroke patients. The Wisconsin Gait Scale (WGS is an observational tool for assessing post-stroke patients’ gait. The aim of the study was to assess test-retest reliability and internal consistency of the WGS and examine correlations between gait assessment made with the WGS and gait speed, Brunnström scale, Ashworth’s scale and the Barthel Index.

  17. The 12-item World Health Organization Disability Assessment Schedule II (WHO-DAS II: a nonparametric item response analysis

    Directory of Open Access Journals (Sweden)

    Fernandez Ana

    2010-05-01

    Full Text Available Abstract Background Previous studies have analyzed the psychometric properties of the World Health Organization Disability Assessment Schedule II (WHO-DAS II using classical omnibus measures of scale quality. These analyses are sample dependent and do not model item responses as a function of the underlying trait level. The main objective of this study was to examine the effectiveness of the WHO-DAS II items and their options in discriminating between changes in the underlying disability level by means of item response analyses. We also explored differential item functioning (DIF in men and women. Methods The participants were 3615 adult general practice patients from 17 regions of Spain, with a first diagnosed major depressive episode. The 12-item WHO-DAS II was administered by the general practitioners during the consultation. We used a non-parametric item response method (Kernel-Smoothing implemented with the TestGraf software to examine the effectiveness of each item (item characteristic curves and their options (option characteristic curves in discriminating between changes in the underliying disability level. We examined composite DIF to know whether women had a higher probability than men of endorsing each item. Results Item response analyses indicated that the twelve items forming the WHO-DAS II perform very well. All items were determined to provide good discrimination across varying standardized levels of the trait. The items also had option characteristic curves that showed good discrimination, given that each increasing option became more likely than the previous as a function of increasing trait level. No gender-related DIF was found on any of the items. Conclusions All WHO-DAS II items were very good at assessing overall disability. Our results supported the appropriateness of the weights assigned to response option categories and showed an absence of gender differences in item functioning.

  18. Item Response Theory with Covariates (IRT-C): Assessing Item Recovery and Differential Item Functioning for the Three-Parameter Logistic Model

    Science.gov (United States)

    Tay, Louis; Huang, Qiming; Vermunt, Jeroen K.

    2016-01-01

    In large-scale testing, the use of multigroup approaches is limited for assessing differential item functioning (DIF) across multiple variables as DIF is examined for each variable separately. In contrast, the item response theory with covariate (IRT-C) procedure can be used to examine DIF across multiple variables (covariates) simultaneously. To…

  19. Adaptation and validation into Portuguese language of the six-item cognitive impairment test (6CIT).

    Science.gov (United States)

    Apóstolo, João Luís Alves; Paiva, Diana Dos Santos; Silva, Rosa Carla Gomes da; Santos, Eduardo José Ferreira Dos; Schultz, Timothy John

    2017-07-25

    The six-item cognitive impairment test (6CIT) is a brief cognitive screening tool that can be administered to older people in 2-3 min. To adapt the 6CIT for the European Portuguese and determine its psychometric properties based on a sample recruited from several contexts (nursing homes; universities for older people; day centres; primary health care units). The original 6CIT was translated into Portuguese and the draft Portuguese version (6CIT-P) was back-translated and piloted. The accuracy of the 6CIT-P was assessed by comparison with the Portuguese Mini-Mental State Examination (MMSE). A convenience sample of 550 older people from various geographical locations in the north and centre of the country was used. The test-retest reliability coefficient was high (r = 0.95). The 6CIT-P also showed good internal consistency (α = 0.88) and corrected item-total correlations ranged between 0.32 and 0.90. Total 6CIT-P and MMSE scores were strongly correlated. The proposed 6CIT-P threshold for cognitive impairment is ≥10 in the Portuguese population, which gives sensitivity of 82.78% and specificity of 84.84%. The accuracy of 6CIT-P, as measured by area under the ROC curve, was 0.91. The 6CIT-P has high reliability and validity and is accurate when used to screen for cognitive impairment.

  20. The randomly renewed general item and the randomly inspected item with exponential life distribution

    International Nuclear Information System (INIS)

    Schneeweiss, W.G.

    1979-01-01

    For a randomly renewed item the probability distributions of the time to failure and of the duration of down time and the expectations of these random variables are determined. Moreover, it is shown that the same theory applies to randomly checked items with exponential probability distribution of life such as electronic items. The case of periodic renewals is treated as an example. (orig.) [de

  1. Development and psychometric characteristics of the SCI-QOL Ability to Participate and Satisfaction with Social Roles and Activities item banks and short forms.

    Science.gov (United States)

    Heinemann, Allen W; Kisala, Pamela A; Hahn, Elizabeth A; Tulsky, David S

    2015-05-01

    To develop a spinal cord injury (SCI)-focused version of PROMIS and Neuro-QOL social domain item banks; evaluate the psychometric properties of items developed for adults with SCI; and report information to facilitate clinical and research use. We used a mixed-methods design to develop and evaluate Ability to Participate in Social Roles and Activities and Satisfaction with Social Roles and Activities items. Focus groups helped define the constructs; cognitive interviews helped revise items; and confirmatory factor analysis and item response theory methods helped calibrate item banks and evaluate differential item functioning related to demographic and injury characteristics. Five SCI Model System sites and one Veterans Administration medical center. The calibration sample consisted of 641 individuals; a reliability sample consisted of 245 individuals residing in the community. A subset of 27 Ability to Participate and 35 Satisfaction items demonstrated good measurement properties and negligible differential item functioning related to demographic and injury characteristics. The SCI-specific measures correlate strongly with the PROMIS and Neuro-QOL versions. Ten item short forms correlate >0.96 with the full banks. Variable-length CATs with a minimum of 4 items, variable-length CATs with a minimum of 8 items, fixed-length CATs of 10 items, and the 10-item short forms demonstrate construct coverage and measurement error that is comparable to the full item bank. The Ability to Participate and Satisfaction with Social Roles and Activities CATs and short forms demonstrate excellent psychometric properties and are suitable for clinical and research applications.

  2. Re-Examining Test Item Issues in the TIMSS Mathematics and Science Assessments

    Science.gov (United States)

    Wang, Jianjun

    2011-01-01

    As the largest international study ever taken in history, the Trend in Mathematics and Science Study (TIMSS) has been held as a benchmark to measure U.S. student performance in the global context. In-depth analyses of the TIMSS project are conducted in this study to examine key issues of the comparative investigation: (1) item flaws in mathematics…

  3. Validity of the SF-36 five-item Mental Health Index for major depression in functionally impaired, community-dwelling elderly patients.

    Science.gov (United States)

    Friedman, Bruce; Heisel, Marnin; Delavan, Rachel

    2005-11-01

    To examine criterion and construct validity of the five-item Mental Health Index (MHI-5) of the 36-item Short Form health survey (SF-36) in relation to the presence of major depression in functionally impaired, community-dwelling elderly patients and of eight subsamples defined by cognitive functioning, levels of functional impairment, and proxy report versus self-report. Cross-sectional observational. Nineteen counties in western New York, West Virginia, and Ohio. One thousand four hundred forty-four functionally impaired, community-dwelling Medicare beneficiaries aged 65 and older who participated in the Medicare Primary and Consumer-Directed Care Demonstration. MHI-5, Mini-International Neuropsychiatric Interview Major Depressive Episode (MINI-MDE) module. The MHI-5 demonstrated sufficient criterion validity (area under the receiver operating characteristic curve=0.837; sensitivity=78.7% and specificity=72.1% using a cutpoint of 59/60) with respect to the presence of depression for the entire sample. A significant correlation between MHI-5 scores and presence of major depression as identified using the MINI-MDE (Spearman correlation=-0.426, Pvalidity. Additional evidence is provided by decline in mean MHI-5 score as level of formal education and number of close friends and relatives decreased. All eight subsamples demonstrated similar criterion and construct validity. A Cronbach alpha of 0.794 demonstrated internal consistency reliability. This study provides evidence for adequate criterion and construct validity of the MHI-5 in relation to the presence of major depression among functionally impaired, community-dwelling elderly Medicare patients.

  4. 26 CFR 301.6231(a)(2)-1 - Persons whose tax liability is determined indirectly by partnership items.

    Science.gov (United States)

    2010-04-01

    ... in a partnership. (b) Shareholder of C corporation. A shareholder of a C corporation (as defined in... indirectly by partnership items. 301.6231(a)(2)-1 Section 301.6231(a)(2)-1 Internal Revenue INTERNAL REVENUE... Assessment In General § 301.6231(a)(2)-1 Persons whose tax liability is determined indirectly by partnership...

  5. Escala de Autoestima de Rosenberg (EAR: validade fatorial e consistência interna Rosenberg Self-Esteem Scale (RSS: factorial validity and internal consistency

    Directory of Open Access Journals (Sweden)

    Juliana Burges Sbicigo

    2010-12-01

    Full Text Available O objetivo deste estudo foi investigar as propriedades psicométricas da Escala de Autoestima de Rosenberg (EAR para adolescentes. Participaram 4.757 adolescentes, com idades entre 14 e 18 anos (M=15,77; DP=1,22, de nove cidades brasileiras. Os participantes responderam a uma versão da EAR adaptada para o Brasil. A análise fatorial exploratória apontou uma estrutura bidimensional, com 51.4% da variância explicada, que foi sustentada pela análise fatorial confirmatória. As análises de consistência interna realizadas por meio do coeficiente alfa de Cronbach, confiabilidade composta e variância extraída indicaram bons valores de fidedignidade. Diferenças nos escores de autoestima em função do sexo e da idade não foram encontradas. Conclui-se que a EAR apresenta qualidades psicométricas satisfatórias, mostrando-se um instrumento confiável para medir autoestima em adolescentes brasileiros.The aim of this study was to investigate the psychometrics properties of the Rosenberg Self-Esteem Scale (RSS for adolescents. The sample was composed of 4.757 adolescents, with ages between 14 and 18 years old (M=15.77; SD=1.22 in nine Brazilian cities. Participants responded to an adapted version of the RSS for Brazil. Exploratory factorial analysis showed a bidimensional structure, with 51.4% of explained variance. This result was supported by confirmatory factor analysis. The internal consistency analysis by Cronbach alpha coefficient, composite reliability and extracted variance indicated good reliability. Differences in self-esteem for gender and age were not found. These findings show that RSS has satisfactory psychometric qualities and it's a reliable instrument to assess self-esteem in Brazilian adolescents.

  6. The Internalized Stigma of Mental Illness (ISMI) scale: validation of the Japanese version.

    Science.gov (United States)

    Tanabe, Yosuke; Hayashi, Kunihiko; Ideno, Yuki

    2016-04-29

    The present study investigated the reliability and validity of a Japanese version of the Internalized Stigma of Mental Illness (ISMI) scale, designed to assess internalized stigma experienced by people with mental illness. A survey was conducted with 173 outpatients with mental illness who attended psychiatric clinics on a regular basis. A retest was conducted with 51 participants to evaluate the scale's psychometric properties. The alpha coefficient for the overall internal consistency was 0.91, and the coefficients of the individual ISMI subscales ranged from 0.57 to 0.81. The test-retest reliability was r = 0.85 (n = 51, P stigma resistance items excluded. The Japanese version of the ISMI scale demonstrated similar reliability and validity to the original English version. Therefore, the Japanese version of the ISMI scale may be an effective and valid tool to measure internalized stigma among Japanese people who have a mental illness.

  7. Development of Rasch-based item banks for the assessment of work performance in patients with musculoskeletal diseases.

    Science.gov (United States)

    Mueller, Evelyn A; Bengel, Juergen; Wirtz, Markus A

    2013-12-01

    This study aimed to develop a self-description assessment instrument to measure work performance in patients with musculoskeletal diseases. In terms of the International Classification of Functioning, Disability and Health (ICF), work performance is defined as the degree of meeting the work demands (activities) at the actual workplace (environment). To account for the fact that work performance depends on the work demands of the job, we strived to develop item banks that allow a flexible use of item subgroups depending on the specific work demands of the patients' jobs. Item development included the collection of work tasks from literature and content validation through expert surveys and patient interviews. The resulting 122 items were answered by 621 patients with musculoskeletal diseases. Exploratory factor analysis to ascertain dimensionality and Rasch analysis (partial credit model) for each of the resulting dimensions were performed. Exploratory factor analysis resulted in four dimensions, and subsequent Rasch analysis led to the following item banks: 'impaired productivity' (15 items), 'impaired cognitive performance' (18), 'impaired coping with stress' (13) and 'impaired physical performance' (low physical workload 20 items, high physical workload 10 items). The item banks exhibited person separation indices (reliability) between 0.89 and 0.96. The assessment of work performance adds the activities component to the more commonly employed participation component of the ICF-model. The four item banks can be adapted to specific jobs where necessary without losing comparability of person measures, as the item banks are based on Rasch analysis.

  8. Measuring everyday functional competence using the Rasch assessment of everyday activity limitations (REAL) item bank.

    Science.gov (United States)

    Oude Voshaar, Martijn A H; Ten Klooster, Peter M; Vonkeman, Harald E; van de Laar, Mart A F J

    2017-11-01

    Traditional patient-reported physical function instruments often poorly differentiate patients with mild-to-moderate disability. We describe the development and psychometric evaluation of a generic item bank for measuring everyday activity limitations in outpatient populations. Seventy-two items generated from patient interviews and mapped to the International Classification of Functioning, Disability and Health (ICF) domestic life chapter were administered to 1128 adults representative of the Dutch population. The partial credit model was fitted to the item responses and evaluated with respect to its assumptions, model fit, and differential item functioning (DIF). Measurement performance of a computerized adaptive testing (CAT) algorithm was compared with the SF-36 physical functioning scale (PF-10). A final bank of 41 items was developed. All items demonstrated acceptable fit to the partial credit model and measurement invariance across age, sex, and educational level. Five- and ten-item CAT simulations were shown to have high measurement precision, which exceeded that of SF-36 physical functioning scale across the physical function continuum. Floor effects were absent for a 10-item empirical CAT simulation, and ceiling effects were low (13.5%) compared with SF-36 physical functioning (38.1%). CAT also discriminated better than SF-36 physical functioning between age groups, number of chronic conditions, and respondents with or without rheumatic conditions. The Rasch assessment of everyday activity limitations (REAL) item bank will hopefully prove a useful instrument for assessing everyday activity limitations. T-scores obtained using derived measures can be used to benchmark physical function outcomes against the general Dutch adult population.

  9. Transcranial direct current stimulation over the parietal cortex alters bias in item and source memory tasks.

    Science.gov (United States)

    Pergolizzi, Denise; Chua, Elizabeth F

    2016-10-01

    Neuroimaging data have shown that activity in the lateral posterior parietal cortex (PPC) correlates with item recognition and source recollection, but there is considerable debate about its specific contributions. Performance on both item and source memory tasks were compared between participants who were given bilateral transcranial direct current stimulation (tDCS) over the parietal cortex to those given prefrontal or sham tDCS. The parietal tDCS group, but not the prefrontal group, showed decreased false recognition, and less bias in item and source discrimination tasks compared to sham stimulation. These results are consistent with a causal role of the PPC in item and source memory retrieval, likely based on attentional and decision-making biases. Copyright © 2016 Elsevier Inc. All rights reserved.

  10. 17 CFR 260.7a-16 - Inclusion of items, differentiation between items and answers, omission of instructions.

    Science.gov (United States)

    2010-04-01

    ... 17 Commodity and Securities Exchanges 3 2010-04-01 2010-04-01 false Inclusion of items, differentiation between items and answers, omission of instructions. 260.7a-16 Section 260.7a-16 Commodity and... INDENTURE ACT OF 1939 Formal Requirements § 260.7a-16 Inclusion of items, differentiation between items and...

  11. Combination of classical test theory (CTT) and item response theory (IRT) analysis to study the psychometric properties of the French version of the Quality of Life Enjoyment and Satisfaction Questionnaire-Short Form (Q-LES-Q-SF).

    Science.gov (United States)

    Bourion-Bédès, Stéphanie; Schwan, Raymund; Epstein, Jonathan; Laprevote, Vincent; Bédès, Alex; Bonnet, Jean-Louis; Baumann, Cédric

    2015-02-01

    The study aimed to examine the construct validity and reliability of the Quality of Life Enjoyment and Satisfaction Questionnaire-Short Form (Q-LES-Q-SF) according to both classical test and item response theories. The psychometric properties of the French version of this instrument were investigated in a cross-sectional, multicenter study. A total of 124 outpatients with a substance dependence diagnosis participated in the study. Psychometric evaluation included descriptive analysis, internal consistency, test-retest reliability, and validity. The dimensionality of the instrument was explored using a combination of the classical test, confirmatory factor analysis (CFA), and an item response theory analysis, the Person Separation Index (PSI), in a complementary manner. The results of the Q-LES-Q-SF revealed that the questionnaire was easy to administer and the acceptability was good. The internal consistency and the test-retest reliability were 0.9 and 0.88, respectively. All items were significantly correlated with the total score and the SF-12 used in the study. The CFA with one factor model was good, and for the unidimensional construct, the PSI was found to be 0.902. The French version of the Q-LES-Q-SF yielded valid and reliable clinical assessments of the quality of life for future research and clinical practice involving French substance abusers. In response to recent questioning regarding the unidimensionality or bidimensionality of the instrument and according to the underlying theoretical unidimensional construct used for its development, this study suggests the Q-LES-Q-SF as a one-dimension questionnaire in French QoL studies.

  12. Consistent guiding center drift theories

    International Nuclear Information System (INIS)

    Wimmel, H.K.

    1982-04-01

    Various guiding-center drift theories are presented that are optimized in respect of consistency. They satisfy exact energy conservation theorems (in time-independent fields), Liouville's theorems, and appropriate power balance equations. A theoretical framework is given that allows direct and exact derivation of associated drift-kinetic equations from the respective guiding-center drift-orbit theories. These drift-kinetic equations are listed. Northrop's non-optimized theory is discussed for reference, and internal consistency relations of G.C. drift theories are presented. (orig.)

  13. The Effects of Test Length and Sample Size on Item Parameters in Item Response Theory

    Science.gov (United States)

    Sahin, Alper; Anil, Duygu

    2017-01-01

    This study investigates the effects of sample size and test length on item-parameter estimation in test development utilizing three unidimensional dichotomous models of item response theory (IRT). For this purpose, a real language test comprised of 50 items was administered to 6,288 students. Data from this test was used to obtain data sets of…

  14. Approximation Preserving Reductions among Item Pricing Problems

    Science.gov (United States)

    Hamane, Ryoso; Itoh, Toshiya; Tomita, Kouhei

    When a store sells items to customers, the store wishes to determine the prices of the items to maximize its profit. Intuitively, if the store sells the items with low (resp. high) prices, the customers buy more (resp. less) items, which provides less profit to the store. So it would be hard for the store to decide the prices of items. Assume that the store has a set V of n items and there is a set E of m customers who wish to buy those items, and also assume that each item i ∈ V has the production cost di and each customer ej ∈ E has the valuation vj on the bundle ej ⊆ V of items. When the store sells an item i ∈ V at the price ri, the profit for the item i is pi = ri - di. The goal of the store is to decide the price of each item to maximize its total profit. We refer to this maximization problem as the item pricing problem. In most of the previous works, the item pricing problem was considered under the assumption that pi ≥ 0 for each i ∈ V, however, Balcan, et al. [In Proc. of WINE, LNCS 4858, 2007] introduced the notion of “loss-leader, ” and showed that the seller can get more total profit in the case that pi < 0 is allowed than in the case that pi < 0 is not allowed. In this paper, we derive approximation preserving reductions among several item pricing problems and show that all of them have algorithms with good approximation ratio.

  15. Which Statistic Should Be Used to Detect Item Preknowledge When the Set of Compromised Items Is Known?

    Science.gov (United States)

    Sinharay, Sandip

    2017-09-01

    Benefiting from item preknowledge is a major type of fraudulent behavior during educational assessments. Belov suggested the posterior shift statistic for detection of item preknowledge and showed its performance to be better on average than that of seven other statistics for detection of item preknowledge for a known set of compromised items. Sinharay suggested a statistic based on the likelihood ratio test for detection of item preknowledge; the advantage of the statistic is that its null distribution is known. Results from simulated and real data and adaptive and nonadaptive tests are used to demonstrate that the Type I error rate and power of the statistic based on the likelihood ratio test are very similar to those of the posterior shift statistic. Thus, the statistic based on the likelihood ratio test appears promising in detecting item preknowledge when the set of compromised items is known.

  16. Factoring handedness data: I. Item analysis.

    Science.gov (United States)

    Messinger, H B; Messinger, M I

    1995-12-01

    Recently in this journal Peters and Murphy challenged the validity of factor analyses done on bimodal handedness data, suggesting instead that right- and left-handers be studied separately. But bimodality may be avoidable if attention is paid to Oldfield's questionnaire format and instructions for the subjects. Two characteristics appear crucial: a two-column LEFT-RIGHT format for the body of the instrument and what we call Oldfield's Admonition: not to indicate strong preference for handedness item, such as write, unless "... the preference is so strong that you would never try to use the other hand unless absolutely forced to...". Attaining unimodality of an item distribution would seem to overcome the objections of Peters and Murphy. In a 1984 survey in Boston we used Oldfield's ten-item questionnaire exactly as published. This produced unimodal item distributions. With reflection of the five-point item scale and a logarithmic transformation, we achieved a degree of normalization for the items. Two surveys elsewhere based on Oldfield's 20-item list but with changes in the questionnaire format and the instructions, yielded markedly different item distributions with peaks at each extreme and sometimes in the middle as well.

  17. Item Modeling Concept Based on Multimedia Authoring

    Directory of Open Access Journals (Sweden)

    Janez Stergar

    2008-09-01

    Full Text Available In this paper a modern item design framework for computer based assessment based on Flash authoring environment will be introduced. Question design will be discussed as well as the multimedia authoring environment used for item modeling emphasized. Item type templates are a structured means of collecting and storing item information that can be used to improve the efficiency and security of the innovative item design process. Templates can modernize the item design, enhance and speed up the development process. Along with content creation, multimedia has vast potential for use in innovative testing. The introduced item design template is based on taxonomy of innovative items which have great potential for expanding the content areas and construct coverage of an assessment. The presented item design approach is based on GUI's – one for question design based on implemented item design templates and one for user interaction tracking/retrieval. The concept of user interfaces based on Flash technology will be discussed as well as implementation of the innovative approach of the item design forms with multimedia authoring. Also an innovative method for user interaction storage/retrieval based on PHP extending Flash capabilities in the proposed framework will be introduced.

  18. All projects related to | Page 536 | IDRC - International Development ...

    International Development Research Centre (IDRC) Digital Library (Canada)

    ... ORGANIZATIONAL CHANGE, AGRICULTURAL PRODUCTIVITY, ACCESS ... Building a Cross-sectoral, Multi-sectoral, Gender-sensitive Approach to ... in international trade and an important item on the international development agenda.

  19. Factors Influencing the Degree of Intrajudge Consistency during the Standard Setting Process.

    Science.gov (United States)

    Plake, Barbara S.; And Others

    The accuracy of standards obtained from judgmental methods is dependent on the quality of the judgments made by experts throughout the standard setting process. One important dimension of the quality of these judgments is the consistency of the judges' perceptions with item performance of minimally competent candidates. Several interrelated…

  20. Reliability and construct validity of the Spanish version of the 6-item CTS symptoms scale for outcomes assessment in carpal tunnel syndrome.

    Science.gov (United States)

    Rosales, Roberto S; Martin-Hidalgo, Yolanda; Reboso-Morales, Luis; Atroshi, Isam

    2016-03-03

    The purpose of this study was to assess the reliability and construct validity of the Spanish version of the 6-item carpal tunnel syndrome (CTS) symptoms scale (CTS-6). In this cross-sectional study 40 patients diagnosed with CTS based on clinical and neurophysiologic criteria, completed the standard Spanish versions of the CTS-6 and the disabilities of the arm, shoulder and hand (QuickDASH) scales on two occasions with a 1-week interval. Internal-consistency reliability was assessed with the Cronbach alpha coefficient and test-retest reliability with the intraclass correlation coefficient, two way random effect model and absolute agreement definition (ICC2,1). Cross-sectional precision was analyzed with the Standard Error of the Measurement (SEM). Longitudinal precision for test-retest reliability coefficient was assessed with the Standard Error of the Measurement difference (SEMdiff) and the Minimal Detectable Change at 95 % confidence level (MDC95). For assessing construct validity it was hypothesized that the CTS-6 would have a strong positive correlation with the QuickDASH, analyzed with the Pearson correlation coefficient (r). The standard Spanish version of the CTS-6 presented a Cronbach alpha of 0.81 with a SEM of 0.3. Test-retest reliability showed an ICC of 0.85 with a SRMdiff of 0.36 and a MDC95 of 0.7. The correlation between CTS-6 and the QuickDASH was concordant with the a priori formulated construct hypothesis (r 0.69) CONCLUSIONS: The standard Spanish version of the 6-item CTS symptoms scale showed good internal consistency, test-retest reliability and construct validity for outcomes assessment in CTS. The CTS-6 will be useful to clinicians and researchers in Spanish speaking parts of the world. The use of standardized outcome measures across countries also will facilitate comparison of research results in carpal tunnel syndrome.

  1. Differential item functioning of the patient-reported outcomes information system (PROMIS®) pain interference item bank by language (Spanish versus English).

    Science.gov (United States)

    Paz, Sylvia H; Spritzer, Karen L; Reise, Steven P; Hays, Ron D

    2017-06-01

    About 70% of Latinos, 5 years old or older, in the United States speak Spanish at home. Measurement equivalence of the PROMIS ® pain interference (PI) item bank by language of administration (English versus Spanish) has not been evaluated. A sample of 527 adult Spanish-speaking Latinos completed the Spanish version of the 41-item PROMIS ® pain interference item bank. We evaluate dimensionality, monotonicity and local independence of the Spanish-language items. Then we evaluate differential item functioning (DIF) using ordinal logistic regression with item response theory scores estimated from DIF-free "anchor" items. One of the 41 items in the Spanish version of the PROMIS ® PI item bank was identified as having significant uniform DIF. English- and Spanish-speaking subjects with the same level of pain interference responded differently to 1 of the 41 items in the PROMIS ® PI item bank. This item was not retained due to proprietary issues. The original English language item parameters can be used when estimating PROMIS ® PI scores.

  2. A strategy for optimizing item-pool management

    NARCIS (Netherlands)

    Ariel, A.; van der Linden, Willem J.; Veldkamp, Bernard P.

    2006-01-01

    Item-pool management requires a balancing act between the input of new items into the pool and the output of tests assembled from it. A strategy for optimizing item-pool management is presented that is based on the idea of a periodic update of an optimal blueprint for the item pool to tune item

  3. Using internal and external reviewers can help to optimise neonatal mortality and morbidity conferences.

    Science.gov (United States)

    Assaad, Michael-Andrew; Janvier, Annie; Lapointe, Anie

    2018-02-01

    This study determined whether there was a difference in the conclusions reached by neonatologists in morbidity and mortality conferences based on their level of involvement in a case. All neonatal deaths occurring between August 2014 and September 2015 at the neonatal intensive care unit of Sainte-Justine Hospital, Montreal, Quebec, Canada, were reviewed by internal physicians involved in the case and external physicians who were not. The reviewers were asked to identify positive and negative clinical practice items and provide written recommendations. These were classified into eight categories and compared for each case. During the study, 55 patients died leading to 110 reviews and a total of 590 positive and negative items. Most items were in the communication (25.2%), ethical decision-making (16.7%) and clinical management (14.8%) categories. Both the internal and external reviewers were in agreement 48.5% of the time for positive items and 44.8% for negative items. There were 242 written recommendations, which differed significantly among the internal and external reviewers. Reviews of neonatal deaths by two independent reviewers, internal physicians and external physicians, led to different positive and negative practice items and recommendations. This could allow for a richer discussion and improve recommendations for patient care. ©2017 Foundation Acta Paediatrica. Published by John Wiley & Sons Ltd.

  4. Effects of language dominance on item and order memory in free recall, serial recall and order reconstruction.

    Science.gov (United States)

    Francis, Wendy S; Baca, Yuzeth

    2014-01-01

    Spanish-English bilinguals (N = 144) performed free recall, serial recall and order reconstruction tasks in both English and Spanish. Long-term memory for both item and order information was worse in the less fluent language (L2) than in the more fluent language (L1). Item scores exhibited a stronger disadvantage for the L2 in serial recall than in free recall. Relative order scores were lower in the L2 for all three tasks, but adjusted scores for free and serial recall were equivalent across languages. Performance of English-speaking monolinguals (N = 72) was comparable to bilingual performance in the L1, except that monolinguals had higher adjusted order scores in free recall. Bilingual performance patterns in the L2 were consistent with the established effects of concurrent task performance on these memory tests, suggesting that the cognitive resources required for processing words in the L2 encroach on resources needed to commit item and order information to memory. These findings are also consistent with a model in which item memory is connected to the language system, order information is processed by separate mechanisms and attention can be allocated differentially to these two systems.

  5. Pattern analysis of total item score and item response of the Kessler Screening Scale for Psychological Distress (K6 in a nationally representative sample of US adults

    Directory of Open Access Journals (Sweden)

    Shinichiro Tomitaka

    2017-02-01

    Full Text Available Background Several recent studies have shown that total scores on depressive symptom measures in a general population approximate an exponential pattern except for the lower end of the distribution. Furthermore, we confirmed that the exponential pattern is present for the individual item responses on the Center for Epidemiologic Studies Depression Scale (CES-D. To confirm the reproducibility of such findings, we investigated the total score distribution and item responses of the Kessler Screening Scale for Psychological Distress (K6 in a nationally representative study. Methods Data were drawn from the National Survey of Midlife Development in the United States (MIDUS, which comprises four subsamples: (1 a national random digit dialing (RDD sample, (2 oversamples from five metropolitan areas, (3 siblings of individuals from the RDD sample, and (4 a national RDD sample of twin pairs. K6 items are scored using a 5-point scale: “none of the time,” “a little of the time,” “some of the time,” “most of the time,” and “all of the time.” The pattern of total score distribution and item responses were analyzed using graphical analysis and exponential regression model. Results The total score distributions of the four subsamples exhibited an exponential pattern with similar rate parameters. The item responses of the K6 approximated a linear pattern from “a little of the time” to “all of the time” on log-normal scales, while “none of the time” response was not related to this exponential pattern. Discussion The total score distribution and item responses of the K6 showed exponential patterns, consistent with other depressive symptom scales.

  6. Profile-likelihood Confidence Intervals in Item Response Theory Models.

    Science.gov (United States)

    Chalmers, R Philip; Pek, Jolynn; Liu, Yang

    2017-01-01

    Confidence intervals (CIs) are fundamental inferential devices which quantify the sampling variability of parameter estimates. In item response theory, CIs have been primarily obtained from large-sample Wald-type approaches based on standard error estimates, derived from the observed or expected information matrix, after parameters have been estimated via maximum likelihood. An alternative approach to constructing CIs is to quantify sampling variability directly from the likelihood function with a technique known as profile-likelihood confidence intervals (PL CIs). In this article, we introduce PL CIs for item response theory models, compare PL CIs to classical large-sample Wald-type CIs, and demonstrate important distinctions among these CIs. CIs are then constructed for parameters directly estimated in the specified model and for transformed parameters which are often obtained post-estimation. Monte Carlo simulation results suggest that PL CIs perform consistently better than Wald-type CIs for both non-transformed and transformed parameters.

  7. Archives: International Journal of Biological and Chemical Sciences

    African Journals Online (AJOL)

    Items 1 - 50 of 61 ... Archives: International Journal of Biological and Chemical Sciences. Journal Home > Archives: International Journal of Biological and Chemical Sciences. Log in or Register to get access to full text downloads.

  8. Examining the Effect of Reverse Worded Items on the Factor Structure of the Need for Cognition Scale.

    Directory of Open Access Journals (Sweden)

    Xijuan Zhang

    Full Text Available Reverse worded (RW items are often used to reduce or eliminate acquiescence bias, but there is a rising concern about their harmful effects on the covariance structure of the scale. Therefore, results obtained via traditional covariance analyses may be distorted. This study examined the effect of the RW items on the factor structure of the abbreviated 18-item Need for Cognition (NFC scale using confirmatory factor analysis. We modified the scale to create three revised versions, varying from no RW items to all RW items. We also manipulated the type of the RW items (polar opposite vs. negated. To each of the four scales, we fit four previously developed models. The four models included a 1-factor model, a 2-factor model distinguishing between positively worded (PW items and RW items, and two 2-factor models, each with one substantive factor and one method factor. Results showed that the number and type of the RW items affected the factor structure of the NFC scale. Consistent with previous research findings, for the original NFC scale, which contains both PW and RW items, the 1-factor model did not have good fit. In contrast, for the revised scales that had no RW items or all RW items, the 1-factor model had reasonably good fit. In addition, for the scale with polar opposite and negated RW items, the factor model with a method factor among the polar opposite items had considerably better fit than the 1-factor model.

  9. Dependability of technical items: Problems of standardization

    Science.gov (United States)

    Fedotova, G. A.; Voropai, N. I.; Kovalev, G. F.

    2016-12-01

    This paper is concerned with problems blown up in the development of a new version of the Interstate Standard GOST 27.002 "Industrial product dependability. Terms and definitions". This Standard covers a wide range of technical items and is used in numerous regulations, specifications, standard and technical documentation. A currently available State Standard GOST 27.002-89 was introduced in 1990. Its development involved a participation of scientists and experts from different technical areas, its draft was debated in different audiences and constantly refined, so it was a high quality document. However, after 25 years of its application it's become necessary to develop a new version of the Standard that would reflect the current understanding of industrial dependability, accounting for the changes taking place in Russia in the production, management and development of various technical systems and facilities. The development of a new version of the Standard makes it possible to generalize on a terminological level the knowledge and experience in the area of reliability of technical items, accumulated over a quarter of the century in different industries and reliability research schools, to account for domestic and foreign experience of standardization. Working on the new version of the Standard, we have faced a number of issues and problems on harmonization with the International Standard IEC 60500-192, caused first of all by different approaches to the use of terms and differences in the mentalities of experts from different countries. The paper focuses on the problems related to the chapter "Maintenance, restoration and repair", which caused difficulties for the developers to harmonize term definitions both with experts and the International Standard, which is mainly related to differences between the Russian concept and practice of maintenance and repair and foreign ones.

  10. Item response theory - A first approach

    Science.gov (United States)

    Nunes, Sandra; Oliveira, Teresa; Oliveira, Amílcar

    2017-07-01

    The Item Response Theory (IRT) has become one of the most popular scoring frameworks for measurement data, frequently used in computerized adaptive testing, cognitively diagnostic assessment and test equating. According to Andrade et al. (2000), IRT can be defined as a set of mathematical models (Item Response Models - IRM) constructed to represent the probability of an individual giving the right answer to an item of a particular test. The number of Item Responsible Models available to measurement analysis has increased considerably in the last fifteen years due to increasing computer power and due to a demand for accuracy and more meaningful inferences grounded in complex data. The developments in modeling with Item Response Theory were related with developments in estimation theory, most remarkably Bayesian estimation with Markov chain Monte Carlo algorithms (Patz & Junker, 1999). The popularity of Item Response Theory has also implied numerous overviews in books and journals, and many connections between IRT and other statistical estimation procedures, such as factor analysis and structural equation modeling, have been made repeatedly (Van der Lindem & Hambleton, 1997). As stated before the Item Response Theory covers a variety of measurement models, ranging from basic one-dimensional models for dichotomously and polytomously scored items and their multidimensional analogues to models that incorporate information about cognitive sub-processes which influence the overall item response process. The aim of this work is to introduce the main concepts associated with one-dimensional models of Item Response Theory, to specify the logistic models with one, two and three parameters, to discuss some properties of these models and to present the main estimation procedures.

  11. The Spanish version of the Self-Determination Inventory Student Report: application of item response theory to self-determination measurement.

    Science.gov (United States)

    Mumbardó-Adam, C; Guàrdia-Olmos, J; Giné, C; Raley, S K; Shogren, K A

    2018-04-01

    A new measure of self-determination, the Self-Determination Inventory: Student Report (Spanish version), has recently been adapted and empirically validated in Spanish language. As it is the first instrument intended to measure self-determination in youth with and without disabilities, there is a need to further explore and strengthen its psychometric analysis based on item response patterns. Through item response theory approach, this study examined item observed distributions across the essential characteristics of self-determination. The results demonstrated satisfactory to excellent item functioning patterns across characteristics, particularly within agentic action domains. Increased variability across items was also found within action-control beliefs dimensions, specifically within the self-realisation subdomain. These findings further support the instrument's psychometric properties and outline future research directions. © 2017 MENCAP and International Association of the Scientific Study of Intellectual and Developmental Disabilities and John Wiley & Sons Ltd.

  12. International conference on plasma physics

    International Nuclear Information System (INIS)

    Silin, V.P.; Sitenko, A.G.

    1985-01-01

    A brief report on the 6th International conference on plasma physics and on the 6th International Congress on plasma waves and plasma instabilities, which have taken place in summer 1984 in Losanne, is presented. Main items of the conference are enlightened, such as the general theory of a plasma, laboratory plasma, thermonuclear plasma, cosmic plasma and astrophysics

  13. The PROMIS Physical Function item bank was calibrated to a standardized metric and shown to improve measurement efficiency

    DEFF Research Database (Denmark)

    Rose, Matthias; Bjørner, Jakob; Gandek, Barbara

    2014-01-01

    OBJECTIVE: To document the development and psychometric evaluation of the Patient-Reported Outcomes Measurement Information System (PROMIS) Physical Function (PF) item bank and static instruments. STUDY DESIGN AND SETTING: The items were evaluated using qualitative and quantitative methods. A total...... response model was used to estimate item parameters, which were normed to a mean of 50 (standard deviation [SD]=10) in a US general population sample. RESULTS: The final bank consists of 124 PROMIS items covering upper, central, and lower extremity functions and instrumental activities of daily living...... to identify differences between age and disease groups. CONCLUSION: The item bank provides a common metric and can improve the measurement of PF by facilitating the standardization of patient-reported outcome measures and implementation of CATs for more efficient PF assessments over a larger range....

  14. Gender differences in national assessment of educational progress science items: What does i don't know really mean?

    Science.gov (United States)

    Linn, Marcia C.; de Benedictis, Tina; Delucchi, Kevin; Harris, Abigail; Stage, Elizabeth

    The National Assessment of Educational Progress Science Assessment has consistently revealed small gender differences on science content items but not on science inquiry items. This assessment differs from others in that respondents can choose I don't know rather than guessing. This paper examines explanations for the gender differences including (a) differential prior instruction, (b) differential response to uncertainty and use of the I don't know response, (c) differential response to figurally presented items, and (d) different attitudes towards science. Of these possible explanations, the first two received support. Females are more likely to use the I don't know response, especially for items with physical science content or masculine themes such as football. To ameliorate this situation we need more effective science instruction and more gender-neutral assessment items.

  15. Consistency of variables in PCS and JASTRO great area database

    International Nuclear Information System (INIS)

    Nishino, Tomohiro; Teshima, Teruki; Abe, Mitsuyuki

    1998-01-01

    To examine whether the Patterns of Care Study (PCS) reflects the data for the major areas in Japan, the consistency of variables in the PCS and in the major area database of the Japanese Society for Therapeutic Radiology and Oncology (JASTRO) were compared. Patients with esophageal or uterine cervical cancer were sampled from the PCS and JASTRO databases. From the JASTRO database, 147 patients with esophageal cancer and 95 patients with uterine cervical cancer were selected according to the eligibility criteria for the PCS. From the PCS, 455 esophageal and 432 uterine cervical cancer patients were surveyed. Six items for esophageal cancer and five items for uterine cervical cancer were selected for a comparative analysis of PCS and JASTRO databases. Esophageal cancer: Age (p=.0777), combination of radiation and surgery (p=.2136), and energy of the external beam (p=.6400) were consistent for PCS and JASTRO. However, the dose of the external beam for the non-surgery group showed inconsistency (p=.0467). Uterine cervical cancer: Age (p=.6301) and clinical stage (p=.8555) were consistent for the two sets of data. However, the energy of the external beam (p<.0001), dose rate of brachytherapy (p<.0001), and brachytherapy utilization by clinical stage (p<.0001) showed inconsistencies. It appears possible that the JASTRO major area database could not account for all patients' backgrounds and factors and that both surveys might have an imbalance in the stratification of institutions including differences in equipment and staffing patterns. (author)

  16. Investigating Separate and Concurrent Approaches for Item Parameter Drift in 3PL Item Response Theory Equating

    Science.gov (United States)

    Arce-Ferrer, Alvaro J.; Bulut, Okan

    2017-01-01

    This study examines separate and concurrent approaches to combine the detection of item parameter drift (IPD) and the estimation of scale transformation coefficients in the context of the common item nonequivalent groups design with the three-parameter item response theory equating. The study uses real and synthetic data sets to compare the two…

  17. Consistent force fields for saccharides

    DEFF Research Database (Denmark)

    Rasmussen, Kjeld

    1999-01-01

    Consistent force fields for carbohydrates were hitherto developed by extensive optimization ofpotential energy function parameters on experimental data and on ab initio results. A wide range of experimental data is used: internal structures obtained from gas phase electron diffraction and from x......-anomeric effects are accounted for without addition of specific terms. The work is done in the framework of the Consistent Force Field which originatedin Israel and was further developed in Denmark. The actual methods and strategies employed havebeen described previously. Extensive testing of the force field...

  18. Psychometric Properties of the Kidney Disease Quality of Life 36-Item Short-Form Survey (KDQOL-36) in the United States.

    Science.gov (United States)

    Peipert, John D; Bentler, Peter M; Klicko, Kristi; Hays, Ron D

    2018-04-01

    The Centers for Medicare & Medicaid Services require that dialysis patients' health-related quality of life be assessed annually. The primary instrument used for this purpose is the Kidney Disease Quality of Life 36-Item Short-Form Survey (KDQOL-36), which includes the SF-12 as its generic core and 3 kidney disease-targeted scales: Burden of Kidney Disease, Symptoms and Problems of Kidney Disease, and Effects of Kidney Disease. Despite its broad use, there has been limited evaluation of KDQOL-36's psychometric properties. Secondary analyses of data collected by the Medical Education Institute to evaluate the reliability and factor structure of the KDQOL-36 scales. KDQOL-36 responses from 70,786 dialysis patients in 1,381 US dialysis facilities that permitted data analysis were collected from June 1, 2015, through May 31, 2016, as part of routine clinical assessment. We assessed the KDQOL-36 scales' internal consistency reliability and dialysis facility-level reliability using coefficient alpha and 1-way analysis of variance. We evaluated the KDQOL-36's factor structure using item-to-total scale correlations and confirmatory factor analysis. Construct validity was examined using correlations between SF-12 and KDQOL-36 scales and "known groups" analyses. Each of the KDQOL-36's kidney disease-targeted scales had acceptable internal consistency reliability (α=0.83-0.85) and facility-level reliability (r=0.75-0.83). Item-scale correlations and a confirmatory factor analysis model evidenced the KDQOL-36's original factor structure. Construct validity was supported by large correlations between the SF-12 Physical Component Summary and Mental Component Summary (r=0.40-0.52) and the KDQOL-36 scale scores, as well as significant differences on the scale scores between patients receiving different types of dialysis, diabetic and nondiabetic patients, and patients who were employed full-time versus not. Use of secondary data from a clinical registry. The study provides

  19. Analysis of differential item functioning in the depression item bank from the Patient Reported Outcome Measurement Information System (PROMIS: An item response theory approach

    Directory of Open Access Journals (Sweden)

    JOSEPH P. EIMICKE

    2009-06-01

    Full Text Available The aims of this paper are to present findings related to differential item functioning (DIF in the Patient Reported Outcome Measurement Information System (PROMIS depression item bank, and to discuss potential threats to the validity of results from studies of DIF. The 32 depression items studied were modified from several widely used instruments. DIF analyses of gender, age and education were performed using a sample of 735 individuals recruited by a survey polling firm. DIF hypotheses were generated by asking content experts to indicate whether or not they expected DIF to be present, and the direction of the DIF with respect to the studied comparison groups. Primary analyses were conducted using the graded item response model (for polytomous, ordered response category data with likelihood ratio tests of DIF, accompanied by magnitude measures. Sensitivity analyses were performed using other item response models and approaches to DIF detection. Despite some caveats, the items that are recommended for exclusion or for separate calibration were "I felt like crying" and "I had trouble enjoying things that I used to enjoy." The item, "I felt I had no energy," was also flagged as evidencing DIF, and recommended for additional review. On the one hand, false DIF detection (Type 1 error was controlled to the extent possible by ensuring model fit and purification. On the other hand, power for DIF detection might have been compromised by several factors, including sparse data and small sample sizes. Nonetheless, practical and not just statistical significance should be considered. In this case the overall magnitude and impact of DIF was small for the groups studied, although impact was relatively large for some individuals.

  20. A leprosy clinical severity scale for erythema nodosum leprosum: An international, multicentre validation study of the ENLIST ENL Severity Scale.

    Science.gov (United States)

    Walker, Stephen L; Sales, Anna M; Butlin, C Ruth; Shah, Mahesh; Maghanoy, Armi; Lambert, Saba M; Darlong, Joydeepa; Rozario, Benjamin Jewel; Pai, Vivek V; Balagon, Marivic; Doni, Shimelis N; Hagge, Deanna A; Nery, José A C; Neupane, Kapil D; Baral, Suwash; Sangma, Biliom A; Alembo, Digafe T; Yetaye, Abeba M; Hassan, Belaynesh A; Shelemo, Mohammed B; Nicholls, Peter G; Lockwood, Diana N J

    2017-07-01

    We wished to validate our recently devised 16-item ENLIST ENL Severity Scale, a clinical tool for measuring the severity of the serious leprosy associated complication of erythema nodosum leprosum (ENL). We also wished to assess the responsiveness of the ENLIST ENL Severity Scale in detecting clinical change in patients with ENL. Participants, recruited from seven centres in six leprosy endemic countries, were assessed using the ENLIST ENL Severity Scale by two researchers, one of whom categorised the severity of ENL. At a subsequent visit a further assessment using the scale was made and both participant and physician rated the change in ENL using the subjective categories of "Much better", "somewhat better", "somewhat worse" and "much worse" compared with "No change" or "about the same". 447 participants were assessed with the ENLIST ENL Severity Scale. The Cronbach alpha of the scale and each item was calculated to determine the internal consistency of the scale. The ENLIST ENL Severity Scale had good internal consistency and this improved following removal of six items to give a Cronbach's alpha of 0.77. The cut off between mild ENL and more severe disease was 9 determined using ROC curves. The minimal important difference of the scale was determined to be 5 using both participant and physician ratings of change. The 10-item ENLIST ENL Severity Scale is the first valid, reliable and responsive measure of ENL severity and improves our ability to assess and compare patients and their treatments in this severe and difficult to manage complication of leprosy. The ENLIST ENL Severity Scale will assist physicians in the monitoring and treatment of patients with ENL. The ENLIST ENL Severity Scale is easy to apply and will be useful as an outcome measure in treatment studies and enable the standardisation of other clinical and laboratory ENL research.

  1. 76 FR 60474 - Commercial Item Handbook

    Science.gov (United States)

    2011-09-29

    ... DEPARTMENT OF DEFENSE Defense Acquisition Regulations System Commercial Item Handbook AGENCY.... SUMMARY: DoD has updated its Commercial Item Handbook. The purpose of the Handbook is to help acquisition personnel develop sound business strategies for procuring commercial items. DoD is seeking industry input on...

  2. Estimating True Short-Term Consistency in Vocational Interests: A Longitudinal SEM Approach

    Science.gov (United States)

    Gaudron, Jean-Philippe; Vautier, Stephane

    2007-01-01

    This study aimed at estimating the correlation between true scores (true consistency) of vocational interest over a short time span in a sample of 1089 adults. Participants were administered 54 items assessing vocational, family, and leisure interests twice over a 1-month period. Responses were analyzed with a multitrait (MT) model, which supposes…

  3. Spare Items validation

    International Nuclear Information System (INIS)

    Fernandez Carratala, L.

    1998-01-01

    There is an increasing difficulty for purchasing safety related spare items, with certifications by manufacturers for maintaining the original qualifications of the equipment of destination. The main reasons are, on the top of the logical evolution of technology, applied to the new manufactured components, the quitting of nuclear specific production lines and the evolution of manufacturers quality systems, originally based on nuclear codes and standards, to conventional industry standards. To face this problem, for many years different Dedication processes have been implemented to verify whether a commercial grade element is acceptable to be used in safety related applications. In the same way, due to our particular position regarding the spare part supplies, mainly from markets others than the american, C.N. Trillo has developed a methodology called Spare Items Validation. This methodology, which is originally based on dedication processes, is not a single process but a group of coordinated processes involving engineering, quality and management activities. These are to be performed on the spare item itself, its design control, its fabrication and its supply for allowing its use in destinations with specific requirements. The scope of application is not only focussed on safety related items, but also to complex design, high cost or plant reliability related components. The implementation in C.N. Trillo has been mainly curried out by merging, modifying and making the most of processes and activities which were already being performed in the company. (Author)

  4. Ordinal-To-Interval Scale Conversion Tables and National Items for the New Zealand Version of the WHOQOL-BREF.

    Directory of Open Access Journals (Sweden)

    Christian U Krägeloh

    Full Text Available The World Health Organisation Quality of Life (WHOQOL questionnaires are widely used around the world and can claim strong cross-cultural validity due to their development in collaboration with international field centres. To enhance conceptual equivalence of quality of life across cultures, optional national items are often developed for use alongside the core instrument. The present study outlines the development of national items for the New Zealand WHOQOL-BREF. Focus groups with members of the community as well as health experts discussed what constitutes quality of life in their opinion. Based on themes extracted of aspects not contained in the existing WHOQOL instrument, 46 candidate items were generated and subsequently rated for their importance by a random sample of 585 individuals from the general population. Applying importance criteria reduced these items to 24, which were then sent to another large random sample (n = 808 to be rated alongside the existing WHOQOL-BREF. A final set of five items met the criteria for national items. Confirmatory factor analysis identified four national items as belonging to the psychological domain of quality of life, and one item to the social domain. Rasch analysis validated these results and generated ordinal-to-interval conversion algorithms to allow use of parametric statistics for domain scores with and without national items.

  5. Test-retest reliability at the item level and total score level of the Norwegian version of the Spinal Cord Injury Falls Concern Scale (SCI-FCS).

    Science.gov (United States)

    Roaldsen, Kirsti Skavberg; Måøy, Åsa Blad; Jørgensen, Vivien; Stanghelle, Johan Kvalvik

    2016-05-01

    Translation of the Spinal Cord Injury Falls Concern Scale (SCI-FCS), and investigation of test-retest reliability on item-level and total-score-level. Translation, adaptation and test-retest study. A specialized rehabilitation setting in Norway. Fifty-four wheelchair users with a spinal cord injury. The median age of the cohort was 49 years, and the median number of years after injury was 13. Interventions/measurements: The SCI-FCS was translated and back-translated according to guidelines. Individuals answered the SCI-FCS twice over the course of one week. We investigated item-level test-retest reliability using Svensson's rank-based statistical method for disagreement analysis of paired ordinal data. For relative reliability, we analyzed the total-score-level test-retest reliability with intraclass correlation coefficients (ICC2.1), the standard error of measurement (SEM), and the smallest detectable change (SDC) for absolute reliability/measurement-error assessment and Cronbach's alpha for internal consistency. All items showed satisfactory percentage agreement (≥69%) between test and retest. There were small but non-negligible systematic disagreements among three items; we recovered an 11-13% higher chance for a lower second score. There was no disagreement due to random variance. The test-retest agreement (ICC2.1) was excellent (0.83). The SEM was 2.6 (12%), and the SDC was 7.1 (32%). The Cronbach's alpha was high (0.88). The Norwegian SCI-FCS is highly reliable for wheelchair users with chronic spinal cord injuries.

  6. Using Item Data for Evaluating Criterion Reference Measures with an Empirical Investigation of Index Consistency.

    Science.gov (United States)

    Meredith, Keith E.; Sabers, Darrell L.

    Data required for evaluating a Criterion Referenced Measurement (CRM) is described with a matrix. The information within the matrix consists of the "pass-fail" decisions of two CRMs. By differentially defining these two CRMs, different concepts of reliability and validity can be examined. Indices suggested for analyzing the matrix are listed with…

  7. Happiness as stable extraversion : internal consistency reliability and construct validity of the Oxford Happiness Questionnaire among undergraduate students\\ud \\ud

    OpenAIRE

    Robbins, Mandy; Francis, Leslie J.; Edwards, Bethan

    2010-01-01

    The Oxford Happiness Questionnaire (OHQ) was developed by Hills and Argyle (2002) to provide a more accessible equivalent measure of the Oxford Happiness Inventory (OHI). The aim of the present study was to examine the internal consistency reliability, and construct validity of this new instrument alongside the Eysenckian dimensional model of personality. The Oxford Happiness Questionnaire was completed by a sample of 131 undergraduate students together with the abbreviated form of the Revise...

  8. Archives: Nnamdi Azikiwe University Journal of International Law ...

    African Journals Online (AJOL)

    Items 1 - 11 of 11 ... Archives: Nnamdi Azikiwe University Journal of International Law and Jurisprudence. Journal Home > Archives: Nnamdi Azikiwe University Journal of International Law and Jurisprudence. Log in or Register to get access to full text downloads.

  9. Item Analysis in Introductory Economics Testing.

    Science.gov (United States)

    Tinari, Frank D.

    1979-01-01

    Computerized analysis of multiple choice test items is explained. Examples of item analysis applications in the introductory economics course are discussed with respect to three objectives: to evaluate learning; to improve test items; and to help improve classroom instruction. Problems, costs and benefits of the procedures are identified. (JMD)

  10. A review of the effects on IRT item parameter estimates with a focus on misbehaving common items in test equating

    Directory of Open Access Journals (Sweden)

    Michalis P Michaelides

    2010-10-01

    Full Text Available Many studies have investigated the topic of change or drift in item parameter estimates in the context of Item Response Theory. Content effects, such as instructional variation and curricular emphasis, as well as context effects, such as the wording, position, or exposure of an item have been found to impact item parameter estimates. The issue becomes more critical when items with estimates exhibiting differential behavior across test administrations are used as common for deriving equating transformations. This paper reviews the types of effects on IRT item parameter estimates and focuses on the impact of misbehaving or aberrant common items on equating transformations. Implications relating to test validity and the judgmental nature of the decision to keep or discard aberrant common items are discussed, with recommendations for future research into more informed and formal ways of dealing with misbehaving common items.

  11. A Review of the Effects on IRT Item Parameter Estimates with a Focus on Misbehaving Common Items in Test Equating.

    Science.gov (United States)

    Michaelides, Michalis P

    2010-01-01

    Many studies have investigated the topic of change or drift in item parameter estimates in the context of item response theory (IRT). Content effects, such as instructional variation and curricular emphasis, as well as context effects, such as the wording, position, or exposure of an item have been found to impact item parameter estimates. The issue becomes more critical when items with estimates exhibiting differential behavior across test administrations are used as common for deriving equating transformations. This paper reviews the types of effects on IRT item parameter estimates and focuses on the impact of misbehaving or aberrant common items on equating transformations. Implications relating to test validity and the judgmental nature of the decision to keep or discard aberrant common items are discussed, with recommendations for future research into more informed and formal ways of dealing with misbehaving common items.

  12. The PROMIS Physical Function item bank was calibrated to a standardized metric and shown to improve measurement efficiency.

    Science.gov (United States)

    Rose, Matthias; Bjorner, Jakob B; Gandek, Barbara; Bruce, Bonnie; Fries, James F; Ware, John E

    2014-05-01

    To document the development and psychometric evaluation of the Patient-Reported Outcomes Measurement Information System (PROMIS) Physical Function (PF) item bank and static instruments. The items were evaluated using qualitative and quantitative methods. A total of 16,065 adults answered item subsets (n>2,200/item) on the Internet, with oversampling of the chronically ill. Classical test and item response theory methods were used to evaluate 149 PROMIS PF items plus 10 Short Form-36 and 20 Health Assessment Questionnaire-Disability Index items. A graded response model was used to estimate item parameters, which were normed to a mean of 50 (standard deviation [SD]=10) in a US general population sample. The final bank consists of 124 PROMIS items covering upper, central, and lower extremity functions and instrumental activities of daily living. In simulations, a 10-item computerized adaptive test (CAT) eliminated floor and decreased ceiling effects, achieving higher measurement precision than any comparable length static tool across four SDs of the measurement range. Improved psychometric properties were transferred to the CAT's superior ability to identify differences between age and disease groups. The item bank provides a common metric and can improve the measurement of PF by facilitating the standardization of patient-reported outcome measures and implementation of CATs for more efficient PF assessments over a larger range. Copyright © 2014. Published by Elsevier Inc.

  13. A Comparison of the 27-Item and 12-Item Intolerance of Uncertainty Scales

    Science.gov (United States)

    Khawaja, Nigar G.; Yu, Lai Ngo Heidi

    2010-01-01

    The 27-item Intolerance of Uncertainty Scale (IUS) has become one of the most frequently used measures of Intolerance of Uncertainty. More recently, an abridged, 12-item version of the IUS has been developed. The current research used clinical (n = 50) and non-clinical (n = 56) samples to examine and compare the psychometric properties of both…

  14. Combining item response theory with multiple imputation to equate health assessment questionnaires.

    Science.gov (United States)

    Gu, Chenyang; Gutman, Roee

    2017-09-01

    The assessment of patients' functional status across the continuum of care requires a common patient assessment tool. However, assessment tools that are used in various health care settings differ and cannot be easily contrasted. For example, the Functional Independence Measure (FIM) is used to evaluate the functional status of patients who stay in inpatient rehabilitation facilities, the Minimum Data Set (MDS) is collected for all patients who stay in skilled nursing facilities, and the Outcome and Assessment Information Set (OASIS) is collected if they choose home health care provided by home health agencies. All three instruments or questionnaires include functional status items, but the specific items, rating scales, and instructions for scoring different activities vary between the different settings. We consider equating different health assessment questionnaires as a missing data problem, and propose a variant of predictive mean matching method that relies on Item Response Theory (IRT) models to impute unmeasured item responses. Using real data sets, we simulated missing measurements and compared our proposed approach to existing methods for missing data imputation. We show that, for all of the estimands considered, and in most of the experimental conditions that were examined, the proposed approach provides valid inferences, and generally has better coverages, relatively smaller biases, and shorter interval estimates. The proposed method is further illustrated using a real data set. © 2016, The International Biometric Society.

  15. More is not Always Better: The Relation between Item Response and Item Response Time in Raven’s Matrices

    Directory of Open Access Journals (Sweden)

    Frank Goldhammer

    2015-03-01

    Full Text Available The role of response time in completing an item can have very different interpretations. Responding more slowly could be positively related to success as the item is answered more carefully. However, the association may be negative if working faster indicates higher ability. The objective of this study was to clarify the validity of each assumption for reasoning items considering the mode of processing. A total of 230 persons completed a computerized version of Raven’s Advanced Progressive Matrices test. Results revealed that response time overall had a negative effect. However, this effect was moderated by items and persons. For easy items and able persons the effect was strongly negative, for difficult items and less able persons it was less negative or even positive. The number of rules involved in a matrix problem proved to explain item difficulty significantly. Most importantly, a positive interaction effect between the number of rules and item response time indicated that the response time effect became less negative with an increasing number of rules. Moreover, exploratory analyses suggested that the error type influenced the response time effect.

  16. Validation of a 15-item care-related regret coping scale for health-care professionals (RCS-HCP).

    Science.gov (United States)

    Courvoisier, Delphine Sophie; Cullati, Stephane; Ouchi, Rieko; Schmidt, Ralph Eric; Haller, Guy; Chopard, Pierre; Agoritsas, Thomas; Perneger, Thomas V

    2014-01-01

    Coping with difficult care-related situations is a common challenge for health-care professionals. How these professionals deal with the regrets they may experience following one of the many decisions and interventions they must make every day can have an impact on their own health and quality of life, and also on their patient care practices. To identify professionals most at need for extra support, development and validation of a tool measuring coping style are needed. We performed a survey of physicians and nurses of a French-speaking University hospital; 469 health-care professionals responded to the survey, and 175 responded to the same survey one-month later. Regret was assessed with the regret coping scale developed for this study, self-report questions on the frequency of regretted situations and the intensity of regret. Construct validity was assessed using measures of health-care professionals' quality of life (including job and life satisfaction, and self-reported health) as well as sleep problems and depression. Based on factor analysis and item response analysis, the initial 31-item scale was shortened to 15 items, which measured three types of strategies: problem-focused strategies (i.e., trying to find solutions, talking to colleagues) and two types of emotion-focused strategies, A (i.e., self-blame, rumination) and B (e.g., acceptance, emotional distance). All subscales showed high internal consistency (α >0.85). Overall, as expected, problem-focused and emotion-focused B strategies correlated with higher quality of life, fewer sleep problems and less depression, and emotion-focused A strategies showed the opposite pattern. The regret coping scale (RCS-HCP) is a valid and reliable measure of coping abilities of hospital-based health-care professionals.

  17. Sharing medicine: the candidacy of medicines and other household items for sharing, Dominican Republic.

    Directory of Open Access Journals (Sweden)

    Michael N Dohn

    Full Text Available People share medicines and problems can result from this behavior. Successful interventions to change sharing behavior will require understanding people's motives and purposes for sharing medicines. Better information about how medicines fit into the gifting and reciprocity system could be useful in designing interventions to modify medicine sharing behavior. However, it is uncertain how people situate medicines among other items that might be shared. This investigation is a descriptive study of how people sort medicines and other shareable items.This study in the Dominican Republic examined how a convenience sample (31 people sorted medicines and rated their shareability in relation to other common household items. We used non-metric multidimensional scaling to produce association maps in which the distances between items offer a visual representation of the collective opinion of the participants regarding the relationships among the items. In addition, from a pile sort constrained by four categories of whether sharing or loaning the item was acceptable (on a scale from not shareable to very shareable, we assessed the degree to which the participants rated the medicines as shareable compared to other items. Participants consistently grouped medicines together in all pile sort activities; yet, medicines were mixed with other items when rated by their candidacy to be shared. Compared to the other items, participants had more variability of opinion as to whether medicines should be shared.People think of medicines as a distinct group, suggesting that interventions might be designed to apply to medicines as a group. People's differing opinions as to whether it was appropriate to share medicines imply a degree of uncertainty or ambiguity that health promotion interventions might exploit to alter attitudes and behaviors. These findings have implications for the design of health promotion interventions to impact medicine sharing behavior.

  18. The relevance of the International Classification of Functioning, Disability and Health (ICF) in monitoring and evaluating Community-based Rehabilitation (CBR).

    Science.gov (United States)

    Madden, Rosamond H; Dune, Tinashe; Lukersmith, Sue; Hartley, Sally; Kuipers, Pim; Gargett, Alexandra; Llewellyn, Gwynnyth

    2014-01-01

    To examine the relevance of the International Classification of Functioning, Disability and Health (ICF) to CBR monitoring and evaluation by investigating the relationship between the ICF and information in published CBR monitoring and evaluation reports. A three-stage literature search and analysis method was employed. Studies were identified via online database searches for peer-reviewed journal articles, and hand-searching of CBR network resources, NGO websites and specific journals. From each study "information items" were extracted; extraction consistency among authors was established. Finally, the resulting information items were coded to ICF domains and categories, with consensus on coding being achieved. Thirty-six articles relating to monitoring and evaluating CBR were selected for analysis. Approximately one third of the 2495 information items identified in these articles (788 or 32%) related to concepts of functioning, disability and environment, and could be coded to the ICF. These information items were spread across the entire ICF classification with a concentration on Activities and Participation (49% of the 788 information items) and Environmental Factors (42%). The ICF is a relevant and potentially useful framework and classification, providing building blocks for the systematic recording of information pertaining to functioning and disability, for CBR monitoring and evaluation. Implications for Rehabilitation The application of the ICF, as one of the building blocks for CBR monitoring and evaluation, is a constructive step towards an evidence-base on the efficacy and outcomes of CBR programs. The ICF can be used to provide the infrastructure for functioning and disability information to inform service practitioners and enable national and international comparisons.

  19. Sensitivity of the Addiction Severity Index physical and sexual assault items: preliminary findings on gender differences

    NARCIS (Netherlands)

    Langeland, W.; van den Brink, W.; Draijer, N.; Hartgers, C.

    2001-01-01

    Evaluation of the Addiction Severity Index (ASI) as a screen for identifying sexual and physical assault histories. The sensitivity and specificity of the ASI assault items were examined in 146 alcoholic patients with the assault questions of the Composite International Diagnostic Interview

  20. Pengendalian Persediaan Primary Items dalam Logistik Konstruksi

    Directory of Open Access Journals (Sweden)

    Lady Lisya

    2016-09-01

    Full Text Available Construction logistics are activities that consist of ordering, storage and transportation of materials of construction projects. Storage material is logistics activity that ensure the availability of materials in project site. Generally, material storage activities have been conducted at the project site. Logistics construction is aimed to support the project activities that the completion schedule has been set. Construction logistics issues is determining the schedule of ordering materials so that the project can be implemented on schedule. The purpose of research is to determine the optimum ordering period for the primary items on the main building structure construction and designing inventory control cards as a mechanism for monitoring procurement of materials. This research has been obtained optimal ordering period for the primary items of main building structure with elements of the work using Fixed Period Requirement method. Inventories were already meet the material requirement of each period. Material management has been conducted based grouping approach as many as 31 groups. In addition, this research has proposed the inventory control cards as an instrument for material procurement monitoring. The implications of inventory control cards are coordinate contracting parties with vendors to plan the replenishment  of materials to meet the work schedule. Further research can be developed with other aspects such as integrated material order system between contractors and vendors to consider the safety stock. In addition, the information system for planning material is an important consideration for construction projects with large scale so that the companies can plan primary items inventory and other materials in the projects completion more easily, quickly and accurately.

  1. Examination of the PROMIS upper extremity item bank.

    Science.gov (United States)

    Hung, Man; Voss, Maren W; Bounsanga, Jerry; Crum, Anthony B; Tyser, Andrew R

    Clinical measurement. The psychometric properties of the PROMIS v1.2 UE item bank were tested on various samples prior to its release, but have not been fully evaluated among the orthopaedic population. This study assesses the performance of the UE item bank within the UE orthopaedic patient population. The UE item bank was administered to 1197 adult patients presenting to a tertiary orthopaedic clinic specializing in hand and UE conditions and was examined using traditional statistics and Rasch analysis. The UE item bank fits a unidimensional model (outfit MNSQ range from 0.64 to 1.70) and has adequate reliabilities (person = 0.84; item = 0.82) and local independence (item residual correlations range from -0.37 to 0.34). Only one item exhibits gender differential item functioning. Most items target low levels of function. The UE item bank is a useful clinical assessment tool. Additional items covering higher functions are needed to enhance validity. Supplemental testing is recommended for patients at higher levels of function until more high function UE items are developed. 2c. Copyright © 2016 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.

  2. Psychometric Consequences of Subpopulation Item Parameter Drift

    Science.gov (United States)

    Huggins-Manley, Anne Corinne

    2017-01-01

    This study defines subpopulation item parameter drift (SIPD) as a change in item parameters over time that is dependent on subpopulations of examinees, and hypothesizes that the presence of SIPD in anchor items is associated with bias and/or lack of invariance in three psychometric outcomes. Results show that SIPD in anchor items is associated…

  3. Location Indices for Ordinal Polytomous Items Based on Item Response Theory. Research Report. ETS RR-15-20

    Science.gov (United States)

    Ali, Usama S.; Chang, Hua-Hua; Anderson, Carolyn J.

    2015-01-01

    Polytomous items are typically described by multiple category-related parameters; situations, however, arise in which a single index is needed to describe an item's location along a latent trait continuum. Situations in which a single index would be needed include item selection in computerized adaptive testing or test assembly. Therefore single…

  4. 26 CFR 1.338-8 - Asset and stock consistency.

    Science.gov (United States)

    2010-04-01

    ... that are controlled foreign corporations. (6) Stock consistency. This section limits the application of... 26 Internal Revenue 4 2010-04-01 2010-04-01 false Asset and stock consistency. 1.338-8 Section 1... (CONTINUED) INCOME TAXES Effects on Corporation § 1.338-8 Asset and stock consistency. (a) Introduction—(1...

  5. Validation of the Chinese version 10-item Perceived Efficacy in Patient-Physician Interactions scale in patients with osteoarthritis

    Directory of Open Access Journals (Sweden)

    Zhao HW

    2016-10-01

    Full Text Available Huiwen Zhao,1 Wen Luo,1 Rose C Maly,2 Jun Liu,1 Junyi Lee,1 Yaning Cui1 1Joint Department, The 2nd Ward of Joint Surgery, Tianjin Hospital, Tianjin, the People’s Republic of China; 2Department of Family Medicine David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA, USA Objectives: This study aimed to assess the reliability and validity of the Chinese version of the 10-item Perceived Efficacy in Patient–Physician Interaction (PEPPI-10 scale in hospitalized patients with severe knee osteoarthritis in the People’s Republic of China. Methods: Between January and March 2015, the Chinese versions of PEPPI, self-efficacy for exercise scale, osteoporosis self-efficacy scale, and modified fall efficacy scale were applied to assess 110 severe knee osteoarthritis patients who were hospitalized in the second ward of the department of arthroplasty surgery of Tianjin Hospital. Results: The Chinese version of the PEPPI-10 scale had a high coefficient of internal consistency (Cronbach’s α coefficient, 0.907. The score of the Chinese version of PEPPI was weakly correlated with the scores of the Chinese versions of self-efficacy for exercise scale, osteoporosis self-efficacy scale, and modified fall efficacy scale. Conclusion: The Chinese version of the PEPPI-10 scale exhibits sufficient internal consistency and convergent validity in hospitalized patients with severe knee osteoarthritis in the People’s Republic of China. Keywords: assessment of osteoarthritis, patient–physician communication, self-efficacy, instrument validation

  6. Few items in the thyroid-related quality of life instrument ThyPRO exhibited differential item functioning.

    Science.gov (United States)

    Watt, Torquil; Groenvold, Mogens; Hegedüs, Laszlo; Bonnema, Steen Joop; Rasmussen, Åse Krogh; Feldt-Rasmussen, Ulla; Bjorner, Jakob Bue

    2014-02-01

    To evaluate the extent of differential item functioning (DIF) within the thyroid-specific quality of life patient-reported outcome measure, ThyPRO, according to sex, age, education and thyroid diagnosis. A total of 838 patients with benign thyroid diseases completed the ThyPRO questionnaire (84 five-point items, 13 scales). Uniform and nonuniform DIF were investigated using ordinal logistic regression, testing for both statistical significance and magnitude (∆R(2) > 0.02). Scale level was estimated by the sum score, after purification. Twenty instances of DIF in 17 of the 84 items were found. Eight according to diagnosis, where the goiter scale was the one most affected, possibly due to differing perceptions in patients with auto-immune thyroid diseases compared to patients with simple goiter. Eight DIFs according to age were found, of which 5 were in positively worded items, which younger patients were more likely to endorse; one according to gender: women were more likely to report crying, and three according to educational level. The vast majority of DIF had only minor influence on the scale scores (0.1-2.3 points on the 0-100 scales), but two DIF corresponded to a difference of 4.6 and 9.8, respectively. Ordinal logistic regression identified DIF in 17 of 84 items. The potential impact of this on the present scales was low, but items displaying DIF could be avoided when developing abbreviated scales, where the potential impact of DIF (due to fewer items) will be larger.

  7. Delphi Method Validation of a Procedural Performance Checklist for Insertion of an Ultrasound-Guided Internal Jugular Central Line.

    Science.gov (United States)

    Hartman, Nicholas; Wittler, Mary; Askew, Kim; Manthey, David

    2016-01-01

    Placement of ultrasound-guided central lines is a critical skill for physicians in several specialties. Improving the quality of care delivered surrounding this procedure demands rigorous measurement of competency, and validated tools to assess performance are essential. Using the iterative, modified Delphi technique and experts in multiple disciplines across the United States, the study team created a 30-item checklist designed to assess competency in the placement of ultrasound-guided internal jugular central lines. Cronbach α was .94, indicating an excellent degree of internal consistency. Further validation of this checklist will require its implementation in simulated and clinical environments. © The Author(s) 2014.

  8. Loglinear multidimensional IRT models for polytomously scired Items

    NARCIS (Netherlands)

    Kelderman, Henk

    1988-01-01

    A loglinear item response theory (IRT) model is proposed that relates polytomously scored item responses to a multidimensional latent space. Each item may have a different response function where each item response may be explained by one or more latent traits. Item response functions may follow a

  9. 48 CFR 852.214-72 - Alternate item(s).

    Science.gov (United States)

    2010-10-01

    ... AND FORMS SOLICITATION PROVISIONS AND CONTRACT CLAUSES Texts of Provisions and Clauses 852.214-72... 2008) Bids on []* will be given equal consideration along with bids on []** and any such bids received... [].** * Contracting officer will insert an alternate item that is considered acceptable. ** Contracting officer will...

  10. The Australian Racism, Acceptance, and Cultural-Ethnocentrism Scale (RACES): item response theory findings.

    Science.gov (United States)

    Grigg, Kaine; Manderson, Lenore

    2016-03-17

    Racism and associated discrimination are pervasive and persistent challenges with multiple cumulative deleterious effects contributing to inequities in various health outcomes. Globally, research over the past decade has shown consistent associations between racism and negative health concerns. Such research confirms that race endures as one of the strongest predictors of poor health. Due to the lack of validated Australian measures of racist attitudes, RACES (Racism, Acceptance, and Cultural-Ethnocentrism Scale) was developed. Here, we examine RACES' psychometric properties, including the latent structure, utilising Item Response Theory (IRT). Unidimensional and Multidimensional Rating Scale Model (RSM) Rasch analyses were utilised with 296 Victorian primary school students and 182 adolescents and 220 adults from the Australian community. RACES was demonstrated to be a robust 24-item three-dimensional scale of Accepting Attitudes (12 items), Racist Attitudes (8 items), and Ethnocentric Attitudes (4 items). RSM Rasch analyses provide strong support for the instrument as a robust measure of racist attitudes in the Australian context, and for the overall factorial and construct validity of RACES across primary school children, adolescents, and adults. RACES provides a reliable and valid measure that can be utilised across the lifespan to evaluate attitudes towards all racial, ethnic, cultural, and religious groups. A core function of RACES is to assess the effectiveness of interventions to reduce community levels of racism and in turn inequities in health outcomes within Australia.

  11. Stability of the Spanish version of the five-item Francis Scale of Attitude toward Christianity.

    Science.gov (United States)

    Miranda-Tapia, Giskar Alonso; Cogollo, Zuleima; Herazo, Edwin; Campo-Arias, Adalberto

    2010-12-01

    The aim of this study was to establish test-retest reliability of a Spanish version of the Francis Scale of Attitude toward Christianity (Campo-Arias, Oviedo, & Cogollo, 2009) among adolescent students in Cartagena, Colombia. A group of ninth grade students from two public schools in Colombia (N = 157) completed the five-item scale. Cronbach's alphas were .74 and .76 in the first and second administrations, respectively. Both Pearson's rho and intra-class correlation coefficient were .69. A Spanish translation of the 5-item scale had consistent stability over four weeks.

  12. Search Results | Page 33 | IDRC - International Development ...

    International Development Research Centre (IDRC) Digital Library (Canada)

    Results 321 - 330 of 374 ... Negotiations around intellectual property rights (IPR) are increasingly a key factor in international trade and an important item on the international ... Private funding - including out-of-pocket expenditure - is a considerable source of health financing in Latin America, where families can face economic ...

  13. Development and Validation of the 34-Item Disability Screening Questionnaire (DSQ-34 for Use in Low and Middle Income Countries Epidemiological and Development Surveys.

    Directory of Open Access Journals (Sweden)

    Jean-François Trani

    Full Text Available Although 80% of persons with disabilities live in low and middle-income countries, there is still a lack of comprehensive, cross-culturally validated tools to identify persons facing activity limitations and functioning difficulties in these settings. In absence of such a tool, disability estimates vary considerably according to the methodology used, and policies are based on unreliable estimates.The Disability Screening Questionnaire composed of 27 items (DSQ-27 was initially designed by a group of international experts in survey development and disability in Afghanistan for a national survey. Items were selected based on major domains of activity limitations and functioning difficulties linked to an impairment as defined by the International Classification of Functioning, Disability and Health. Face, content and construct validity, as well as sensitivity and specificity were examined. Based on the results obtained, the tool was subsequently refined and expanded to 34 items, tested and validated in Darfur, Sudan. Internal consistency for the total DSQ-34 using a raw and standardized Cronbach's Alpha and within each domain using a standardized Cronbach's Alpha was examined in the Asian context (India and Nepal. Exploratory factor analysis (EFA using principal axis factoring (PAF evaluated the lowest number of factors to account for the common variance among the questions in the screen. Test-retest reliability was determined by calculating intraclass correlation (ICC and inter-rater reliability by calculating the kappa statistic; results were checked using Bland-Altman plots. The DSQ-34 was further tested for standard error of measurement (SEM and for the minimum detectable change (MDC. Good internal consistency was indicated by Cronbach's Alpha of 0.83/0.82 for India and 0.76/0.78 for Nepal. We confirmed our assumption for EFA using the Kaiser-Meyer-Olkin measure of sampling well above the accepted cutoff of 0.40 for India (0.82 and Nepal (0

  14. Development and Validation of the 34-Item Disability Screening Questionnaire (DSQ-34) for Use in Low and Middle Income Countries Epidemiological and Development Surveys.

    Science.gov (United States)

    Trani, Jean-François; Babulal, Ganesh Muneshwar; Bakhshi, Parul

    2015-01-01

    Although 80% of persons with disabilities live in low and middle-income countries, there is still a lack of comprehensive, cross-culturally validated tools to identify persons facing activity limitations and functioning difficulties in these settings. In absence of such a tool, disability estimates vary considerably according to the methodology used, and policies are based on unreliable estimates. The Disability Screening Questionnaire composed of 27 items (DSQ-27) was initially designed by a group of international experts in survey development and disability in Afghanistan for a national survey. Items were selected based on major domains of activity limitations and functioning difficulties linked to an impairment as defined by the International Classification of Functioning, Disability and Health. Face, content and construct validity, as well as sensitivity and specificity were examined. Based on the results obtained, the tool was subsequently refined and expanded to 34 items, tested and validated in Darfur, Sudan. Internal consistency for the total DSQ-34 using a raw and standardized Cronbach's Alpha and within each domain using a standardized Cronbach's Alpha was examined in the Asian context (India and Nepal). Exploratory factor analysis (EFA) using principal axis factoring (PAF) evaluated the lowest number of factors to account for the common variance among the questions in the screen. Test-retest reliability was determined by calculating intraclass correlation (ICC) and inter-rater reliability by calculating the kappa statistic; results were checked using Bland-Altman plots. The DSQ-34 was further tested for standard error of measurement (SEM) and for the minimum detectable change (MDC). Good internal consistency was indicated by Cronbach's Alpha of 0.83/0.82 for India and 0.76/0.78 for Nepal. We confirmed our assumption for EFA using the Kaiser-Meyer-Olkin measure of sampling well above the accepted cutoff of 0.40 for India (0.82) and Nepal (0.82). The

  15. The development of automaticity in short-term memory search: Item-response learning and category learning.

    Science.gov (United States)

    Cao, Rui; Nosofsky, Robert M; Shiffrin, Richard M

    2017-05-01

    In short-term-memory (STM)-search tasks, observers judge whether a test probe was present in a short list of study items. Here we investigated the long-term learning mechanisms that lead to the highly efficient STM-search performance observed under conditions of consistent-mapping (CM) training, in which targets and foils never switch roles across trials. In item-response learning, subjects learn long-term mappings between individual items and target versus foil responses. In category learning, subjects learn high-level codes corresponding to separate sets of items and learn to attach old versus new responses to these category codes. To distinguish between these 2 forms of learning, we tested subjects in categorized varied mapping (CV) conditions: There were 2 distinct categories of items, but the assignment of categories to target versus foil responses varied across trials. In cases involving arbitrary categories, CV performance closely resembled standard varied-mapping performance without categories and departed dramatically from CM performance, supporting the item-response-learning hypothesis. In cases involving prelearned categories, CV performance resembled CM performance, as long as there was sufficient practice or steps taken to reduce trial-to-trial category-switching costs. This pattern of results supports the category-coding hypothesis for sufficiently well-learned categories. Thus, item-response learning occurs rapidly and is used early in CM training; category learning is much slower but is eventually adopted and is used to increase the efficiency of search beyond that available from item-response learning. (PsycINFO Database Record (c) 2017 APA, all rights reserved).

  16. Macrostructural Treatment of Multi-word Lexical Items

    Directory of Open Access Journals (Sweden)

    Alenka Vrbinc

    2011-05-01

    Full Text Available The paper discusses the macrostructural treatment of multi-word lexical items in mono- and bilingual dictionaries. First, the classification of multi-word lexical items is presented, and special attention is paid to the discussion of compounds – a specific group of multi-word lexical items that is most commonly afforded headword status but whose inclusion in the headword list may also depend on spelling. Then the inclusion of multi-word lexical items in monolingual dictionaries is dealt with in greater detail, while the results of a short survey on the inclusion of five randomly chosen multi-word lexical items in seven English monolingual dictionaries are presented. The proposals as to how to treat these five multi-word lexical items in bilingual dictionaries are presented in the section about the inclusion of multi-word lexical items in bilingual dictionaries. The conclusion is that it is most important to take the users’ needs into consideration and to make any dictionary as user friendly as possible.

  17. Losing Items in the Psychogeriatric Nursing Home

    Directory of Open Access Journals (Sweden)

    J. van Hoof PhD

    2016-09-01

    Full Text Available Introduction: Losing items is a time-consuming occurrence in nursing homes that is ill described. An explorative study was conducted to investigate which items got lost by nursing home residents, and how this affects the residents and family caregivers. Method: Semi-structured interviews and card sorting tasks were conducted with 12 residents with early-stage dementia and 12 family caregivers. Thematic analysis was applied to the outcomes of the sessions. Results: The participants stated that numerous personal items and assistive devices get lost in the nursing home environment, which had various emotional, practical, and financial implications. Significant amounts of time are spent on trying to find items, varying from 1 hr up to a couple of weeks. Numerous potential solutions were identified by the interviewees. Discussion: Losing items often goes together with limitations to the participation of residents. Many family caregivers are reluctant to replace lost items, as these items may get lost again.

  18. The Computer Book of the Internal Medicine Resident: competence acquisition and achievement of learning objectives.

    Science.gov (United States)

    Oristrell, J; Oliva, J C; Casanovas, A; Comet, R; Jordana, R; Navarro, M

    2014-01-01

    The Computer Book of the Internal Medicine resident (CBIMR) is a computer program that was validated to analyze the acquisition of competences in teams of Internal Medicine residents. To analyze the characteristics of the rotations during the Internal Medicine residency and to identify the variables associated with the acquisition of clinical and communication skills, the achievement of learning objectives and resident satisfaction. All residents of our service (n=20) participated in the study during a period of 40 months. The CBIMR consisted of 22 self-assessment questionnaires specific for each rotation, with items on services (clinical workload, disease protocolization, resident responsibilities, learning environment, service organization and teamwork) and items on educational outcomes (acquisition of clinical and communication skills, achievement of learning objectives, overall satisfaction). Associations between services features and learning outcomes were analyzed using bivariate and multivariate analysis. An intense clinical workload, high resident responsibilities and disease protocolization were associated with the acquisition of clinical skills. High clinical competence and teamwork were both associated with better communication skills. Finally, an adequate learning environment was associated with increased clinical competence, the achievement of educational goals and resident satisfaction. Potentially modifiable variables related with the operation of clinical services had a significant impact on the acquisition of clinical and communication skills, the achievement of educational goals, and resident satisfaction during the specialized training in Internal Medicine. Copyright © 2013 Elsevier España, S.L. All rights reserved.

  19. Analyzing Multiple-Choice Questions by Model Analysis and Item Response Curves

    Science.gov (United States)

    Wattanakasiwich, P.; Ananta, S.

    2010-07-01

    In physics education research, the main goal is to improve physics teaching so that most students understand physics conceptually and be able to apply concepts in solving problems. Therefore many multiple-choice instruments were developed to probe students' conceptual understanding in various topics. Two techniques including model analysis and item response curves were used to analyze students' responses from Force and Motion Conceptual Evaluation (FMCE). For this study FMCE data from more than 1000 students at Chiang Mai University were collected over the past three years. With model analysis, we can obtain students' alternative knowledge and the probabilities for students to use such knowledge in a range of equivalent contexts. The model analysis consists of two algorithms—concentration factor and model estimation. This paper only presents results from using the model estimation algorithm to obtain a model plot. The plot helps to identify a class model state whether it is in the misconception region or not. Item response curve (IRC) derived from item response theory is a plot between percentages of students selecting a particular choice versus their total score. Pros and cons of both techniques are compared and discussed.

  20. Normative data for the Rappel libre/Rappel indicé à 16 items (16-item Free and Cued Recall) in the elderly Quebec-French population.

    Science.gov (United States)

    Dion, Mélissa; Potvin, Olivier; Belleville, Sylvie; Ferland, Guylaine; Renaud, Mélanie; Bherer, Louis; Joubert, Sven; Vallet, Guillaume T; Simard, Martine; Rouleau, Isabelle; Lecomte, Sarah; Macoir, Joël; Hudon, Carol

    2015-01-01

    Performance on verbal memory tests is generally associated with socio-demographic variables such as age, sex, and education level. Performance also varies between different cultural groups. The present study aimed to establish normative data for the Rappel libre/Rappel indicé à 16 items (16-item Free and Cued Recall; RL/RI-16), a French adaptation of the Free and Cued Selective Reminding Test (Buschke, 1984; Grober, Buschke, Crystal, Bang, & Dresner, 1988). The sample consisted of 566 healthy French-speaking older adults (50-88 years old) from the province of Quebec, Canada. Normative data for the RL/RI-16 were derived from 80% of the total sample (normative sample) and cross-validated using the remaining participants (20%; validation sample). The effects of participants' age, sex, and education level were assessed on different indices of memory performance. Results indicated that these variables were independently associated with performance. Normative data are presented as regression equations with standard deviations (symmetric distributions) and percentiles (asymmetric distributions).

  1. A Study on the Systematization of Classification Process for NSG Trigger List Items

    Energy Technology Data Exchange (ETDEWEB)

    Yang, Seunghyo; Tae, Jaewoong; Shin, Donghoon [Korea Institute of Nuclear Nonproliferation and Control/Nuclear Export Control Div., Daejeon (Korea, Republic of)

    2013-05-15

    In 1978, Nuclear Suppliers Group (NSG) was established to prevent nuclear items from being used for nuclear weapons. NSG drew up the NSG Guidelines (INFCIRC/254) that regulates export control items(so-called NSG trigger list items) and procedures. NSG recommends its member countries to reflect these guidelines on their export control systems and fulfill their obligations. Korea has carried out export controls on nuclear items by reflecting NSG Guidelines on Notice on Trade of Strategic Item of Foreign Trade Act since joining NSG in 1995. Nuclear export control starts with Classification that determines whether export items can be used for strategic items (goods and technologies that can be exclusively used for the manufacture, development and use of WMD). The standard of Classification is based on the NSG Guidelines. However, due to the qualitative characteristics of the Guidelines, there take place lots of difficulties in the Classification. Thus this study aims to suggest the systematic Classification process. Recently, the number of Classification requests is rapidly increasing due to the UAE commercial nuclear power plants and the Jordan reactors export. It is required to provide a more systematic Classification standard and process in order to provide an efficient and consistent Classification. Thus, this study analyzed limitations of EDP which causes difficulties in the process of classification due to its qualitative characteristics. Besides, it established systematic Classification process by quantitatively analyzing EDP. Consequently, it is expected that the results of this study will be used for as actual Classification. It still remains to establish a criterion of detailed information, which is one of the most important in the Classification for technology. Therefore, a further study will be conducted to establish a criterion of detailed information by analyzing Classification cases through the text mining techniques.

  2. A Study on the Systematization of Classification Process for NSG Trigger List Items

    International Nuclear Information System (INIS)

    Yang, Seunghyo; Tae, Jaewoong; Shin, Donghoon

    2013-01-01

    In 1978, Nuclear Suppliers Group (NSG) was established to prevent nuclear items from being used for nuclear weapons. NSG drew up the NSG Guidelines (INFCIRC/254) that regulates export control items(so-called NSG trigger list items) and procedures. NSG recommends its member countries to reflect these guidelines on their export control systems and fulfill their obligations. Korea has carried out export controls on nuclear items by reflecting NSG Guidelines on Notice on Trade of Strategic Item of Foreign Trade Act since joining NSG in 1995. Nuclear export control starts with Classification that determines whether export items can be used for strategic items (goods and technologies that can be exclusively used for the manufacture, development and use of WMD). The standard of Classification is based on the NSG Guidelines. However, due to the qualitative characteristics of the Guidelines, there take place lots of difficulties in the Classification. Thus this study aims to suggest the systematic Classification process. Recently, the number of Classification requests is rapidly increasing due to the UAE commercial nuclear power plants and the Jordan reactors export. It is required to provide a more systematic Classification standard and process in order to provide an efficient and consistent Classification. Thus, this study analyzed limitations of EDP which causes difficulties in the process of classification due to its qualitative characteristics. Besides, it established systematic Classification process by quantitatively analyzing EDP. Consequently, it is expected that the results of this study will be used for as actual Classification. It still remains to establish a criterion of detailed information, which is one of the most important in the Classification for technology. Therefore, a further study will be conducted to establish a criterion of detailed information by analyzing Classification cases through the text mining techniques

  3. Modified economic order quantity (EOQ model for items with imperfect quality: Game-theoretical approaches

    Directory of Open Access Journals (Sweden)

    Milad Elyasi

    2014-04-01

    Full Text Available In the recent decade, studying the economic order quantity (EOQ models with imperfect quality has appealed to many researchers. Only few papers are published discussing EOQ models with imperfect items in a supply chain. In this paper, a two-echelon decentralized supply chain consisting of a manufacture and a supplier that both face just in time (JIT inventory problem is considered. It is sought to find the optimal number of the shipments and the quantity of each shipment in a way that minimizes the both manufacturer’s and the supplier’s cost functions. To the authors’ best knowledge, this is the first paper that deals with imperfect items in a decentralized supply chain. Thereby, three different game theoretical solution approaches consisting of two non-cooperative games and a cooperative game are proposed. Comparing the results of three different scenarios with those of the centralized model, the conclusions are drawn to obtain the best approach.

  4. Re-Fitting for a Different Purpose: A Case Study of Item Writer Practices in Adapting Source Texts for a Test of Academic Reading

    Science.gov (United States)

    Green, Anthony; Hawkey, Roger

    2012-01-01

    The important yet under-researched role of item writers in the selection and adaptation of texts for high-stakes reading tests is investigated through a case study involving a group of trained item writers working on the International English Language Testing System (IELTS). In the first phase of the study, participants were invited to reflect in…

  5. Measurement properties of the WOMAC LK 3.1 pain scale.

    Science.gov (United States)

    Stratford, P W; Kennedy, D M; Woodhouse, L J; Spadoni, G F

    2007-03-01

    The Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) is applied extensively to patients with osteoarthritis of the hip or knee. Previous work has challenged the validity of its physical function scale however an extensive evaluation of its pain scale has not been reported. Our purpose was to estimate internal consistency, factorial validity, test-retest reliability, and the standard error of measurement (SEM) of the WOMAC LK 3.1 pain scale. Four hundred and seventy-four patients with osteoarthritis of the hip or knee awaiting arthroplasty were administered the WOMAC. Estimates of internal consistency (coefficient alpha), factorial validity (confirmatory factor analysis), and the SEM based on internal consistency (SEM(IC)) were obtained. Test-retest reliability [Type 2,1 intraclass correlation coefficients (ICC)] and a corresponding SEM(TRT) were estimated on a subsample of 36 patients. Our estimates were: internal consistency alpha=0.84; SEM(IC)=1.48; Type 2,1 ICC=0.77; SEM(TRT)=1.69. Confirmatory factor analysis failed to support a single factor structure of the pain scale with uncorrelated error terms. Two comparable models provided excellent fit: (1) a model with correlated error terms between the walking and stairs items, and between night and sit items (chi2=0.18, P=0.98); (2) a two factor model with walking and stairs items loading on one factor, night and sit items loading on a second factor, and the standing item loading on both factors (chi2=0.18, P=0.98). Our examination of the factorial structure of the WOMAC pain scale failed to support a single factor and internal consistency analysis yielded a coefficient less than optimal for individual patient use. An alternate strategy to summing the five-item responses when considering individual patient application would be to interpret item responses separately or to sum only those items which display homogeneity.

  6. An Investigation of Invariance Properties of One, Two and Three Parameter Logistic Item Response Theory Models

    Directory of Open Access Journals (Sweden)

    O.A. Awopeju

    2017-12-01

    Full Text Available The study investigated the invariance properties of one, two and three parame-ter logistic item response theory models. It examined the best fit among one parameter logistic (1PL, two-parameter logistic (2PL and three-parameter logistic (3PL IRT models for SSCE, 2008 in Mathematics. It also investigated the degree of invariance of the IRT models based item difficulty parameter estimates in SSCE in Mathematics across different samples of examinees and examined the degree of invariance of the IRT models based item discrimination estimates in SSCE in Mathematics across different samples of examinees. In order to achieve the set objectives, 6000 students (3000 males and 3000 females were drawn from the population of 35262 who wrote the 2008 paper 1 Senior Secondary Certificate Examination (SSCE in Mathematics organized by National Examination Council (NECO. The item difficulty and item discrimination parameter estimates from CTT and IRT were tested for invariance using BLOG MG 3 and correlation analysis was achieved using SPSS version 20. The research findings were that two parameter model IRT item difficulty and discrimination parameter estimates exhibited invariance property consistently across different samples and that 2-parameter model was suitable for all samples of examinees unlike one-parameter model and 3-parameter model.

  7. Development of a Short Version of MSQOL-54 Using Factor Analysis and Item Response Theory.

    Directory of Open Access Journals (Sweden)

    Rosalba Rosato

    Full Text Available The Multiple Sclerosis Quality of Life-54 (MSQOL-54, 52 items grouped in 12 subscales plus two single items is the most used MS specific health related quality of life inventory.To develop a shortened version of the MSQOL-54.MSQOL-54 dimensionality and metric properties were investigated by confirmatory factor analysis (CFA and Rasch modelling (Partial Credit Model, PCM on MSQOL-54s completed by 473 MS patients. Their mean age was 41 years, 65% were women, and median Expanded Disability Status Scale (EDSS score was 2.0 (range 0-9.5. Differential item functioning (DIF was evaluated for gender, age and EDSS. Dimensionality of the resulting short version was assessed by exploratory factor analysis (EFA and CFA. Cognitive debriefing of the short instrument (vs. the original was then performed on 12 MS patients.CFA of MSQOL-54 subscales showed that the data fitted the overall model well. Two subscales (Role Limitations--Physical, Role Limitations--Emotional did not fit the PCM, and were removed; two other subscales (Health Perceptions, Social Function did not fit the model, but were retained as single items. Sexual Satisfaction (single-item subscale was also removed. The resulting MSQOL-29 consisted of 25 items grouped in 7 subscales, plus 4 single items. PCM fit statistics were within the acceptability range for all MSQOL-29 items except one which had significant DIF by age. EFA and CFA indicated adequate fit to the original two-factor (Physical and Mental Health Composites hypothesis. Cognitive debriefing confirmed that MSQOL-29 was acceptable and had lost no key items.The proposed MSQOL-29 is 50% shorter than MSQOL-54, yet preserves key quality of life dimensions. Prospective validation on a large, independent MS patient sample is ongoing.

  8. Detecting intrajudge inconsistency in standard setting using test items with a selected-response format

    NARCIS (Netherlands)

    van der Linden, Willem J.; Vos, Hendrik J.; Chang, Lei

    2002-01-01

    In judgmental standard setting experiments, it may be difficult to specify subjective probabilities that adequately take the properties of the items into account. As a result, these probabilities are not consistent with each other in the sense that they do not refer to the same borderline level of

  9. Item selection via Bayesian IRT models.

    Science.gov (United States)

    Arima, Serena

    2015-02-10

    With reference to a questionnaire that aimed to assess the quality of life for dysarthric speakers, we investigate the usefulness of a model-based procedure for reducing the number of items. We propose a mixed cumulative logit model, which is known in the psychometrics literature as the graded response model: responses to different items are modelled as a function of individual latent traits and as a function of item characteristics, such as their difficulty and their discrimination power. We jointly model the discrimination and the difficulty parameters by using a k-component mixture of normal distributions. Mixture components correspond to disjoint groups of items. Items that belong to the same groups can be considered equivalent in terms of both difficulty and discrimination power. According to decision criteria, we select a subset of items such that the reduced questionnaire is able to provide the same information that the complete questionnaire provides. The model is estimated by using a Bayesian approach, and the choice of the number of mixture components is justified according to information criteria. We illustrate the proposed approach on the basis of data that are collected for 104 dysarthric patients by local health authorities in Lecce and in Milan. Copyright © 2014 John Wiley & Sons, Ltd.

  10. Re-evaluating a vision-related quality of life questionnaire with item response theory (IRT and differential item functioning (DIF analyses

    Directory of Open Access Journals (Sweden)

    Knol Dirk L

    2011-09-01

    Full Text Available Abstract Background For the Low Vision Quality Of Life questionnaire (LVQOL it is unknown whether the psychometric properties are satisfactory when an item response theory (IRT perspective is considered. This study evaluates some essential psychometric properties of the LVQOL questionnaire in an IRT model, and investigates differential item functioning (DIF. Methods Cross-sectional data were used from an observational study among visually-impaired patients (n = 296. Calibration was performed for every dimension of the LVQOL in the graded response model. Item goodness-of-fit was assessed with the S-X2-test. DIF was assessed on relevant background variables (i.e. age, gender, visual acuity, eye condition, rehabilitation type and administration type with likelihood-ratio tests for DIF. The magnitude of DIF was interpreted by assessing the largest difference in expected scores between subgroups. Measurement precision was assessed by presenting test information curves; reliability with the index of subject separation. Results All items of the LVQOL dimensions fitted the model. There was significant DIF on several items. For two items the maximum difference between expected scores exceeded one point, and DIF was found on multiple relevant background variables. Item 1 'Vision in general' from the "Adjustment" dimension and item 24 'Using tools' from the "Reading and fine work" dimension were removed. Test information was highest for the "Reading and fine work" dimension. Indices for subject separation ranged from 0.83 to 0.94. Conclusions The items of the LVQOL showed satisfactory item fit to the graded response model; however, two items were removed because of DIF. The adapted LVQOL with 21 items is DIF-free and therefore seems highly appropriate for use in heterogeneous populations of visually impaired patients.

  11. Psychometric properties of the 25-item Work Limitations Questionnaire in Japan: factor structure, validity, and reliability in information and communication technology company employees.

    Science.gov (United States)

    Kono, Yuko; Matsushima, Eisuke; Uji, Masayo

    2014-02-01

    The 25-item Work Limitations Questionnaire (WLQ-25) measures presenteeism but has not been sufficiently validated in a Japanese population. A total of 451 employees from four information technology companies in Tokyo completed the WLQ-25 and questionnaires of other variables on two occasions, 2 weeks apart. The WLQ-25 yielded a two-factor structure: Cognitive Demand and Physical Demand. These subscales showed good internal consistency, and both were associated with adverse working conditions, greater perceived job strain, lower skill use, poorer workplace social support, and less satisfactory psychological adjustment. Intraclass correlation coefficients of the two WLQ-25 subscales between time 1 and time 2 were 0.78 and 0.55, respectively. This study suggests acceptable psychometric properties of the WLQ-25 in Japan.

  12. All projects related to | Page 528 | IDRC - International Development ...

    International Development Research Centre (IDRC) Digital Library (Canada)

    2008-01-01

    Negotiations around intellectual property rights (IPR) are increasingly a key factor in international trade and an important item on the international development agenda. Start Date: January 1, 2008. End Date: April 14, 2011. Topic: INTELLECTUAL PROPERTY, COMPUTER PROGRAMS, ACCESS TO INFORMATION.

  13. All projects related to | Page 551 | IDRC - International Development ...

    International Development Research Centre (IDRC) Digital Library (Canada)

    2008-01-01

    Negotiations around intellectual property rights (IPR) are increasingly a key factor in international trade and an important item on the international development agenda. Start Date: January 1, 2008. End Date: April 14, 2011. Topic: INTELLECTUAL PROPERTY, COMPUTER PROGRAMS, ACCESS TO INFORMATION.

  14. All projects related to | Page 552 | IDRC - International Development ...

    International Development Research Centre (IDRC) Digital Library (Canada)

    2008-01-01

    Negotiations around intellectual property rights (IPR) are increasingly a key factor in international trade and an important item on the international development agenda. Start Date: January 1, 2008. End Date: April 14, 2011. Topic: INTELLECTUAL PROPERTY, COMPUTER PROGRAMS, ACCESS TO INFORMATION.

  15. All projects related to | Page 522 | IDRC - International Development ...

    International Development Research Centre (IDRC) Digital Library (Canada)

    Region: Argentina, South America, Peru, Uruguay, North and Central America ... Topic: VIOLENCE AGAINST WOMEN, WOMEN'S RIGHTS, Gender ... Negotiations around intellectual property rights (IPR) are increasingly a key factor in international trade and an important item on the international development agenda.

  16. Determining an Imaging Literacy Curriculum for Radiation Oncologists: An International Delphi Study

    Energy Technology Data Exchange (ETDEWEB)

    Giuliani, Meredith E., E-mail: Meredith.Giuliani@rmp.uhn.on.ca [Radiation Medicine Program, Princess Margaret Cancer Centre, Toronto, Ontario (Canada); Department of Radiation Oncology, University of Toronto, Toronto, Ontario (Canada); Gillan, Caitlin [Radiation Medicine Program, Princess Margaret Cancer Centre, Toronto, Ontario (Canada); Department of Radiation Oncology, University of Toronto, Toronto, Ontario (Canada); Milne, Robin A. [Radiation Medicine Program, Princess Margaret Cancer Centre, Toronto, Ontario (Canada); Uchino, Minako; Millar, Barbara-Ann; Catton, Pamela [Radiation Medicine Program, Princess Margaret Cancer Centre, Toronto, Ontario (Canada); Department of Radiation Oncology, University of Toronto, Toronto, Ontario (Canada)

    2014-03-15

    Purpose: Rapid evolution of imaging technologies and their integration into radiation therapy practice demands that radiation oncology (RO) training curricula be updated. The purpose of this study was to develop an entry-to-practice image literacy competency profile. Methods and Materials: A list of 263 potential imaging competency items were assembled from international objectives of training. Expert panel eliminated redundant or irrelevant items to create a list of 97 unique potential competency items. An international 2-round Delphi process was conducted with experts in RO. In round 1, all experts scored, on a 9-point Likert scale, the degree to which they agreed an item should be included in the competency profile. Items with a mean score ≥7 were included, those 4 to 6 were reviewed in round 2, and items scored <4 were excluded. In round 2, items were discussed and subsequently ranked for inclusion or exclusion in the competency profile. Items with >75% voting for inclusion were included in the final competency profile. Results: Forty-nine radiation oncologists were invited to participate in round 1, and 32 (65%) did so. Participants represented 24 centers in 6 countries. Of the 97 items ranked in round 1, 80 had a mean score ≥7, 1 item had a score <4, and 16 items with a mean score of 4 to 6 were reviewed and rescored in round 2. In round 2, 4 items had >75% of participants voting for inclusion and were included; the remaining 12 were excluded. The final list of 84 items formed the final competency profile. The 84 enabling competency items were aggregated into the following 4 thematic groups of key competencies: (1) imaging fundamentals (42 items); (2) clinical application (27 items); (3) clinical management (5 items); and (4) professional practice (10 items). Conclusions: We present an imaging literacy competency profile which could constitute the minimum training standards in radiation oncology residency programs.

  17. Determining an Imaging Literacy Curriculum for Radiation Oncologists: An International Delphi Study

    International Nuclear Information System (INIS)

    Giuliani, Meredith E.; Gillan, Caitlin; Milne, Robin A.; Uchino, Minako; Millar, Barbara-Ann; Catton, Pamela

    2014-01-01

    Purpose: Rapid evolution of imaging technologies and their integration into radiation therapy practice demands that radiation oncology (RO) training curricula be updated. The purpose of this study was to develop an entry-to-practice image literacy competency profile. Methods and Materials: A list of 263 potential imaging competency items were assembled from international objectives of training. Expert panel eliminated redundant or irrelevant items to create a list of 97 unique potential competency items. An international 2-round Delphi process was conducted with experts in RO. In round 1, all experts scored, on a 9-point Likert scale, the degree to which they agreed an item should be included in the competency profile. Items with a mean score ≥7 were included, those 4 to 6 were reviewed in round 2, and items scored <4 were excluded. In round 2, items were discussed and subsequently ranked for inclusion or exclusion in the competency profile. Items with >75% voting for inclusion were included in the final competency profile. Results: Forty-nine radiation oncologists were invited to participate in round 1, and 32 (65%) did so. Participants represented 24 centers in 6 countries. Of the 97 items ranked in round 1, 80 had a mean score ≥7, 1 item had a score <4, and 16 items with a mean score of 4 to 6 were reviewed and rescored in round 2. In round 2, 4 items had >75% of participants voting for inclusion and were included; the remaining 12 were excluded. The final list of 84 items formed the final competency profile. The 84 enabling competency items were aggregated into the following 4 thematic groups of key competencies: (1) imaging fundamentals (42 items); (2) clinical application (27 items); (3) clinical management (5 items); and (4) professional practice (10 items). Conclusions: We present an imaging literacy competency profile which could constitute the minimum training standards in radiation oncology residency programs

  18. Knowledge of the ordinal position of list items in pigeons.

    Science.gov (United States)

    Scarf, Damian; Colombo, Michael

    2011-10-01

    Ordinal knowledge is a fundamental aspect of advanced cognition. It is self-evident that humans represent ordinal knowledge, and over the past 20 years it has become clear that nonhuman primates share this ability. In contrast, evidence that nonprimate species represent ordinal knowledge is missing from the comparative literature. To address this issue, in the present experiment we trained pigeons on three 4-item lists and then tested them with derived lists in which, relative to the training lists, the ordinal position of the items was either maintained or changed. Similar to the findings with human and nonhuman primates, our pigeons performed markedly better on the maintained lists compared to the changed lists, and displayed errors consistent with the view that they used their knowledge of ordinal position to guide responding on the derived lists. These findings demonstrate that the ability to acquire ordinal knowledge is not unique to the primate lineage. (PsycINFO Database Record (c) 2011 APA, all rights reserved).

  19. Conditioning factors of test-taking engagement in PIAAC: an exploratory IRT modelling approach considering person and item characteristics

    Directory of Open Access Journals (Sweden)

    Frank Goldhammer

    2017-11-01

    Full Text Available Abstract Background A potential problem of low-stakes large-scale assessments such as the Programme for the International Assessment of Adult Competencies (PIAAC is low test-taking engagement. The present study pursued two goals in order to better understand conditioning factors of test-taking disengagement: First, a model-based approach was used to investigate whether item indicators of disengagement constitute a continuous latent person variable by domain. Second, the effects of person and item characteristics were jointly tested using explanatory item response models. Methods Analyses were based on the Canadian sample of Round 1 of the PIAAC, with N = 26,683 participants completing test items in the domains of literacy, numeracy, and problem solving. Binary item disengagement indicators were created by means of item response time thresholds. Results The results showed that disengagement indicators define a latent dimension by domain. Disengagement increased with lower educational attainment, lower cognitive skills, and when the test language was not the participant’s native language. Gender did not exert any effect on disengagement, while age had a positive effect for problem solving only. An item’s location in the second of two assessment modules was positively related to disengagement, as was item difficulty. The latter effect was negatively moderated by cognitive skill, suggesting that poor test-takers are especially likely to disengage with more difficult items. Conclusions The negative effect of cognitive skill, the positive effect of item difficulty, and their negative interaction effect support the assumption that disengagement is the outcome of individual expectations about success (informed disengagement.

  20. Factor structure and internal reliability of an exercise health belief model scale in a Mexican population

    Directory of Open Access Journals (Sweden)

    Oscar Armando Esparza-Del Villar

    2017-03-01

    Full Text Available Abstract Background Mexico is one of the countries with the highest rates of overweight and obesity around the world, with 68.8% of men and 73% of women reporting both. This is a public health problem since there are several health related consequences of not exercising, like having cardiovascular diseases or some types of cancers. All of these problems can be prevented by promoting exercise, so it is important to evaluate models of health behaviors to achieve this goal. Among several models the Health Belief Model is one of the most studied models to promote health related behaviors. This study validates the first exercise scale based on the Health Belief Model (HBM in Mexicans with the objective of studying and analyzing this model in Mexico. Methods Items for the scale called the Exercise Health Belief Model Scale (EHBMS were developed by a health research team, then the items were applied to a sample of 746 participants, male and female, from five cities in Mexico. The factor structure of the items was analyzed with an exploratory factor analysis and the internal reliability with Cronbach’s alpha. Results The exploratory factor analysis reported the expected factor structure based in the HBM. The KMO index (0.92 and the Barlett’s sphericity test (p < 0.01 indicated an adequate and normally distributed sample. Items had adequate factor loadings, ranging from 0.31 to 0.92, and the internal consistencies of the factors were also acceptable, with alpha values ranging from 0.67 to 0.91. Conclusions The EHBMS is a validated scale that can be used to measure exercise based on the HBM in Mexican populations.

  1. Item response theory analysis of the life orientation test-revised: age and gender differential item functioning analyses.

    Science.gov (United States)

    Steca, Patrizia; Monzani, Dario; Greco, Andrea; Chiesi, Francesca; Primi, Caterina

    2015-06-01

    This study is aimed at testing the measurement properties of the Life Orientation Test-Revised (LOT-R) for the assessment of dispositional optimism by employing item response theory (IRT) analyses. The LOT-R was administered to a large sample of 2,862 Italian adults. First, confirmatory factor analyses demonstrated the theoretical conceptualization of the construct measured by the LOT-R as a single bipolar dimension. Subsequently, IRT analyses for polytomous, ordered response category data were applied to investigate the items' properties. The equivalence of the items across gender and age was assessed by analyzing differential item functioning. Discrimination and severity parameters indicated that all items were able to distinguish people with different levels of optimism and adequately covered the spectrum of the latent trait. Additionally, the LOT-R appears to be gender invariant and, with minor exceptions, age invariant. Results provided evidence that the LOT-R is a reliable and valid measure of dispositional optimism. © The Author(s) 2014.

  2. Using Item Response Theory to Develop Measures of Acquisitive and Protective Self-Monitoring From the Original Self-Monitoring Scale.

    Science.gov (United States)

    Wilmot, Michael P; Kostal, Jack W; Stillwell, David; Kosinski, Michal

    2017-07-01

    For the past 40 years, the conventional univariate model of self-monitoring has reigned as the dominant interpretative paradigm in the literature. However, recent findings associated with an alternative bivariate model challenge the conventional paradigm. In this study, item response theory is used to develop measures of the bivariate model of acquisitive and protective self-monitoring using original Self-Monitoring Scale (SMS) items, and data from two large, nonstudent samples ( Ns = 13,563 and 709). Results indicate that the new acquisitive (six-item) and protective (seven-item) self-monitoring scales are reliable, unbiased in terms of gender and age, and demonstrate theoretically consistent relations to measures of personality traits and cognitive ability. Additionally, by virtue of using original SMS items, previously collected responses can be reanalyzed in accordance with the alternative bivariate model. Recommendations for the reanalysis of archival SMS data, as well as directions for future research, are provided.

  3. Software Note: Using BILOG for Fixed-Anchor Item Calibration

    Science.gov (United States)

    DeMars, Christine E.; Jurich, Daniel P.

    2012-01-01

    The nonequivalent groups anchor test (NEAT) design is often used to scale item parameters from two different test forms. A subset of items, called the anchor items or common items, are administered as part of both test forms. These items are used to adjust the item calibrations for any differences in the ability distributions of the groups taking…

  4. Inventions on presenting textual items in Graphical User Interface

    OpenAIRE

    Mishra, Umakant

    2014-01-01

    Although a GUI largely replaces textual descriptions by graphical icons, the textual items are not completely removed. The textual items are inevitably used in window titles, message boxes, help items, menu items and popup items. Textual items are necessary for communicating messages that are beyond the limitation of graphical messages. However, it is necessary to harness the textual items on the graphical interface in such a way that they complement each other to produce the best effect. One...

  5. Science Library of Test Items. Volume Eighteen. A Collection of Multiple Choice Test Items Relating Mainly to Chemistry.

    Science.gov (United States)

    New South Wales Dept. of Education, Sydney (Australia).

    As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…

  6. Science Library of Test Items. Volume Seventeen. A Collection of Multiple Choice Test Items Relating Mainly to Biology.

    Science.gov (United States)

    New South Wales Dept. of Education, Sydney (Australia).

    As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…

  7. Science Library of Test Items. Volume Nineteen. A Collection of Multiple Choice Test Items Relating Mainly to Geology.

    Science.gov (United States)

    New South Wales Dept. of Education, Sydney (Australia).

    As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…

  8. Developing core elements and checklist items for global hospital antimicrobial stewardship programmes: a consensus approach.

    Science.gov (United States)

    Pulcini, C; Binda, F; Lamkang, A S; Trett, A; Charani, E; Goff, D A; Harbarth, S; Hinrichsen, S L; Levy-Hara, G; Mendelson, M; Nathwani, D; Gunturu, R; Singh, S; Srinivasan, A; Thamlikitkul, V; Thursky, K; Vlieghe, E; Wertheim, H; Zeng, M; Gandra, S; Laxminarayan, R

    2018-04-03

    With increasing global interest in hospital antimicrobial stewardship (AMS) programmes, there is a strong demand for core elements of AMS to be clearly defined on the basis of principles of effectiveness and affordability. To date, efforts to identify such core elements have been limited to Europe, Australia, and North America. The aim of this study was to develop a set of core elements and their related checklist items for AMS programmes that should be present in all hospitals worldwide, regardless of resource availability. A literature review was performed by searching Medline and relevant websites to retrieve a list of core elements and items that could have global relevance. These core elements and items were evaluated by an international group of AMS experts using a structured modified Delphi consensus procedure, using two-phased online in-depth questionnaires. The literature review identified seven core elements and their related 29 checklist items from 48 references. Fifteen experts from 13 countries in six continents participated in the consensus procedure. Ultimately, all seven core elements were retained, as well as 28 of the initial checklist items plus one that was newly suggested, all with ≥80% agreement; 20 elements and items were rephrased. This consensus on core elements for hospital AMS programmes is relevant to both high- and low-to-middle-income countries and could facilitate the development of national AMS stewardship guidelines and adoption by healthcare settings worldwide. Copyright © 2018 European Society of Clinical Microbiology and Infectious Diseases. All rights reserved.

  9. Feed mechanism and method for feeding minute items

    Science.gov (United States)

    Stringer, Timothy Kent [Bucyrus, KS; Yerganian, Simon Scott [Lee's Summit, MO

    2009-10-20

    A feeding mechanism and method for feeding minute items, such as capacitors, resistors, or solder preforms. The mechanism is adapted to receive a plurality of the randomly-positioned and randomly-oriented extremely small or minute items, and to isolate, orient, and position one or more of the items in a specific repeatable pickup location wherefrom they may be removed for use by, for example, a computer-controlled automated assembly machine. The mechanism comprises a sliding shelf adapted to receive and support the items; a wiper arm adapted to achieve a single even layer of the items; and a pushing arm adapted to push the items into the pickup location. The mechanism can be adapted for providing the items with a more exact orientation, and can also be adapted for use in a liquid environment.

  10. A Development of Group Decision Support System for Strategic Item Classification using Analytic Hierarchy Process

    International Nuclear Information System (INIS)

    Yoon, Sung Ho; Tae, Jae Woong; Yang, Seung Hyo; Shin, Dong Hoon

    2016-01-01

    Korea has carried out export controls on nuclear items that reflect the Nuclear Suppliers Group (NSG) guidelines (Notice on Trade of Strategic Item of Foreign Trade Act) since joining the NSG in 1995. Nuclear export control starts with classifications that determine whether export items are relevant to nuclear proliferation or not according to NSG guidelines. However, due to qualitative characteristics of nuclear item definition in the guidelines, classification spends a lot of time and effort to make a consensus. The aim of this study is to provide an analysis of an experts' group decision support system (GDSS) based on an analytic hierarchy process (AHP) for the classification of strategic items. The results of this study clearly demonstrated that a GDSS based on an AHP proved positive, systematically providing relative weight among the planning variables and objectives. By using an AHP we can quantify the subjective judgements of reviewers. An order of priority is derived from a numerical value. The verbal and fuzzy measurement of an AHP enables us reach a consensus among reviewers in a GDSS. An AHP sets common weight factors which are a priority of each attribute that represent the views of an entire group. It makes a consistency in decision-making that is important for classification

  11. A Development of Group Decision Support System for Strategic Item Classification using Analytic Hierarchy Process

    Energy Technology Data Exchange (ETDEWEB)

    Yoon, Sung Ho; Tae, Jae Woong; Yang, Seung Hyo; Shin, Dong Hoon [Korea Institute of Nuclear Nonproliferation and Control, Daejeon (Korea, Republic of)

    2016-05-15

    Korea has carried out export controls on nuclear items that reflect the Nuclear Suppliers Group (NSG) guidelines (Notice on Trade of Strategic Item of Foreign Trade Act) since joining the NSG in 1995. Nuclear export control starts with classifications that determine whether export items are relevant to nuclear proliferation or not according to NSG guidelines. However, due to qualitative characteristics of nuclear item definition in the guidelines, classification spends a lot of time and effort to make a consensus. The aim of this study is to provide an analysis of an experts' group decision support system (GDSS) based on an analytic hierarchy process (AHP) for the classification of strategic items. The results of this study clearly demonstrated that a GDSS based on an AHP proved positive, systematically providing relative weight among the planning variables and objectives. By using an AHP we can quantify the subjective judgements of reviewers. An order of priority is derived from a numerical value. The verbal and fuzzy measurement of an AHP enables us reach a consensus among reviewers in a GDSS. An AHP sets common weight factors which are a priority of each attribute that represent the views of an entire group. It makes a consistency in decision-making that is important for classification.

  12. Psychometric properties and factor structure of the 13-item satisfaction with daily occupations scale when used with people with mental health problems.

    Science.gov (United States)

    Eklund, Mona; Bäckström, Martin; Eakman, Aaron M

    2014-12-24

    In mental health care practice and research it is increasingly recognized that clients' subjective perceptions of everyday occupations, such as satisfaction, are important in recovery from mental illness. Instruments thus need to be developed to assess satisfaction with everyday occupations. The aim of the present study was to assess psychometric properties of the 13-item Satisfaction with Daily Occupation (SDO-13) when used with people with mental health problems, including its internal consistency, factor structure, construct validity and whether the scale produced ceiling or floor effects. An additional question concerned if the factor structure varied whether the participants were, or were not, presently engaged in the activity they rated. The interview-based SDO-13 includes items pertaining to work/studies, leisure, home maintenance, and self-care occupations. Whether the person currently performs an occupation or not, he/she is asked to indicate his/her satisfaction with that occupation. The SDO-13 was completed with 184 persons with mental illness. Residual variables were created to remove the variation linked with currently performing the targeted occupation or not and to assess the factor structure of the SDO-13. The indicators of general satisfaction with daily occupations, self-esteem and global functioning were used to assess construct validity. The statistical methods included tests of homogeneity, confirmatory factor analysis and Pearson correlations. The internal consistency was satisfactory at 0.79. A three-factor solution indicated that the construct behind the SDO-13 was composed of three facets; Taking care of oneself and the home, Work and studies, and Leisure and relaxation. The same factor structure was valid for both original scores and the residuals. An expected pattern of correlations with the indicators was mainly found, suggesting basic construct validity. No ceiling or floor effects were found. Taken together, the findings suggest the

  13. Applying Hierarchical Model Calibration to Automatically Generated Items.

    Science.gov (United States)

    Williamson, David M.; Johnson, Matthew S.; Sinharay, Sandip; Bejar, Isaac I.

    This study explored the application of hierarchical model calibration as a means of reducing, if not eliminating, the need for pretesting of automatically generated items from a common item model prior to operational use. Ultimately the successful development of automatic item generation (AIG) systems capable of producing items with highly similar…

  14. 41 CFR 101-27.404 - Review of items.

    Science.gov (United States)

    2010-07-01

    ... 41 Public Contracts and Property Management 2 2010-07-01 2010-07-01 true Review of items. 101-27.404 Section 101-27.404 Public Contracts and Property Management Federal Property Management...-Elimination of Items From Inventory § 101-27.404 Review of items. Except for standby or reserve stocks, items...

  15. Towards an authoring system for item construction

    NARCIS (Netherlands)

    Rikers, Jos H.A.N.

    1988-01-01

    The process of writing test items is analyzed, and a blueprint is presented for an authoring system for test item writing to reduce invalidity and to structure the process of item writing. The developmental methodology is introduced, and the first steps in the process are reported. A historical

  16. Item Analysis to Improve Reliability for an Internal Medicine Undergraduate OSCE

    Science.gov (United States)

    Auewarakul, Chirayu; Downing, Steven M.; Praditsuwan, Rungnirand; Jaturatamrong, Uapong

    2005-01-01

    Utilization of objective structured clinical examinations (OSCEs) for final assessment of medical students in Internal Medicine requires a representative sample of OSCE stations. The reliability and generalizability of OSCE scores provides validity evidence for OSCE scores and supports its contribution to the final clinical grade of medical…

  17. Modeling Local Item Dependence in Cloze and Reading Comprehension Test Items Using Testlet Response Theory

    Science.gov (United States)

    Baghaei, Purya; Ravand, Hamdollah

    2016-01-01

    In this study the magnitudes of local dependence generated by cloze test items and reading comprehension items were compared and their impact on parameter estimates and test precision was investigated. An advanced English as a foreign language reading comprehension test containing three reading passages and a cloze test was analyzed with a…

  18. 10 CFR 835.605 - Labeling items and containers.

    Science.gov (United States)

    2010-01-01

    ... 10 Energy 4 2010-01-01 2010-01-01 false Labeling items and containers. 835.605 Section 835.605... items and containers. Except as provided at § 835.606, each item or container of radioactive material... information to permit individuals handling, using, or working in the vicinity of the items or containers to...

  19. Internal Consistency of Reliability Assessment of the Persian version of the ‘Home Falls and Accident Screening Tool’

    Directory of Open Access Journals (Sweden)

    Afsoon Hassani Mehraban

    2013-10-01

    Full Text Available Objectives: Falling is a common problem among the elderly. Falling indoors and outdoors is highly prevalent among the Iranian elderly. Therefore, identification of the contributing factors at home and their modification can reduce falls and subsequent injuries inthe elderly. The goal of this study was to identify the elderly at risk of fall, using the ‘Home Falls and Accident Screening Tool’ (HOME FAST, and to determine the reliability of this tool. Methods: Sixty old people were selected from five geographical regions of Tehran through the Local Town Councils. Participants were aged 60 to 65 years, and HOME FAST was used to assess inter rater and test- retest reliability. Results: Test-retest reliability in the study showed that agreement between the items of the Persian version of HOME FAST was over 0.8, which is a very good reliability. The agreement between the domains was 0.65-1.00, indicative of moderate to high reliability. Moreover, the Inter rater reliability of the items was over 0.8, which is also very good. The correlation of each item between the domains was 0.01-1.00, which shows poor to high reliability. Discussion: This study showed that the reliability of the Persian version of HOME FAST is high. This tool can therefore be used as an appropriate screening tool by professionals to take necessary preventive measures for the Iranian elderly population.

  20. Obtaining a Proportional Allocation by Deleting Items

    NARCIS (Netherlands)

    Dorn, B.; de Haan, R.; Schlotter, I.; Röthe, J.

    2017-01-01

    We consider the following control problem on fair allocation of indivisible goods. Given a set I of items and a set of agents, each having strict linear preference over the items, we ask for a minimum subset of the items whose deletion guarantees the existence of a proportional allocation in the

  1. Item-Based Top-N Recommendation Algorithms

    Science.gov (United States)

    2003-01-20

    basket of items, utilized by many e-commerce sites, cannot take advantage of pre-computed user-to-user similarities. Finally, even though the...not discriminate between items that are present in frequent itemsets and items that are not, while still maintaining the computational advantages of...453219 0.02% 7.74 ccard 42629 68793 398619 0.01% 9.35 ecommerce 6667 17491 91222 0.08% 13.68 em 8002 1648 769311 5.83% 96.14 ml 943 1682 100000 6.31

  2. Internally consistent gamma ray burst time history phenomenology

    International Nuclear Information System (INIS)

    Cline, T.L.

    1985-01-01

    A phenomenology for gamma ray burst time histories is outlined. Order of their generally chaotic appearance is attempted, based on the speculation that any one burst event can be represented above 150 keV as a superposition of similarly shaped increases of varying intensity. The increases can generally overlap, however, confusing the picture, but a given event must at least exhibit its own limiting characteristic rise and decay times if the measurements are made with instruments having adequate temporal resolution. Most catalogued observations may be of doubtful or marginal utility to test this hypothesis, but some time histories from Helios-2, Pioneer Venus Orbiter and other instruments having one-to several-millisecond capabilities appear to provide consistency. Also, recent studies of temporally resolved Solar Maximum Mission burst energy spectra are entirely compatible with this picture. The phenomenology suggested here, if correct, may assist as an analytic tool for modelling of burst processes and possibly in the definition of burst source populations

  3. A Review of Classical Methods of Item Analysis.

    Science.gov (United States)

    French, Christine L.

    Item analysis is a very important consideration in the test development process. It is a statistical procedure to analyze test items that combines methods used to evaluate the important characteristics of test items, such as difficulty, discrimination, and distractibility of the items in a test. This paper reviews some of the classical methods for…

  4. Electronics. Criterion-Referenced Test (CRT) Item Bank.

    Science.gov (United States)

    Davis, Diane, Ed.

    This document contains 519 criterion-referenced multiple choice and true or false test items for a course in electronics. The test item bank is designed to work with both the Vocational Instructional Management System (VIMS) and the Vocational Administrative Management System (VAMS) in Missouri. The items are grouped into 15 units covering the…

  5. Developing consistent pronunciation models for phonemic variants

    CSIR Research Space (South Africa)

    Davel, M

    2006-09-01

    Full Text Available Pronunciation lexicons often contain pronunciation variants. This can create two problems: It can be difficult to define these variants in an internally consistent way and it can also be difficult to extract generalised grapheme-to-phoneme rule sets...

  6. Consolidation differentially modulates schema effects on memory for items and associations.

    Science.gov (United States)

    van Kesteren, Marlieke T R; Rijpkema, Mark; Ruiter, Dirk J; Fernández, Guillén

    2013-01-01

    Newly learned information that is congruent with a preexisting schema is often better remembered than information that is incongruent. This schema effect on memory has previously been associated to more efficient encoding and consolidation mechanisms. However, this effect is not always consistently supported in the literature, with differential schema effects reported for different types of memory, different retrieval cues, and the possibility of time-dependent effects related to consolidation processes. To examine these effects more directly, we tested participants on two different types of memory (item recognition and associative memory) for newly encoded visuo-tactile associations at different study-test intervals, thus probing memory retrieval accuracy for schema-congruent and schema-incongruent items and associations at different time points (t = 0, t = 20, and t = 48 hours) after encoding. Results show that the schema effect on visual item recognition only arises after consolidation, while the schema effect on associative memory is already apparent immediately after encoding, persisting, but getting smaller over time. These findings give further insight into different factors influencing the schema effect on memory, and can inform future schema experiments by illustrating the value of considering effects of memory type and consolidation on schema-modulated retrieval.

  7. Consolidation differentially modulates schema effects on memory for items and associations.

    Directory of Open Access Journals (Sweden)

    Marlieke T R van Kesteren

    Full Text Available Newly learned information that is congruent with a preexisting schema is often better remembered than information that is incongruent. This schema effect on memory has previously been associated to more efficient encoding and consolidation mechanisms. However, this effect is not always consistently supported in the literature, with differential schema effects reported for different types of memory, different retrieval cues, and the possibility of time-dependent effects related to consolidation processes. To examine these effects more directly, we tested participants on two different types of memory (item recognition and associative memory for newly encoded visuo-tactile associations at different study-test intervals, thus probing memory retrieval accuracy for schema-congruent and schema-incongruent items and associations at different time points (t = 0, t = 20, and t = 48 hours after encoding. Results show that the schema effect on visual item recognition only arises after consolidation, while the schema effect on associative memory is already apparent immediately after encoding, persisting, but getting smaller over time. These findings give further insight into different factors influencing the schema effect on memory, and can inform future schema experiments by illustrating the value of considering effects of memory type and consolidation on schema-modulated retrieval.

  8. Ten Items of Integrated Technology Developed by CNPC

    Institute of Scientific and Technical Information of China (English)

    2006-01-01

    @@ The technological work of China National Petroleum Corporation (CNPC) was based on the company's general development strategy to become a multinational giant with international competitiveness during the 10th FiveYear Plan Period (2001-2005). The technological efforts were focused on strengthening strategic management of technology to identify the technological development targets, optimizing allocation of technological resources and increasing technological investment to highlight creation of key technology. Aiming at the important and key technologies needed for main business development,CNPC launched 15 technological projects at the State level with a 100 percent completion rate and 379 other projects at the corporate level with a 92.8 percent completion rate. With a number of high-level results achieved, CNPC has developed 10 items of integrated technology.

  9. A Balance Sheet for Educational Item Banking.

    Science.gov (United States)

    Hiscox, Michael D.

    Educational item banking presents observers with a considerable paradox. The development of test items from scratch is viewed as wasteful, a luxury in times of declining resources. On the other hand, item banking has failed to become a mature technology despite large amounts of money and the efforts of talented professionals. The question of which…

  10. Promoting cold-start items in recommender systems.

    Science.gov (United States)

    Liu, Jin-Hu; Zhou, Tao; Zhang, Zi-Ke; Yang, Zimo; Liu, Chuang; Li, Wei-Min

    2014-01-01

    As one of the major challenges, cold-start problem plagues nearly all recommender systems. In particular, new items will be overlooked, impeding the development of new products online. Given limited resources, how to utilize the knowledge of recommender systems and design efficient marketing strategy for new items is extremely important. In this paper, we convert this ticklish issue into a clear mathematical problem based on a bipartite network representation. Under the most widely used algorithm in real e-commerce recommender systems, the so-called item-based collaborative filtering, we show that to simply push new items to active users is not a good strategy. Interestingly, experiments on real recommender systems indicate that to connect new items with some less active users will statistically yield better performance, namely, these new items will have more chance to appear in other users' recommendation lists. Further analysis suggests that the disassortative nature of recommender systems contributes to such observation. In a word, getting in-depth understanding on recommender systems could pave the way for the owners to popularize their cold-start products with low costs.

  11. Promoting Cold-Start Items in Recommender Systems

    Science.gov (United States)

    Liu, Jin-Hu; Zhou, Tao; Zhang, Zi-Ke; Yang, Zimo; Liu, Chuang; Li, Wei-Min

    2014-01-01

    As one of the major challenges, cold-start problem plagues nearly all recommender systems. In particular, new items will be overlooked, impeding the development of new products online. Given limited resources, how to utilize the knowledge of recommender systems and design efficient marketing strategy for new items is extremely important. In this paper, we convert this ticklish issue into a clear mathematical problem based on a bipartite network representation. Under the most widely used algorithm in real e-commerce recommender systems, the so-called item-based collaborative filtering, we show that to simply push new items to active users is not a good strategy. Interestingly, experiments on real recommender systems indicate that to connect new items with some less active users will statistically yield better performance, namely, these new items will have more chance to appear in other users' recommendation lists. Further analysis suggests that the disassortative nature of recommender systems contributes to such observation. In a word, getting in-depth understanding on recommender systems could pave the way for the owners to popularize their cold-start products with low costs. PMID:25479013

  12. Mixed-Format Test Score Equating: Effect of Item-Type Multidimensionality, Length and Composition of Common-Item Set, and Group Ability Difference

    Science.gov (United States)

    Wang, Wei

    2013-01-01

    Mixed-format tests containing both multiple-choice (MC) items and constructed-response (CR) items are now widely used in many testing programs. Mixed-format tests often are considered to be superior to tests containing only MC items although the use of multiple item formats leads to measurement challenges in the context of equating conducted under…

  13. Negative affect impairs associative memory but not item memory.

    Science.gov (United States)

    Bisby, James A; Burgess, Neil

    2013-12-17

    The formation of associations between items and their context has been proposed to rely on mechanisms distinct from those supporting memory for a single item. Although emotional experiences can profoundly affect memory, our understanding of how it interacts with different aspects of memory remains unclear. We performed three experiments to examine the effects of emotion on memory for items and their associations. By presenting neutral and negative items with background contexts, Experiment 1 demonstrated that item memory was facilitated by emotional affect, whereas memory for an associated context was reduced. In Experiment 2, arousal was manipulated independently of the memoranda, by a threat of shock, whereby encoding trials occurred under conditions of threat or safety. Memory for context was equally impaired by the presence of negative affect, whether induced by threat of shock or a negative item, relative to retrieval of the context of a neutral item in safety. In Experiment 3, participants were presented with neutral and negative items as paired associates, including all combinations of neutral and negative items. The results showed both above effects: compared to a neutral item, memory for the associate of a negative item (a second item here, context in Experiments 1 and 2) is impaired, whereas retrieval of the item itself is enhanced. Our findings suggest that negative affect impairs associative memory while recognition of a negative item is enhanced. They support dual-processing models in which negative affect or stress impairs hippocampal-dependent associative memory while the storage of negative sensory/perceptual representations is spared or even strengthened.

  14. Validity and reliability of an adapted arabic version of the long international physical activity questionnaire.

    Science.gov (United States)

    Helou, Khalil; El Helou, Nour; Mahfouz, Maya; Mahfouz, Yara; Salameh, Pascale; Harmouche-Karaki, Mireille

    2017-07-24

    The International Physical Actvity Questionnaire (IPAQ) is a validated tool for physical activity assessment used in many countries however no Arabic version of the long-form of this questionnaire exists to this date. Hence, the aim of this study was to cross-culturally adapt and validate an Arabic version of the long International Physical Activity Questionnaire (AIPAQ) equivalent to the French version (F-IPAQ) in a Lebanese population. The guidelines for cross-cultural adaptation provided by the World Health Organization and the International Physical Activity Questionnaire committee were followed. One hundred fifty-nine students and staff members from Saint Joseph University of Beirut were randomly recruited to participate in the study. Items of the A-IPAQ were compared to those from the F-IPAQ for concurrent validity using Spearman's correlation coefficient. Content validity of the questionnaire was assessed using factor analysis for the A-IPAQ's items. The physical activity indicators derived from the A-IPAQ were compared with the body mass index (BMI) of the participants for construct validity. The instrument was also evaluated for internal consistency reliability using Cronbach's alpha and Intraclass Correlation Coefficient (ICC). Finally, thirty-one participants were asked to complete the A-IPAQ on two occasions three weeks apart to examine its test-retest reliability. Bland-Altman analyses were performed to evaluate the extent of agreement between the two versions of the questionnaire and its repeated administrations. A high correlation was observed between answers of the F-IPAQ and those of the A-IPAQ, with Spearman's correlation coefficients ranging from 0.91 to 1.00 (p reliability with Cronbach's alpha ranging from 0.769-1.00 (p reliability for most of its items (ICC ranging from 0.66-0.96; p validity and reliability for the assessment of physical activity among Lebanese adults. More studies are necessary in the future to assess its validity compared

  15. Non-ignorable missingness item response theory models for choice effects in examinee-selected items.

    Science.gov (United States)

    Liu, Chen-Wei; Wang, Wen-Chung

    2017-11-01

    Examinee-selected item (ESI) design, in which examinees are required to respond to a fixed number of items in a given set, always yields incomplete data (i.e., when only the selected items are answered, data are missing for the others) that are likely non-ignorable in likelihood inference. Standard item response theory (IRT) models become infeasible when ESI data are missing not at random (MNAR). To solve this problem, the authors propose a two-dimensional IRT model that posits one unidimensional IRT model for observed data and another for nominal selection patterns. The two latent variables are assumed to follow a bivariate normal distribution. In this study, the mirt freeware package was adopted to estimate parameters. The authors conduct an experiment to demonstrate that ESI data are often non-ignorable and to determine how to apply the new model to the data collected. Two follow-up simulation studies are conducted to assess the parameter recovery of the new model and the consequences for parameter estimation of ignoring MNAR data. The results of the two simulation studies indicate good parameter recovery of the new model and poor parameter recovery when non-ignorable missing data were mistakenly treated as ignorable. © 2017 The British Psychological Society.

  16. A Case Study on an Item Writing Process: Use of Test Specifications, Nature of Group Dynamics, and Individual Item Writers' Characteristics

    Science.gov (United States)

    Kim, Jiyoung; Chi, Youngshin; Huensch, Amanda; Jun, Heesung; Li, Hongli; Roullion, Vanessa

    2010-01-01

    This article discusses a case study on an item writing process that reflects on our practical experience in an item development project. The purpose of the article is to share our lessons from the experience aiming to demystify item writing process. The study investigated three issues that naturally emerged during the project: how item writers use…

  17. The Rucio Consistency Service

    CERN Document Server

    Serfon, Cedric; The ATLAS collaboration

    2016-01-01

    One of the biggest challenge with Large scale data management system is to ensure the consistency between the global file catalog and what is physically on all storage elements. To tackle this issue, the Rucio software which is used by the ATLAS Distributed Data Management system has been extended to automatically handle lost or unregistered files (aka Dark Data). This system automatically detects these inconsistencies and take actions like recovery or deletion of unneeded files in a central manner. In this talk, we will present this system, explain the internals and give some results.

  18. Automated Item Generation with Recurrent Neural Networks.

    Science.gov (United States)

    von Davier, Matthias

    2018-03-12

    Utilizing technology for automated item generation is not a new idea. However, test items used in commercial testing programs or in research are still predominantly written by humans, in most cases by content experts or professional item writers. Human experts are a limited resource and testing agencies incur high costs in the process of continuous renewal of item banks to sustain testing programs. Using algorithms instead holds the promise of providing unlimited resources for this crucial part of assessment development. The approach presented here deviates in several ways from previous attempts to solve this problem. In the past, automatic item generation relied either on generating clones of narrowly defined item types such as those found in language free intelligence tests (e.g., Raven's progressive matrices) or on an extensive analysis of task components and derivation of schemata to produce items with pre-specified variability that are hoped to have predictable levels of difficulty. It is somewhat unlikely that researchers utilizing these previous approaches would look at the proposed approach with favor; however, recent applications of machine learning show success in solving tasks that seemed impossible for machines not too long ago. The proposed approach uses deep learning to implement probabilistic language models, not unlike what Google brain and Amazon Alexa use for language processing and generation.

  19. Identification of items and activities important to waste form acceptance by Westinghouse GoCo sites

    International Nuclear Information System (INIS)

    Plodinec, M.J.; Marra, S.L.; Dempster, J.; Randklev, E.H.

    1993-01-01

    The Department of Energy has established specifications (Waste Acceptance Product Specifications for Vitrified High-Level Waste Forms, or WAPS) for canistered waste forms produced at Hanford, Savannah River, and West Valley. Compliance with these specifications requires that each waste form producer identify the items and activities which must be controlled to ensure compliance. As part of quality assurance oversight activities, reviewers have tried to compare the methodologies used by the waste form producers to identify items and activities important to waste form acceptance. Due to the lack of a documented comparison of the methods used by each producer, confusion has resulted over whether the methods being used are consistent. This confusion has been exacerbated by different systems of nomenclature used by each producer, and the different stages of development of each project. The waste form producers have met three times in the last two years, most recently on June 28, 1993, to exchange information on each producer's program. These meetings have been sponsored by the Westinghouse GoCo HLW Vitrification Committee. This document is the result of this most recent exchange. It fills the need for a documented comparison of the methodologies used to identify items and activities important to waste form acceptance. In this document, the methodology being used by each waste form producer is summarized, and the degree of consistency among the waste form producers is determined

  20. Science Library of Test Items. Volume Twenty-Two. A Collection of Multiple Choice Test Items Relating Mainly to Skills.

    Science.gov (United States)

    New South Wales Dept. of Education, Sydney (Australia).

    As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…