survey included items: Topics by WorldWideScience.org

Sample records for survey included items

Reduced-Item Food Audits Based on the Nutrition Environment Measures Surveys.

Science.gov (United States)

Partington, Susan N; Menzies, Tim J; Colburn, Trina A; Saelens, Brian E; Glanz, Karen

2015-10-01

The community food environment may contribute to obesity by influencing food choice. Store and restaurant audits are increasingly common methods for assessing food environments, but are time consuming and costly. A valid, reliable brief measurement tool is needed. The purpose of this study was to develop and validate reduced-item food environment audit tools for stores and restaurants. Nutrition Environment Measures Surveys for stores (NEMS-S) and restaurants (NEMS-R) were completed in 820 stores and 1,795 restaurants in West Virginia, San Diego, and Seattle. Data mining techniques (correlation-based feature selection and linear regression) were used to identify survey items highly correlated to total survey scores and produce reduced-item audit tools that were subsequently validated against full NEMS surveys. Regression coefficients were used as weights that were applied to reduced-item tool items to generate comparable scores to full NEMS surveys. Data were collected and analyzed in 2008-2013. The reduced-item tools included eight items for grocery, ten for convenience, seven for variety, and five for other stores; and 16 items for sit-down, 14 for fast casual, 19 for fast food, and 13 for specialty restaurants-10% of the full NEMS-S and 25% of the full NEMS-R. There were no significant differences in median scores for varying types of retail food outlets when compared to the full survey scores. Median in-store audit time was reduced 25%-50%. Reduced-item audit tools can reduce the burden and complexity of large-scale or repeated assessments of the retail food environment without compromising measurement quality. Copyright © 2015 American Journal of Preventive Medicine. Published by Elsevier Inc. All rights reserved.
A Comprehensive List of Items to be Included on a Pediatric Drug Monograph.

Science.gov (United States)

Kelly, Lauren E; Ito, Shinya; Woods, David; Nunn, Anthony J; Taketomo, Carol; de Hoog, Matthijs; Offringa, Martin

2017-01-01

Children require special considerations for drug prescribing. Drug information summarized in a formulary containing drug monographs is essential for safe and effective prescribing. Currently, little is known about the information needs of those who prescribe and administer medicines to children. Our primary objective was to identify a list of important and relevant items to be included in a pediatric drug monograph. Following the establishment of an expert steering committee and an environmental scan of adult and pediatric formulary monograph items, 46 participants from 25 countries were invited to complete a 2-round Delphi survey. Questions regarding source of prescribing information and importance of items were recorded. An international consensus meeting to vote on and finalize the items list with the steering committee followed. Pediatric formularies are most commonly the first resource consulted for information on medication used in children by 31 Delphi participants. After the Delphi rounds, 116 items were identified to be included in a comprehensive pediatric drug monograph, including general information, adverse drug reactions, dosages, precautions, drug-drug interactions, formulation, and drug properties. Health care providers identified 116 monograph items as important for prescribing medicines for children by an international consensus-based process. This information will assist in setting standards for the creation of new pediatric drug monographs for international application and for those involved in pediatric formulary development.
Factors affecting study efficiency and item non-response in health surveys in developing countries: the Jamaica national healthy lifestyle survey

Directory of Open Access Journals (Sweden)

Bennett Franklyn

2007-02-01

Full Text Available Abstract Background Health surveys provide important information on the burden and secular trends of risk factors and disease. Several factors including survey and item non-response can affect data quality. There are few reports on efficiency, validity and the impact of item non-response, from developing countries. This report examines factors associated with item non-response and study efficiency in a national health survey in a developing Caribbean island. Methods A national sample of participants aged 15–74 years was selected in a multi-stage sampling design accounting for 4 health regions and 14 parishes using enumeration districts as primary sampling units. Means and proportions of the variables of interest were compared between various categories. Non-response was defined as failure to provide an analyzable response. Linear and logistic regression models accounting for sample design and post-stratification weighting were used to identify independent correlates of recruitment efficiency and item non-response. Results We recruited 2012 15–74 year-olds (66.2% females at a response rate of 87.6% with significant variation between regions (80.9% to 97.6%; p Conclusion Informative health surveys are possible in developing countries. While survey response rates may be satisfactory, item non-response was high in respect of income and sexual practice. In contrast to developed countries, non-response to questions on income is higher and has different correlates. These findings can inform future surveys.
Harmonizing Measures of Cognitive Performance Across International Surveys of Aging Using Item Response Theory.

Science.gov (United States)

Chan, Kitty S; Gross, Alden L; Pezzin, Liliana E; Brandt, Jason; Kasper, Judith D

2015-12-01

To harmonize measures of cognitive performance using item response theory (IRT) across two international aging studies. Data for persons ≥65 years from the Health and Retirement Study (HRS, N = 9,471) and the English Longitudinal Study of Aging (ELSA, N = 5,444). Cognitive performance measures varied (HRS fielded 25, ELSA 13); 9 were in common. Measurement precision was examined for IRT scores based on (a) common items, (b) common items adjusted for differential item functioning (DIF), and (c) DIF-adjusted all items. Three common items (day of date, immediate word recall, and delayed word recall) demonstrated DIF by survey. Adding survey-specific items improved precision but mainly for HRS respondents at lower cognitive levels. IRT offers a feasible strategy for harmonizing cognitive performance measures across other surveys and for other multi-item constructs of interest in studies of aging. Practical implications depend on sample distribution and the difficulty mix of in-common and survey-specific items. © The Author(s) 2015.
Poisson and negative binomial item count techniques for surveys with sensitive question.

Science.gov (United States)

Tian, Guo-Liang; Tang, Man-Lai; Wu, Qin; Liu, Yin

2017-04-01

Although the item count technique is useful in surveys with sensitive questions, privacy of those respondents who possess the sensitive characteristic of interest may not be well protected due to a defect in its original design. In this article, we propose two new survey designs (namely the Poisson item count technique and negative binomial item count technique) which replace several independent Bernoulli random variables required by the original item count technique with a single Poisson or negative binomial random variable, respectively. The proposed models not only provide closed form variance estimate and confidence interval within [0, 1] for the sensitive proportion, but also simplify the survey design of the original item count technique. Most importantly, the new designs do not leak respondents' privacy. Empirical results show that the proposed techniques perform satisfactorily in the sense that it yields accurate parameter estimate and confidence interval.
Development and evaluation of CAHPS survey items assessing how well healthcare providers address health literacy.

Science.gov (United States)

Weidmer, Beverly A; Brach, Cindy; Hays, Ron D

2012-09-01

The complexity of health information often exceeds patients' skills to understand and use it. To develop survey items assessing how well healthcare providers communicate health information. Domains and items for the Consumer Assessment of Healthcare Providers and Systems (CAHPS) Item Set for Addressing Health Literacy were identified through an environmental scan and input from stakeholders. The draft item set was translated into Spanish and pretested in both English and Spanish. The revised item set was field tested with a randomly selected sample of adult patients from 2 sites using mail and telephonic data collection. Item-scale correlations, confirmatory factor analysis, and internal consistency reliability estimates were estimated to assess how well the survey items performed and identify composite measures. Finally, we regressed the CAHPS global rating of the provider item on the CAHPS core communication composite and the new health literacy composites. A total of 601 completed surveys were obtained (52% response rate). Two composite measures were identified: (1) Communication to Improve Health Literacy (16 items); and (2) How Well Providers Communicate About Medicines (6 items). These 2 composites were significantly uniquely associated with the global rating of the provider (communication to improve health literacy: PLiteracy composite accounted for 90% of the variance of the original 16-item composite. This study provides support for reliability and validity of the CAHPS Item Set for Addressing Health Literacy. These items can serve to assess whether healthcare providers have communicated effectively with their patients and as a tool for quality improvement.
Recommended core items to assess e-cigarette use in population-based surveys.

Science.gov (United States)

Pearson, Jennifer L; Hitchman, Sara C; Brose, Leonie S; Bauld, Linda; Glasser, Allison M; Villanti, Andrea C; McNeill, Ann; Abrams, David B; Cohen, Joanna E

2018-05-01

A consistent approach using standardised items to assess e-cigarette use in both youth and adult populations will aid cross-survey and cross-national comparisons of the effect of e-cigarette (and tobacco) policies and improve our understanding of the population health impact of e-cigarette use. Focusing on adult behaviour, we propose a set of e-cigarette use items, discuss their utility and potential adaptation, and highlight e-cigarette constructs that researchers should avoid without further item development. Reliable and valid items will strengthen the emerging science and inform knowledge synthesis for policy-making. Building on informal discussions at a series of international meetings of 65 experts from 15 countries, the authors provide recommendations for assessing e-cigarette use behaviour, relative perceived harm, device type, presence of nicotine, flavours and reasons for use. We recommend items assessing eight core constructs: e-cigarette ever use, frequency of use and former daily use; relative perceived harm; device type; primary flavour preference; presence of nicotine; and primary reason for use. These items should be standardised or minimally adapted for the policy context and target population. Researchers should be prepared to update items as e-cigarette device characteristics change. A minimum set of e-cigarette items is proposed to encourage consensus around items to allow for cross-survey and cross-jurisdictional comparisons of e-cigarette use behaviour. These proposed items are a starting point. We recognise room for continued improvement, and welcome input from e-cigarette users and scientific colleagues. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Development of the Quantitative Reasoning Items on the National Survey of Student Engagement

Directory of Open Access Journals (Sweden)

Amber D. Dumford

2015-01-01

Full Text Available As society’s needs for quantitative skills become more prevalent, college graduates require quantitative skills regardless of their career choices. Therefore, it is important that institutions assess students’ engagement in quantitative activities during college. This study chronicles the process taken by the National Survey of Student Engagement (NSSE to develop items that measure students’ participation in quantitative reasoning (QR activities. On the whole, findings across the quantitative and qualitative analyses suggest good overall properties for the developed QR items. The items show great promise to explore and evaluate the frequency with which college students participate in QR-related activities. Each year, hundreds of institutions across the United States and Canada participate in NSSE, and, with the addition of these new items on the core survey, every participating institution will have information on this topic. Our hope is that these items will spur conversations on campuses about students’ use of quantitative reasoning activities.
Item-level psychometrics of the ADL instrument of the Korean National Survey on persons with physical disabilities.

Science.gov (United States)

Hong, Ickpyo; Lee, Mi Jung; Kim, Moon Young; Park, Hae Yean

2017-10-01

The aim of this study is to investigate the psychometrics of the 12 items of an instrument assessing activities of daily living (ADL) using an item response theory model. A total of 648 adults with physical disabilities and having difficulties in ADLs were retrieved from the 2014 Korean National Survey on People with Disabilities. The psychometric testing included factor analysis, internal consistency, precision, and differential item functioning (DIF) across categories including sex, older age, marital status, and physical impairment area. The sample had a mean age of 69.7 years old (SD = 13.7). The majority of the sample had lower extremity impairments (62.0%) and had at least 2.1 chronic conditions. The instrument demonstrated unidimensional construct and good internal consistency (Cronbach's alpha = 0.95). The instrument precisely estimated person measures within a wide range of theta values (-2.22 logits 5.0%). Our findings indicate that the dressing item would need to be modified to improve its psychometrics. Overall, the ADL instrument demonstrates good psychometrics, and thus, it may be used as a standardized instrument for measuring disability in rehabilitation contexts. However, the findings are limited to adults with physical disabilities. Future studies should replicate psychometric testing for survey respondents with other disorders and for children.
Validity of Suicidality Items from the Youth Risk Behavior Survey in a High School Sample

Science.gov (United States)

May, Alexis; Klonsky, E. David

2011-01-01

The Youth Risk Behavior Survey (YRBS) is used by the United States Centers for Disease Control to estimate rates of suicidal thoughts and behaviors in adolescents. This study investigated the validity of the YRBS suicidality items by examining their relationship to criterion variables including loneliness, anxiety, depression, substance use, and…
5 CFR 591.212 - How does OPM select survey items?

Science.gov (United States)

2010-01-01

... 5 Administrative Personnel 1 2010-01-01 2010-01-01 false How does OPM select survey items? 591.212 Section 591.212 Administrative Personnel OFFICE OF PERSONNEL MANAGEMENT CIVIL SERVICE REGULATIONS ALLOWANCES AND DIFFERENTIALS Cost-of-Living Allowance and Post Differential-Nonforeign Areas Cost-Of-Living...
42 CFR 413.217 - Items and services included in the ESRD prospective payment system.

Science.gov (United States)

2010-10-01

... payment system. 413.217 Section 413.217 Public Health CENTERS FOR MEDICARE & MEDICAID SERVICES, DEPARTMENT....217 Items and services included in the ESRD prospective payment system. The following items and services are included in the ESRD prospective payment system effective January 1, 2011: (a) Renal dialysis...
Guideline appraisal with AGREE II: online survey of the potential influence of AGREE II items on overall assessment of guideline quality and recommendation for use.

Science.gov (United States)

Hoffmann-Eßer, Wiebke; Siering, Ulrich; Neugebauer, Edmund A M; Brockhaus, Anne Catharina; McGauran, Natalie; Eikermann, Michaela

2018-02-27

The AGREE II instrument is the most commonly used guideline appraisal tool. It includes 23 appraisal criteria (items) organized within six domains. AGREE II also includes two overall assessments (overall guideline quality, recommendation for use). Our aim was to investigate how strongly the 23 AGREE II items influence the two overall assessments. An online survey of authors of publications on guideline appraisals with AGREE II and guideline users from a German scientific network was conducted between 10th February 2015 and 30th March 2015. Participants were asked to rate the influence of the AGREE II items on a Likert scale (0 = no influence to 5 = very strong influence). The frequencies of responses and their dispersion were presented descriptively. Fifty-eight of the 376 persons contacted (15.4%) participated in the survey and the data of the 51 respondents with prior knowledge of AGREE II were analysed. Items 7-12 of Domain 3 (rigour of development) and both items of Domain 6 (editorial independence) had the strongest influence on the two overall assessments. In addition, Items 15-17 (clarity of presentation) had a strong influence on the recommendation for use. Great variations were shown for the other items. The main limitation of the survey is the low response rate. In guideline appraisals using AGREE II, items representing rigour of guideline development and editorial independence seem to have the strongest influence on the two overall assessments. In order to ensure a transparent approach to reaching the overall assessments, we suggest the inclusion of a recommendation in the AGREE II user manual on how to consider item and domain scores. For instance, the manual could include an a-priori weighting of those items and domains that should have the strongest influence on the two overall assessments. The relevance of these assessments within AGREE II could thereby be further specified.
Recommended core items to assess e-cigarette use in population-based surveys

OpenAIRE

Pearson, Jennifer L; Hitchman, Sara C; Brose, Leonie S; Bauld, Linda; Glasser, Allison M; Villanti, Andrea C; McNeill, Ann; Abrams, David B; Cohen, Joanna E

2017-01-01

Background: A consistent approach using standardized items to assess e-cigarette use in both youth and adult populations will aid cross-survey and cross-national comparisons of the effect of e-cigarette (and tobacco) policies and improve our understanding of the population health impact of e-cigarette use. Focusing on adult behavior, we propose a set of e-cigarette use items, discuss their utility and potential adaptation, and highlight e-cigarette constructs that researchers should avoid wit...
Examining Multiple Sources of Differential Item Functioning on the Clinician & Group CAHPS® Survey

Science.gov (United States)

Rodriguez, Hector P; Crane, Paul K

2011-01-01

Objective To evaluate psychometric properties of a widely used patient experience survey. Data Sources English-language responses to the Clinician & Group Consumer Assessment of Healthcare Providers and Systems (CG-CAHPS®) survey (n = 12,244) from a 2008 quality improvement initiative involving eight southern California medical groups. Methods We used an iterative hybrid ordinal logistic regression/item response theory differential item functioning (DIF) algorithm to identify items with DIF related to patient sociodemographic characteristics, duration of the physician–patient relationship, number of physician visits, and self-rated physical and mental health. We accounted for all sources of DIF and determined its cumulative impact. Principal Findings The upper end of the CG-CAHPS® performance range is measured with low precision. With sensitive settings, some items were found to have DIF. However, overall DIF impact was negligible, as 0.14 percent of participants had salient DIF impact. Latinos who spoke predominantly English at home had the highest prevalence of salient DIF impact at 0.26 percent. Conclusions The CG-CAHPS® functions similarly across commercially insured respondents from diverse backgrounds. Consequently, previously documented racial and ethnic group differences likely reflect true differences rather than measurement bias. The impact of low precision at the upper end of the scale should be clarified. PMID:22092021
The 1992 Pacific Northwest Residential Energy Survey : Phase 1 : Book 4 : Item-by-item Crosstabulations.

Energy Technology Data Exchange (ETDEWEB)

United States. Bonneville Power Administration. End-Use Research Section; Applied Management & Planning Group (Firm)

1993-06-01

This book constitutes a portion of the primary documentation for the 1992 Pacific Northwest Residential Energy Survey, Phase I. The complete 33-volume set of primary documentation provides information needed by energy analysts and interpreters with respect to planning, execution, data collection, and data management of the PNWRES92-I process. Thirty of these volumes are devoted to different ``views`` of the data themselves, with each view having a special purpose or interest as its focus. Analyses and interpretations of these data will be the subjects of forthcoming publications. Conducted during the late summer and fall months of 1992, PNWRES92-I had the over-arching goal of satisfying basic requirements for a variety of information about the stock of residential units in Bonneville`s service region. Surveys with a similar goal were conducted in 1979 and 1983. This volume discerns the information by state. ``Selected crosstabulations`` refers to a set of nine survey items of wide interest (Dwelling Type, Ownership Type, Year-of-Construction, Dwelling Size, Primary Space-Heating Fuel, Primary Water-Heating Fuel, Household Income for 1991, Utility Type, and Space-Heating Fuels: Systems and Equipment) that were crosstabulated among themselves.
Psychometric Evaluation of Chinese-Language 44-Item and 10-Item Big Five Personality Inventories, Including Correlations with Chronotype, Mindfulness and Mind Wandering.

Science.gov (United States)

Carciofo, Richard; Yang, Jiaoyan; Song, Nan; Du, Feng; Zhang, Kan

2016-01-01

The 44-item and 10-item Big Five Inventory (BFI) personality scales are widely used, but there is a lack of psychometric data for Chinese versions. Eight surveys (total N = 2,496, aged 18-82), assessed a Chinese-language BFI-44 and/or an independently translated Chinese-language BFI-10. Most BFI-44 items loaded strongly or predominantly on the expected dimension, and values of Cronbach's alpha ranged .698-.807. Test-retest coefficients ranged .694-.770 (BFI-44), and .515-.873 (BFI-10). The BFI-44 and BFI-10 showed good convergent and discriminant correlations, and expected associations with gender (females higher for agreeableness and neuroticism), and age (older age associated with more conscientiousness and agreeableness, and also less neuroticism and openness). Additionally, predicted correlations were found with chronotype (morningness positive with conscientiousness), mindfulness (negative with neuroticism, positive with conscientiousness), and mind wandering/daydreaming frequency (negative with conscientiousness, positive with neuroticism). Exploratory analysis found that the Self-discipline facet of conscientiousness positively correlated with morningness and mindfulness, and negatively correlated with mind wandering/daydreaming frequency. Furthermore, Self-discipline was found to be a mediator in the relationships between chronotype and mindfulness, and chronotype and mind wandering/daydreaming frequency. Overall, the results support the utility of the BFI-44 and BFI-10 for Chinese-language big five personality research.
Validation of the MOS Social Support Survey 6-item (MOS-SSS-6) measure with two large population-based samples of Australian women.

Science.gov (United States)

Holden, Libby; Lee, Christina; Hockey, Richard; Ware, Robert S; Dobson, Annette J

2014-12-01

This study aimed to validate a 6-item 1-factor global measure of social support developed from the Medical Outcomes Study Social Support Survey (MOS-SSS) for use in large epidemiological studies. Data were obtained from two large population-based samples of participants in the Australian Longitudinal Study on Women's Health. The two cohorts were aged 53-58 and 28-33 years at data collection (N = 10,616 and 8,977, respectively). Items selected for the 6-item 1-factor measure were derived from the factor structure obtained from unpublished work using an earlier wave of data from one of these cohorts. Descriptive statistics, including polychoric correlations, were used to describe the abbreviated scale. Cronbach's alpha was used to assess internal consistency and confirmatory factor analysis to assess scale validity. Concurrent validity was assessed using correlations between the new 6-item version and established 19-item version, and other concurrent variables. In both cohorts, the new 6-item 1-factor measure showed strong internal consistency and scale reliability. It had excellent goodness-of-fit indices, similar to those of the established 19-item measure. Both versions correlated similarly with concurrent measures. The 6-item 1-factor MOS-SSS measures global functional social support with fewer items than the established 19-item measure.
Test-retest reliability of selected items of Health Behaviour in School-aged Children (HBSC survey questionnaire in Beijing, China

Directory of Open Access Journals (Sweden)

Liu Yang

2010-08-01

Full Text Available Abstract Background Children's health and health behaviour are essential for their development and it is important to obtain abundant and accurate information to understand young people's health and health behaviour. The Health Behaviour in School-aged Children (HBSC study is among the first large-scale international surveys on adolescent health through self-report questionnaires. So far, more than 40 countries in Europe and North America have been involved in the HBSC study. The purpose of this study is to assess the test-retest reliability of selected items in the Chinese version of the HBSC survey questionnaire in a sample of adolescents in Beijing, China. Methods A sample of 95 male and female students aged 11 or 15 years old participated in a test and retest with a three weeks interval. Student Identity numbers of respondents were utilized to permit matching of test-retest questionnaires. 23 items concerning physical activity, sedentary behaviour, sleep and substance use were evaluated by using the percentage of response shifts and the single measure Intraclass Correlation Coefficients (ICC with 95% confidence interval (CI for all respondents and stratified by gender and age. Items on substance use were only evaluated for school children aged 15 years old. Results The percentage of no response shift between test and retest varied from 32% for the item on computer use at weekends to 92% for the three items on smoking. Of all the 23 items evaluated, 6 items (26% showed a moderate reliability, 12 items (52% displayed a substantial reliability and 4 items (17% indicated almost perfect reliability. No gender and age group difference of the test-retest reliability was found except for a few items on sedentary behaviour. Conclusions The overall findings of this study suggest that most selected indicators in the HBSC survey questionnaire have satisfactory test-retest reliability for the students in Beijing. Further test-retest studies in a large
Development of six PROMIS pediatrics proxy-report item banks.

Science.gov (United States)

Irwin, Debra E; Gross, Heather E; Stucky, Brian D; Thissen, David; DeWitt, Esi Morgan; Lai, Jin Shei; Amtmann, Dagmar; Khastou, Leyla; Varni, James W; DeWalt, Darren A

2012-02-22

Pediatric self-report should be considered the standard for measuring patient reported outcomes (PRO) among children. However, circumstances exist when the child is too young, cognitively impaired, or too ill to complete a PRO instrument and a proxy-report is needed. This paper describes the development process including the proxy cognitive interviews and large-field-test survey methods and sample characteristics employed to produce item parameters for the Patient Reported Outcomes Measurement Information System (PROMIS) pediatric proxy-report item banks. The PROMIS pediatric self-report items were converted into proxy-report items before undergoing cognitive interviews. These items covered six domains (physical function, emotional distress, social peer relationships, fatigue, pain interference, and asthma impact). Caregivers (n = 25) of children ages of 5 and 17 years provided qualitative feedback on proxy-report items to assess any major issues with these items. From May 2008 to March 2009, the large-scale survey enrolled children ages 8-17 years to complete the self-report version and caregivers to complete the proxy-report version of the survey (n = 1548 dyads). Caregivers of children ages 5 to 7 years completed the proxy report survey (n = 432). In addition, caregivers completed other proxy instruments, PedsQL™ 4.0 Generic Core Scales Parent Proxy-Report version, PedsQL™ Asthma Module Parent Proxy-Report version, and KIDSCREEN Parent-Proxy-52. Item content was well understood by proxies and did not require item revisions but some proxies clearly noted that determining an answer on behalf of their child was difficult for some items. Dyads and caregivers of children ages 5-17 years old were enrolled in the large-scale testing. The majority were female (85%), married (70%), Caucasian (64%) and had at least a high school education (94%). Approximately 50% had children with a chronic health condition, primarily asthma, which was diagnosed or treated within 6

[Wing 1 radiation survey and contamination report

International Nuclear Information System (INIS)

Olsen, K.

1991-01-01

We have completed the 5480.11 survey for Wing 1. All area(s)/item(s) requested by the 5480.11 committee have been thoroughly surveyed and documented. Decontamination/disposal of contaminated items has been accomplished. The wing 1 survey was started on 8/13/90 and completed 9/18/90. However, the follow-up surveys were not completed until 2/18/91. We received the final set of smear samples for wing 1 on 1/13/91. A total of 5,495 smears were taken from wing 1 and total of 465 smears were taken during the follow-up surveys. There were a total 122 items found to have fixed contamination and 4 items with smearable contamination in excess of the limits specified in DOE ORDER 5480.11 (AR 3-7). The following area(s)/item(s) were not included in the 5480.11 survey: Hallways, Access panels, Men's and women's change rooms, Janitor closets, Wall lockers and item(s) stored in wing 1 hallways and room 1116. If our contract is renewed, we will include those areas in our survey according to your request of April 15, 1991
Development of six PROMIS pediatrics proxy-report item banks

Directory of Open Access Journals (Sweden)

Irwin Debra E

2012-02-01

Full Text Available Abstract Background Pediatric self-report should be considered the standard for measuring patient reported outcomes (PRO among children. However, circumstances exist when the child is too young, cognitively impaired, or too ill to complete a PRO instrument and a proxy-report is needed. This paper describes the development process including the proxy cognitive interviews and large-field-test survey methods and sample characteristics employed to produce item parameters for the Patient Reported Outcomes Measurement Information System (PROMIS pediatric proxy-report item banks. Methods The PROMIS pediatric self-report items were converted into proxy-report items before undergoing cognitive interviews. These items covered six domains (physical function, emotional distress, social peer relationships, fatigue, pain interference, and asthma impact. Caregivers (n = 25 of children ages of 5 and 17 years provided qualitative feedback on proxy-report items to assess any major issues with these items. From May 2008 to March 2009, the large-scale survey enrolled children ages 8-17 years to complete the self-report version and caregivers to complete the proxy-report version of the survey (n = 1548 dyads. Caregivers of children ages 5 to 7 years completed the proxy report survey (n = 432. In addition, caregivers completed other proxy instruments, PedsQL™ 4.0 Generic Core Scales Parent Proxy-Report version, PedsQL™ Asthma Module Parent Proxy-Report version, and KIDSCREEN Parent-Proxy-52. Results Item content was well understood by proxies and did not require item revisions but some proxies clearly noted that determining an answer on behalf of their child was difficult for some items. Dyads and caregivers of children ages 5-17 years old were enrolled in the large-scale testing. The majority were female (85%, married (70%, Caucasian (64% and had at least a high school education (94%. Approximately 50% had children with a chronic health condition, primarily
Item validity vs. item discrimination index: a redundancy?

Science.gov (United States)

Panjaitan, R. L.; Irawati, R.; Sujana, A.; Hanifah, N.; Djuanda, D.

2018-03-01

In several literatures about evaluation and test analysis, it is common to find that there are calculations of item validity as well as item discrimination index (D) with different formula for each. Meanwhile, other resources said that item discrimination index could be obtained by calculating the correlation between the testee’s score in a particular item and the testee’s score on the overall test, which is actually the same concept as item validity. Some research reports, especially undergraduate theses tend to include both item validity and item discrimination index in the instrument analysis. It seems that these concepts might overlap for both reflect the test quality on measuring the examinees’ ability. In this paper, examples of some results of data processing on item validity and item discrimination index were compared. It would be discussed whether item validity and item discrimination index can be represented by one of them only or it should be better to present both calculations for simple test analysis, especially in undergraduate theses where test analyses were included.
Method of data mining including determining multidimensional coordinates of each item using a predetermined scalar similarity value for each item pair

Science.gov (United States)

Meyers, Charles E.; Davidson, George S.; Johnson, David K.; Hendrickson, Bruce A.; Wylie, Brian N.

1999-01-01

A method of data mining represents related items in a multidimensional space. Distance between items in the multidimensional space corresponds to the extent of relationship between the items. The user can select portions of the space to perceive. The user also can interact with and control the communication of the space, focusing attention on aspects of the space of most interest. The multidimensional spatial representation allows more ready comprehension of the structure of the relationships among the items.
Reliability of the Core Items in the General Social Survey: Estimates from the Three-Wave Panels, 2006–2014

Directory of Open Access Journals (Sweden)

Michael Hout

2016-11-01

Full Text Available We used standard and multilevel models to assess the reliability of core items in the General Social Survey panel studies spanning 2006 to 2014. Most of the 293 core items scored well on the measure of reliability: 62 items (21 percent had reliability measures greater than 0.85; another 71 (24 percent had reliability measures between 0.70 and 0.85. Objective items, especially facts about demography and religion, were generally more reliable than subjective items. The economic recession of 2007–2009, the slow recovery afterward, and the election of Barack Obama in 2008 altered the social context in ways that may look like unreliability of items. For example, unemployment status, hours worked, and weeks worked have lower reliability than most work-related items, reflecting the consequences of the recession on the facts of peoples lives. Items regarding racial and gender discrimination and racial stereotypes scored as particularly unreliable, accounting for most of the 15 items with reliability coefficients less than 0.40. Our results allow scholars to more easily take measurement reliability into consideration in their own research, while also highlighting the limitations of these approaches.
Item-focussed Trees for the Identification of Items in Differential Item Functioning.

Science.gov (United States)

Tutz, Gerhard; Berger, Moritz

2016-09-01

A novel method for the identification of differential item functioning (DIF) by means of recursive partitioning techniques is proposed. We assume an extension of the Rasch model that allows for DIF being induced by an arbitrary number of covariates for each item. Recursive partitioning on the item level results in one tree for each item and leads to simultaneous selection of items and variables that induce DIF. For each item, it is possible to detect groups of subjects with different item difficulties, defined by combinations of characteristics that are not pre-specified. The way a DIF item is determined by covariates is visualized in a small tree and therefore easily accessible. An algorithm is proposed that is based on permutation tests. Various simulation studies, including the comparison with traditional approaches to identify items with DIF, show the applicability and the competitive performance of the method. Two applications illustrate the usefulness and the advantages of the new method.
Factoring handedness data: I. Item analysis.

Science.gov (United States)

Messinger, H B; Messinger, M I

1995-12-01

Recently in this journal Peters and Murphy challenged the validity of factor analyses done on bimodal handedness data, suggesting instead that right- and left-handers be studied separately. But bimodality may be avoidable if attention is paid to Oldfield's questionnaire format and instructions for the subjects. Two characteristics appear crucial: a two-column LEFT-RIGHT format for the body of the instrument and what we call Oldfield's Admonition: not to indicate strong preference for handedness item, such as write, unless "... the preference is so strong that you would never try to use the other hand unless absolutely forced to...". Attaining unimodality of an item distribution would seem to overcome the objections of Peters and Murphy. In a 1984 survey in Boston we used Oldfield's ten-item questionnaire exactly as published. This produced unimodal item distributions. With reflection of the five-point item scale and a logarithmic transformation, we achieved a degree of normalization for the items. Two surveys elsewhere based on Oldfield's 20-item list but with changes in the questionnaire format and the instructions, yielded markedly different item distributions with peaks at each extreme and sometimes in the middle as well.
Phase I Marine and Terrestrial Cultural Resources Survey of 13 Project Items Located on Marsh Island, Iberia Parish, Louisiana

National Research Council Canada - National Science Library

Barr, William

1999-01-01

This report presents the results of Phase I cultural resources survey and archeological inventory of two marine and 11 terrestrial project items on and near Marsh Island in Iberia Parish, Louisiana...
The Role of Item Models in Automatic Item Generation

Science.gov (United States)

Gierl, Mark J.; Lai, Hollis

2012-01-01

Automatic item generation represents a relatively new but rapidly evolving research area where cognitive and psychometric theories are used to produce tests that include items generated using computer technology. Automatic item generation requires two steps. First, test development specialists create item models, which are comparable to templates…
A Study of General Education Astronomy Students' Understandings of Cosmology. Part III. Evaluating Four Conceptual Cosmology Surveys: An Item Response Theory Approach

Science.gov (United States)

Wallace, Colin S.; Prather, Edward E.; Duncan, Douglas K.

2012-01-01

This is the third of five papers detailing our national study of general education astronomy students' conceptual and reasoning difficulties with cosmology. In this paper, we use item response theory to analyze students' responses to three out of the four conceptual cosmology surveys we developed. The specific item response theory model we use is…
Analysis of differential item functioning in the depression item bank from the Patient Reported Outcome Measurement Information System (PROMIS: An item response theory approach

Directory of Open Access Journals (Sweden)

JOSEPH P. EIMICKE

2009-06-01

Full Text Available The aims of this paper are to present findings related to differential item functioning (DIF in the Patient Reported Outcome Measurement Information System (PROMIS depression item bank, and to discuss potential threats to the validity of results from studies of DIF. The 32 depression items studied were modified from several widely used instruments. DIF analyses of gender, age and education were performed using a sample of 735 individuals recruited by a survey polling firm. DIF hypotheses were generated by asking content experts to indicate whether or not they expected DIF to be present, and the direction of the DIF with respect to the studied comparison groups. Primary analyses were conducted using the graded item response model (for polytomous, ordered response category data with likelihood ratio tests of DIF, accompanied by magnitude measures. Sensitivity analyses were performed using other item response models and approaches to DIF detection. Despite some caveats, the items that are recommended for exclusion or for separate calibration were "I felt like crying" and "I had trouble enjoying things that I used to enjoy." The item, "I felt I had no energy," was also flagged as evidencing DIF, and recommended for additional review. On the one hand, false DIF detection (Type 1 error was controlled to the extent possible by ensuring model fit and purification. On the other hand, power for DIF detection might have been compromised by several factors, including sparse data and small sample sizes. Nonetheless, practical and not just statistical significance should be considered. In this case the overall magnitude and impact of DIF was small for the groups studied, although impact was relatively large for some individuals.
Relationship between handling heavy items during pregnancy and spontaneous abortion: a cross-sectional survey of working women in South Korea.

Science.gov (United States)

Lee, Bokim; Jung, Hye-Sun

2012-01-01

The researchers conducted a cross-sectional survey to determine the relationship between handling heavy items during pregnancy and spontaneous abortion among working women in South Korea. One thousand working women were selected from a database of those eligible for maternity benefits under the National Employment Insurance Plan. Study results showed that handling heavy items during pregnancy was associated with an increased risk of spontaneous abortion after adjusting for general characteristics of the participants and their work environment. A collective effort is needed on the parts of employers, employees, occupational health nurses, and the government to protect working women from lifting heavy items while pregnant. Copyright 2012, SLACK Incorporated.
41 CFR 302-7.20 - If my HHG shipment includes an item (e.g., boat, trailer, ultralight vehicle) for which a weight...

Science.gov (United States)

2010-07-01

... includes an item (e.g., boat, trailer, ultralight vehicle) for which a weight additive is assessed by the...) General Rules § 302-7.20 If my HHG shipment includes an item (e.g., boat, trailer, ultralight vehicle) for which a weight additive is assessed by the HHG carrier, am I responsible for payment? If your HHG...
Comparison of Self-Reported Telephone Interviewing and Web-Based Survey Responses: Findings From the Second Australian Young and Well National Survey.

Science.gov (United States)

Milton, Alyssa C; Ellis, Louise A; Davenport, Tracey A; Burns, Jane M; Hickie, Ian B

2017-09-26

Web-based self-report surveying has increased in popularity, as it can rapidly yield large samples at a low cost. Despite this increase in popularity, in the area of youth mental health, there is a distinct lack of research comparing the results of Web-based self-report surveys with the more traditional and widely accepted computer-assisted telephone interviewing (CATI). The Second Australian Young and Well National Survey 2014 sought to compare differences in respondent response patterns using matched items on CATI versus a Web-based self-report survey. The aim of this study was to examine whether responses varied as a result of item sensitivity, that is, the item's susceptibility to exaggeration on underreporting and to assess whether certain subgroups demonstrated this effect to a greater extent. A subsample of young people aged 16 to 25 years (N=101), recruited through the Second Australian Young and Well National Survey 2014, completed the identical items on two occasions: via CATI and via Web-based self-report survey. Respondents also rated perceived item sensitivity. When comparing CATI with the Web-based self-report survey, a Wilcoxon signed-rank analysis showed that respondents answered 14 of the 42 matched items in a significantly different way. Significant variation in responses (CATI vs Web-based) was more frequent if the item was also rated by the respondents as highly sensitive in nature. Specifically, 63% (5/8) of the high sensitivity items, 43% (3/7) of the neutral sensitivity items, and 0% (0/4) of the low sensitivity items were answered in a significantly different manner by respondents when comparing their matched CATI and Web-based question responses. The items that were perceived as highly sensitive by respondents and demonstrated response variability included the following: sexting activities, body image concerns, experience of diagnosis, and suicidal ideation. For high sensitivity items, a regression analysis showed respondents who were male
Language-related differential item functioning between English and German PROMIS Depression items is negligible.

Science.gov (United States)

Fischer, H Felix; Wahl, Inka; Nolte, Sandra; Liegl, Gregor; Brähler, Elmar; Löwe, Bernd; Rose, Matthias

2017-12-01

To investigate differential item functioning (DIF) of PROMIS Depression items between US and German samples we compared data from the US PROMIS calibration sample (n = 780), a German general population survey (n = 2,500) and a German clinical sample (n = 621). DIF was assessed in an ordinal logistic regression framework, with 0.02 as criterion for R 2 -change and 0.096 for Raju's non-compensatory DIF. Item parameters were initially fixed to the PROMIS Depression metric; we used plausible values to account for uncertainty in depression estimates. Only four items showed DIF. Accounting for DIF led to negligible effects for the full item bank as well as a post hoc simulated computer-adaptive test (German general population sample was considerably lower compared to the US reference value of 50. Overall, we found little evidence for language DIF between US and German samples, which could be addressed by either replacing the DIF items by items not showing DIF or by scoring the short form in German samples with the corrected item parameters reported. Copyright © 2016 John Wiley & Sons, Ltd.
Comparison of Self-Reported Telephone Interviewing and Web-Based Survey Responses: Findings From the Second Australian Young and Well National Survey

Science.gov (United States)

Davenport, Tracey A; Burns, Jane M; Hickie, Ian B

2017-01-01

Background Web-based self-report surveying has increased in popularity, as it can rapidly yield large samples at a low cost. Despite this increase in popularity, in the area of youth mental health, there is a distinct lack of research comparing the results of Web-based self-report surveys with the more traditional and widely accepted computer-assisted telephone interviewing (CATI). Objective The Second Australian Young and Well National Survey 2014 sought to compare differences in respondent response patterns using matched items on CATI versus a Web-based self-report survey. The aim of this study was to examine whether responses varied as a result of item sensitivity, that is, the item’s susceptibility to exaggeration on underreporting and to assess whether certain subgroups demonstrated this effect to a greater extent. Methods A subsample of young people aged 16 to 25 years (N=101), recruited through the Second Australian Young and Well National Survey 2014, completed the identical items on two occasions: via CATI and via Web-based self-report survey. Respondents also rated perceived item sensitivity. Results When comparing CATI with the Web-based self-report survey, a Wilcoxon signed-rank analysis showed that respondents answered 14 of the 42 matched items in a significantly different way. Significant variation in responses (CATI vs Web-based) was more frequent if the item was also rated by the respondents as highly sensitive in nature. Specifically, 63% (5/8) of the high sensitivity items, 43% (3/7) of the neutral sensitivity items, and 0% (0/4) of the low sensitivity items were answered in a significantly different manner by respondents when comparing their matched CATI and Web-based question responses. The items that were perceived as highly sensitive by respondents and demonstrated response variability included the following: sexting activities, body image concerns, experience of diagnosis, and suicidal ideation. For high sensitivity items, a regression
Health Information National Trends Survey in American Sign Language (HINTS-ASL): Protocol for the Cultural Adaptation and Linguistic Validation of a National Survey.

Science.gov (United States)

Kushalnagar, Poorna; Harris, Raychelle; Paludneviciene, Raylene; Hoglind, TraciAnn

2017-09-13

The Health Information National Trends Survey (HINTS) collects nationally representative data about the American's public use of health-related information. This survey is available in English and Spanish, but not in American Sign Language (ASL). Thus, the exclusion of ASL users from these national health information survey studies has led to a significant gap in knowledge of Internet usage for health information access in this underserved and understudied population. The objectives of this study are (1) to culturally adapt and linguistically translate the HINTS items to ASL (HINTS-ASL); and (2) to gather information about deaf people's health information seeking behaviors across technology-mediated platforms. We modified the standard procedures developed at the US National Center for Health Statistics Cognitive Survey Laboratory to culturally adapt and translate HINTS items to ASL. Cognitive interviews were conducted to assess clarity and delivery of these HINTS-ASL items. Final ASL video items were uploaded to a protected online survey website. The HINTS-ASL online survey has been administered to over 1350 deaf adults (ages 18 to 90 and up) who use ASL. Data collection is ongoing and includes deaf adult signers across the United States. Some items from HINTS item bank required cultural adaptation for use with deaf people who use accessible services or technology. A separate item bank for deaf-related experiences was created, reflecting deaf-specific technology such as sharing health-related ASL videos through social network sites and using video remote interpreting services in health settings. After data collection is complete, we will conduct a series of analyses on deaf people's health information seeking behaviors across technology-mediated platforms. HINTS-ASL is an accessible health information national trends survey, which includes a culturally appropriate set of items that are relevant to the experiences of deaf people who use ASL. The final HINTS
Citizens' perceptions of political processes. A critical evaluation of preference consistency and survey items

Directory of Open Access Journals (Sweden)

Bengtsson, Åsa

2012-12-01

Full Text Available The current state of research does not tell us much about citizens’ expectations of political decision making. Most surveys allow respondents to evaluate how the current system is working, but do not inquire about alternative political decision-making procedures. The lack of established survey items can be explained by the fact that radical changes in decision-making procedures have been hard to envisage, but also by a general scepticism regarding people’s ability to form opinions on these matters. Political processes are, without doubt, complex matters that do not lend themselves very well to simplistic survey questions. Moreover, previous research has convincingly shown that most people in general have difficulties forming single, coherent and stable attitudes even towards far more straightforward political issues. In order to determine if trying to grasp attitudes towards political decision-making in future empirical studies can be considered a fruitful endeavour, this study sets out to critically assess the extent to which people express coherent preferences on these matters, and if preferences are in line with expectations in previous, rather scattered research. The study is based on the Finnish National Election Study 2011; a study which, contrary to most other election studies, includes a rich variety of survey items on the topic, and utilises a combination of strategies in order to explore patterns in the opinions held by citizens.

El estado actual de las investigaciones no nos dice mucho sobre las expectativas de los ciudadanos con respecto a la toma de decisiones políticas. La mayoría de las encuestas permiten que quienes las responden evalúen cómo funciona el sistema actual, pero no preguntan por procedimientos alternativos de decisión política. La falta de preguntas de encuesta contrastadas se puede explicar tanto por el hecho de que los cambios en los procedimientos de toma de decisiones han resultado difíciles de
Item response theory analysis of Centers for Disease Control and Prevention Health-Related Quality of Life (CDC HRQOL) items in adults with arthritis.

Science.gov (United States)

Mielenz, Thelma J; Callahan, Leigh F; Edwards, Michael C

2016-03-12

Examine the feasibility of performing an item response theory (IRT) analysis on two of the Centers for Disease Control and Prevention health-related quality of life (CDC HRQOL) modules - the 4-item Healthy Days Core Module (HDCM) and the 5-item Healthy days Symptoms Module (HDSM). Previous principal components analyses confirm that the two scales both assess a mix of mental (CDC-MH) and physical health (CDC-PH). The purpose is to conduct item response theory (IRT) analysis on the CDC-MH and CDC-PH scales separately. 2182 patients with self-reported or physician-diagnosed arthritis completed a cross-sectional survey including HDCM and HDSM items. Besides global health, the other 8 items ask the number of days that some statement was true; we chose to recode the data into 8 categories based on observed clustering. The IRT assumptions were assessed using confirmatory factor analysis and the data could be modeled using an unidimensional IRT model. The graded response model was used for IRT analyses and CDC-MH and CDC-PH scales were analyzed separately in flexMIRT. The IRT parameter estimates for the five-item CDC-PH all appeared reasonable. The three-item CDC-MH did not have reasonable parameter estimates. The CDC-PH scale is amenable to IRT analysis but the existing The CDC-MH scale is not. We suggest either using the 4-item Healthy Days Core Module (HDCM) and the 5-item Healthy days Symptoms Module (HDSM) as they currently stand or the CDC-PH scale alone if the primary goal is to measure physical health related HRQOL.
Better assessment of physical function: item improvement is neglected but essential.

Science.gov (United States)

Bruce, Bonnie; Fries, James F; Ambrosini, Debbie; Lingala, Bharathi; Gandek, Barbara; Rose, Matthias; Ware, John E

2009-01-01

Physical function is a key component of patient-reported outcome (PRO) assessment in rheumatology. Modern psychometric methods, such as Item Response Theory (IRT) and Computerized Adaptive Testing, can materially improve measurement precision at the item level. We present the qualitative and quantitative item-evaluation process for developing the Patient Reported Outcomes Measurement Information System (PROMIS) Physical Function item bank. The process was stepwise: we searched extensively to identify extant Physical Function items and then classified and selectively reduced the item pool. We evaluated retained items for content, clarity, relevance and comprehension, reading level, and translation ease by experts and patient surveys, focus groups, and cognitive interviews. We then assessed items by using classic test theory and IRT, used confirmatory factor analyses to estimate item parameters, and graded response modeling for parameter estimation. We retained the 20 Legacy (original) Health Assessment Questionnaire Disability Index (HAQ-DI) and the 10 SF-36's PF-10 items for comparison. Subjects were from rheumatoid arthritis, osteoarthritis, and healthy aging cohorts (n = 1,100) and a national Internet sample of 21,133 subjects. We identified 1,860 items. After qualitative and quantitative evaluation, 124 newly developed PROMIS items composed the PROMIS item bank, which included revised Legacy items with good fit that met IRT model assumptions. Results showed that the clearest and best-understood items were simple, in the present tense, and straightforward. Basic tasks (like dressing) were more relevant and important versus complex ones (like dancing). Revised HAQ-DI and PF-10 items with five response options had higher item-information content than did comparable original Legacy items with fewer response options. IRT analyses showed that the Physical Function domain satisfied general criteria for unidimensionality with one-, two-, three-, and four-factor models

A survey of anatomical items relevant to the practice of rheumatology: upper extremity, head, neck, spine, and general concepts.

Science.gov (United States)

Villaseñor-Ovies, Pablo; Navarro-Zarza, José Eduardo; Saavedra, Miguel Ángel; Hernández-Díaz, Cristina; Canoso, Juan J; Biundo, Joseph J; Kalish, Robert A; de Toro Santos, Francisco Javier; McGonagle, Dennis; Carette, Simon; Alvarez-Nemegyei, José

2016-12-01

This study aimed to identify the anatomical items of the upper extremity and spine that are potentially relevant to the practice of rheumatology. Ten rheumatologists interested in clinical anatomy who published, taught, and/or participated as active members of Clinical Anatomy Interest groups (six seniors, four juniors), participated in a one-round relevance Delphi exercise. An initial, 560-item list that included 45 (8.0 %) general concepts items; 138 (24.8 %) hand items; 100 (17.8 %) forearm and elbow items; 147 (26.2 %) shoulder items; and 130 (23.2 %) head, neck, and spine items was compiled by 5 of the participants. Each item was graded for importance with a Likert scale from 1 (not important) to 5 (very important). Thus, scores could range from 10 (1 × 10) to 50 (5 × 10). An item score of ≥40 was considered most relevant to competent practice as a rheumatologist. Mean item Likert scores ranged from 2.2 ± 0.5 to 4.6 ± 0.7. A total of 115 (20.5 %) of the 560 initial items reached relevance. Broken down by categories, this final relevant item list was composed by 7 (6.1 %) general concepts items; 32 (27.8 %) hand items; 20 (17.4 %) forearm and elbow items; 33 (28.7 %) shoulder items; and 23 (17.6 %) head, neck, and spine items. In this Delphi exercise, a group of practicing academic rheumatologists with an interest in clinical anatomy compiled a list of anatomical items that were deemed important to the practice of rheumatology. We suggest these items be considered curricular priorities when training rheumatology fellows in clinical anatomy skills and in programs of continuing rheumatology education.
Eating Well While Dining Out: Collaborating with Local Restaurants to Promote Heart Healthy Menu Items

Science.gov (United States)

Thayer, Linden M.; Pimentel, Daniela C.; Smith, Janice C.; Garcia, Beverly A.; Lee Sylvester, Laura; Kelly, Tammy; Johnston, Larry F.; Ammerman, Alice S.; Keyserling, Thomas C.

2017-01-01

Background As Americans commonly consume restaurant foods with poor dietary quality, effective interventions are needed to improve food choices at restaurants. Purpose To design and evaluate a restaurant-based intervention to help customers select and restaurants promote heart healthy menu items with healthful fats and high quality carbohydrates. Methods The intervention included table tents outlining 10 heart healthy eating tips, coupons promoting healthy menu items, an information brochure, and link to study website. Pre and post intervention surveys were completed by restaurant managers and customers completed a brief “intercept” survey. Results Managers (n = 10) reported the table tents and coupons were well received, and several noted improved personal nutrition knowledge. Overall, 4214 coupons were distributed with 1244 (30%) redeemed. Of 300 customers surveyed, 126 (42%) noticed the table tents and of these, 115 (91%) considered the nutrition information helpful, 42 (33%) indicated the information influenced menu items purchased, and 91 (72%) reported the information will influence what they order in the future. Discussion The intervention was well-received by restaurant managers and positively influenced menu item selection by many customers. Translation to Health Education Practice Further research is needed to assess effective strategies for scaling up and sustaining this intervention approach. PMID:28947925
Grouping of Items in Mobile Web Questionnaires

Science.gov (United States)

Mavletova, Aigul; Couper, Mick P.

2016-01-01

There is some evidence that a scrolling design may reduce breakoffs in mobile web surveys compared to a paging design, but there is little empirical evidence to guide the choice of the optimal number of items per page. We investigate the effect of the number of items presented on a page on data quality in two types of questionnaires: with or…
Intake of natural radioactivity through dietary items: a prelude to preoperational environmental survey at Kudankulam

International Nuclear Information System (INIS)

Varughese, K.G.; Kumar, M.; George, Thomas; Sunder Rajan, P.; Vijay Kumar, B.; Rajan, M.P.

2008-01-01

High background radiation are found in nature at some parts of Australia, Brazil, China, Iran, India etc. Kanyakumari district in the southern peninsular India is such a NHBRA (Natural high background radiation area) having monazite placers along the coast. Although general radiation levels in this area has been investigated by many researchers in the past, the impact of this high background radioactivity on the flora and fauna is scarce. In the present investigations radiation survey has been done at high background areas with special attention to vegetables and crops grown in this area. The studies are centered at the 2x1000 MWe, Kudankulam Nuclear Power Project site which is about 25 km from Kanyakumari. Samples of soil, sand, vegetations and other food items are collected from the 30 km radial zone of KKNPP site and analysed for naturally occurring radionuclides such as 238 U, 232 Th and 40 K. The intake of natural radioactivity through food items produced in this area is found to be very small, and the internal dose to general population staying at this high natural background area is insignificant. (author)
Macrostructural Treatment of Multi-word Lexical Items

Directory of Open Access Journals (Sweden)

Alenka Vrbinc

2011-05-01

Full Text Available The paper discusses the macrostructural treatment of multi-word lexical items in mono- and bilingual dictionaries. First, the classification of multi-word lexical items is presented, and special attention is paid to the discussion of compounds – a specific group of multi-word lexical items that is most commonly afforded headword status but whose inclusion in the headword list may also depend on spelling. Then the inclusion of multi-word lexical items in monolingual dictionaries is dealt with in greater detail, while the results of a short survey on the inclusion of five randomly chosen multi-word lexical items in seven English monolingual dictionaries are presented. The proposals as to how to treat these five multi-word lexical items in bilingual dictionaries are presented in the section about the inclusion of multi-word lexical items in bilingual dictionaries. The conclusion is that it is most important to take the users’ needs into consideration and to make any dictionary as user friendly as possible.
A randomised trial and economic evaluation of the effect of response mode on response rate, response bias, and item non-response in a survey of doctors

Directory of Open Access Journals (Sweden)

Witt Julia

2011-09-01

Full Text Available Abstract Background Surveys of doctors are an important data collection method in health services research. Ways to improve response rates, minimise survey response bias and item non-response, within a given budget, have not previously been addressed in the same study. The aim of this paper is to compare the effects and costs of three different modes of survey administration in a national survey of doctors. Methods A stratified random sample of 4.9% (2,702/54,160 of doctors undertaking clinical practice was drawn from a national directory of all doctors in Australia. Stratification was by four doctor types: general practitioners, specialists, specialists-in-training, and hospital non-specialists, and by six rural/remote categories. A three-arm parallel trial design with equal randomisation across arms was used. Doctors were randomly allocated to: online questionnaire (902; simultaneous mixed mode (a paper questionnaire and login details sent together (900; or, sequential mixed mode (online followed by a paper questionnaire with the reminder (900. Analysis was by intention to treat, as within each primary mode, doctors could choose either paper or online. Primary outcome measures were response rate, survey response bias, item non-response, and cost. Results The online mode had a response rate 12.95%, followed by the simultaneous mixed mode with 19.7%, and the sequential mixed mode with 20.7%. After adjusting for observed differences between the groups, the online mode had a 7 percentage point lower response rate compared to the simultaneous mixed mode, and a 7.7 percentage point lower response rate compared to sequential mixed mode. The difference in response rate between the sequential and simultaneous modes was not statistically significant. Both mixed modes showed evidence of response bias, whilst the characteristics of online respondents were similar to the population. However, the online mode had a higher rate of item non-response compared
Development of a Microsoft Excel tool for one-parameter Rasch model of continuous items: an application to a safety attitude survey.

Science.gov (United States)

Chien, Tsair-Wei; Shao, Yang; Kuo, Shu-Chun

2017-01-10

Many continuous item responses (CIRs) are encountered in healthcare settings, but no one uses item response theory's (IRT) probabilistic modeling to present graphical presentations for interpreting CIR results. A computer module that is programmed to deal with CIRs is required. To present a computer module, validate it, and verify its usefulness in dealing with CIR data, and then to apply the model to real healthcare data in order to show how the CIR that can be applied to healthcare settings with an example regarding a safety attitude survey. Using Microsoft Excel VBA (Visual Basic for Applications), we designed a computer module that minimizes the residuals and calculates model's expected scores according to person responses across items. Rasch models based on a Wright map and on KIDMAP were demonstrated to interpret results of the safety attitude survey. The author-made CIR module yielded OUTFIT mean square (MNSQ) and person measures equivalent to those yielded by professional Rasch Winsteps software. The probabilistic modeling of the CIR module provides messages that are much more valuable to users and show the CIR advantage over classic test theory. Because of advances in computer technology, healthcare users who are familiar to MS Excel can easily apply the study CIR module to deal with continuous variables to benefit comparisons of data with a logistic distribution and model fit statistics.
Development of the PROMIS positive emotional and sensory expectancies of smoking item banks.

Science.gov (United States)

Tucker, Joan S; Shadel, William G; Edelen, Maria Orlando; Stucky, Brian D; Li, Zhen; Hansen, Mark; Cai, Li

2014-09-01

The positive emotional and sensory expectancies of cigarette smoking include improved cognitive abilities, positive affective states, and pleasurable sensorimotor sensations. This paper describes development of Positive Emotional and Sensory Expectancies of Smoking item banks that will serve to standardize the assessment of this construct among daily and nondaily cigarette smokers. Data came from daily (N = 4,201) and nondaily (N =1,183) smokers who completed an online survey. To identify a unidimensional set of items, we conducted item factor analyses, item response theory analyses, and differential item functioning analyses. Additionally, we evaluated the performance of fixed-item short forms (SFs) and computer adaptive tests (CATs) to efficiently assess the construct. Eighteen items were included in the item banks (15 common across daily and nondaily smokers, 1 unique to daily, 2 unique to nondaily). The item banks are strongly unidimensional, highly reliable (reliability = 0.95 for both), and perform similarly across gender, age, and race/ethnicity groups. A SF common to daily and nondaily smokers consists of 6 items (reliability = 0.86). Results from simulated CATs indicated that, on average, less than 8 items are needed to assess the construct with adequate precision using the item banks. These analyses identified a new set of items that can assess the positive emotional and sensory expectancies of smoking in a reliable and standardized manner. Considerable efficiency in assessing this construct can be achieved by using the item bank SF, employing computer adaptive tests, or selecting subsets of items tailored to specific research or clinical purposes. © The Author 2014. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Evaluating construct validity of the second version of the Copenhagen Psychosocial Questionnaire through analysis of differential item functioning and differential item effect

DEFF Research Database (Denmark)

Bjorner, Jakob Bue; Pejtersen, Jan Hyld

2010-01-01

AIMS: To evaluate the construct validity of the Copenhagen Psychosocial Questionnaire II (COPSOQ II) by means of tests for differential item functioning (DIF) and differential item effect (DIE). METHODS: We used a Danish general population postal survey (n = 4,732 with 3,517 wage earners) with a ...
Engaging Community Leaders in the Development of a Cardiovascular Health Behavior Survey Using Focus Group–Based Cognitive Interviewing

Directory of Open Access Journals (Sweden)

Gwenyth R Wallen

2017-04-01

Full Text Available Establishing the validity of health behavior surveys used in community-based participatory research (CBPR in diverse populations is often overlooked. A novel, group-based cognitive interviewing method was used to obtain qualitative data for tailoring a survey instrument designed to identify barriers to improved cardiovascular health in at-risk populations in Washington, DC. A focus group–based cognitive interview was conducted to assess item comprehension, recall, and interpretation and to establish the initial content validity of the survey. Thematic analysis of verbatim transcripts yielded 5 main themes for which participants (n = 8 suggested survey modifications, including survey item improvements, suggestions for additional items, community-specific issues, changes in the skip logic of the survey items, and the identification of typographical errors. Population-specific modifications were made, including the development of more culturally appropriate questions relevant to the community. Group-based cognitive interviewing provided an efficient and effective method for piloting a cardiovascular health survey instrument using CBPR.
Development of a self-report physical function instrument for disability assessment: item pool construction and factor analysis.

Science.gov (United States)

McDonough, Christine M; Jette, Alan M; Ni, Pengsheng; Bogusz, Kara; Marfeo, Elizabeth E; Brandt, Diane E; Chan, Leighton; Meterko, Mark; Haley, Stephen M; Rasch, Elizabeth K

2013-09-01

To build a comprehensive item pool representing work-relevant physical functioning and to test the factor structure of the item pool. These developmental steps represent initial outcomes of a broader project to develop instruments for the assessment of function within the context of Social Security Administration (SSA) disability programs. Comprehensive literature review; gap analysis; item generation with expert panel input; stakeholder interviews; cognitive interviews; cross-sectional survey administration; and exploratory and confirmatory factor analyses to assess item pool structure. In-person and semistructured interviews and Internet and telephone surveys. Sample of SSA claimants (n=1017) and a normative sample of adults from the U.S. general population (n=999). Not applicable. Model fit statistics. The final item pool consisted of 139 items. Within the claimant sample, 58.7% were white; 31.8% were black; 46.6% were women; and the mean age was 49.7 years. Initial factor analyses revealed a 4-factor solution, which included more items and allowed separate characterization of: (1) changing and maintaining body position, (2) whole body mobility, (3) upper body function, and (4) upper extremity fine motor. The final 4-factor model included 91 items. Confirmatory factor analyses for the 4-factor models for the claimant and the normative samples demonstrated very good fit. Fit statistics for claimant and normative samples, respectively, were: Comparative Fit Index=.93 and .98; Tucker-Lewis Index=.92 and .98; and root mean square error approximation=.05 and .04. The factor structure of the physical function item pool closely resembled the hypothesized content model. The 4 scales relevant to work activities offer promise for providing reliable information about claimant physical functioning relevant to work disability. Copyright © 2013 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Development of a Microsoft Excel tool for one-parameter Rasch model of continuous items: an application to a safety attitude survey

Directory of Open Access Journals (Sweden)

Tsair-Wei Chien

2017-01-01

Full Text Available Abstract Background Many continuous item responses (CIRs are encountered in healthcare settings, but no one uses item response theory’s (IRT probabilistic modeling to present graphical presentations for interpreting CIR results. A computer module that is programmed to deal with CIRs is required. To present a computer module, validate it, and verify its usefulness in dealing with CIR data, and then to apply the model to real healthcare data in order to show how the CIR that can be applied to healthcare settings with an example regarding a safety attitude survey. Methods Using Microsoft Excel VBA (Visual Basic for Applications, we designed a computer module that minimizes the residuals and calculates model’s expected scores according to person responses across items. Rasch models based on a Wright map and on KIDMAP were demonstrated to interpret results of the safety attitude survey. Results The author-made CIR module yielded OUTFIT mean square (MNSQ and person measures equivalent to those yielded by professional Rasch Winsteps software. The probabilistic modeling of the CIR module provides messages that are much more valuable to users and show the CIR advantage over classic test theory. Conclusions Because of advances in computer technology, healthcare users who are familiar to MS Excel can easily apply the study CIR module to deal with continuous variables to benefit comparisons of data with a logistic distribution and model fit statistics.
ABORTION ATTITUDES, 1984-1987-1988 - EFFECTS OF ITEM ORDER AND DIMENSIONALITY

NARCIS (Netherlands)

TENVERGERT, E; GILLESPIE, MW; KINGMA, J; KLASEN, H

The comparability of surveys is often hampered by differences in the item order of presentation. The major focus of the present study was to investigate whether a general item or a specific item at the beginning of the questionnaire would affect the endorsement as well as the scalability of a set of
Development and Validation of the Poverty Attributions Survey

Science.gov (United States)

Bennett, Robert M.; Raiz, Lisa; Davis, Tamara S.

2016-01-01

This article describes the process of developing and testing the Poverty Attribution Survey (PAS), a measure of poverty attributions. The PAS is theory based and includes original items as well as items from previously tested poverty attribution instruments. The PAS was electronically administered to a sample of state-licensed professional social…
The Long-Term Conditions Questionnaire: conceptual framework and item development.

Science.gov (United States)

Peters, Michele; Potter, Caroline M; Kelly, Laura; Hunter, Cheryl; Gibbons, Elizabeth; Jenkinson, Crispin; Coulter, Angela; Forder, Julien; Towers, Ann-Marie; A'Court, Christine; Fitzpatrick, Ray

2016-01-01

To identify the main issues of importance when living with long-term conditions to refine a conceptual framework for informing the item development of a patient-reported outcome measure for long-term conditions. Semi-structured qualitative interviews (n=48) were conducted with people living with at least one long-term condition. Participants were recruited through primary care. The interviews were transcribed verbatim and analyzed by thematic analysis. The analysis served to refine the conceptual framework, based on reviews of the literature and stakeholder consultations, for developing candidate items for a new measure for long-term conditions. Three main organizing concepts were identified: impact of long-term conditions, experience of services and support, and self-care. The findings helped to refine a conceptual framework, leading to the development of 23 items that represent issues of importance in long-term conditions. The 23 candidate items formed the first draft of the measure, currently named the Long-Term Conditions Questionnaire. The aim of this study was to refine the conceptual framework and develop items for a patient-reported outcome measure for long-term conditions, including single and multiple morbidities and physical and mental health conditions. Qualitative interviews identified the key themes for assessing outcomes in long-term conditions, and these underpinned the development of the initial draft of the measure. These initial items will undergo cognitive testing to refine the items prior to further validation in a survey.
Including Item Characteristics in the Probabilistic Latent Semantic Analysis Model for Collaborative Filtering

NARCIS (Netherlands)

M. Kagie (Martijn); M.J.H.M. van der Loos (Matthijs); M.C. van Wezel (Michiel)

2008-01-01

textabstractWe propose a new hybrid recommender system that combines some advantages of collaborative and content-based recommender systems. While it uses ratings data of all users, as do collaborative recommender systems, it is also able to recommend new items and provide an explanation of its
An emotional functioning item bank of 24 items for computerized adaptive testing (CAT) was established

DEFF Research Database (Denmark)

Petersen, Morten Aa.; Gamper, Eva-Maria; Costantini, Anna

2016-01-01

of the widely used EORTC Quality of Life questionnaire (QLQ-C30). STUDY DESIGN AND SETTING: On the basis of literature search and evaluations by international samples of experts and cancer patients, 38 candidate items were developed. The psychometric properties of the items were evaluated in a large...... international sample of cancer patients. This included evaluations of dimensionality, item response theory (IRT) model fit, differential item functioning (DIF), and of measurement precision/statistical power. RESULTS: Responses were obtained from 1,023 cancer patients from four countries. The evaluations showed...... that 24 items could be included in a unidimensional IRT model. DIF did not seem to have any significant impact on the estimation of EF. Evaluations indicated that the CAT measure may reduce sample size requirements by up to 50% compared to the QLQ-C30 EF scale without reducing power. CONCLUSION...
Development of the Chicago Food Allergy Research Surveys: assessing knowledge, attitudes, and beliefs of parents, physicians, and the general public

Directory of Open Access Journals (Sweden)

Pongracic Jacqueline A

2009-08-01

Full Text Available Abstract Background Parents of children with food allergy, primary care physicians, and members of the general public play a critical role in the health and well-being of food-allergic children, though little is known about their knowledge and perceptions of food allergy. The purpose of this paper is to detail the development of the Chicago Food Allergy Research Surveys to assess food allergy knowledge, attitudes, and beliefs among these three populations. Methods From 2006–2008, parents of food-allergic children, pediatricians, family physicians, and adult members of the general public were recruited to assist in survey development. Preliminary analysis included literature review, creation of initial content domains, expert panel review, and focus groups. Survey validation included creation of initial survey items, expert panel ratings, cognitive interviews, reliability testing, item reduction, and final validation. National administration of the surveys is ongoing. Results Nine experts were assembled to oversee survey development. Six focus groups were held: 2/survey population, 4–9 participants/group; transcripts were reviewed via constant comparative methods to identify emerging themes and inform item creation. At least 220 participants per population were recruited to assess the relevance, reliability, and utility of each survey item as follows: cognitive interviews, 10 participants; reliability testing ≥ 10; item reduction ≥ 50; and final validation, 150 respondents. Conclusion The Chicago Food Allergy Research surveys offer validated tools to assess food allergy knowledge and perceptions among three distinct populations: a 42 item parent tool, a 50 item physician tool, and a 35 item general public tool. No such tools were previously available.
P values in display items are ubiquitous and almost invariably significant: A survey of top science journals.

Science.gov (United States)

Cristea, Ioana Alina; Ioannidis, John P A

2018-01-01

P values represent a widely used, but pervasively misunderstood and fiercely contested method of scientific inference. Display items, such as figures and tables, often containing the main results, are an important source of P values. We conducted a survey comparing the overall use of P values and the occurrence of significant P values in display items of a sample of articles in the three top multidisciplinary journals (Nature, Science, PNAS) in 2017 and, respectively, in 1997. We also examined the reporting of multiplicity corrections and its potential influence on the proportion of statistically significant P values. Our findings demonstrated substantial and growing reliance on P values in display items, with increases of 2.5 to 14.5 times in 2017 compared to 1997. The overwhelming majority of P values (94%, 95% confidence interval [CI] 92% to 96%) were statistically significant. Methods to adjust for multiplicity were almost non-existent in 1997, but reported in many articles relying on P values in 2017 (Nature 68%, Science 48%, PNAS 38%). In their absence, almost all reported P values were statistically significant (98%, 95% CI 96% to 99%). Conversely, when any multiplicity corrections were described, 88% (95% CI 82% to 93%) of reported P values were statistically significant. Use of Bayesian methods was scant (2.5%) and rarely (0.7%) articles relied exclusively on Bayesian statistics. Overall, wider appreciation of the need for multiplicity corrections is a welcome evolution, but the rapid growth of reliance on P values and implausibly high rates of reported statistical significance are worrisome.
Calibration of context-specific survey items to assess youth physical activity behaviour.

Science.gov (United States)

Saint-Maurice, Pedro F; Welk, Gregory J; Bartee, R Todd; Heelan, Kate

2017-05-01

This study tests calibration models to re-scale context-specific physical activity (PA) items to accelerometer-derived PA. A total of 195 4th-12th grades children wore an Actigraph monitor and completed the Physical Activity Questionnaire (PAQ) one week later. The relative time spent in moderate-to-vigorous PA (MVPA % ) obtained from the Actigraph at recess, PE, lunch, after-school, evening and weekend was matched with a respective item score obtained from the PAQ's. Item scores from 145 participants were calibrated against objective MVPA % using multiple linear regression with age, and sex as additional predictors. Predicted minutes of MVPA for school, out-of-school and total week were tested in the remaining sample (n = 50) using equivalence testing. The results showed that PAQ β-weights ranged from 0.06 (lunch) to 4.94 (PE) MVPA % (P PAQ and accelerometer MVPA at school and out-of-school ranged from -15.6 to +3.8 min and the PAQ was within 10-15% of accelerometer measured activity. This study demonstrated that context-specific items can be calibrated to predict minutes of MVPA in groups of youth during in- and out-of-school periods.

Survey indicated that core outcome set development is increasingly including patients, being conducted internationally and using Delphi surveys.

Science.gov (United States)

Biggane, Alice M; Brading, Lucy; Ravaud, Philippe; Young, Bridget; Williamson, Paula R

2018-02-17

There are numerous challenges in including patients in a core outcome set (COS) study, these can vary depending on the patient group. This study describes current efforts to include patients in the development of COS, with the aim of identifying areas for further improvement and study. Using the COMET database, corresponding authors of COS projects registered or published from 1 January 2013 to 2 February 2017 were invited via a personalised email to participate in a short online survey. The survey and emails were constructed to maximise the response rate by following the academic literature on enhancing survey responses. Personalised reminder emails were sent to non-responders. This survey explored the frequency of patient input in COS studies, who was involved, what methods were used and whether or not the COS development was international. One hundred and ninety-two COS developers were sent the survey. Responses were collected from 21 February 2017 until 7 May 2017. One hundred and forty-six unique developers responded, yielding a 76% response rate and data in relation to 195 unique COSs (as some developers had worked on multiple COSs). Of focus here are their responses regarding 162 COSs at the published, completed or ongoing stages of development. Inclusion of patient participants was indicated in 87% (141/162) of COSs in the published completed or ongoing stages and over 94% (65/69) of ongoing COS projects. Nearly half (65/135) of COSs included patient participants from two or more countries and 22% (30/135) included patient participants from five or more countries. The Delphi survey was reported as being used singularly or in combination with other methods in 85% (119/140) of projects. Almost a quarter (16/65) of ongoing studies reported using a combination of qualitative interviews, Delphi survey and consensus meeting. These findings indicated that the Delphi survey is the most popular method of facilitating patient participation, while the combination of
Communicating Quantitative Literacy: An Examination of Open-Ended Assessment Items in TIMSS, NALS, IALS, and PISA

Directory of Open Access Journals (Sweden)

Karl W. Kosko

2011-07-01

Full Text Available Quantitative Literacy (QL has been described as the skill set an individual uses when interacting with the world in a quantitative manner. A necessary component of this interaction is communication. To this end, assessments of QL have included open-ended items as a means of including communicative aspects of QL. The present study sought to examine whether such open-ended items typically measured aspects of quantitative communication, as compared to mathematical communication, or mathematical skills. We focused on public-released items and rubrics from four of the most widely referenced assessments: the Third International Mathematics and Science Study (TIMSS-95: the National Adult Literacy Survey (NALS; now the National Assessment of Adult Literacy, NAAL in 1985 and 1992, the International Adult Literacy Skills (IALS beginning in 1994; and the Program for International Student Assessment (PISA beginning in 2000. We found that open-ended item rubrics in these QL assessments showed a strong tendency to assess answer-only responses. Therefore, while some open-ended items may have required certain levels of quantitative reasoning to find a solution, it is the solution rather than the reasoning that was often assessed.
Ninth Triennial Toxicology Salary Survey.

Science.gov (United States)

Gad, Shayne Cox; Sullivan, Dexter Wayne

2016-01-01

This survey serves as the ninth in a series of toxicology salary surveys conducted at 3-year intervals and beginning in 1988. An electronic survey instrument was distributed to 5919 individuals including members of the Society of Toxicology, American College of Toxicology, and 23 additional professional organizations. Question items inquired about gender, age, degree, years of experience, certifications held, areas of specialization, society membership, employment and income. Overall, 1293 responses were received (response rate 21.8%). The results of the 2014 survey provide insight into the job market and career path for current and future toxicologists. © The Author(s) 2016.
Normative data for the 12 item WHO Disability Assessment Schedule 2.0.

Directory of Open Access Journals (Sweden)

Gavin Andrews

Full Text Available BACKGROUND: The World Health Organization Disability Assessment Schedule (WHODAS 2.0 measures disability due to health conditions including diseases, illnesses, injuries, mental or emotional problems, and problems with alcohol or drugs. METHOD: The 12 Item WHODAS 2.0 was used in the second Australian Survey of Mental Health and Well-being. We report the overall factor structure and the distribution of scores and normative data (means and SDs for people with any physical disorder, any mental disorder and for people with neither. FINDINGS: A single second order factor justifies the use of the scale as a measure of global disability. People with mental disorders had high scores (mean 6.3, SD 7.1, people with physical disorders had lower scores (mean 4.3, SD 6.1. People with no disorder covered by the survey had low scores (mean 1.4, SD 3.6. INTERPRETATION: The provision of normative data from a population sample of adults will facilitate use of the WHODAS 2.0 12 item scale in clinical and epidemiological research.
Further Investigating Method Effects Associated with Negatively Worded Items on Self-Report Surveys

Science.gov (United States)

DiStefano, Christine; Motl, Robert W.

2006-01-01

This article used multitrait-multimethod methodology and covariance modeling for an investigation of the presence and correlates of method effects associated with negatively worded items on the Rosenberg Self-Esteem (RSE) scale (Rosenberg, 1989) using a sample of 757 adults. Results showed that method effects associated with negative item phrasing…
Shortening a Patient Experiences Survey for Medical Homes

Directory of Open Access Journals (Sweden)

Judy H. Ng

2015-12-01

Full Text Available The Consumer Assessment of Healthcare Providers and Systems—Patient-Centered Medical Home (CAHPS PCMH Survey assesses patient experiences reflecting domains of care related to general patient experience (access to care, communication with providers, office staff interaction, provider rating and PCMH-specific aspects of patient care (comprehensiveness of care, self-management support, shared decision making. The current work compares psychometric properties of the current survey and a proposed shortened version of the survey (from 52 to 26 adult survey items, from 66 to 31 child survey items. The revisions were based on initial psychometric analysis and stakeholder input regarding survey length concerns. A total of 268 practices voluntarily submitted adult surveys and 58 submitted child survey data to the National Committee for Quality Assurance in 2013. Mean unadjusted scores, practice-level item and composite reliability, and item-to-scale correlations were calculated. Results show that the shorter adult survey has lower reliability, but still it still meets general definitions of a sound survey for the adult version, and resulted in few changes to mean scores. The impact was more problematic for the pediatric version. Further testing is needed to investigate approaches to improving survey response and the relevance of survey items in informing quality improvement.
Location Indices for Ordinal Polytomous Items Based on Item Response Theory. Research Report. ETS RR-15-20

Science.gov (United States)

Ali, Usama S.; Chang, Hua-Hua; Anderson, Carolyn J.

2015-01-01

Polytomous items are typically described by multiple category-related parameters; situations, however, arise in which a single index is needed to describe an item's location along a latent trait continuum. Situations in which a single index would be needed include item selection in computerized adaptive testing or test assembly. Therefore single…
Using Likert-type and ipsative/forced choice items in sequence to generate a preference.

Science.gov (United States)

Ried, L Douglas

2014-01-01

Collaboration and implementation of a minimum, standardized set of core global educational and professional competencies seems appropriate given the expanding international evolution of pharmacy practice. However, winnowing down hundreds of competencies from a plethora of local, national and international competency frameworks to select the most highly preferred to be included in the core set is a daunting task. The objective of this paper is to describe a combination of strategies used to ascertain the most highly preferred items among a large number of disparate items. In this case, the items were >100 educational and professional competencies that might be incorporated as the core components of new and existing competency frameworks. Panelists (n = 30) from the European Union (EU) and United States (USA) were chosen to reflect a variety of practice settings. Each panelist completed two electronic surveys. The first survey presented competencies in a Likert-type format and the second survey presented many of the same competencies in an ipsative/forced choice format. Item mean scores were calculated for each competency, the competencies were ranked, and non-parametric statistical tests were used to ascertain the consistency in the rankings achieved by the two strategies. This exploratory study presented over 100 competencies to the panelists in the beginning. The two methods provided similar results, as indicated by the significant correlation between the rankings (Spearman's rho = 0.30, P < 0.09). A two-step strategy using Likert-type and ipsative/forced choice formats in sequence, appears to be useful in a situation where a clear preference is required from among a large number of choices. The ipsative/forced choice format resulted in some differences in the competency preferences because the panelists could not rate them equally by design. While this strategy was used for the selection of professional educational competencies in this exploratory study, it is
Analyzing force concept inventory with item response theory

Science.gov (United States)

Wang, Jing; Bao, Lei

2010-10-01

Item response theory is a popular assessment method used in education. It rests on the assumption of a probability framework that relates students' innate ability and their performance on test questions. Item response theory transforms students' raw test scores into a scaled proficiency score, which can be used to compare results obtained with different test questions. The scaled score also addresses the issues of ceiling effects and guessing, which commonly exist in quantitative assessment. We used item response theory to analyze the force concept inventory (FCI). Our results show that item response theory can be useful for analyzing physics concept surveys such as the FCI and produces results about the individual questions and student performance that are beyond the capability of classical statistics. The theory yields detailed measurement parameters regarding the difficulty, discrimination features, and probability of correct guess for each of the FCI questions.
Using the LOINC Semantic Structure to Integrate Community-based Survey Items into a Concept-based Enterprise Data Dictionary to Support Comparative Effectiveness Research.

Science.gov (United States)

Co, Manuel C; Boden-Albala, Bernadette; Quarles, Leigh; Wilcox, Adam; Bakken, Suzanne

2012-01-01

In designing informatics infrastructure to support comparative effectiveness research (CER), it is necessary to implement approaches for integrating heterogeneous data sources such as clinical data typically stored in clinical data warehouses and those that are normally stored in separate research databases. One strategy to support this integration is the use of a concept-oriented data dictionary with a set of semantic terminology models. The aim of this paper is to illustrate the use of the semantic structure of Clinical LOINC (Logical Observation Identifiers, Names, and Codes) in integrating community-based survey items into the Medical Entities Dictionary (MED) to support the integration of survey data with clinical data for CER studies.
The Iranian version of 12-item Short Form Health Survey (SF-12): factor structure, internal consistency and construct validity.

Science.gov (United States)

Montazeri, Ali; Vahdaninia, Mariam; Mousavi, Sayed Javad; Omidvari, Speideh

2009-09-16

The 12-item Short Form Health Survey (SF-12) as a shorter alternative of the SF-36 is largely used in health outcomes surveys. The aim of this study was to validate the SF-12 in Iran. A random sample of the general population aged 15 years and over living in Tehran, Iran completed the SF-12. Reliability was estimated using internal consistency and validity was assessed using known groups comparison and convergent validity. In addition, the factor structure of the questionnaire was extracted by performing both exploratory factor analysis (EFA) and confirmatory factor analysis (CFA). In all, 5587 individuals were studied (2721 male and 2866 female). The mean age and formal education of the respondents were 35.1 (SD = 15.4) and 10.2 (SD = 4.4) years respectively. The results showed satisfactory internal consistency for both summary measures, that are the Physical Component Summary (PCS) and the Mental Component Summary (MCS); Cronbach's alpha for PCS-12 and MCS-12 was 0.73 and 0.72, respectively. Known-groups comparison showed that the SF-12 discriminated well between men and women and those who differed in age and educational status (P < 0.001). In addition, correlations between the SF-12 scales and single items showed that the physical functioning, role physical, bodily pain and general health subscales correlated higher with the PCS-12 score, while the vitality, social functioning, role emotional and mental health subscales more correlated with the MCS-12 score lending support to its good convergent validity. Finally the principal component analysis indicated a two-factor structure (physical and mental health) that jointly accounted for 57.8% of the variance. The confirmatory factory analysis also indicated a good fit to the data for the two-latent structure (physical and mental health). In general the findings suggest that the SF-12 is a reliable and valid measure of health related quality of life among Iranian population. However, further studies are needed to
Survey of Public Understanding on Energy Resources including Nuclear Energy (I)

International Nuclear Information System (INIS)

Park, Se-Moon; Song, Sun-Ja

2007-01-01

Women in Nuclear-Korea (WINK) surveyed the public understanding on various energy resources in early September 2006 to offer the result for establishment of the nuclear communication policy. The reason why this survey includes other energy resources is because the previous works are only limited on nuclear energy, and also aimed to know the public's opinion on the present communication skill of nuclear energy for the public understanding. The present study is purposed of having data how public understands nuclear energy compared to other energies, such as fossil fuels, hydro power, and other sustainable energies. The data obtained from this survey have shown different results according to the responded group; age, gender, residential area, etc. Responded numbers are more than 2,000 of general public and university students. The survey result shows that nuclear understanding is more negative in women than in men, and is more negative in young than older age
Development of the Oxford Participation and Activities Questionnaire: constructing an item pool

Directory of Open Access Journals (Sweden)

Kelly L

2015-05-01

Full Text Available Laura Kelly, Crispin Jenkinson, Sarah Dummett, Jill Dawson, Ray Fitzpatrick, David Morley Health Services Research Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK Purpose: The Oxford Participation and Activities Questionnaire is a patient-reported outcome measure in development that is grounded on the World Health Organization International Classification of Functioning, Disability, and Health (ICF. The study reported here aimed to inform and generate an item pool for the new measure, which is specifically designed for the assessment of participation and activity in patients experiencing a range of health conditions. Methods: Items were informed through in-depth interviews conducted with 37 participants spanning a range of conditions. Interviews aimed to identify how their condition impacted their ability to participate in meaningful activities. Conditions included arthritis, cancer, chronic back pain, diabetes, motor neuron disease, multiple sclerosis, Parkinson's disease, and spinal cord injury. Transcripts were analyzed using the framework method. Statements relating to ICF themes were recast as questionnaire items and shown for review to an expert panel. Cognitive debrief interviews (n=13 were used to assess items for face and content validity. Results: ICF themes relevant to activities and participation in everyday life were explored, and a total of 222 items formed the initial item pool. This item pool was refined by the research team and 28 generic items were mapped onto all nine chapters of the ICF construct, detailing activity and participation. Cognitive interviewing confirmed the questionnaire instructions, items, and response options were acceptable to participants. Conclusion: Using a clear conceptual basis to inform item generation, 28 items have been identified as suitable to undergo further psychometric testing. A large-scale postal survey will follow in order to refine the instrument further and
Using automatic item generation to create multiple-choice test items.

Science.gov (United States)

Gierl, Mark J; Lai, Hollis; Turner, Simon R

2012-08-01

Many tests of medical knowledge, from the undergraduate level to the level of certification and licensure, contain multiple-choice items. Although these are efficient in measuring examinees' knowledge and skills across diverse content areas, multiple-choice items are time-consuming and expensive to create. Changes in student assessment brought about by new forms of computer-based testing have created the demand for large numbers of multiple-choice items. Our current approaches to item development cannot meet this demand. We present a methodology for developing multiple-choice items based on automatic item generation (AIG) concepts and procedures. We describe a three-stage approach to AIG and we illustrate this approach by generating multiple-choice items for a medical licensure test in the content area of surgery. To generate multiple-choice items, our method requires a three-stage process. Firstly, a cognitive model is created by content specialists. Secondly, item models are developed using the content from the cognitive model. Thirdly, items are generated from the item models using computer software. Using this methodology, we generated 1248 multiple-choice items from one item model. Automatic item generation is a process that involves using models to generate items using computer technology. With our method, content specialists identify and structure the content for the test items, and computer technology systematically combines the content to generate new test items. By combining these outcomes, items can be generated automatically. © Blackwell Publishing Ltd 2012.
Quality of life assessed with the medical outcomes study short form 36-item health survey of patients on renal replacement therapy: A systematic review and meta-analysis

NARCIS (Netherlands)

Y.S. Liem (Ylian Serina); J.L. Bosch (Johanna); L.R. Arends (Lidia); M.H. Heijenbrok-Kal (Majanka); M.G.M. Hunink (Myriam)

2007-01-01

textabstractObjectives: The Medical Outcomes Study Short Form 36-Item Health Survey (SF-36) is the most widely used generic instrument to estimate quality of life of patients on renal replacement therapy. Purpose of this study was to summarize and compare the published literature on quality of
A Preliminary Analysis of the 1999 USMC Web-Based Exit Survey

National Research Council Canada - National Science Library

Hocevar, Susan

2000-01-01

.... Items included in the survey represented such factors as: pay and benefits, job characteristics, career issues, family and personal life, leadership, culture, standards, unit morale, personal freedom, and optempo...
Development of Rasch-based item banks for the assessment of work performance in patients with musculoskeletal diseases.

Science.gov (United States)

Mueller, Evelyn A; Bengel, Juergen; Wirtz, Markus A

2013-12-01

This study aimed to develop a self-description assessment instrument to measure work performance in patients with musculoskeletal diseases. In terms of the International Classification of Functioning, Disability and Health (ICF), work performance is defined as the degree of meeting the work demands (activities) at the actual workplace (environment). To account for the fact that work performance depends on the work demands of the job, we strived to develop item banks that allow a flexible use of item subgroups depending on the specific work demands of the patients' jobs. Item development included the collection of work tasks from literature and content validation through expert surveys and patient interviews. The resulting 122 items were answered by 621 patients with musculoskeletal diseases. Exploratory factor analysis to ascertain dimensionality and Rasch analysis (partial credit model) for each of the resulting dimensions were performed. Exploratory factor analysis resulted in four dimensions, and subsequent Rasch analysis led to the following item banks: 'impaired productivity' (15 items), 'impaired cognitive performance' (18), 'impaired coping with stress' (13) and 'impaired physical performance' (low physical workload 20 items, high physical workload 10 items). The item banks exhibited person separation indices (reliability) between 0.89 and 0.96. The assessment of work performance adds the activities component to the more commonly employed participation component of the ICF-model. The four item banks can be adapted to specific jobs where necessary without losing comparability of person measures, as the item banks are based on Rasch analysis.
Using Localized Survey Items to Augment Standardized Benchmarking Measures: A LibQUAL+[TM] Study

Science.gov (United States)

Thompson, Bruce; Cook, Colleen; Kyrillidou, Martha

2006-01-01

The LibQUAL+[TM] protocol solicits open-ended comments from users with regard to library service quality, gathers data on 22 core items, and, at the option of individual libraries, also garners ratings on five items drawn from a pool of more than 100 choices selected by libraries. In this article, the relationship of scores on these locally…
Procurement Engineering Process for Commercial Grade Item Dedication

International Nuclear Information System (INIS)

Park, Jong-Hyuck; Park, Jong-Eun; Kwak, Tack-Hun; Yoo, Keun-Bae; Lee, Sang-Guk; Hong, Sung-Yull

2006-01-01

Procurement Engineering Process for commercial grade item dedication plays an increasingly important role in operation management of Korea Nuclear Power Plants. The purpose of the Procurement Engineering Process is the provision and assurance of a high quality and quantity of spare, replacement, retrofit and new parts and equipment while maximizing plant availability, minimizing downtime due to parts unavailability and providing reasonable overall program and inventory cost. In this paper, we will review the overview requirements, responsibilities and the process for demonstrating with reasonable assurance that a procured item for potential nuclear safety related services or other essential plant service is adequate with reasonable assurance for its application. This paper does not cover the details of technical evaluation, selecting critical characteristics, selecting acceptance methods, performing failure modes and effects analysis, performing source surveillance, performing quality surveys, performing special tests and inspections, and the other aspects of effective Procurement Engineering and Commercial Grade Item Dedication. The main contribution of this paper is to provide the provision of an overview of Procurement Engineering Process for commercial grade item
Structural Validation of a French Food Frequency Questionnaire of 94 Items.

Science.gov (United States)

Gazan, Rozenn; Vieux, Florent; Darmon, Nicole; Maillot, Matthieu

2017-01-01

Food frequency questionnaires (FFQs) are used to estimate the usual food and nutrient intakes over a period of time. Such estimates can suffer from measurement errors, either due to bias induced by respondent's answers or to errors induced by the structure of the questionnaire (e.g., using a limited number of food items and an aggregated food database with average portion sizes). The "structural validation" presented in this study aims to isolate and quantify the impact of the inherent structure of a FFQ on the estimation of food and nutrient intakes, independently of respondent's perception of the questionnaire. A semi-quantitative FFQ ( n = 94 items, including 50 items with questions on portion sizes) and an associated aggregated food composition database (named the item-composition database) were developed, based on the self-reported weekly dietary records of 1918 adults (18-79 years-old) in the French Individual and National Dietary Survey 2 (INCA2), and the French CIQUAL 2013 food-composition database of all the foods ( n = 1342 foods) declared as consumed in the population. Reference intakes of foods ("REF_FOOD") and nutrients ("REF_NUT") were calculated for each adult using the food-composition database and the amounts of foods self-reported in his/her dietary record. Then, answers to the FFQ were simulated for each adult based on his/her self-reported dietary record. "FFQ_FOOD" and "FFQ_NUT" intakes were estimated using the simulated answers and the item-composition database. Measurement errors (in %), spearman correlations and cross-classification were used to compare "REF_FOOD" with "FFQ_FOOD" and "REF_NUT" with "FFQ_NUT". Compared to "REF_NUT," "FFQ_NUT" total quantity and total energy intake were underestimated on average by 198 g/day and 666 kJ/day, respectively. "FFQ_FOOD" intakes were well estimated for starches, underestimated for most of the subgroups, and overestimated for some subgroups, in particular vegetables. Underestimation were

2012 Workplace and Gender Relations Survey of Active Duty Members. Survey Note and Briefing

Science.gov (United States)

2013-03-15

items regarding unwanted attempts to establish a sexual relationship – Sexual Coercion – four items regarding classic quid pro quo instances of special...continues to emphasize sexual assault and sexual harassment response and prevention in the military. This survey note discusses findings from the... harassment in the active duty force. This survey note and accompanying briefing (Appendix) provide information on the prevalence rates of sexual
Assessing item fit for unidimensional item response theory models using residuals from estimated item response functions.

Science.gov (United States)

Haberman, Shelby J; Sinharay, Sandip; Chon, Kyong Hee

2013-07-01

Residual analysis (e.g. Hambleton & Swaminathan, Item response theory: principles and applications, Kluwer Academic, Boston, 1985; Hambleton, Swaminathan, & Rogers, Fundamentals of item response theory, Sage, Newbury Park, 1991) is a popular method to assess fit of item response theory (IRT) models. We suggest a form of residual analysis that may be applied to assess item fit for unidimensional IRT models. The residual analysis consists of a comparison of the maximum-likelihood estimate of the item characteristic curve with an alternative ratio estimate of the item characteristic curve. The large sample distribution of the residual is proved to be standardized normal when the IRT model fits the data. We compare the performance of our suggested residual to the standardized residual of Hambleton et al. (Fundamentals of item response theory, Sage, Newbury Park, 1991) in a detailed simulation study. We then calculate our suggested residuals using data from an operational test. The residuals appear to be useful in assessing the item fit for unidimensional IRT models.
Problems with the factor analysis of items: Solutions based on item response theory and item parcelling

Directory of Open Access Journals (Sweden)

Gideon P. De Bruin

2004-10-01

Full Text Available The factor analysis of items often produces spurious results in the sense that unidimensional scales appear multidimensional. This may be ascribed to failure in meeting the assumptions of linearity and normality on which factor analysis is based. Item response theory is explicitly designed for the modelling of the non-linear relations between ordinal variables and provides a strong alternative to the factor analysis of items. Items may also be combined in parcels that are more likely to satisfy the assumptions of factor analysis than do the items. The use of the Rasch rating scale model and the factor analysis of parcels is illustrated with data obtained with the Locus of Control Inventory. The results of these analyses are compared with the results obtained through the factor analysis of items. It is shown that the Rasch rating scale model and the factoring of parcels produce superior results to the factor analysis of items. Recommendations for the analysis of scales are made. Opsomming Die faktorontleding van items lewer dikwels misleidende resultate op, veral in die opsig dat eendimensionele skale as meerdimensioneel voorkom. Hierdie resultate kan dikwels daaraan toegeskryf word dat daar nie aan die aannames van lineariteit en normaliteit waarop faktorontleding berus, voldoen word nie. Itemresponsteorie, wat eksplisiet vir die modellering van die nie-liniêre verbande tussen ordinale items ontwerp is, bied ’n aantreklike alternatief vir die faktorontleding van items. Items kan ook in pakkies gegroepeer word wat meer waarskynlik aan die aannames van faktorontleding voldoen as individuele items. Die gebruik van die Rasch beoordelingskaalmodel en die faktorontleding van pakkies word aan die hand van data wat met die Lokus van Beheervraelys verkry is, gedemonstreer. Die resultate van hierdie ontledings word vergelyk met die resultate wat deur ‘n faktorontleding van die individuele items verkry is. Die resultate dui daarop dat die Rasch
Item information and discrimination functions for trinary PCM items

NARCIS (Netherlands)

Akkermans, Wies; Muraki, Eiji

1997-01-01

For trinary partial credit items the shape of the item information and the item discrimination function is examined in relation to the item parameters. In particular, it is shown that these functions are unimodal if δ2 – δ1 < 4 ln 2 and bimodal otherwise. The locations and values of the maxima are
38 CFR 3.1606 - Transportation items.

Science.gov (United States)

2010-07-01

... 38 Pensions, Bonuses, and Veterans' Relief 1 2010-07-01 2010-07-01 false Transportation items. 3... Burial Benefits § 3.1606 Transportation items. The transportation costs of those persons who come within... shipment. (6) Cost of transportation by common carrier including amounts paid as Federal taxes. (7) Cost of...
Effect of Differential Item Functioning on Test Equating

Science.gov (United States)

Kabasakal, Kübra Atalay; Kelecioglu, Hülya

2015-01-01

This study examines the effect of differential item functioning (DIF) items on test equating through multilevel item response models (MIRMs) and traditional IRMs. The performances of three different equating models were investigated under 24 different simulation conditions, and the variables whose effects were examined included sample size, test…
Merit Principles Survey 2016 Data

Data.gov (United States)

Merit Systems Protection Board — MPS contains a combination of core items that MSPB tracks over time and special-purpose items developed to support a particular special study. This survey differs...
2012 Workplace and Gender Relations Survey of Reserve Component Members (Survey Note No. 2013-002)

Science.gov (United States)

2013-01-18

items regarding unwanted attempts to establish a sexual relationship – Sexual Coercion – four items regarding classic quid pro quo instances of...Department of Defense (DoD) continues to emphasize sexual assault and sexual harassment response and prevention in the Reserve components. This survey...survey assesses the prevalence of sexual assault and sexual harassment and other gender-related issues in the National Guard and Reserves. This
2012 Workplace and Gender Relations Survey of Active Duty Members (Survey Note No. 2013-002)

Science.gov (United States)

2013-01-18

Attention – four items regarding unwanted attempts to establish a sexual relationship – Sexual Coercion – four items regarding classic quid pro quo ...of Defense (DoD) continues to emphasize sexual assault and sexual harassment response and prevention in the military. This survey note discusses...assault and sexual harassment in the active duty force. This survey note and accompanying briefing (Appendix) provide information on the prevalence
The medial temporal lobes distinguish between within-item and item-context relations during autobiographical memory retrieval.

Science.gov (United States)

Sheldon, Signy; Levine, Brian

2015-12-01

During autobiographical memory retrieval, the medial temporal lobes (MTL) relate together multiple event elements, including object (within-item relations) and context (item-context relations) information, to create a cohesive memory. There is consistent support for a functional specialization within the MTL according to these relational processes, much of which comes from recognition memory experiments. In this study, we compared brain activation patterns associated with retrieving within-item relations (i.e., associating conceptual and sensory-perceptual object features) and item-context relations (i.e., spatial relations among objects) with respect to naturalistic autobiographical retrieval. We developed a novel paradigm that cued participants to retrieve information about past autobiographical events, non-episodic within-item relations, and non-episodic item-context relations with the perceptuomotor aspects of retrieval equated across these conditions. We used multivariate analysis techniques to extract common and distinct patterns of activity among these conditions within the MTL and across the whole brain, both in terms of spatial and temporal patterns of activity. The anterior MTL (perirhinal cortex and anterior hippocampus) was preferentially recruited for generating within-item relations later in retrieval whereas the posterior MTL (posterior parahippocampal cortex and posterior hippocampus) was preferentially recruited for generating item-context relations across the retrieval phase. These findings provide novel evidence for functional specialization within the MTL with respect to naturalistic memory retrieval. © 2015 Wiley Periodicals, Inc.
Item level diagnostics and model - data fit in item response theory ...

African Journals Online (AJOL)

Item response theory (IRT) is a framework for modeling and analyzing item response data. Item-level modeling gives IRT advantages over classical test theory. The fit of an item score pattern to an item response theory (IRT) models is a necessary condition that must be assessed for further use of item and models that best fit ...
Multiple sensitive estimation and optimal sample size allocation in the item sum technique.

Science.gov (United States)

Perri, Pier Francesco; Rueda García, María Del Mar; Cobo Rodríguez, Beatriz

2018-01-01

For surveys of sensitive issues in life sciences, statistical procedures can be used to reduce nonresponse and social desirability response bias. Both of these phenomena provoke nonsampling errors that are difficult to deal with and can seriously flaw the validity of the analyses. The item sum technique (IST) is a very recent indirect questioning method derived from the item count technique that seeks to procure more reliable responses on quantitative items than direct questioning while preserving respondents' anonymity. This article addresses two important questions concerning the IST: (i) its implementation when two or more sensitive variables are investigated and efficient estimates of their unknown population means are required; (ii) the determination of the optimal sample size to achieve minimum variance estimates. These aspects are of great relevance for survey practitioners engaged in sensitive research and, to the best of our knowledge, were not studied so far. In this article, theoretical results for multiple estimation and optimal allocation are obtained under a generic sampling design and then particularized to simple random sampling and stratified sampling designs. Theoretical considerations are integrated with a number of simulation studies based on data from two real surveys and conducted to ascertain the efficiency gain derived from optimal allocation in different situations. One of the surveys concerns cannabis consumption among university students. Our findings highlight some methodological advances that can be obtained in life sciences IST surveys when optimal allocation is achieved. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Guide to good practices for the development of test items

Energy Technology Data Exchange (ETDEWEB)

NONE

1997-01-01

While the methodology used in developing test items can vary significantly, to ensure quality examinations, test items should be developed systematically. Test design and development is discussed in the DOE Guide to Good Practices for Design, Development, and Implementation of Examinations. This guide is intended to be a supplement by providing more detailed guidance on the development of specific test items. This guide addresses the development of written examination test items primarily. However, many of the concepts also apply to oral examinations, both in the classroom and on the job. This guide is intended to be used as guidance for the classroom and laboratory instructor or curriculum developer responsible for the construction of individual test items. This document focuses on written test items, but includes information relative to open-reference (open book) examination test items, as well. These test items have been categorized as short-answer, multiple-choice, or essay. Each test item format is described, examples are provided, and a procedure for development is included. The appendices provide examples for writing test items, a test item development form, and examples of various test item formats.
Measurement Equivalence in ADL and IADL Difficulty Across International Surveys of Aging: Findings From the HRS, SHARE, and ELSA

Science.gov (United States)

Kasper, Judith D.; Brandt, Jason; Pezzin, Liliana E.

2012-01-01

Objective. To examine the measurement equivalence of items on disability across three international surveys of aging. Method. Data for persons aged 65 and older were drawn from the Health and Retirement Survey (HRS, n = 10,905), English Longitudinal Study of Aging (ELSA, n = 5,437), and Survey of Health, Ageing and Retirement in Europe (SHARE, n = 13,408). Differential item functioning (DIF) was assessed using item response theory (IRT) methods for activities of daily living (ADL) and instrumental activities of daily living (IADL) items. Results. HRS and SHARE exhibited measurement equivalence, but 6 of 11 items in ELSA demonstrated meaningful DIF. At the scale level, this item-level DIF affected scores reflecting greater disability. IRT methods also spread out score distributions and shifted scores higher (toward greater disability). Results for mean disability differences by demographic characteristics, using original and DIF-adjusted scores, were the same overall but differed for some subgroup comparisons involving ELSA. Discussion. Testing and adjusting for DIF is one means of minimizing measurement error in cross-national survey comparisons. IRT methods were used to evaluate potential measurement bias in disability comparisons across three international surveys of aging. The analysis also suggested DIF was mitigated for scales including both ADL and IADL and that summary indexes (counts of limitations) likely underestimate mean disability in these international populations. PMID:22156662
Methodological quality of diagnostic accuracy studies on non-invasive coronary CT angiography: influence of QUADAS (Quality Assessment of Diagnostic Accuracy Studies included in systematic reviews) items on sensitivity and specificity

International Nuclear Information System (INIS)

Schueler, Sabine; Walther, Stefan; Schuetz, Georg M.; Schlattmann, Peter; Dewey, Marc

2013-01-01

To evaluate the methodological quality of diagnostic accuracy studies on coronary computed tomography (CT) angiography using the QUADAS (Quality Assessment of Diagnostic Accuracy Studies included in systematic reviews) tool. Each QUADAS item was individually defined to adapt it to the special requirements of studies on coronary CT angiography. Two independent investigators analysed 118 studies using 12 QUADAS items. Meta-regression and pooled analyses were performed to identify possible effects of methodological quality items on estimates of diagnostic accuracy. The overall methodological quality of coronary CT studies was merely moderate. They fulfilled a median of 7.5 out of 12 items. Only 9 of the 118 studies fulfilled more than 75 % of possible QUADAS items. One QUADAS item (''Uninterpretable Results'') showed a significant influence (P = 0.02) on estimates of diagnostic accuracy with ''no fulfilment'' increasing specificity from 86 to 90 %. Furthermore, pooled analysis revealed that each QUADAS item that is not fulfilled has the potential to change estimates of diagnostic accuracy. The methodological quality of studies investigating the diagnostic accuracy of non-invasive coronary CT is only moderate and was found to affect the sensitivity and specificity. An improvement is highly desirable because good methodology is crucial for adequately assessing imaging technologies. (orig.)
Methodological quality of diagnostic accuracy studies on non-invasive coronary CT angiography: influence of QUADAS (Quality Assessment of Diagnostic Accuracy Studies included in systematic reviews) items on sensitivity and specificity

Energy Technology Data Exchange (ETDEWEB)

Schueler, Sabine; Walther, Stefan; Schuetz, Georg M. [Humboldt-Universitaet zu Berlin, Freie Universitaet Berlin, Charite Medical School, Department of Radiology, Berlin (Germany); Schlattmann, Peter [University Hospital of Friedrich Schiller University Jena, Department of Medical Statistics, Informatics, and Documentation, Jena (Germany); Dewey, Marc [Humboldt-Universitaet zu Berlin, Freie Universitaet Berlin, Charite Medical School, Department of Radiology, Berlin (Germany); Charite, Institut fuer Radiologie, Berlin (Germany)

2013-06-15

To evaluate the methodological quality of diagnostic accuracy studies on coronary computed tomography (CT) angiography using the QUADAS (Quality Assessment of Diagnostic Accuracy Studies included in systematic reviews) tool. Each QUADAS item was individually defined to adapt it to the special requirements of studies on coronary CT angiography. Two independent investigators analysed 118 studies using 12 QUADAS items. Meta-regression and pooled analyses were performed to identify possible effects of methodological quality items on estimates of diagnostic accuracy. The overall methodological quality of coronary CT studies was merely moderate. They fulfilled a median of 7.5 out of 12 items. Only 9 of the 118 studies fulfilled more than 75 % of possible QUADAS items. One QUADAS item (''Uninterpretable Results'') showed a significant influence (P = 0.02) on estimates of diagnostic accuracy with ''no fulfilment'' increasing specificity from 86 to 90 %. Furthermore, pooled analysis revealed that each QUADAS item that is not fulfilled has the potential to change estimates of diagnostic accuracy. The methodological quality of studies investigating the diagnostic accuracy of non-invasive coronary CT is only moderate and was found to affect the sensitivity and specificity. An improvement is highly desirable because good methodology is crucial for adequately assessing imaging technologies. (orig.)
Preferred Reporting Items for Systematic Review and Meta-Analyses of individual participant data: the PRISMA-IPD Statement.

Science.gov (United States)

Stewart, Lesley A; Clarke, Mike; Rovers, Maroeska; Riley, Richard D; Simmonds, Mark; Stewart, Gavin; Tierney, Jayne F

2015-04-28

Systematic reviews and meta-analyses of individual participant data (IPD) aim to collect, check, and reanalyze individual-level data from all studies addressing a particular research question and are therefore considered a gold standard approach to evidence synthesis. They are likely to be used with increasing frequency as current initiatives to share clinical trial data gain momentum and may be particularly important in reviewing controversial therapeutic areas. To develop PRISMA-IPD as a stand-alone extension to the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) Statement, tailored to the specific requirements of reporting systematic reviews and meta-analyses of IPD. Although developed primarily for reviews of randomized trials, many items will apply in other contexts, including reviews of diagnosis and prognosis. Development of PRISMA-IPD followed the EQUATOR Network framework guidance and used the existing standard PRISMA Statement as a starting point to draft additional relevant material. A web-based survey informed discussion at an international workshop that included researchers, clinicians, methodologists experienced in conducting systematic reviews and meta-analyses of IPD, and journal editors. The statement was drafted and iterative refinements were made by the project, advisory, and development groups. The PRISMA-IPD Development Group reached agreement on the PRISMA-IPD checklist and flow diagram by consensus. Compared with standard PRISMA, the PRISMA-IPD checklist includes 3 new items that address (1) methods of checking the integrity of the IPD (such as pattern of randomization, data consistency, baseline imbalance, and missing data), (2) reporting any important issues that emerge, and (3) exploring variation (such as whether certain types of individual benefit more from the intervention than others). A further additional item was created by reorganization of standard PRISMA items relating to interpreting results. Wording
Development and Preliminary Validation of Refugee Trauma History Checklist (RTHC—A Brief Checklist for Survey Studies

Directory of Open Access Journals (Sweden)

Erika Sigvardsdotter

2017-10-01

Full Text Available A high proportion of refugees have been subjected to potentially traumatic experiences (PTEs, including torture. PTEs, and torture in particular, are powerful predictors of mental ill health. This paper reports the development and preliminary validation of a brief refugee trauma checklist applicable for survey studies. Methods: A pool of 232 items was generated based on pre-existing instruments. Conceptualization, item selection and item refinement was conducted based on existing literature and in collaboration with experts. Ten cognitive interviews using a Think Aloud Protocol (TAP were performed in a clinical setting, and field testing of the proposed checklist was performed in a total sample of n = 137 asylum seekers from Syria. Results: The proposed refugee trauma history checklist (RTHC consists of 2 × 8 items, concerning PTEs that occurred before and during the respondents’ flight, respectively. Results show low item non-response and adequate psychometric properties Conclusion: RTHC is a usable tool for providing self-report data on refugee trauma history surveys of community samples. The core set of included events can be augmented and slight modifications can be applied to RTHC for use also in other refugee populations and settings.
Development and Preliminary Validation of Refugee Trauma History Checklist (RTHC)-A Brief Checklist for Survey Studies.

Science.gov (United States)

Sigvardsdotter, Erika; Nilsson, Henrik; Malm, Andreas; Tinghög, Petter; Gottvall, Maria; Vaez, Marjan; Saboonchi, Fredrik

2017-10-04

A high proportion of refugees have been subjected to potentially traumatic experiences (PTEs), including torture. PTEs, and torture in particular, are powerful predictors of mental ill health. This paper reports the development and preliminary validation of a brief refugee trauma checklist applicable for survey studies. A pool of 232 items was generated based on pre-existing instruments. Conceptualization, item selection and item refinement was conducted based on existing literature and in collaboration with experts. Ten cognitive interviews using a Think Aloud Protocol (TAP) were performed in a clinical setting, and field testing of the proposed checklist was performed in a total sample of n = 137 asylum seekers from Syria. The proposed refugee trauma history checklist (RTHC) consists of 2 × 8 items, concerning PTEs that occurred before and during the respondents' flight, respectively. Results show low item non-response and adequate psychometric properties Conclusion: RTHC is a usable tool for providing self-report data on refugee trauma history surveys of community samples. The core set of included events can be augmented and slight modifications can be applied to RTHC for use also in other refugee populations and settings.
Structural Validation of a French Food Frequency Questionnaire of 94 Items

Directory of Open Access Journals (Sweden)

Rozenn Gazan

2017-12-01

Full Text Available BackgroundFood frequency questionnaires (FFQs are used to estimate the usual food and nutrient intakes over a period of time. Such estimates can suffer from measurement errors, either due to bias induced by respondent’s answers or to errors induced by the structure of the questionnaire (e.g., using a limited number of food items and an aggregated food database with average portion sizes. The “structural validation” presented in this study aims to isolate and quantify the impact of the inherent structure of a FFQ on the estimation of food and nutrient intakes, independently of respondent’s perception of the questionnaire.MethodsA semi-quantitative FFQ (n = 94 items, including 50 items with questions on portion sizes and an associated aggregated food composition database (named the item-composition database were developed, based on the self-reported weekly dietary records of 1918 adults (18–79 years-old in the French Individual and National Dietary Survey 2 (INCA2, and the French CIQUAL 2013 food-composition database of all the foods (n = 1342 foods declared as consumed in the population. Reference intakes of foods (“REF_FOOD” and nutrients (“REF_NUT” were calculated for each adult using the food-composition database and the amounts of foods self-reported in his/her dietary record. Then, answers to the FFQ were simulated for each adult based on his/her self-reported dietary record. “FFQ_FOOD” and “FFQ_NUT” intakes were estimated using the simulated answers and the item-composition database. Measurement errors (in %, spearman correlations and cross-classification were used to compare “REF_FOOD” with “FFQ_FOOD” and “REF_NUT” with “FFQ_NUT”.ResultsCompared to “REF_NUT,” “FFQ_NUT” total quantity and total energy intake were underestimated on average by 198 g/day and 666 kJ/day, respectively. “FFQ_FOOD” intakes were well estimated for starches, underestimated for most of the subgroups, and

An evaluation of computerized adaptive testing for general psychological distress: combining GHQ-12 and Affectometer-2 in an item bank for public mental health research.

Science.gov (United States)

Stochl, Jan; Böhnke, Jan R; Pickett, Kate E; Croudace, Tim J

2016-05-20

Recent developments in psychometric modeling and technology allow pooling well-validated items from existing instruments into larger item banks and their deployment through methods of computerized adaptive testing (CAT). Use of item response theory-based bifactor methods and integrative data analysis overcomes barriers in cross-instrument comparison. This paper presents the joint calibration of an item bank for researchers keen to investigate population variations in general psychological distress (GPD). Multidimensional item response theory was used on existing health survey data from the Scottish Health Education Population Survey (n = 766) to calibrate an item bank consisting of pooled items from the short common mental disorder screen (GHQ-12) and the Affectometer-2 (a measure of "general happiness"). Computer simulation was used to evaluate usefulness and efficacy of its adaptive administration. A bifactor model capturing variation across a continuum of population distress (while controlling for artefacts due to item wording) was supported. The numbers of items for different required reliabilities in adaptive administration demonstrated promising efficacy of the proposed item bank. Psychometric modeling of the common dimension captured by more than one instrument offers the potential of adaptive testing for GPD using individually sequenced combinations of existing survey items. The potential for linking other item sets with alternative candidate measures of positive mental health is discussed since an optimal item bank may require even more items than these.
The Iranian version of 12-item Short Form Health Survey (SF-12: factor structure, internal consistency and construct validity

Directory of Open Access Journals (Sweden)

Mousavi Sayed

2009-09-01

Full Text Available Abstract Background The 12-item Short Form Health Survey (SF-12 as a shorter alternative of the SF-36 is largely used in health outcomes surveys. The aim of this study was to validate the SF-12 in Iran. Methods A random sample of the general population aged 15 years and over living in Tehran, Iran completed the SF-12. Reliability was estimated using internal consistency and validity was assessed using known groups comparison and convergent validity. In addition, the factor structure of the questionnaire was extracted by performing both exploratory factor analysis (EFA and confirmatory factor analysis (CFA. Results: In all, 5587 individuals were studied (2721 male and 2866 female. The mean age and formal education of the respondents were 35.1 (SD = 15.4 and 10.2 (SD = 4.4 years respectively. The results showed satisfactory internal consistency for both summary measures, that are the Physical Component Summary (PCS and the Mental Component Summary (MCS; Cronbach's α for PCS-12 and MCS-12 was 0.73 and 0.72, respectively. Known-groups comparison showed that the SF-12 discriminated well between men and women and those who differed in age and educational status (P Conclusion In general the findings suggest that the SF-12 is a reliable and valid measure of health related quality of life among Iranian population. However, further studies are needed to establish stronger psychometric properties for this alternative form of the SF-36 Health Survey in Iran.
A Bifactor Multidimensional Item Response Theory Model for Differential Item Functioning Analysis on Testlet-Based Items

Science.gov (United States)

Fukuhara, Hirotaka; Kamata, Akihito

2011-01-01

A differential item functioning (DIF) detection method for testlet-based data was proposed and evaluated in this study. The proposed DIF model is an extension of a bifactor multidimensional item response theory (MIRT) model for testlets. Unlike traditional item response theory (IRT) DIF models, the proposed model takes testlet effects into…
Gender-Based Differential Item Performance in Mathematics Achievement Items.

Science.gov (United States)

Doolittle, Allen E.; Cleary, T. Anne

1987-01-01

Eight randomly equivalent samples of high school seniors were each given a unique form of the ACT Assessment Mathematics Usage Test (ACTM). Signed measures of differential item performance (DIP) were obtained for each item in the eight ACTM forms. DIP estimates were analyzed and a significant item category effect was found. (Author/LMO)
Evaluation of psychometric properties and differential item functioning of 8-item Child Perceptions Questionnaires using item response theory.

Science.gov (United States)

Yau, David T W; Wong, May C M; Lam, K F; McGrath, Colman

2015-08-19

Four-factor structure of the two 8-item short forms of Child Perceptions Questionnaire CPQ11-14 (RSF:8 and ISF:8) has been confirmed. However, the sum scores are typically reported in practice as a proxy of Oral health-related Quality of Life (OHRQoL), which implied a unidimensional structure. This study first assessed the unidimensionality of 8-item short forms of CPQ11-14. Item response theory (IRT) was employed to offer an alternative and complementary approach of validation and to overcome the limitations of classical test theory assumptions. A random sample of 649 12-year-old school children in Hong Kong was analyzed. Unidimensionality of the scale was tested by confirmatory factor analysis (CFA), principle component analysis (PCA) and local dependency (LD) statistic. Graded response model was fitted to the data. Contribution of each item to the scale was assessed by item information function (IIF). Reliability of the scale was assessed by test information function (TIF). Differential item functioning (DIF) across gender was identified by Wald test and expected score functions. Both CPQ11-14 RSF:8 and ISF:8 did not deviate much from the unidimensionality assumption. Results from CFA indicated acceptable fit of the one-factor model. PCA indicated that the first principle component explained >30 % of the total variation with high factor loadings for both RSF:8 and ISF:8. Almost all LD statistic items suggesting little contribution of information to the scale and item removal caused little practical impact. Comparing the TIFs, RSF:8 showed slightly better information than ISF:8. In addition to oral symptoms items, the item "Concerned with what other people think" demonstrated a uniform DIF (p Items related to oral symptoms were not informative to OHRQoL and deletion of these items is suggested. The impact of DIF across gender on the overall score was minimal. CPQ11-14 RSF:8 performed slightly better than ISF:8 in measurement precision. The 6-item short forms
Reliability of a computer and Internet survey (Computer User Profile) used by adults with and without traumatic brain injury (TBI).

Science.gov (United States)

Kilov, Andrea M; Togher, Leanne; Power, Emma

2015-01-01

To determine test-re-test reliability of the 'Computer User Profile' (CUP) in people with and without TBI. The CUP was administered on two occasions to people with and without TBI. The CUP investigated the nature and frequency of participants' computer and Internet use. Intra-class correlation coefficients and kappa coefficients were conducted to measure reliability of individual CUP items. Descriptive statistics were used to summarize content of responses. Sixteen adults with TBI and 40 adults without TBI were included in the study. All participants were reliable in reporting demographic information, frequency of social communication and leisure activities and computer/Internet habits and usage. Adults with TBI were reliable in 77% of their responses to survey items. Adults without TBI were reliable in 88% of their responses to survey items. The CUP was practical and valuable in capturing information about social, leisure, communication and computer/Internet habits of people with and without TBI. Adults without TBI scored more items with satisfactory reliability overall in their surveys. Future studies may include larger samples and could also include an exploration of how people with/without TBI use other digital communication technologies. This may provide further information on determining technology readiness for people with TBI in therapy programmes.
Binary classification of items of interest in a repeatable process

Science.gov (United States)

Abell, Jeffrey A.; Spicer, John Patrick; Wincek, Michael Anthony; Wang, Hui; Chakraborty, Debejyo

2014-06-24

A system includes host and learning machines in electrical communication with sensors positioned with respect to an item of interest, e.g., a weld, and memory. The host executes instructions from memory to predict a binary quality status of the item. The learning machine receives signals from the sensor(s), identifies candidate features, and extracts features from the candidates that are more predictive of the binary quality status relative to other candidate features. The learning machine maps the extracted features to a dimensional space that includes most of the items from a passing binary class and excludes all or most of the items from a failing binary class. The host also compares the received signals for a subsequent item of interest to the dimensional space to thereby predict, in real time, the binary quality status of the subsequent item of interest.
Validation of a short qualitative food frequency list used in several German large scale surveys.

Science.gov (United States)

Winkler, G; Döring, A

1998-09-01

Our study aimed to test the validity of a short, qualitative food frequency list (FFL) used in several German large scale surveys. In the surveys of the MONICA project Augsburg, the FFL was used in randomly selected adults. In 1984/85, a dietary survey with 7-day records (DR) was conducted within the subsample of men aged 45 to 64 (response 70%). The 899 DR were used to validate the FFL. Mean weekly food intake frequency and mean daily food intake were compared and Spearman rank order correlation coefficients and classification into tertiles with values of the statistic Kappa were calculated. Spearman correlations range between 0.15 for the item "Other sweets (candies, compote)" and 0.60 for the items "Curds, yoghurt, sour milk", "Milk including butter milk" and "Mineral water"; values for statistic Kappa vary between 0.04 ("White bread, brown bread, crispbread") and 0.41 ("Flaked oats, muesli, cornflakes" and "milk including butter milk"). With the exception of two items, FFL data can be used for analysis on group level. Analysis on individual level should be done with caution. It seems, as if some food groups are generally easier to ask for in FFL than others.
Clinically important deterioration in patients undergoing lumbar spine surgery: a choice of evaluation methods using the Oswestry Disability Index, 36-Item Short Form Health Survey, and pain scales: clinical article.

Science.gov (United States)

Gum, Jeffrey L; Glassman, Steven D; Carreon, Leah Y

2013-11-01

Health-related quality of life (HRQOL) measures have become the mainstay for outcome appraisal in spine surgery. Clinically meaningful interpretation of HRQOL improvement has centered on the minimum clinically important difference (MCID). The purpose of this study was to calculate clinically important deterioration (CIDET) thresholds and determine a CIDET value for each HRQOL measure for patients undergoing lumbar fusion. Seven hundred twenty-two patients (248 males, 127 smokers, mean age 60.8 years) were identified with complete preoperative and 1-year postoperative HRQOLs including the Oswestry Disability Index (ODI), 36-Item Short Form Health Survey (SF-36), and numeric rating scales (0-10) for back and leg pain following primary, instrumented, posterior lumbar fusion. Anchor-based and distribution-based methods were used to calculate CIDET for each HRQOL. Anchor-based methods included change score, change difference, and receiver operating characteristic curve analysis. The Health Transition Item, an independent item of the SF-36, was used as the external anchor. Patients who responded "somewhat worse" and "much worse" were combined and compared with patients responding "about the same." Distribution-based methods were minimum detectable change and effect size. Diagnoses included spondylolisthesis (n = 332), scoliosis (n = 54), instability (n = 37), disc pathology (n = 146), and stenosis (n = 153). There was a statistically significant change (p < 0.0001) for each HRQOL measure from preoperatively to 1-year postoperatively. Only 107 patients (15%) reported being "somewhat worse" (n = 81) or "much worse" (n = 26). Calculation methods yielded a range of CIDET values for ODI (0.17-9.06), SF-36 physical component summary (-0.32 to 4.43), back pain (0.02-1.50), and leg pain (0.02-1.50). A threshold for clinical deterioration was difficult to identify. This may be due to the small number of patients reporting being worse after surgery and the variability across
Transgender-inclusive measures of sex/gender for population surveys: Mixed-methods evaluation and recommendations.

Science.gov (United States)

Bauer, Greta R; Braimoh, Jessica; Scheim, Ayden I; Dharma, Christoffer

2017-01-01

Given that an estimated 0.6% of the U.S. population is transgender (trans) and that large health disparities for this population have been documented, government and research organizations are increasingly expanding measures of sex/gender to be trans inclusive. Options suggested for trans community surveys, such as expansive check-all-that-apply gender identity lists and write-in options that offer maximum flexibility, are generally not appropriate for broad population surveys. These require limited questions and a small number of categories for analysis. Limited evaluation has been undertaken of trans-inclusive population survey measures for sex/gender, including those currently in use. Using an internet survey and follow-up of 311 participants, and cognitive interviews from a maximum-diversity sub-sample (n = 79), we conducted a mixed-methods evaluation of two existing measures: a two-step question developed in the United States and a multidimensional measure developed in Canada. We found very low levels of item missingness, and no indicators of confusion on the part of cisgender (non-trans) participants for both measures. However, a majority of interview participants indicated problems with each question item set. Agreement between the two measures in assessment of gender identity was very high (K = 0.9081), but gender identity was a poor proxy for other dimensions of sex or gender among trans participants. Issues to inform measure development or adaptation that emerged from analysis included dimensions of sex/gender measured, whether non-binary identities were trans, Indigenous and cultural identities, proxy reporting, temporality concerns, and the inability of a single item to provide a valid measure of sex/gender. Based on this evaluation, we recommend that population surveys meant for multi-purpose analysis consider a new Multidimensional Sex/Gender Measure for testing that includes three simple items (one asked only of a small sub-group) to assess gender
Transgender-inclusive measures of sex/gender for population surveys: Mixed-methods evaluation and recommendations.

Directory of Open Access Journals (Sweden)

Greta R Bauer

Full Text Available Given that an estimated 0.6% of the U.S. population is transgender (trans and that large health disparities for this population have been documented, government and research organizations are increasingly expanding measures of sex/gender to be trans inclusive. Options suggested for trans community surveys, such as expansive check-all-that-apply gender identity lists and write-in options that offer maximum flexibility, are generally not appropriate for broad population surveys. These require limited questions and a small number of categories for analysis. Limited evaluation has been undertaken of trans-inclusive population survey measures for sex/gender, including those currently in use. Using an internet survey and follow-up of 311 participants, and cognitive interviews from a maximum-diversity sub-sample (n = 79, we conducted a mixed-methods evaluation of two existing measures: a two-step question developed in the United States and a multidimensional measure developed in Canada. We found very low levels of item missingness, and no indicators of confusion on the part of cisgender (non-trans participants for both measures. However, a majority of interview participants indicated problems with each question item set. Agreement between the two measures in assessment of gender identity was very high (K = 0.9081, but gender identity was a poor proxy for other dimensions of sex or gender among trans participants. Issues to inform measure development or adaptation that emerged from analysis included dimensions of sex/gender measured, whether non-binary identities were trans, Indigenous and cultural identities, proxy reporting, temporality concerns, and the inability of a single item to provide a valid measure of sex/gender. Based on this evaluation, we recommend that population surveys meant for multi-purpose analysis consider a new Multidimensional Sex/Gender Measure for testing that includes three simple items (one asked only of a small sub-group to
Transgender-inclusive measures of sex/gender for population surveys: Mixed-methods evaluation and recommendations

Science.gov (United States)

Bauer, Greta R.; Braimoh, Jessica; Scheim, Ayden I.; Dharma, Christoffer

2017-01-01

Given that an estimated 0.6% of the U.S. population is transgender (trans) and that large health disparities for this population have been documented, government and research organizations are increasingly expanding measures of sex/gender to be trans inclusive. Options suggested for trans community surveys, such as expansive check-all-that-apply gender identity lists and write-in options that offer maximum flexibility, are generally not appropriate for broad population surveys. These require limited questions and a small number of categories for analysis. Limited evaluation has been undertaken of trans-inclusive population survey measures for sex/gender, including those currently in use. Using an internet survey and follow-up of 311 participants, and cognitive interviews from a maximum-diversity sub-sample (n = 79), we conducted a mixed-methods evaluation of two existing measures: a two-step question developed in the United States and a multidimensional measure developed in Canada. We found very low levels of item missingness, and no indicators of confusion on the part of cisgender (non-trans) participants for both measures. However, a majority of interview participants indicated problems with each question item set. Agreement between the two measures in assessment of gender identity was very high (K = 0.9081), but gender identity was a poor proxy for other dimensions of sex or gender among trans participants. Issues to inform measure development or adaptation that emerged from analysis included dimensions of sex/gender measured, whether non-binary identities were trans, Indigenous and cultural identities, proxy reporting, temporality concerns, and the inability of a single item to provide a valid measure of sex/gender. Based on this evaluation, we recommend that population surveys meant for multi-purpose analysis consider a new Multidimensional Sex/Gender Measure for testing that includes three simple items (one asked only of a small sub-group) to assess gender
CTTITEM: SAS macro and SPSS syntax for classical item analysis.

Science.gov (United States)

Lei, Pui-Wa; Wu, Qiong

2007-08-01

This article describes the functions of a SAS macro and an SPSS syntax that produce common statistics for conventional item analysis including Cronbach's alpha, item difficulty index (p-value or item mean), and item discrimination indices (D-index, point biserial and biserial correlations for dichotomous items and item-total correlation for polytomous items). These programs represent an improvement over the existing SAS and SPSS item analysis routines in terms of completeness and user-friendliness. To promote routine evaluations of item qualities in instrument development of any scale, the programs are available at no charge for interested users. The program codes along with a brief user's manual that contains instructions and examples are downloadable from suen.ed.psu.edu/-pwlei/plei.htm.
Suspect/Counterfeit Items Information Guide for Subcontractors/Suppliers

Energy Technology Data Exchange (ETDEWEB)

Tessmar, Nancy D. [Los Alamos National Laboratory; Salazar, Michael J. [Los Alamos National Laboratory

2012-09-18

Counterfeiting of industrial and commercial grade items is an international problem that places worker safety, program objectives, expensive equipment, and security at risk. In order to prevent the introduction of Suspect/Counterfeit Items (S/CI), this information sheet is being made available as a guide to assist in the implementation of S/CI awareness and controls, in conjunction with subcontractor's/supplier's quality assurance programs. When it comes to counterfeit goods, including industrial materials, items, and equipment, no market is immune. Some manufactures have been known to misrepresent their products and intentionally use inferior materials and processes to manufacture substandard items, whose properties can significantly cart from established standards and specifications. These substandard items termed by the Department of Energy (DOE) as S/CI, pose immediate and potential threats to the safety of DOE and contractor workers, the public, and the environment. Failure of certain systems and processes caused by an S/CI could also have national security implications at Los Alamos National Laboratory (LANL). Nuclear Safety Rules (federal Laws), DOE Orders, and other regulations set forth requirements for DOE contractors to implement effective controls to assure that items and services meet specified requirements. This includes techniques to implement and thereby minimizing the potential threat of entry of S/CI to LANL. As a qualified supplier of goods or services to the LANL, your company will be required to establish and maintain effective controls to prevent the introduction of S/CI to LANL. This will require that your company warrant that all items (including their subassemblies, components, and parts) sold to LANL are genuine (i.e. not counterfeit), new, and unused, and conform to the requirements of the LANL purchase orders/contracts unless otherwise approved in writing to the Los Alamos National Security (LANS) contract administrator
‘Forget me (not?’ – Remembering forget-items versus un-cued items in directed forgetting

Directory of Open Access Journals (Sweden)

Bastian eZwissler

2015-11-01

Full Text Available Humans need to be able to selectively control their memories. Here, we investigate the underlying processes in item-method directed forgetting and compare the classic active memory cues in this paradigm with a passive instruction. Typically, individual items are presented and each is followed by either a forget- or remember-instruction. On a surprise test of all items, memory is then worse for to-be-forgotten items (TBF compared to to-be-remembered items (TBR. This is thought to result from selective rehearsal of TBR, or from active inhibition of TBF, or from both. However, evidence suggests that if a forget instruction initiates active processing, paradoxical effects may also arise. To investigate the underlying mechanisms, four experiments were conducted where un-cued items (UI were introduced and recognition performance was compared between TBR, TBF and UI stimuli. Accuracy was encouraged via a performance-dependent monetary bonus. Across all experiments, including perceptually fully matched variants, memory accuracy for TBF was reduced compared to TBR, but better than for UI. Moreover, participants used a more conservative response criterion when responding to TBF stimuli. Thus, ironically, the F cue results in active processing, but this does not have inhibitory effects that would impair recognition memory beyond a un-cued baseline condition. This casts doubts on inhibitory accounts of item-method directed forgetting and is also difficult to reconcile with pure selective rehearsal of TBR. While the F-cue does induce active processing, this does not result in particularly successful forgetting. The pattern seems most consistent with the notion of ironic processing.
The anticipated costs analysis and benefit items survey against performing the maintenance rule

International Nuclear Information System (INIS)

Hwang, M. J.; Kim, K. Y.; Yang, Z. A.

2002-01-01

In this paper, we surveyed the cost and benefit items and evaluated the costs against performing the Maintenance Rule. In the past, only one electric power company had provided the electricity without free competition in Korea. In these days, however, the electric power company was divided into two parts by the sources: atomic and hydraulic generation and thermal-power generation. Therefore, the generation sources that done have competitiveness at the price will be weeded out in the electric power market. Although the preferential goal is on the safe operation at the Nuclear power Plants (NPPs), if too much money is required to maintain or improve the safety of the NPP, the licensee could hesitate to adopt the program related to the safety even though it is a good one. Since the Risk-Informed Applications (RIA) have been using for a plant operation in recent, the condition of a plant might be changed. Therefore, considering the affects of the RIA, a method to keep the capability through the monitoring the maintenance effectiveness has been proposed. However, to perform this, a number of works, continuous collecting data and monitoring the maintenance effectiveness and understanding the reason of degrading capability, should be preceded. Therefore, a lot of man-hour is needed to develop and to manage the application method, and the licensee should pay the costs. Therefore, in the domestic circumstance, it is necessary to evaluate the cost to monitor the maintenance effectiveness. Hence, we are going to examine the cost to perform the MR and its anticipated benefit lists
Psychometric Properties of the Kidney Disease Quality of Life 36-Item Short-Form Survey (KDQOL-36) in the United States.

Science.gov (United States)

Peipert, John D; Bentler, Peter M; Klicko, Kristi; Hays, Ron D

2018-04-01

The Centers for Medicare & Medicaid Services require that dialysis patients' health-related quality of life be assessed annually. The primary instrument used for this purpose is the Kidney Disease Quality of Life 36-Item Short-Form Survey (KDQOL-36), which includes the SF-12 as its generic core and 3 kidney disease-targeted scales: Burden of Kidney Disease, Symptoms and Problems of Kidney Disease, and Effects of Kidney Disease. Despite its broad use, there has been limited evaluation of KDQOL-36's psychometric properties. Secondary analyses of data collected by the Medical Education Institute to evaluate the reliability and factor structure of the KDQOL-36 scales. KDQOL-36 responses from 70,786 dialysis patients in 1,381 US dialysis facilities that permitted data analysis were collected from June 1, 2015, through May 31, 2016, as part of routine clinical assessment. We assessed the KDQOL-36 scales' internal consistency reliability and dialysis facility-level reliability using coefficient alpha and 1-way analysis of variance. We evaluated the KDQOL-36's factor structure using item-to-total scale correlations and confirmatory factor analysis. Construct validity was examined using correlations between SF-12 and KDQOL-36 scales and "known groups" analyses. Each of the KDQOL-36's kidney disease-targeted scales had acceptable internal consistency reliability (α=0.83-0.85) and facility-level reliability (r=0.75-0.83). Item-scale correlations and a confirmatory factor analysis model evidenced the KDQOL-36's original factor structure. Construct validity was supported by large correlations between the SF-12 Physical Component Summary and Mental Component Summary (r=0.40-0.52) and the KDQOL-36 scale scores, as well as significant differences on the scale scores between patients receiving different types of dialysis, diabetic and nondiabetic patients, and patients who were employed full-time versus not. Use of secondary data from a clinical registry. The study provides
Psychometric Properties of the Heart Disease Knowledge Scale: Evidence from Item and Confirmatory Factor Analyses.

Science.gov (United States)

Lim, Bee Chiu; Kueh, Yee Cheng; Arifin, Wan Nor; Ng, Kok Huan

2016-07-01

Heart disease knowledge is an important concept for health education, yet there is lack of evidence on proper validated instruments used to measure levels of heart disease knowledge in the Malaysian context. A cross-sectional, survey design was conducted to examine the psychometric properties of the adapted English version of the Heart Disease Knowledge Questionnaire (HDKQ). Using proportionate cluster sampling, 788 undergraduate students at Universiti Sains Malaysia, Malaysia, were recruited and completed the HDKQ. Item analysis and confirmatory factor analysis (CFA) were used for the psychometric evaluation. Construct validity of the measurement model was included. Most of the students were Malay (48%), female (71%), and from the field of science (51%). An acceptable range was obtained with respect to both the difficulty and discrimination indices in the item analysis results. The difficulty index ranged from 0.12-0.91 and a discrimination index of ≥ 0.20 were reported for the final retained 23 items. The final CFA model showed an adequate fit to the data, yielding a 23-item, one-factor model [weighted least squares mean and variance adjusted scaled chi-square difference = 1.22, degrees of freedom = 2, P-value = 0.544, the root mean square error of approximation = 0.03 (90% confidence interval = 0.03, 0.04); close-fit P-value = > 0.950]. Adequate psychometric values were obtained for Malaysian undergraduate university students using the 23-item, one-factor model of the adapted HDKQ.
Nurse Religiosity and Spiritual Care: An Online Survey.

Science.gov (United States)

Taylor, Elizabeth Johnston; Gober-Park, Carla; Schoonover-Shoffner, Kathy; Mamier, Iris; Somaiya, Chintan K; Bahjri, Khaled

2017-08-01

This study measured the frequency of nurse-provided spiritual care and how it is associated with various facets of nurse religiosity. Data were collected using an online survey accessed from the home page of the Journal of Christian Nursing. The survey included the Nurse Spiritual Care Therapeutics Scale, six scales quantifying facets of religiosity, and demographic and work-related items. Respondents ( N = 358) indicated high religiosity yet reported neutral responses to items about sharing personal beliefs and tentativeness of belief. Findings suggested spiritual care was infrequent. Multivariate analysis showed prayer frequency, employer support of spiritual care, and non-White ethnicity were significantly associated with spiritual care frequency (adjusted R 2 = .10). Results not only provide an indication of spiritual care frequency but empirical encouragement for nurse managers to provide a supportive environment for spiritual care. Findings expose the reality that nurse religiosity is directly related, albeit weakly, to spiritual care frequency.
Development and psychometric testing of the childhood obesity perceptions (COP) survey among African American caregivers: A tool for obesity prevention program planning.

Science.gov (United States)

Alexander, Dayna S; Alfonso, Moya L; Cao, Chunhua

2016-12-01

Currently, public health practitioners are analyzing the role that caregivers play in childhood obesity efforts. Assessing African American caregiver's perceptions of childhood obesity in rural communities is an important prevention effort. This article's objective is to describe the development and psychometric testing of a survey tool to assess childhood obesity perceptions among African American caregivers in a rural setting, which can be used for obesity prevention program development or evaluation. The Childhood Obesity Perceptions (COP) survey was developed to reflect the multidimensional nature of childhood obesity including risk factors, health complications, weight status, built environment, and obesity prevention strategies. A 97-item survey was pretested and piloted with the priority population. After pretesting and piloting, the survey was reduced to 59-items and administered to 135 African American caregivers. An exploratory factor analysis (EFA) and confirmatory factor analysis (CFA) was conducted to test how well the survey items represented the number of Social Cognitive Theory constructs. Twenty items were removed from the original 59-item survey and acceptable internal consistency of the six factors (α=0.70-0.85) was documented for all scales in the final COP instrument. CFA resulted in a less than adequate fit; however, a multivariate Lagrange multiplier test identified modifications to improve the model fit. The COP survey represents a promising approach as a potentially comprehensive assessment for implementation or evaluation of childhood obesity programs. Copyright © 2016 Elsevier Ltd. All rights reserved.

Development of a survey tool to assess and monitor the influence of food budget restraint on healthy eating, food related climate impact and quality of life

DEFF Research Database (Denmark)

Nielsen, Annemette Ljungdalh; Holm, Lotte; Lund, Thomas Bøker

This documentation describes the development of a survey tool designed to: 1) measure how different levels of constraints on food budgets are associated to outcomes of healthy eating, environmental sustainability and life quality for individuals in Denmark, and 2) explore how these different...... outcomes are related to strategies people employ to cope with restricted food budgets. The resulting survey consists of a total of 63 question items. The paper lays out the various steps involved in the process of developing the survey tool, presents the final survey items included in the tool...
Item response theory, computerized adaptive testing, and PROMIS: assessment of physical function.

Science.gov (United States)

Fries, James F; Witter, James; Rose, Matthias; Cella, David; Khanna, Dinesh; Morgan-DeWitt, Esi

2014-01-01

Patient-reported outcome (PRO) questionnaires record health information directly from research participants because observers may not accurately represent the patient perspective. Patient-reported Outcomes Measurement Information System (PROMIS) is a US National Institutes of Health cooperative group charged with bringing PRO to a new level of precision and standardization across diseases by item development and use of item response theory (IRT). With IRT methods, improved items are calibrated on an underlying concept to form an item bank for a "domain" such as physical function (PF). The most informative items can be combined to construct efficient "instruments" such as 10-item or 20-item PF static forms. Each item is calibrated on the basis of the probability that a given person will respond at a given level, and the ability of the item to discriminate people from one another. Tailored forms may cover any desired level of the domain being measured. Computerized adaptive testing (CAT) selects the best items to sharpen the estimate of a person's functional ability, based on prior responses to earlier questions. PROMIS item banks have been improved with experience from several thousand items, and are calibrated on over 21,000 respondents. In areas tested to date, PROMIS PF instruments are superior or equal to Health Assessment Questionnaire and Medical Outcome Study Short Form-36 Survey legacy instruments in clarity, translatability, patient importance, reliability, and sensitivity to change. Precise measures, such as PROMIS, efficiently incorporate patient self-report of health into research, potentially reducing research cost by lowering sample size requirements. The advent of routine IRT applications has the potential to transform PRO measurement.
Creating a Screening Measure of Health Literacy for the Health Information National Trends Survey.

Science.gov (United States)

Champlin, Sara; Mackert, Michael

2016-03-01

Create a screening measure of health literacy for use with the Health Information National Trends Survey (HINTS). Participants completed a paper-based survey. Items from the survey were used to construct a health literacy screening measure. A population-based survey conducted in geographic areas of high and low minority frequency and in Central Appalachia. Two thousand nine hundred four English-speaking participants were included in this study: 66% white, 93% completed high school, mean age = 52.53 years (SD = 16.24). A health literacy screening measure was created using four items included in the HINTS survey. Scores could range from 0 (no questions affirmative/correct) to 4 (all questions answered affirmatively/correctly). Multiple regression analysis was used to determine whether demographic variables known to predict health literacy were indeed associated with the constructed health literacy screening measure. The weighted average health literacy score was 2.63 (SD = 1.00). Those who were nonwhite (p = .0005), were older (p literacy screening measure scores. This study highlights the need to assess health literacy in national surveys, but also serves as evidence that screening measures can be created within existing datasets to give researchers the ability to consider the impact of health literacy. © The Author(s) 2016.
Predicting survey responses: how and why semantics shape survey statistics on organizational behaviour.

Directory of Open Access Journals (Sweden)

Jan Ketil Arnulf

Full Text Available Some disciplines in the social sciences rely heavily on collecting survey responses to detect empirical relationships among variables. We explored whether these relationships were a priori predictable from the semantic properties of the survey items, using language processing algorithms which are now available as new research methods. Language processing algorithms were used to calculate the semantic similarity among all items in state-of-the-art surveys from Organisational Behaviour research. These surveys covered areas such as transformational leadership, work motivation and work outcomes. This information was used to explain and predict the response patterns from real subjects. Semantic algorithms explained 60-86% of the variance in the response patterns and allowed remarkably precise prediction of survey responses from humans, except in a personality test. Even the relationships between independent and their purported dependent variables were accurately predicted. This raises concern about the empirical nature of data collected through some surveys if results are already given a priori through the way subjects are being asked. Survey response patterns seem heavily determined by semantics. Language algorithms may suggest these prior to administering a survey. This study suggests that semantic algorithms are becoming new tools for the social sciences, opening perspectives on survey responses that prevalent psychometric theory cannot explain.
Validation of a 15-item care-related regret coping scale for health-care professionals (RCS-HCP).

Science.gov (United States)

Courvoisier, Delphine Sophie; Cullati, Stephane; Ouchi, Rieko; Schmidt, Ralph Eric; Haller, Guy; Chopard, Pierre; Agoritsas, Thomas; Perneger, Thomas V

2014-01-01

Coping with difficult care-related situations is a common challenge for health-care professionals. How these professionals deal with the regrets they may experience following one of the many decisions and interventions they must make every day can have an impact on their own health and quality of life, and also on their patient care practices. To identify professionals most at need for extra support, development and validation of a tool measuring coping style are needed. We performed a survey of physicians and nurses of a French-speaking University hospital; 469 health-care professionals responded to the survey, and 175 responded to the same survey one-month later. Regret was assessed with the regret coping scale developed for this study, self-report questions on the frequency of regretted situations and the intensity of regret. Construct validity was assessed using measures of health-care professionals' quality of life (including job and life satisfaction, and self-reported health) as well as sleep problems and depression. Based on factor analysis and item response analysis, the initial 31-item scale was shortened to 15 items, which measured three types of strategies: problem-focused strategies (i.e., trying to find solutions, talking to colleagues) and two types of emotion-focused strategies, A (i.e., self-blame, rumination) and B (e.g., acceptance, emotional distance). All subscales showed high internal consistency (α >0.85). Overall, as expected, problem-focused and emotion-focused B strategies correlated with higher quality of life, fewer sleep problems and less depression, and emotion-focused A strategies showed the opposite pattern. The regret coping scale (RCS-HCP) is a valid and reliable measure of coping abilities of hospital-based health-care professionals.
Psychometric properties of the PROMIS Physical Function item bank in patients receiving physical therapy.

Directory of Open Access Journals (Sweden)

Martine H P Crins

Full Text Available The Patient-Reported Outcomes Measurement Information System (PROMIS is a universally applicable set of instruments, including item banks, short forms and computer adaptive tests (CATs, measuring patient-reported health across different patient populations. PROMIS CATs are highly efficient and the use in practice is considered feasible with little administration time, offering standardized and routine patient monitoring. Before an item bank can be used as CAT, the psychometric properties of the item bank have to be examined. Therefore, the objective was to assess the psychometric properties of the Dutch-Flemish PROMIS Physical Function item bank (DF-PROMIS-PF in Dutch patients receiving physical therapy.Cross-sectional study.805 patients >18 years, who received any kind of physical therapy in primary care in the past year, completed the full DF-PROMIS-PF (121 items.Unidimensionality was examined by Confirmatory Factor Analysis and local dependence and monotonicity were evaluated. A Graded Response Model was fitted. Construct validity was examined with correlations between DF-PROMIS-PF T-scores and scores on two legacy instruments (SF-36 Health Survey Physical Functioning scale [SF36-PF10] and the Health Assessment Questionnaire Disability-Index [HAQ-DI]. Reliability (standard errors of theta was assessed.The results for unidimensionality were mixed (scaled CFI = 0.924, TLI = 0.923, RMSEA = 0.045, 1th factor explained 61.5% of variance. Some local dependence was found (8.2% of item pairs. The item bank showed a broad coverage of the physical function construct (threshold-parameters range: -4.28-2.33 and good construct validity (correlation with SF36-PF10 = 0.84 and HAQ-DI = -0.85. Furthermore, the DF-PROMIS-PF showed greater reliability over a broader score-range than the SF36-PF10 and HAQ-DI.The psychometric properties of the DF-PROMIS-PF item bank are sufficient. The DF-PROMIS-PF can now be used as short forms or CAT to measure the level of
An Introduction to Item Response Theory for Health Behavior Researchers

Science.gov (United States)

Warne, Russell T.; McKyer, E. J. Lisako; Smith, Matthew L.

2012-01-01

Objective: To introduce item response theory (IRT) to health behavior researchers by contrasting it with classical test theory and providing an example of IRT in health behavior. Method: Demonstrate IRT by fitting the 2PL model to substance-use survey data from the Adolescent Health Risk Behavior questionnaire (n = 1343 adolescents). Results: An…
The Case to Include Brand of Moist Snuff in Health Surveys.

Science.gov (United States)

Timberlake, David S

2016-08-01

Brand of smokeless tobacco was added to the most recent Tobacco Use Supplement to the Current Population Survey (TUS-CPS), but deleted from the Centers for Disease Control's National Adult Tobacco Survey. The objective of this study was to assess the utility of brand in distinguishing users of moist snuff. The sample consisted of participants from the 2010-2011 TUS-CPS who reported having used one of 14 brands of moist snuff in the past month (n = 2334). The brands were categorized into one of three types: snus, discount snuff, premium snuff. Multinomial logistic regression was employed for testing for associations between brand type and a series of demographic and tobacco use measures. Females, metropolitan residents, current smokers, and moderate users of snuff had significantly greater odds of using snus relative to premium snuff in the adjusted model (P discount versus premium snuff. Separate analyses among current smokers (n = 470) and former smokers (n = 70) revealed positive associations between smoking cessation attempts and smokers' switch to discount snuff. Differences among the three categories of snuff users are likely attributed to variations in marketing campaigns. The differences are sufficient to warrant inclusion of snuff brand in health surveys because brand type could serve as a proxy measure for snuff use and dependence. Inclusion of brand of moist snuff in health surveys will enable researchers to categorize snuff users by brand type. Findings from this study indicate that brand type, defined according to cost (ie, discount vs. premium brands) and type of preferred snuff (ie, snus vs. other moist snuff), can distinguish snuff users by various demographic and tobacco use measures. Consequently, categorization by brand type could be used as a proxy measure for studies whose surveys do not include detailed information on snuff use and behavior. © The Author 2016. Published by Oxford University Press on behalf of the Society for Research on
A survey of the praying mantises of Rwanda, including new records (Insecta, Mantodea).

Science.gov (United States)

Tedrow, Riley; Nathan, Kabanguka; Richard, Nasasira; Svenson, Gavin J

2015-10-01

We report the results of two surveys targeting praying mantises in four localities in Rwanda, specifically Akagera National Park, Nyungwe National Park, Volcanoes National Park, and the Arboretum de Ruhande at the National University of Rwanda. Using an assortment of collecting techniques, including metal halide light traps, sweep netting vegetation and general searching, we obtained 387 adult and 352 juvenile specimens, representing 41 species. A total of 28 novel species records for Rwanda are added to the 18 previously recorded species for the country, in addition to 20 novel species records for the broader region, including neighbouring Uganda and Burundi. This study provides high resolution images of the dorsal habitus of both sexes of representative species, both pinned and living. Species distribution records are presented and discussed. With a 155% increase in species recorded from Rwanda, this survey illustrates the need for further taxonomic work in the region.
Development and psychometric evaluation of an information literacy self-efficacy survey and an information literacy knowledge test.

Science.gov (United States)

Tepe, Rodger; Tepe, Chabha

2015-03-01

To develop and psychometrically evaluate an information literacy (IL) self-efficacy survey and an IL knowledge test. In this test-retest reliability study, a 25-item IL self-efficacy survey and a 50-item IL knowledge test were developed and administered to a convenience sample of 53 chiropractic students. Item analyses were performed on all questions. The IL self-efficacy survey demonstrated good reliability (test-retest correlation = 0.81) and good/very good internal consistency (mean κ = .56 and Cronbach's α = .92). A total of 25 questions with the best item analysis characteristics were chosen from the 50-item IL knowledge test, resulting in a 25-item IL knowledge test that demonstrated good reliability (test-retest correlation = 0.87), very good internal consistency (mean κ = .69, KR20 = 0.85), and good item discrimination (mean point-biserial = 0.48). This study resulted in the development of three instruments: a 25-item IL self-efficacy survey, a 50-item IL knowledge test, and a 25-item IL knowledge test. The information literacy self-efficacy survey and the 25-item version of the information literacy knowledge test have shown preliminary evidence of adequate reliability and validity to justify continuing study with these instruments.
Assessment of health surveys: fitting a multidimensional graded response model.

Science.gov (United States)

Depaoli, Sarah; Tiemensma, Jitske; Felt, John M

The multidimensional graded response model, an item response theory (IRT) model, can be used to improve the assessment of surveys, even when sample sizes are restricted. Typically, health-based survey development utilizes classical statistical techniques (e.g. reliability and factor analysis). In a review of four prominent journals within the field of Health Psychology, we found that IRT-based models were used in less than 10% of the studies examining scale development or assessment. However, implementing IRT-based methods can provide more details about individual survey items, which is useful when determining the final item content of surveys. An example using a quality of life survey for Cushing's syndrome (CushingQoL) highlights the main components for implementing the multidimensional graded response model. Patients with Cushing's syndrome (n = 397) completed the CushingQoL. Results from the multidimensional graded response model supported a 2-subscale scoring process for the survey. All items were deemed as worthy contributors to the survey. The graded response model can accommodate unidimensional or multidimensional scales, be used with relatively lower sample sizes, and is implemented in free software (example code provided in online Appendix). Use of this model can help to improve the quality of health-based scales being developed within the Health Sciences.
An Investigation of Item Type in a Standards-Based Assessment.

Directory of Open Access Journals (Sweden)

Liz Hollingworth

2007-12-01

Full Text Available Large-scale state assessment programs use both multiple-choice and open-ended items on tests for accountability purposes. Certainly, there is an intuitive belief among some educators and policy makers that open-ended items measure something different than multiple-choice items. This study examined two item formats in custom-built, standards-based tests of achievement in Reading and Mathematics at grades 3-8. In this paper, we raise questions about the value of including open-ended items, given scoring costs, time constraints, and the higher probability of missing data from test-takers.
Single-Item Measurement of Suicidal Behaviors: Validity and Consequences of Misclassification.

Directory of Open Access Journals (Sweden)

Alexander J Millner

Full Text Available Suicide is a leading cause of death worldwide. Although research has made strides in better defining suicidal behaviors, there has been less focus on accurate measurement. Currently, the widespread use of self-report, single-item questions to assess suicide ideation, plans and attempts may contribute to measurement problems and misclassification. We examined the validity of single-item measurement and the potential for statistical errors. Over 1,500 participants completed an online survey containing single-item questions regarding a history of suicidal behaviors, followed by questions with more precise language, multiple response options and narrative responses to examine the validity of single-item questions. We also conducted simulations to test whether common statistical tests are robust against the degree of misclassification produced by the use of single-items. We found that 11.3% of participants that endorsed a single-item suicide attempt measure engaged in behavior that would not meet the standard definition of a suicide attempt. Similarly, 8.8% of those who endorsed a single-item measure of suicide ideation endorsed thoughts that would not meet standard definitions of suicide ideation. Statistical simulations revealed that this level of misclassification substantially decreases statistical power and increases the likelihood of false conclusions from statistical tests. Providing a wider range of response options for each item reduced the misclassification rate by approximately half. Overall, the use of single-item, self-report questions to assess the presence of suicidal behaviors leads to misclassification, increasing the likelihood of statistical decision errors. Improving the measurement of suicidal behaviors is critical to increase understanding and prevention of suicide.
Surveillance indicators for potential reduced exposure products (PREPs: developing survey items to measure awareness

Directory of Open Access Journals (Sweden)

McNeill Ann

2009-10-01

Full Text Available Abstract Background Over the past decade, tobacco companies have introduced cigarettes and smokeless tobacco products (known as Potential Reduced Exposure Products, PREPs with purportedly lower levels of some toxins than conventional cigarettes and smokeless products. It is essential that public health agencies monitor awareness, interest, use, and perceptions of these products so that their impact on population health can be detected at the earliest stages. Methods This paper reviews and critiques existing strategies for measuring awareness of PREPs from 16 published and unpublished studies. From these measures, we developed new surveillance items and subjected them to two rounds of cognitive testing, a common and accepted method for evaluating questionnaire wording. Results Our review suggests that high levels of awareness of PREPs reported in some studies are likely to be inaccurate. Two likely sources of inaccuracy in awareness measures were identified: 1 the tendency of respondents to misclassify "no additive" and "natural" cigarettes as PREPs and 2 the tendency of respondents to mistakenly report awareness as a result of confusion between PREPs brands and similarly named familiar products, for example, Eclipse chewing gum and Accord automobiles. Conclusion After evaluating new measures with cognitive interviews, we conclude that as of winter 2006, awareness of reduced exposure products among U.S. smokers was likely to be between 1% and 8%, with the higher estimates for some products occurring in test markets. Recommended measurement strategies for future surveys are presented.
Surveillance indicators for potential reduced exposure products (PREPs): developing survey items to measure awareness

Science.gov (United States)

Bogen, Karen; Biener, Lois; Garrett, Catherine A; Allen, Jane; Cummings, K Michael; Hartman, Anne; Marcus, Stephen; McNeill, Ann; O'Connor, Richard J; Parascandola, Mark; Pederson, Linda

2009-01-01

Background Over the past decade, tobacco companies have introduced cigarettes and smokeless tobacco products (known as Potential Reduced Exposure Products, PREPs) with purportedly lower levels of some toxins than conventional cigarettes and smokeless products. It is essential that public health agencies monitor awareness, interest, use, and perceptions of these products so that their impact on population health can be detected at the earliest stages. Methods This paper reviews and critiques existing strategies for measuring awareness of PREPs from 16 published and unpublished studies. From these measures, we developed new surveillance items and subjected them to two rounds of cognitive testing, a common and accepted method for evaluating questionnaire wording. Results Our review suggests that high levels of awareness of PREPs reported in some studies are likely to be inaccurate. Two likely sources of inaccuracy in awareness measures were identified: 1) the tendency of respondents to misclassify "no additive" and "natural" cigarettes as PREPs and 2) the tendency of respondents to mistakenly report awareness as a result of confusion between PREPs brands and similarly named familiar products, for example, Eclipse chewing gum and Accord automobiles. Conclusion After evaluating new measures with cognitive interviews, we conclude that as of winter 2006, awareness of reduced exposure products among U.S. smokers was likely to be between 1% and 8%, with the higher estimates for some products occurring in test markets. Recommended measurement strategies for future surveys are presented. PMID:19840394
Memory for Items and Relationships among Items Embedded in Realistic Scenes: Disproportionate Relational Memory Impairments in Amnesia

Science.gov (United States)

Hannula, Deborah E.; Tranel, Daniel; Allen, John S.; Kirchhoff, Brenda A.; Nickel, Allison E.; Cohen, Neal J.

2014-01-01

Objective The objective of this study was to examine the dependence of item memory and relational memory on medial temporal lobe (MTL) structures. Patients with amnesia, who either had extensive MTL damage or damage that was relatively restricted to the hippocampus, were tested, as was a matched comparison group. Disproportionate relational memory impairments were predicted for both patient groups, and those with extensive MTL damage were also expected to have impaired item memory. Method Participants studied scenes, and were tested with interleaved two-alternative forced-choice probe trials. Probe trials were either presented immediately after the corresponding study trial (lag 1), five trials later (lag 5), or nine trials later (lag 9) and consisted of the studied scene along with a manipulated version of that scene in which one item was replaced with a different exemplar (item memory test) or was moved to a new location (relational memory test). Participants were to identify the exact match of the studied scene. Results As predicted, patients were disproportionately impaired on the test of relational memory. Item memory performance was marginally poorer among patients with extensive MTL damage, but both groups were impaired relative to matched comparison participants. Impaired performance was evident at all lags, including the shortest possible lag (lag 1). Conclusions The results are consistent with the proposed role of the hippocampus in relational memory binding and representation, even at short delays, and suggest that the hippocampus may also contribute to successful item memory when items are embedded in complex scenes. PMID:25068665
Evaluation of item candidates for a diabetic retinopathy quality of life item bank.

Science.gov (United States)

Fenwick, Eva K; Pesudovs, Konrad; Khadka, Jyoti; Rees, Gwyn; Wong, Tien Y; Lamoureux, Ecosse L

2013-09-01

We are developing an item bank assessing the impact of diabetic retinopathy (DR) on quality of life (QoL) using a rigorous multi-staged process combining qualitative and quantitative methods. We describe here the first two qualitative phases: content development and item evaluation. After a comprehensive literature review, items were generated from four sources: (1) 34 previously validated patient-reported outcome measures; (2) five published qualitative articles; (3) eight focus groups and 18 semi-structured interviews with 57 DR patients; and (4) seven semi-structured interviews with diabetes or ophthalmic experts. Items were then evaluated during 3 stages, namely binning (grouping) and winnowing (reduction) based on key criteria and panel consensus; development of item stems and response options; and pre-testing of items via cognitive interviews with patients. The content development phase yielded 1,165 unique items across 7 QoL domains. After 3 sessions of binning and winnowing, items were reduced to a minimally representative set (n = 312) across 9 domains of QoL: visual symptoms; ocular surface symptoms; activity limitation; mobility; emotional; health concerns; social; convenience; and economic. After 8 cognitive interviews, 42 items were amended resulting in a final set of 314 items. We have employed a systematic approach to develop items for a DR-specific QoL item bank. The psychometric properties of the nine QoL subscales will be assessed using Rasch analysis. The resulting validated item bank will allow clinicians and researchers to better understand the QoL impact of DR and DR therapies from the patient's perspective.
The role of attention in item-item binding in visual working memory.

Science.gov (United States)

Peterson, Dwight J; Naveh-Benjamin, Moshe

2017-09-01

An important yet unresolved question regarding visual working memory (VWM) relates to whether or not binding processes within VWM require additional attentional resources compared with processing solely the individual components comprising these bindings. Previous findings indicate that binding of surface features (e.g., colored shapes) within VWM is not demanding of resources beyond what is required for single features. However, it is possible that other types of binding, such as the binding of complex, distinct items (e.g., faces and scenes), in VWM may require additional resources. In 3 experiments, we examined VWM item-item binding performance under no load, articulatory suppression, and backward counting using a modified change detection task. Binding performance declined to a greater extent than single-item performance under higher compared with lower levels of concurrent load. The findings from each of these experiments indicate that processing item-item bindings within VWM requires a greater amount of attentional resources compared with single items. These findings also highlight an important distinction between the role of attention in item-item binding within VWM and previous studies of long-term memory (LTM) where declines in single-item and binding test performance are similar under divided attention. The current findings provide novel evidence that the specific type of binding is an important determining factor regarding whether or not VWM binding processes require attention. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Survey of drug use among young people in Ife, Nigeria | Afolabi ...

African Journals Online (AJOL)

Relevant data were obtained using a modified version of a questionnaire designed by the United Nations for conducting school surveys on drug abuse. The toolkit had been previously validated in Nigeria. The questionnaire items solicited information on students' drug use practices including the types of drugs, sources, ...
The computation of equating errors in international surveys in education.

Science.gov (United States)

Monseur, Christian; Berezner, Alla

2007-01-01

Since the IEA's Third International Mathematics and Science Study, one of the major objectives of international surveys in education has been to report trends in achievement. The names of the two current IEA surveys reflect this growing interest: Trends in International Mathematics and Science Study (TIMSS) and Progress in International Reading Literacy Study (PIRLS). Similarly a central concern of the OECD's PISA is with trends in outcomes over time. To facilitate trend analyses these studies link their tests using common item equating in conjunction with item response modelling methods. IEA and PISA policies differ in terms of reporting the error associated with trends. In IEA surveys, the standard errors of the trend estimates do not include the uncertainty associated with the linking step while PISA does include a linking error component in the standard errors of trend estimates. In other words, PISA implicitly acknowledges that trend estimates partly depend on the selected common items, while the IEA's surveys do not recognise this source of error. Failing to recognise the linking error leads to an underestimation of the standard errors and thus increases the Type I error rate, thereby resulting in reporting of significant changes in achievement when in fact these are not significant. The growing interest of policy makers in trend indicators and the impact of the evaluation of educational reforms appear to be incompatible with such underestimation. However, the procedure implemented by PISA raises a few issues about the underlying assumptions for the computation of the equating error. After a brief introduction, this paper will describe the procedure PISA implemented to compute the linking error. The underlying assumptions of this procedure will then be discussed. Finally an alternative method based on replication techniques will be presented, based on a simulation study and then applied to the PISA 2000 data.

The effect of sociodemographic (mis)match between interviewers and respondents on unit and item nonresponse in Belgium.

Science.gov (United States)

Vercruyssen, Anina; Wuyts, Celine; Loosveldt, Geert

2017-09-01

Interviewer characteristics affect nonresponse and measurement errors in face-to-face surveys. Some studies have shown that mismatched sociodemographic characteristics - for example gender - affect people's behavior when interacting with an interviewer at the door and during the survey interview, resulting in more nonresponse. We investigate the effect of sociodemographic (mis)matching on nonresponse in two successive rounds of the European Social Survey in Belgium. As such, we replicate the analyses of the effect of (mis)matching gender and age on unit nonresponse on the one hand, and of gender, age and education level (mis)matching on item nonresponse on the other hand. Recurring effects of sociodemographic (mis)match are found for both unit and item nonresponse. Copyright © 2017 Elsevier Inc. All rights reserved.
Item Response Theory in the context of Improving Student Reasoning

Science.gov (United States)

Goddard, Chase; Davis, Jeremy; Pyper, Brian

2011-10-01

We are interested to see if Item Response Theory can help to better inform the development of reasoning ability in introductory physics. A first pass through our latest batch of data from the Heat and Temperature Conceptual Evaluation, the Lawson Classroom Test of Scientific Reasoning, and the Epistemological Beliefs About Physics Survey may help in this effort.
The variety, popularity and nutritional quality of tuck shop items ...

African Journals Online (AJOL)

Method: A cross-sectional tuck shop survey. Nutritional analyses were conducted using the ... Results: Savoury pies were the most popular lunch item for all learners for both breaks (n = 5, 45%, and n = 3, 27.3%), selling the most number of units (43) per day at eight schools (72.7%). Iced popsicles were sold at almost every ...
Three controversies over item disclosure in medical licensure examinations

Directory of Open Access Journals (Sweden)

Yoon Soo Park

2015-09-01

Full Text Available In response to views on public's right to know, there is growing attention to item disclosure – release of items, answer keys, and performance data to the public – in medical licensure examinations and their potential impact on the test's ability to measure competence and select qualified candidates. Recent debates on this issue have sparked legislative action internationally, including South Korea, with prior discussions among North American countries dating over three decades. The purpose of this study is to identify and analyze three issues associated with item disclosure in medical licensure examinations – 1 fairness and validity, 2 impact on passing levels, and 3 utility of item disclosure – by synthesizing existing literature in relation to standards in testing. Historically, the controversy over item disclosure has centered on fairness and validity. Proponents of item disclosure stress test takers’ right to know, while opponents argue from a validity perspective. Item disclosure may bias item characteristics, such as difficulty and discrimination, and has consequences on setting passing levels. To date, there has been limited research on the utility of item disclosure for large scale testing. These issues requires ongoing and careful consideration.
Trends in Sexual Orientation Missing Data Over a Decade of the California Health Interview Survey

Science.gov (United States)

Viana, Joseph; Grant, David; Cochran, Susan D.; Lee, Annie C.; Ponce, Ninez A.

2015-01-01

Objectives. We explored changes in sexual orientation question item completion in a large statewide health survey. Methods. We used 2003 to 2011 California Health Interview Survey data to investigate sexual orientation item nonresponse and sexual minority self-identification trends in a cross-sectional sample representing the noninstitutionalized California household population aged 18 to 70 years (n = 182 812 adults). Results. Asians, Hispanics, limited-English-proficient respondents, and those interviewed in non-English languages showed the greatest declines in sexual orientation item nonresponse. Asian women, regardless of English-proficiency status, had the highest odds of item nonresponse. Spanish interviews produced more nonresponse than English interviews and Asian-language interviews produced less nonresponse when we controlled for demographic factors and survey cycle. Sexual minority self-identification increased in concert with the item nonresponse decline. Conclusions. Sexual orientation nonresponse declines and the increase in sexual minority identification suggest greater acceptability of sexual orientation assessment in surveys. Item nonresponse rate convergence among races/ethnicities, language proficiency groups, and interview languages shows that sexual orientation can be measured in surveys of diverse populations. PMID:25790399
Differential item functioning of the UWES-17 in South Africa

Directory of Open Access Journals (Sweden)

Leanne Goliath-Yarde

2011-11-01

Research purpose: This study assesses the Differential Item Functioning (DIF of the Utrecht Work Engagement Scale (UWES-17 for different South African cultural groups in a South African company. Motivation for the study: Organisations are using the UWES-17 more and more in South Africa to assess work engagement. Therefore, research evidence from psychologists or assessment practitioners on its DIF across different cultural groups is necessary. Research design, approach and method: The researchers conducted a Secondary Data Analysis (SDA on the UWES-17 sample (n = 2429 that they obtained from a cross-sectional survey undertaken in a South African Information and Communication Technology (ICT sector company (n = 24 134. Quantitative item data on the UWES-17 scale enabled the authors to address the research question. Main findings: The researchers found uniform and/or non-uniform DIF on five of the vigour items, four of the dedication items and two of the absorption items. This also showed possible Differential Test Functioning (DTF on the vigour and dedication dimensions. Practical/managerial implications: Based on the DIF, the researchers suggested that organisations should not use the UWES-17 comparatively for different cultural groups or employment decisions in South Africa. Contribution/value add: The study provides evidence on DIF and possible DTF for the UWES-17. However, it also raises questions about possible interaction effects that need further investigation.
Negative affect impairs associative memory but not item memory.

Science.gov (United States)

Bisby, James A; Burgess, Neil

2013-12-17

The formation of associations between items and their context has been proposed to rely on mechanisms distinct from those supporting memory for a single item. Although emotional experiences can profoundly affect memory, our understanding of how it interacts with different aspects of memory remains unclear. We performed three experiments to examine the effects of emotion on memory for items and their associations. By presenting neutral and negative items with background contexts, Experiment 1 demonstrated that item memory was facilitated by emotional affect, whereas memory for an associated context was reduced. In Experiment 2, arousal was manipulated independently of the memoranda, by a threat of shock, whereby encoding trials occurred under conditions of threat or safety. Memory for context was equally impaired by the presence of negative affect, whether induced by threat of shock or a negative item, relative to retrieval of the context of a neutral item in safety. In Experiment 3, participants were presented with neutral and negative items as paired associates, including all combinations of neutral and negative items. The results showed both above effects: compared to a neutral item, memory for the associate of a negative item (a second item here, context in Experiments 1 and 2) is impaired, whereas retrieval of the item itself is enhanced. Our findings suggest that negative affect impairs associative memory while recognition of a negative item is enhanced. They support dual-processing models in which negative affect or stress impairs hippocampal-dependent associative memory while the storage of negative sensory/perceptual representations is spared or even strengthened.
The surveys to the companies: A tool for the improvement of degrees

Directory of Open Access Journals (Sweden)

Montserrat Cruells Cadevall

2017-03-01

Full Text Available In scientific and technical degrees, the opinion of the final employers on the given subjects is really important. For this reason, the Quality Committee (CQ of the Faculty of Chemistry of the University of Barcelona prepared a survey for chemical, engineering and pharmaceutical companies asking about the academic training required by the companies. The survey consists of nine sections including items related to laboratory operations, chemical processes, calculation methods, management systems (quality, environment and safety or general management information. In addition, at the end of each section, a question inquires the companies about the competences shown by students in the items of the section. The results were compared with that of a similar survey carried out in 2007. The scores obtained, between 2 and 3, for all the items (score: 1, not important; 2, unimportant; 3, important; 4, very important, show that companies accept the training given to our students and the competences achieved by them. However, according to their opinion, it is possible to improve this training, especially in the subjects related to management (time, information, environment, quality, safety, etc.. Therefore, surveys are a good tool for the evaluation of the training achieved in our degrees and, consequently, for improving degrees and the teaching task, according the Quality Management System implemented in the Faculty of Chemistry.
Overview of Classical Test Theory and Item Response Theory for Quantitative Assessment of Items in Developing Patient-Reported Outcome Measures

Science.gov (United States)

Cappelleri, Joseph C.; Lundy, J. Jason; Hays, Ron D.

2014-01-01

Introduction The U.S. Food and Drug Administration’s patient-reported outcome (PRO) guidance document defines content validity as “the extent to which the instrument measures the concept of interest” (FDA, 2009, p. 12). “Construct validity is now generally viewed as a unifying form of validity for psychological measurements, subsuming both content and criterion validity” (Strauss & Smith, 2009, p. 7). Hence both qualitative and quantitative information are essential in evaluating the validity of measures. Methods We review classical test theory and item response theory approaches to evaluating PRO measures including frequency of responses to each category of the items in a multi-item scale, the distribution of scale scores, floor and ceiling effects, the relationship between item response options and the total score, and the extent to which hypothesized “difficulty” (severity) order of items is represented by observed responses. Conclusion Classical test theory and item response theory can be useful in providing a quantitative assessment of items and scales during the content validity phase of patient-reported outcome measures. Depending on the particular type of measure and the specific circumstances, either one or both approaches should be considered to help maximize the content validity of PRO measures. PMID:24811753
Clusters of cultures: diversity in meaning of family value and gender role items across Europe.

Science.gov (United States)

van Vlimmeren, Eva; Moors, Guy B D; Gelissen, John P T M

2017-01-01

Survey data are often used to map cultural diversity by aggregating scores of attitude and value items across countries. However, this procedure only makes sense if the same concept is measured in all countries. In this study we argue that when (co)variances among sets of items are similar across countries, these countries share a common way of assigning meaning to the items. Clusters of cultures can then be observed by doing a cluster analysis on the (co)variance matrices of sets of related items. This study focuses on family values and gender role attitudes. We find four clusters of cultures that assign a distinct meaning to these items, especially in the case of gender roles. Some of these differences reflect response style behavior in the form of acquiescence. Adjusting for this style effect impacts on country comparisons hence demonstrating the usefulness of investigating the patterns of meaning given to sets of items prior to aggregating scores into cultural characteristics.
Survey Development to Assess College Students' Perceptions of the Campus Environment.

Science.gov (United States)

Sowers, Morgan F; Colby, Sarah; Greene, Geoffrey W; Pickett, Mackenzie; Franzen-Castle, Lisa; Olfert, Melissa D; Shelnutt, Karla; Brown, Onikia; Horacek, Tanya M; Kidd, Tandalayo; Kattelmann, Kendra K; White, Adrienne A; Zhou, Wenjun; Riggsbee, Kristin; Yan, Wangcheng; Byrd-Bredbenner, Carol

2017-11-01

We developed and tested a College Environmental Perceptions Survey (CEPS) to assess college students' perceptions of the healthfulness of their campus. CEPS was developed in 3 stages: questionnaire development, validity testing, and reliability testing. Questionnaire development was based on an extensive literature review and input from an expert panel to establish content validity. Face validity was established with the target population using cognitive interviews with 100 college students. Concurrent-criterion validity was established with in-depth interviews (N = 30) of college students compared to surveys completed by the same 30 students. Surveys completed by college students from 8 universities (N = 1147) were used to test internal structure (factor analysis) and internal consistency (Cronbach's alpha). After development and testing, 15 items remained from the original 48 items. A 5-factor solution emerged: physical activity (4 items, α = .635), water (3 items, α = .773), vending (2 items, α = .680), healthy food (2 items, α = .631), and policy (2 items, α = .573). The mean total score for all universities was 62.71 (±11.16) on a 100-point scale. CEPS appears to be a valid and reliable tool for assessing college students' perceptions of their health-related campus environment.
Method of locating related items in a geometric space for data mining

Science.gov (United States)

Hendrickson, Bruce A.

1999-01-01

A method for locating related items in a geometric space transforms relationships among items to geometric locations. The method locates items in the geometric space so that the distance between items corresponds to the degree of relatedness. The method facilitates communication of the structure of the relationships among the items. The method is especially beneficial for communicating databases with many items, and with non-regular relationship patterns. Examples of such databases include databases containing items such as scientific papers or patents, related by citations or keywords. A computer system adapted for practice of the present invention can include a processor, a storage subsystem, a display device, and computer software to direct the location and display of the entities. The method comprises assigning numeric values as a measure of similarity between each pairing of items. A matrix is constructed, based on the numeric values. The eigenvectors and eigenvalues of the matrix are determined. Each item is located in the geometric space at coordinates determined from the eigenvectors and eigenvalues. Proper construction of the matrix and proper determination of coordinates from eigenvectors can ensure that distance between items in the geometric space is representative of the numeric value measure of the items' similarity.
Factors affecting physical therapists' job satisfaction: questionnaire survey targeting first-year physical therapists.

Science.gov (United States)

Kota, Munetsugu; Kudo, Hiroyuki; Okita, Kazuhiko

2018-04-01

[Purpose] The survey aimed to clarify the factors that affect physiotherapists' job satisfaction. [Subjects and Methods] To examine factors affecting physical therapists' job satisfaction using a cross-sectional study with a questionnaire survey. Subjects were 193 first-year physical therapists who participated in a newcomer orientation at Hiroshima Prefectural Physical Therapy Association. The questionnaire comprised items concerning physical therapists' satisfaction with their work, motives for becoming physical therapists, education in school, internships, the workplace, and comfort in the workplace. [Results] Subjects were divided into two groups according to their satisfaction with their occupation. The "high satisfaction" group included 157 subjects, and the group "low satisfaction" group included 36 subjects. Using logistic regression analysis, items concerning comfort in the workplace, motives for becoming physical therapists, and learning in school were analysed. [Conclusion] Factors affecting physical therapists' job satisfaction were primarily influenced by previous experience and working conditions.
Use of indicator items to monitor marine debris on a New Jersey beach from 1991 to 1996

Science.gov (United States)

Ribic, C.A.

1998-01-01

The US National Marine Debris Monitoring Program is using indicator items from beach surveys to identify whether amounts of marine debris are changing over time. Indicator items were selected through expert opinion and assumed to reflect the trend of all debris. We used monthly data from a 1991-1996 study of debris on a New Jersey beach to determine if indicator and non-indicator items showed similar trends. Total indicator debris levels did not change; this was true regardless of probable source. Non-indicator debris increased about 40% annually. Plastic non-indicator items increased regardless of whether items were whole items, cigarette filters, or pieces. Of the whole items, almost 50% were plastic lids, cups, and utensils, and about 25% were drug-related paraphernalia, tobacco-related products, plastic stirrers, pull rings, and fireworks. When indicator items are used in a monitoring programme to reflect total debris patterns, concordance of trends in indicator and non-indicator debris should be checked.
Improving ability measurement in surveys by following the principles of IRT: The Wordsum vocabulary test in the General Social Survey.

Science.gov (United States)

Cor, M Ken; Haertel, Edward; Krosnick, Jon A; Malhotra, Neil

2012-09-01

Survey researchers often administer batteries of questions to measure respondents' abilities, but these batteries are not always designed in keeping with the principles of optimal test construction. This paper illustrates one instance in which following these principles can improve a measurement tool used widely in the social and behavioral sciences: the GSS's vocabulary test called "Wordsum". This ten-item test is composed of very difficult items and very easy items, and item response theory (IRT) suggests that the omission of moderately difficult items is likely to have handicapped Wordsum's effectiveness. Analyses of data from national samples of thousands of American adults show that after adding four moderately difficult items to create a 14-item battery, "Wordsumplus" (1) outperformed the original battery in terms of quality indicators suggested by classical test theory; (2) reduced the standard error of IRT ability estimates in the middle of the latent ability dimension; and (3) exhibited higher concurrent validity. These findings show how to improve Wordsum and suggest that analysts should use a score based on all 14 items instead of using the summary score provided by the GSS, which is based on only the original 10 items. These results also show more generally how surveys measuring abilities (and other constructs) can benefit from careful application of insights from the contemporary educational testing literature. Copyright © 2012 Elsevier Inc. All rights reserved.
Assessment of the psychometrics of a PROMIS item bank: self-efficacy for managing daily activities.

Science.gov (United States)

Hong, Ickpyo; Velozo, Craig A; Li, Chih-Ying; Romero, Sergio; Gruber-Baldini, Ann L; Shulman, Lisa M

2016-09-01

The aim of this study is to investigate the psychometrics of the Patient-Reported Outcomes Measurement Information System self-efficacy for managing daily activities item bank. The item pool was field tested on a sample of 1087 participants via internet (n = 250) and in-clinic (n = 837) surveys. All participants reported having at least one chronic health condition. The 35 item pool was investigated for dimensionality (confirmatory factor analyses, CFA and exploratory factor analysis, EFA), item-total correlations, local independence, precision, and differential item functioning (DIF) across gender, race, ethnicity, age groups, data collection modes, and neurological chronic conditions (McFadden Pseudo R (2) less than 10 %). The item pool met two of the four CFA fit criteria (CFI = 0.952 and SRMR = 0.07). EFA analysis found a dominant first factor (eigenvalue = 24.34) and the ratio of first to second eigenvalue was 12.4. The item pool demonstrated good item-total correlations (0.59-0.85) and acceptable internal consistency (Cronbach's alpha = 0.97). The item pool maintained its precision (reliability over 0.90) across a wide range of theta (3.70), and there was no significant DIF. The findings indicated the item pool has sound psychometric properties and the test items are eligible for development of computerized adaptive testing and short forms.
A Third-Order Item Response Theory Model for Modeling the Effects of Domains and Subdomains in Large-Scale Educational Assessment Surveys

Science.gov (United States)

Rijmen, Frank; Jeon, Minjeong; von Davier, Matthias; Rabe-Hesketh, Sophia

2014-01-01

Second-order item response theory models have been used for assessments consisting of several domains, such as content areas. We extend the second-order model to a third-order model for assessments that include subdomains nested in domains. Using a graphical model framework, it is shown how the model does not suffer from the curse of…
Differential item functioning magnitude and impact measures from item response theory models.

Science.gov (United States)

Kleinman, Marjorie; Teresi, Jeanne A

2016-01-01

Measures of magnitude and impact of differential item functioning (DIF) at the item and scale level, respectively are presented and reviewed in this paper. Most measures are based on item response theory models. Magnitude refers to item level effect sizes, whereas impact refers to differences between groups at the scale score level. Reviewed are magnitude measures based on group differences in the expected item scores and impact measures based on differences in the expected scale scores. The similarities among these indices are demonstrated. Various software packages are described that provide magnitude and impact measures, and new software presented that computes all of the available statistics conveniently in one program with explanations of their relationships to one another.
Identifying the ‘red flags’ for unhealthy weight control among adolescents: Findings from an item response theory analysis of a national survey

Directory of Open Access Journals (Sweden)

Utter Jennifer

2012-08-01

Full Text Available Abstract Background Weight control behaviors are common among young people and are associated with poor health outcomes. Yet clinicians rarely ask young people about their weight control; this may be due to uncertainty about which questions to ask, specifically around whether certain weight loss strategies are healthier or unhealthy or about what weight loss behaviors are more likely to lead to adverse outcomes. Thus, the aims of the current study are: to confirm, using item response theory analysis, that the underlying latent constructs of healthy and unhealthy weight control exist; to determine the ‘red flag’ weight loss behaviors that may discriminate unhealthy from healthy weight loss; to determine the relationships between healthy and unhealthy weight loss and mental health; and to examine how weight control may vary among demographic groups. Methods Data were collected as part of a national health and wellbeing survey of secondary school students in New Zealand (n = 9,107 in 2007. Item response theory analyses were conducted to determine the underlying constructs of weight control behaviors and the behaviors that discriminate unhealthy from healthy weight control. Results The current study confirms that there are two underlying constructs of weight loss behaviors which can be described as healthy and unhealthy weight control. Unhealthy weight control was positively correlated with depressive mood. Fasting and skipping meals for weight loss had the lowest item thresholds on the unhealthy weight control continuum, indicating that they act as ‘red flags’ and warrant further discussion in routine clinical assessments. Conclusions Routine assessments of weight control strategies by clinicians are warranted, particularly for screening for meal skipping and fasting for weight loss as these behaviors appear to ‘flag’ behaviors that are associated with poor mental wellbeing.
Understanding and quantifying cognitive complexity level in mathematical problem solving items

Directory of Open Access Journals (Sweden)

SUSAN E. EMBRETSON

2008-09-01

Full Text Available The linear logistic test model (LLTM; Fischer, 1973 has been applied to a wide variety of new tests. When the LLTM application involves item complexity variables that are both theoretically interesting and empirically supported, several advantages can result. These advantages include elaborating construct validity at the item level, defining variables for test design, predicting parameters of new items, item banking by sources of complexity and providing a basis for item design and item generation. However, despite the many advantages of applying LLTM to test items, it has been applied less often to understand the sources of complexity for large-scale operational test items. Instead, previously calibrated item parameters are modeled using regression techniques because raw item response data often cannot be made available. In the current study, both LLTM and regression modeling are applied to mathematical problem solving items from a widely used test. The findings from the two methods are compared and contrasted for their implications for continued development of ability and achievement tests based on mathematical problem solving items.

ITEM LEVEL DIAGNOSTICS AND MODEL - DATA FIT IN ITEM ...

African Journals Online (AJOL)

Global Journal

Item response theory (IRT) is a framework for modeling and analyzing item response ... data. Though, there is an argument that the evaluation of fit in IRT modeling has been ... National Council on Measurement in Education ... model data fit should be based on three types of ... prediction should be assessed through the.
Sources of interference in item and associative recognition memory.

Science.gov (United States)

Osth, Adam F; Dennis, Simon

2015-04-01

A powerful theoretical framework for exploring recognition memory is the global matching framework, in which a cue's memory strength reflects the similarity of the retrieval cues being matched against the contents of memory simultaneously. Contributions at retrieval can be categorized as matches and mismatches to the item and context cues, including the self match (match on item and context), item noise (match on context, mismatch on item), context noise (match on item, mismatch on context), and background noise (mismatch on item and context). We present a model that directly parameterizes the matches and mismatches to the item and context cues, which enables estimation of the magnitude of each interference contribution (item noise, context noise, and background noise). The model was fit within a hierarchical Bayesian framework to 10 recognition memory datasets that use manipulations of strength, list length, list strength, word frequency, study-test delay, and stimulus class in item and associative recognition. Estimates of the model parameters revealed at most a small contribution of item noise that varies by stimulus class, with virtually no item noise for single words and scenes. Despite the unpopularity of background noise in recognition memory models, background noise estimates dominated at retrieval across nearly all stimulus classes with the exception of high frequency words, which exhibited equivalent levels of context noise and background noise. These parameter estimates suggest that the majority of interference in recognition memory stems from experiences acquired before the learning episode. (c) 2015 APA, all rights reserved).
Refinement of the Brazilian Household Food Insecurity Measurement Scale: Recommendation for a 14-item EBIA

Directory of Open Access Journals (Sweden)

Ana Maria Segall-Corrêa

2014-04-01

Full Text Available OBJECTIVE: To review and refine Brazilian Household Food Insecurity Measurement Scale structure. METHODS: The study analyzed the impact of removing the item "adult lost weight" and one of two possibly redundant items on Brazilian Household Food Insecurity Measurement Scale psychometric behavior using the one-parameter logistic (Rasch model. Brazilian Household Food Insecurity Measurement Scale psychometric behavior was analyzed with respect to acceptable adjustment values ranging from 0.7 to 1.3, and to severity scores of the items with theoretically expected gradients. The socioeconomic and food security indicators came from the 2004 National Household Sample Survey, which obtained complete answers to Brazilian Household Food Insecurity Measurement Scale items from 112,665 households. RESULTS: Removing the items "adult reduced amount..." followed by "adult ate less..." did not change the infit of the remaining items, except for "adult lost weight", whose infit increased from 1.21 to 1.56. The internal consistency and item severity scores did not change when "adult ate less" and one of the two redundant items were removed. CONCLUSION: Brazilian Household Food Insecurity Measurement Scale reanalysis reduced the number of scale items from 16 to 14 without changing its internal validity. Its use as a nationwide household food security measure is strongly recommended.
7 CFR 65.220 - Processed food item.

Science.gov (United States)

2010-01-01

... extruding). Examples of items excluded include teriyaki flavored pork loin, roasted peanuts, breaded chicken... OF BEEF, PORK, LAMB, CHICKEN, GOAT MEAT, PERISHABLE AGRICULTURAL COMMODITIES, MACADAMIA NUTS, PECANS... includes cooking (e.g., frying, broiling, grilling, boiling, steaming, baking, roasting), curing (e.g...
Developing, testing, and implementing a survey of scientist mentoring teachers as part of an RET: The GABI RET mentor survey.

Science.gov (United States)

Davey, B.

2017-12-01

The impacts of mentoring in education have been well established. Mentors have a large impact on their mentees and have been show to affect mentee attitudes towards learning, interest in subjects, future success, and more. While mentoring has a well-documented impact on the mentees, mentoring also has an impact on the mentors themselves. However, little has been studied empirically about these impacts. When we looked for a validated instrument that measured the impact of mentoring on the scientists working with the teachers, we found many anecdotal reports but no instruments that meet our specific needs. To this end, we developed, tested, and implemented our own instrument for measuring the impacts of mentoring on our scientist mentors. Our instrument contained both quantitative and qualitative items designed to reveal the effects of mentoring in two areas: 1) cognitive domain (mentoring, teaching, understanding K-12) and 2) affective domain (professional, personal, participation). We first shared our survey with experts in survey development and mentoring, gathered their feedback, and incorporated their suggestions into our instrument. We then had a subsection of our mentors complete the survey and then complete it again three to four days later (test-retest). Our survey has a high correlation for the test-retest quantitative items (0.93) and a high correlation (0.90) between the three reviewers of the qualitative items. From our findings, we feel we have a validated instrument (face, content, and contruct validity) that answers our research questions reliably. Our contribution to the study of mentoring of science teachers reveals a broad range of impacts on the mentors themselves including an improved understanding of the challenges of classroom teaching, a recognition of the importance of scientists working with science teachers, an enhanced ability to communicate their research and findings, and an increased interest and excitement for their own work.
Design of Web Questionnaires : A Test for Number of Items per Screen

NARCIS (Netherlands)

Toepoel, V.; Das, J.W.M.; van Soest, A.H.O.

2005-01-01

This paper presents results from an experimental manipulation of one versus multiple-items per screen format in a Web survey.The purpose of the experiment was to find out if a questionnaire s format influences how respondents provide answers in online questionnaires and if this is depending on
Using item response theory to measure extreme response style in marketing research

NARCIS (Netherlands)

de Jong, Martijn G.; Steenkamp, Jan-Benedict E.M.; Fox, Gerardus J.A.; Baumgartner, Hans

2008-01-01

Extreme response style (ERS) is an important threat to the validity of survey-based marketing research. In this article, the authors present a new item response theory–based model for measuring ERS. This model contributes to the ERS literature in two ways. First, the method improves on existing
Nonparametric Bounds in the Presence of Item Nonresponse, Unfolding Brackets and Anchoring

NARCIS (Netherlands)

Vazquez-Alvarez, R.; Melenberg, B.; van Soest, A.H.O.

2001-01-01

Household surveys often suffer from nonresponse on variables such as income, savings or wealth.Recent work by Manski shows how bounds on conditional quantiles of the variable of interest can be derived, allowing for any type of nonrandom item nonresponse.The width between these bounds can be reduced
The development and initial assessment of the strategy and leadership systems capability evaluation survey.

Science.gov (United States)

Coon, Cheryl D; Bokowy, Kay L; Horblyuk, Ruslan; Zisman, Robert S; McLeod, Lori D; Brown, T Michelle

2012-01-01

Hospital management and leadership systems are associated with organizational success and quality care. The Strategy and Leadership Systems Capability Evaluation (CE) survey was developed by GE Healthcare to assess management and leadership systems at health care institutions, serve as a benchmark for improvement, and measure progress. To assess the psychometric properties of the 29-item CE survey, including the factor structure, scoring algorithm, reliability, and discriminant validity, an online survey was completed by 3450 employees at 15 US hospitals. Of these employees, 609 worked at a hospital where a leadership and management intervention occurred after the initial survey administration. Data were also collected on job level, number of hospital beds, hospital ownership, location, community type, and the implementation of hospital interventions. Item response frequencies showed no floor or ceiling effects and limited missing data. Interitem correlations were strong without obvious redundancies, and factor analysis suggested a unidimensional scale. The resulting scale had strong internal consistency and was able to discriminate among known groups. The CE survey was developed to evaluate management and leadership systems at health care institutions. This study provides psychometric evidence in support of the reliability, validity, and scoring structure of this survey.
Improving Inpatient Surveys: Web-Based Computer Adaptive Testing Accessed via Mobile Phone QR Codes.

Science.gov (United States)

Chien, Tsair-Wei; Lin, Weir-Sen

2016-03-02

The National Health Service (NHS) 70-item inpatient questionnaire surveys inpatients on their perceptions of their hospitalization experience. However, it imposes more burden on the patient than other similar surveys. The literature shows that computerized adaptive testing (CAT) based on item response theory can help shorten the item length of a questionnaire without compromising its precision. Our aim was to investigate whether CAT can be (1) efficient with item reduction and (2) used with quick response (QR) codes scanned by mobile phones. After downloading the 2008 inpatient survey data from the Picker Institute Europe website and analyzing the difficulties of this 70-item questionnaire, we used an author-made Excel program using the Rasch partial credit model to simulate 1000 patients' true scores followed by a standard normal distribution. The CAT was compared to two other scenarios of answering all items (AAI) and the randomized selection method (RSM), as we investigated item length (efficiency) and measurement accuracy. The author-made Web-based CAT program for gathering patient feedback was effectively accessed from mobile phones by scanning the QR code. We found that the CAT can be more efficient for patients answering questions (ie, fewer items to respond to) than either AAI or RSM without compromising its measurement accuracy. A Web-based CAT inpatient survey accessed by scanning a QR code on a mobile phone was viable for gathering inpatient satisfaction responses. With advances in technology, patients can now be offered alternatives for providing feedback about hospitalization satisfaction. This Web-based CAT is a possible option in health care settings for reducing the number of survey items, as well as offering an innovative QR code access.
Evolution of a Test Item

Science.gov (United States)

Spaan, Mary

2007-01-01

This article follows the development of test items (see "Language Assessment Quarterly", Volume 3 Issue 1, pp. 71-79 for the article "Test and Item Specifications Development"), beginning with a review of test and item specifications, then proceeding to writing and editing of items, pretesting and analysis, and finally selection of an item for a…
Comparative survey of comprehensiveness of literature collection between two information systems

International Nuclear Information System (INIS)

Narui, Shigeko; Habara, Tadashi; Izawa, Michiyo; Naramoto, Miyoko; Kajiro, Tadashi

1983-01-01

To make clear a feature of INIS database for the subject areas of plasma physics and thermonuclear reactions (the INIS category A14), the overlap literature was surveyedwhich had been collected into both INIS and INSPEC databases. All of 4,454 items of literature inputted into that category of INIS during 1980 were checked on whether they had also been included in INSPEC or not. The overlap ratio of the items to those in INIS was found to be 50 % and the ratios for types of literature were 96%, 42%, 28%, 3% and 0% for journals, books, pamphlets, technical reports, and patent dissertations, respectively. Journal articles not included into INSPEC were found to be derived from the journals which were not central to INSPEC. These facts show that INIS covers various types of literature, which INSPEC collected mainly journal literature. For more warrantable conclusion, it needs further survey on those items which are collected into INSPEC but not into INIS. (author)
2005 AdvanceVT Work/Life Survey Leadership Report

OpenAIRE

Glass, Valerie Q.

2005-01-01

The AdvanceVT Faculty Work-Life Survey, distributed to all teaching and research faculty in January 2005, addressed, among other things, leadership issues at Virginia Tech. This report presents findings from tenured and tenure- track faculty members (N=816) about items on the questionnaire related to leadership including: aspirations of Virginia Tech faculty members towards leadership positions, their views about the possibility of maintaining a balance between leadership and other responsibi...
Report on technological survey in fiscal 1999. Demonstration test for smoothing grid interconnection (Collection of information by surveys in overseas countries); 1999 nendo keito renkei enkatsuka jissho shiken chosa hokokusho. Kaigai chosa ni yoru joho shushu

Energy Technology Data Exchange (ETDEWEB)

NONE

2000-03-01

Surveys were performed on the institutional aspects of establishment and operation of grid interconnection guidelines in the countries advanced in introduction of discrete power supply systems. The survey items for America include: (1) summary of the status related to grid interconnection, (2) grid interconnection process, (3) methods for paying expenses for increasing power transmission facilities by means of grid interconnection, (4) dispute processing, (5) information release, and (6) software. The survey items for England, Germany, and France include: (1) summary of electricity business, (2) regulation patterns in electricity business, (3) summary of grid operating organizations, (4) connection to grid interconnection systems, and (5) the future liberalization programs. America is establishing standards for grid interconnection in discrete power supplies including photovoltaic power generation and energy storage under SCC21 of IEEE, whose conclusion will be drawn in the end of 2000. The Energy Department has an intention to give the standards the legal bases to operate them under unified requirements. Germany, England and France have all established standards for operating the grid interconnection. Market liberalization for electric power retailing is advancing in the order of America, England, Germany, and France. (NEDO)
A hierarchy of distress and invariant item ordering in the General Health Questionnaire-12.

Science.gov (United States)

Doyle, F; Watson, R; Morgan, K; McBride, O

2012-06-01

Invariant item ordering (IIO) is defined as the extent to which items have the same ordering (in terms of item difficulty/severity - i.e. demonstrating whether items are difficult [rare] or less difficult [common]) for each respondent who completes a scale. IIO is therefore crucial for establishing a scale hierarchy that is replicable across samples, but no research has demonstrated IIO in scales of psychological distress. We aimed to determine if a hierarchy of distress with IIO exists in a large general population sample who completed a scale measuring distress. Data from 4107 participants who completed the 12-item General Health Questionnaire (GHQ-12) from the Northern Ireland Health and Social Wellbeing Survey 2005-6 were analysed. Mokken scaling was used to determine the dimensionality and hierarchy of the GHQ-12, and items were investigated for IIO. All items of the GHQ-12 formed a single, strong unidimensional scale (H=0.58). IIO was found for six of the 12 items (H-trans=0.55), and these symptoms reflected the following hierarchy: anhedonia, concentration, participation, coping, decision-making and worthlessness. The cross-sectional analysis needs replication. The GHQ-12 showed a hierarchy of distress, but IIO is only demonstrated for six of the items, and the scale could therefore be shortened. Adopting brief, hierarchical scales with IIO may be beneficial in both clinical and research contexts. Copyright © 2011 Elsevier B.V. All rights reserved.
Survey research.

Science.gov (United States)

Alderman, Amy K; Salem, Barbara

2010-10-01

Survey research is a unique methodology that can provide insight into individuals' perspectives and experiences and can be collected on a large population-based sample. Specifically, in plastic surgery, survey research can provide patients and providers with accurate and reproducible information to assist with medical decision-making. When using survey methods in research, researchers should develop a conceptual model that explains the relationships of the independent and dependent variables. The items of the survey are of primary importance. Collected data are only useful if they accurately measure the concepts of interest. In addition, administration of the survey must follow basic principles to ensure an adequate response rate and representation of the intended target sample. In this article, the authors review some general concepts important for successful survey research and discuss the many advantages this methodology has for obtaining limitless amounts of valuable information.
Development of an assessment tool to measure students′ perceptions of respiratory care education programs: Item generation, item reduction, and preliminary validation

Directory of Open Access Journals (Sweden)

Ghazi Alotaibi

2013-01-01

Full Text Available Objectives: Students who perceived their learning environment positively are more likely to develop effective learning strategies, and adopt a deep learning approach. Currently, there is no validated instrument for measuring the educational environment of educational programs on respiratory care (RC. The aim of this study was to develop an instrument to measure students′ perception of the RC educational environment. Materials and Methods: Based on the literature review and an assessment of content validity by multiple focus groups of RC educationalists, potential items of the instrument relevant to RC educational environment construct were generated by the research group. The initial 71 item questionnaire was then field-tested on all students from the 3 RC programs in Saudi Arabia and was subjected to multi-trait scaling analysis. Cronbach′s alpha was used to assess internal consistency reliabilities. Results: Two hundred and twelve students (100% completed the survey. The initial instrument of 71 items was reduced to 65 across 5 scales. Convergent and discriminant validity assessment demonstrated that the majority of items correlated more highly with their intended scale than a competing one. Cronbach′s alpha exceeded the standard criterion of >0.70 in all scales except one. There was no floor or ceiling effect for scale or overall score. Conclusions: This instrument is the first assessment tool developed to measure the RC educational environment. There was evidence of its good feasibility, validity, and reliability. This first validation of the instrument supports its use by RC students to evaluate educational environment.
An item-response theory approach to safety climate measurement: The Liberty Mutual Safety Climate Short Scales.

Science.gov (United States)

Huang, Yueng-Hsiang; Lee, Jin; Chen, Zhuo; Perry, MacKenna; Cheung, Janelle H; Wang, Mo

2017-06-01

Zohar and Luria's (2005) safety climate (SC) scale, measuring organization- and group- level SC each with 16 items, is widely used in research and practice. To improve the utility of the SC scale, we shortened the original full-length SC scales. Item response theory (IRT) analysis was conducted using a sample of 29,179 frontline workers from various industries. Based on graded response models, we shortened the original scales in two ways: (1) selecting items with above-average discriminating ability (i.e. offering more than 6.25% of the original total scale information), resulting in 8-item organization-level and 11-item group-level SC scales; and (2) selecting the most informative items that together retain at least 30% of original scale information, resulting in 4-item organization-level and 4-item group-level SC scales. All four shortened scales had acceptable reliability (≥0.89) and high correlations (≥0.95) with the original scale scores. The shortened scales will be valuable for academic research and practical survey implementation in improving occupational safety. Copyright © 2017 The Author(s). Published by Elsevier Ltd.. All rights reserved.
Using Item Response Theory to Develop a 60-Item Representation of the NEO PI-R Using the International Personality Item Pool: Development of the IPIP-NEO-60.

Science.gov (United States)

Maples-Keller, Jessica L; Williamson, Rachel L; Sleep, Chelsea E; Carter, Nathan T; Campbell, W Keith; Miller, Joshua D

2017-10-31

Given advantages of freely available and modifiable measures, an increase in the use of measures developed from the International Personality Item Pool (IPIP), including the 300-item representation of the Revised NEO Personality Inventory (NEO PI-R; Costa & McCrae, 1992a ) has occurred. The focus of this study was to use item response theory to develop a 60-item, IPIP-based measure of the Five-Factor Model (FFM) that provides equal representation of the FFM facets and to test the reliability and convergent and criterion validity of this measure compared to the NEO Five Factor Inventory (NEO-FFI). In an undergraduate sample (n = 359), scores from the NEO-FFI and IPIP-NEO-60 demonstrated good reliability and convergent validity with the NEO PI-R and IPIP-NEO-300. Additionally, across criterion variables in the undergraduate sample as well as a community-based sample (n = 757), the NEO-FFI and IPIP-NEO-60 demonstrated similar nomological networks across a wide range of external variables (r ICC = .96). Finally, as expected, in an MTurk sample the IPIP-NEO-60 demonstrated advantages over the Big Five Inventory-2 (Soto & John, 2017 ; n = 342) with regard to the Agreeableness domain content. The results suggest strong reliability and validity of the IPIP-NEO-60 scores.
Assessing the Equivalence of Paper, Mobile Phone, and Tablet Survey Responses at a Community Mental Health Center Using Equivalent Halves of a 'Gold-Standard' Depression Item Bank.

Science.gov (United States)

Brodey, Benjamin B; Gonzalez, Nicole L; Elkin, Kathryn Ann; Sasiela, W Jordan; Brodey, Inger S

2017-09-06

The computerized administration of self-report psychiatric diagnostic and outcomes assessments has risen in popularity. If results are similar enough across different administration modalities, then new administration technologies can be used interchangeably and the choice of technology can be based on other factors, such as convenience in the study design. An assessment based on item response theory (IRT), such as the Patient-Reported Outcomes Measurement Information System (PROMIS) depression item bank, offers new possibilities for assessing the effect of technology choice upon results. To create equivalent halves of the PROMIS depression item bank and to use these halves to compare survey responses and user satisfaction among administration modalities-paper, mobile phone, or tablet-with a community mental health care population. The 28 PROMIS depression items were divided into 2 halves based on content and simulations with an established PROMIS response data set. A total of 129 participants were recruited from an outpatient public sector mental health clinic based in Memphis. All participants took both nonoverlapping halves of the PROMIS IRT-based depression items (Part A and Part B): once using paper and pencil, and once using either a mobile phone or tablet. An 8-cell randomization was done on technology used, order of technologies used, and order of PROMIS Parts A and B. Both Parts A and B were administered as fixed-length assessments and both were scored using published PROMIS IRT parameters and algorithms. All 129 participants received either Part A or B via paper assessment. Participants were also administered the opposite assessment, 63 using a mobile phone and 66 using a tablet. There was no significant difference in item response scores for Part A versus B. All 3 of the technologies yielded essentially identical assessment results and equivalent satisfaction levels. Our findings show that the PROMIS depression assessment can be divided into 2 equivalent

Students' approaches to learning in a clinical practicum: A psychometric evaluation based on item response theory.

Science.gov (United States)

Zhao, Yue; Kuan, Hoi Kei; Chung, Joyce O K; Chan, Cecilia K Y; Li, William H C

2018-07-01

The investigation of learning approaches in the clinical workplace context has remained an under-researched area. Despite the validation of learning approach instruments and their applications in various clinical contexts, little is known about the extent to which an individual item, that reflects a specific learning strategy and motive, effectively contributes to characterizing students' learning approaches. This study aimed to measure nursing students' approaches to learning in a clinical practicum using the Approaches to Learning at Work Questionnaire (ALWQ). Survey research design was used in the study. A sample of year 3 nursing students (n = 208) who undertook a 6-week clinical practicum course participated in the study. Factor analyses were conducted, followed by an item response theory analysis, including model assumption evaluation (unidimensionality and local independence), item calibration and goodness-of-fit assessment. Two subscales, deep and surface, were derived. Findings suggested that: (a) items measuring the deep motive from intrinsic interest and deep strategies of relating new ideas to similar situations, and that of concept mapping served as the strongest discriminating indicators; (b) the surface strategy of memorizing facts and details without an overall picture exhibited the highest discriminating power among all surface items; and, (c) both subscales appeared to be informative in assessing a broad range of the corresponding latent trait. The 21-item ALWQ derived from this study presented an efficient, internally consistent and precise measure. Findings provided a useful psychometric evaluation of the ALWQ in the clinical practicum context, added evidence to the utility of the ALWQ for nursing education practice and research, and echoed the discussions from previous studies on the role of the contextual factors in influencing student choices of different learning strategies. They provided insights for clinical educators to measure
17 CFR 210.3A-04 - Intercompany items and transactions.

Science.gov (United States)

2010-04-01

... Financial Statements § 210.3A-04 Intercompany items and transactions. In general, there shall be eliminated intercompany items and transactions between persons included in the (a) consolidated financial statements being... FORM AND CONTENT OF AND REQUIREMENTS FOR FINANCIAL STATEMENTS, SECURITIES ACT OF 1933, SECURITIES...
Instructional Topics in Educational Measurement (ITEMS) Module: Using Automated Processes to Generate Test Items

Science.gov (United States)

Gierl, Mark J.; Lai, Hollis

2013-01-01

Changes to the design and development of our educational assessments are resulting in the unprecedented demand for a large and continuous supply of content-specific test items. One way to address this growing demand is with automatic item generation (AIG). AIG is the process of using item models to generate test items with the aid of computer…
A 67-Item Stress Resilience item bank showing high content validity was developed in a psychosomatic sample.

Science.gov (United States)

Obbarius, Nina; Fischer, Felix; Obbarius, Alexander; Nolte, Sandra; Liegl, Gregor; Rose, Matthias

2018-04-10

To develop the first item bank to measure Stress Resilience (SR) in clinical populations. Qualitative item development resulted in an initial pool of 131 items covering a broad theoretical SR concept. These items were tested in n=521 patients at a psychosomatic outpatient clinic. Exploratory and Confirmatory Factor Analysis (CFA), as well as other state-of-the-art item analyses and IRT were used for item evaluation and calibration of the final item bank. Out of the initial item pool of 131 items, we excluded 64 items (54 factor loading .3, 2 non-discriminative Item Response Curves, 4 Differential Item Functioning). The final set of 67 items indicated sufficient model fit in CFA and IRT analyses. Additionally, a 10-item short form with high measurement precision (SE≤.32 in a theta range between -1.8 and +1.5) was derived. Both the SR item bank and the SR short form were highly correlated with an existing static legacy tool (Connor-Davidson Resilience Scale). The final SR item bank and 10-item short form showed good psychometric properties. When further validated, they will be ready to be used within a framework of Computer-Adaptive Tests for a comprehensive assessment of the Stress-Construct. Copyright © 2018. Published by Elsevier Inc.
Using item response theory to address vulnerabilities in FFQ.

Science.gov (United States)

Kazman, Josh B; Scott, Jonathan M; Deuster, Patricia A

2017-09-01

The limitations for self-reporting of dietary patterns are widely recognised as a major vulnerability of FFQ and the dietary screeners/scales derived from FFQ. Such instruments can yield inconsistent results to produce questionable interpretations. The present article discusses the value of psychometric approaches and standards in addressing these drawbacks for instruments used to estimate dietary habits and nutrient intake. We argue that a FFQ or screener that treats diet as a 'latent construct' can be optimised for both internal consistency and the value of the research results. Latent constructs, a foundation for item response theory (IRT)-based scales (e.g. Patient Reported Outcomes Measurement Information System) are typically introduced in the design stage of an instrument to elicit critical factors that cannot be observed or measured directly. We propose an iterative approach that uses such modelling to refine FFQ and similar instruments. To that end, we illustrate the benefits of psychometric modelling by using items and data from a sample of 12 370 Soldiers who completed the 2012 US Army Global Assessment Tool (GAT). We used factor analysis to build the scale incorporating five out of eleven survey items. An IRT-driven assessment of response category properties indicates likely problems in the ordering or wording of several response categories. Group comparisons, examined with differential item functioning (DIF), provided evidence of scale validity across each Army sub-population (sex, service component and officer status). Such an approach holds promise for future FFQ.
Teoria da Resposta ao Item Teoria de la respuesta al item Item response theory

Directory of Open Access Journals (Sweden)

Eutalia Aparecida Candido de Araujo

2009-12-01

Full Text Available A preocupação com medidas de traços psicológicos é antiga, sendo que muitos estudos e propostas de métodos foram desenvolvidos no sentido de alcançar este objetivo. Entre os trabalhos propostos, destaca-se a Teoria da Resposta ao Item (TRI que, a princípio, veio completar limitações da Teoria Clássica de Medidas, empregada em larga escala até hoje na medida de traços psicológicos. O ponto principal da TRI é que ela leva em consideração o item particularmente, sem relevar os escores totais; portanto, as conclusões não dependem apenas do teste ou questionário, mas de cada item que o compõe. Este artigo propõe-se a apresentar esta Teoria que revolucionou a teoria de medidas.La preocupación con las medidas de los rasgos psicológicos es antigua y muchos estudios y propuestas de métodos fueron desarrollados para lograr este objetivo. Entre estas propuestas de trabajo se incluye la Teoría de la Respuesta al Ítem (TRI que, en principio, vino a completar las limitaciones de la Teoría Clásica de los Tests, ampliamente utilizada hasta hoy en la medida de los rasgos psicológicos. El punto principal de la TRI es que se tiene en cuenta el punto concreto, sin relevar las puntuaciones totales; por lo tanto, los resultados no sólo dependen de la prueba o cuestionario, sino que de cada ítem que lo compone. En este artículo se propone presentar la Teoría que revolucionó la teoría de medidas.The concern with measures of psychological traits is old and many studies and proposals of methods were developed to achieve this goal. Among these proposed methods highlights the Item Response Theory (IRT that, in principle, came to complete limitations of the Classical Test Theory, which is widely used until nowadays in the measurement of psychological traits. The main point of IRT is that it takes into account the item in particular, not relieving the total scores; therefore, the findings do not only depend on the test or questionnaire
Barriers in the path of yoga practice: An online survey

Directory of Open Access Journals (Sweden)

H V Dayananda

2014-01-01

Full Text Available Context: Clinical benefits of yoga have been well explored, but factors contributing to adherence to regular yoga practice are not well studied. Aims: To study the factors influencing adherence to yoga practices on those participants who have completed 1-month Yoga Instructors′ course from a yoga university. Settings and Design: Online survey was conducted on participants who had finished 1-month Yoga Instructors′ course at a yoga university. Materials and Methods: Online survey was conducted using Survey Monkey web portal with response rate of 42.5%. A total of 1355 participants were approached. Demographic items and a checklist of 21 items on a 5-point likert scale were prepared based on traditional yoga texts. A few items to assess modern lifestyle barriers were also included. Statistical Analysis: One-sample proportion test with chi square statistics was used for analysis. Results: Irregularity in lifestyle, family commitments, and occupational commitments are perceived as significant strong barriers. Dullness, excessive talking, strictly adhering to rules, laziness, physical and mental overexertion, fickleness and wandering of mind, unsteadiness of mind, procrastination, and oversleeping are considered as significant barriers of moderate nature. Conclusions: Modern lifestyle is the major challenge for yoga practitioners to adhere to regular practice of yoga. To address this, attention is required in strengthening the lifestyle management and the spiritual dimension of yoga practice as the spiritual component seems to be side-tracked.
[General survey and protection of intangible cultural heritage in traditional medicine in Zhejiang Province].

Science.gov (United States)

Zhu, D M

2017-07-28

From January 2003 to October 2008, the Zhejiang Provincial Department of Culture, together with the Intangible Cultural Heritage Management Department of 11 cities and counties, including Hangzhou, Ningbo, Wenzhou, Huzhou, Jiaxing, Shaoxing, Jinhua, Quzhou, Zhoushan, Taizhou, Lishui, surveyed the Province's intangible cultural heritage in traditional medicine, with a total of 7849 items, including 7 kinds of traditional medicine in 8 major categories: living Chinese medicine culture, ethnic medicine, acu-moxibustion, osteopathic therapy, unique therapies, and Chinese crude drugs, herbal medicine and traditional Chinese medicine preparation, TCM processing.Among them, 9 items have been included in the Representative Project List of National Traditional Medicine Intangible Cultural Heritage, 18 items were listed in Representative Project Directory of Zhejiang Traditional Medicine Intangible Cultural Heritage.Theprotection and inheritance of traditional of the intangible heritage of traditional medicine in Zhejiang province are mainly through the 4 batches of master guidance apprentices.In addition, protection is carried out through organizational support, literature systematization and other measures.
Dissociation between source and item memory in Parkinson's disease

Institute of Scientific and Technical Information of China (English)

Hu Panpan; Li Youhai; Ma Huijuan; Xi Chunhua; Chen Xianwen; Wang Kai

2014-01-01

Background Episodic memory includes information about item memory and source memory.Many researches support the hypothesis that these two memory systems are implemented by different brain structures.The aim of this study was to investigate the characteristics of item memory and source memory processing in patients with Parkinson's disease (PD),and to further verify the hypothesis of dual-process model of source and item memory.Methods We established a neuropsychological battery to measure the performance of item memory and source memory.Totally 35 PD individuals and 35 matched healthy controls (HC) were administrated with the battery.Item memory task consists of the learning and recognition of high-frequency national Chinese characters; source memory task consists of the learning and recognition of three modes (character,picture,and image) of objects.Results Compared with the controls,the idiopathic PD patients have been impaired source memory (PD vs.HC:0.65±0.06 vs.0.72±0.09,P=0.001),but not impaired in item memory (PD vs.HC:0.65±0.07 vs.0.67±0.08,P=0.240).Conclusions The present experiment provides evidence for dissociation between item and source memory in PD patients,thereby strengthening the claim that the item or source memory rely on different brain structures.PD patients show poor source memory,in which dopamine plays a critical role.
Computerized Adaptive Test (CAT) Applications and Item Response Theory Models for Polytomous Items

Science.gov (United States)

Aybek, Eren Can; Demirtasli, R. Nukhet

2017-01-01

This article aims to provide a theoretical framework for computerized adaptive tests (CAT) and item response theory models for polytomous items. Besides that, it aims to introduce the simulation and live CAT software to the related researchers. Computerized adaptive test algorithm, assumptions of item response theory models, nominal response…
Can Item Keyword Feedback Help Remediate Knowledge Gaps?

Science.gov (United States)

Feinberg, Richard A; Clauser, Amanda L

2016-10-01

In graduate medical education, assessment results can effectively guide professional development when both assessment and feedback support a formative model. When individuals cannot directly access the test questions and responses, a way of using assessment results formatively is to provide item keyword feedback. The purpose of the following study was to investigate whether exposure to item keyword feedback aids in learner remediation. Participants included 319 trainees who completed a medical subspecialty in-training examination (ITE) in 2012 as first-year fellows, and then 1 year later in 2013 as second-year fellows. Performance on 2013 ITE items in which keywords were, or were not, exposed as part of the 2012 ITE score feedback was compared across groups based on the amount of time studying (preparation). For the same items common to both 2012 and 2013 ITEs, response patterns were analyzed to investigate changes in answer selection. Test takers who indicated greater amounts of preparation on the 2013 ITE did not perform better on the items in which keywords were exposed compared to those who were not exposed. The response pattern analysis substantiated overall growth in performance from the 2012 ITE. For items with incorrect responses on both attempts, examinees selected the same option 58% of the time. Results from the current study were unsuccessful in supporting the use of item keywords in aiding remediation. Unfortunately, the results did provide evidence of examinees retaining misinformation.
Safety classification of items in Tianwan Nuclear Power Plant

International Nuclear Information System (INIS)

Sun Yongbin

2005-01-01

The principle of integrality, moderation and equilibrium should be considered in the safety classification of items in nuclear power plant. The basic ways for safety classification of items is to classify the safety function based on the effect of the outside enclosure damage of the items (parts) on the safety. Tianwan Nuclear Power Plant adopts Russian VVER-1000/428 type reactor, it safety classification mainly refers to Russian Guidelines and standards. The safety classification of the electric equipment refers to IEEE-308(80) standard, including 1E and Non 1E classification. The safety classification of the instrumentation and control equipment refers to GB/T 15474-1995 standard, including safety 1E, safety-related SR and NC non-safety classification. The safety classification of Tianwan Nuclear Power Plant has to be approved by NNSA and satisfy Chinese Nuclear Safety Guidelines. (authors)
Selecting Items for Criterion-Referenced Tests.

Science.gov (United States)

Mellenbergh, Gideon J.; van der Linden, Wim J.

1982-01-01

Three item selection methods for criterion-referenced tests are examined: the classical theory of item difficulty and item-test correlation; the latent trait theory of item characteristic curves; and a decision-theoretic approach for optimal item selection. Item contribution to the standardized expected utility of mastery testing is discussed. (CM)
Geriatric Anxiety Scale: item response theory analysis, differential item functioning, and creation of a ten-item short form (GAS-10).

Science.gov (United States)

Mueller, Anne E; Segal, Daniel L; Gavett, Brandon; Marty, Meghan A; Yochim, Brian; June, Andrea; Coolidge, Frederick L

2015-07-01

The Geriatric Anxiety Scale (GAS; Segal et al. (Segal, D. L., June, A., Payne, M., Coolidge, F. L. and Yochim, B. (2010). Journal of Anxiety Disorders, 24, 709-714. doi:10.1016/j.janxdis.2010.05.002) is a self-report measure of anxiety that was designed to address unique issues associated with anxiety assessment in older adults. This study is the first to use item response theory (IRT) to examine the psychometric properties of a measure of anxiety in older adults. A large sample of older adults (n = 581; mean age = 72.32 years, SD = 7.64 years, range = 60 to 96 years; 64% women; 88% European American) completed the GAS. IRT properties were examined. The presence of differential item functioning (DIF) or measurement bias by age and sex was assessed, and a ten-item short form of the GAS (called the GAS-10) was created. All GAS items had discrimination parameters of 1.07 or greater. Items from the somatic subscale tended to have lower discrimination parameters than items on the cognitive or affective subscales. Two items were flagged for DIF, but the impact of the DIF was negligible. Women scored significantly higher than men on the GAS and its subscales. Participants in the young-old group (60 to 79 years old) scored significantly higher on the cognitive subscale than participants in the old-old group (80 years old and older). Results from the IRT analyses indicated that the GAS and GAS-10 have strong psychometric properties among older adults. We conclude by discussing implications and future research directions.
Asymptotic Standard Errors for Item Response Theory True Score Equating of Polytomous Items

Science.gov (United States)

Cher Wong, Cheow

2015-01-01

Building on previous works by Lord and Ogasawara for dichotomous items, this article proposes an approach to derive the asymptotic standard errors of item response theory true score equating involving polytomous items, for equivalent and nonequivalent groups of examinees. This analytical approach could be used in place of empirical methods like…
Fiscal 1995 geothermal development promotion survey. Natural environment survey report; 1995 nendo chinetsu kaihatsu sokushin chosa. Shizen kankyo chosa hokokusho

Energy Technology Data Exchange (ETDEWEB)

NONE

1996-03-01

In Candidate C area for the geothermal development survey, the natural environment was surveyed and `the secondary landscape assessment` was summed up in which places proposed for drilling of large-size wells and for construction of power generation facilities are extracted and a simulation of the landscape is conducted. The area for survey is the Shiramizu-gawa region in the south of Lake Akan, Akan-cho, Akan-gun, Hokkaido. The field survey was carried out about three items of landscape, plants and animals during the June-November period, 1995. As to the flora, diverse florae including vegetation unique to alpine areas, wetlands, and fumarole surrounding areas were found in the region, which is covered with summer-green broad-leaved forests or mixed forests of coniferous and broad-leaved trees. As to the fauna, faunae inhabitant of the highly natural forests were found including black woodpeckers and mountain hawk eagles. As a result of studying the places proposed for geothermal development from the above-mentioned survey, two places were picked up in the west of the survey area, where geothermal development is comparatively less influential in the natural environment and landscape and there is a high locational adaptability. 19 refs., 56 figs., 49 tabs.
MIMIC Methods for Assessing Differential Item Functioning in Polytomous Items

Science.gov (United States)

Wang, Wen-Chung; Shih, Ching-Lin

2010-01-01

Three multiple indicators-multiple causes (MIMIC) methods, namely, the standard MIMIC method (M-ST), the MIMIC method with scale purification (M-SP), and the MIMIC method with a pure anchor (M-PA), were developed to assess differential item functioning (DIF) in polytomous items. In a series of simulations, it appeared that all three methods…
A Comparison of the Alcohol Use Disorder Identification Test (AUDIT) in General Population Surveys in nine European Countries

DEFF Research Database (Denmark)

Bloomfield, Kim; Knibbe, Ronald; Derickx, Mieke

2006-01-01

Aims: This study explored the suitability of the Alcohol Use Disorder Identification Test (AUDIT) for cross-national comparable estimates of problem drinking in general populations. On the item level the focus is on responsiveness to cross-national and gender differences. For the set of items...... the focus is on intercorrelations between items, indicating to what extent the AUDIT constitutes a scale. Methods: General population surveys from nine European countries were included. Cross-tabulations were used to analyse cross-national and gender differences in scores on the items. Reliability analysis...... was used to analyse intercorrelations between the items. Results: The items ‘blackouts' (men and women) and ‘guilt and remorse' (women) are the most frequently reported consequences. Gender differences tended to be smaller for ‘guilt and remorse' and ‘concern of others', and largest for ‘morning drinking...
Evaluation of Northwest University, Kano Post-UTME Test Items Using Item Response Theory

Science.gov (United States)

Bichi, Ado Abdu; Hafiz, Hadiza; Bello, Samira Abdullahi

2016-01-01

High-stakes testing is used for the purposes of providing results that have important consequences. Validity is the cornerstone upon which all measurement systems are built. This study applied the Item Response Theory principles to analyse Northwest University Kano Post-UTME Economics test items. The developed fifty (50) economics test items was…
Using Differential Item Functioning Procedures to Explore Sources of Item Difficulty and Group Performance Characteristics.

Science.gov (United States)

Scheuneman, Janice Dowd; Gerritz, Kalle

1990-01-01

Differential item functioning (DIF) methodology for revealing sources of item difficulty and performance characteristics of different groups was explored. A total of 150 Scholastic Aptitude Test items and 132 Graduate Record Examination general test items were analyzed. DIF was evaluated for males and females and Blacks and Whites. (SLD)

Item Response Data Analysis Using Stata Item Response Theory Package

Science.gov (United States)

Yang, Ji Seung; Zheng, Xiaying

2018-01-01

The purpose of this article is to introduce and review the capability and performance of the Stata item response theory (IRT) package that is available from Stata v.14, 2015. Using a simulated data set and a publicly available item response data set extracted from Programme of International Student Assessment, we review the IRT package from…
Psychometric properties of the Centers for Disease Control and Prevention Health-Related Quality of Life (CDC HRQOL items in adults with arthritis

Directory of Open Access Journals (Sweden)

DeVellis Robert

2006-09-01

Full Text Available Abstract Background Measuring health-related quality of life (HRQOL is important in arthritis and the SF-36v2 is the current state-of-the-art. It is only emerging how well the Centers for Disease Control and Prevention (CDC HRQOL measures HRQOL for people with arthritis. This study's purpose is to assess the psychometric properties of the 9-item CDC HRQOL (4-item Healthy Days Core Module and 5-item Healthy Days Symptoms Module in an arthritis sample using the SF-36v2 as a comparison. Methods In Fall 2002, a cross-sectional study acquired survey data including the CDC HRQOL and SF-36v2 from 2 North Carolina populations of adult patients reporting osteoarthritis, rheumatoid arthritis, and fibromyalgia; 2182 (52% responded. The first item of both the CDC HRQOL and the SF-36v2 was general health (GEN. All 8 other CDC HRQOL items ask for the number of days in the past 30 days that respondents experienced various aspects of HRQOL. Exploratory principal components analyses (PCA were conducted on each sample and the combined samples of the CDC HRQOL. The multitrait-multimethod matrix (MTMM was used to compute correlations between each trait (physical health and mental health and between each method of measurement (CDC HRQOL and SF36v2. The relative contribution of the CDC HRQOL in predicting the physical component summary (PCS and the mental component summary (MCS was determined by regressing the CDC HRQOL items on the PCS and MCS scales. Results All 9 CDC HRQOL items loaded primarily onto 1 factor (explaining 57% of the item variance representing a reasonable solution for capturing overall HRQOL. After rotation a 2 factor interpretation for the 9 items was clear, with 4 items capturing physical health (physical, activity, pain, and energy days and 3 items capturing mental health (mental, depression, and anxiety days. All of the loadings for these two factors were greater than 0.70. The CDC HRQOL physical health factor correlated with PCS (r = -.78, p 2
SU-F-T-244: Radiotherapy Risk Estimation Based On Expert Group Survey

International Nuclear Information System (INIS)

Koo, J; Yoon, M; Chung, W; Chung, M; Kim, D

2016-01-01

Purpose: To evaluate the reliability of RPN (Risk Priority Number) decided by expert group and to provide preliminary data for adapting FMEA in Korea. Methods: 1163 Incidents reported in ROSIS for 11 years were used as a real data to be compared with, and were categorized into 146 items. The questionnaire was composed of the 146 items and respondents had to valuate ‘occurrence (O)’, ‘severity (S)’, ‘detectability (D)’ of each item on a scale from 1 to 10 according to the proposed AAPM TG-100 rating scales. 19 medical physicists from 19 different organizations in Korea had participated in the survey. Because the number of ROSIS items was not evenly spread enough to be classified into 10 grades, 1–5 scale was chosen instead of 1–10 and survey result was also fit to 5 grades to compare. Results: The average O,S,D were 1.77, 3.50, 2.13, respectively and the item which had the highest RPN(32) was ‘patient movement during treatment’ in the survey. When comparing items ranked in the top 10 of each survey(O) and ROSIS database, two items were duplicated and ‘Simulation’ and ’Treatment’ were the most frequently ranked RT process in top 10 of survey and ROSIS each. The Chronbach α of each RT process were ranged from 0.74 to 0.99 and p-value was <0.001. When comparing O*D, the average difference was 1.4. Conclusion: This work indicates the deviation between actual risk and expectation. Considering that the respondents were Korean and ROSIS is mainly composed of incidents happened in European countries and some of the top 10 items of ROSIS cannot be applied in radiotherapy procedure in Korea, the deviation could have been came from procedural difference. Moreover, if expert group was consisted of experts from various parts, expectation might have been more accurate. Therefore, further research on radiotherapy risk estimation is needed.
SU-F-T-244: Radiotherapy Risk Estimation Based On Expert Group Survey

Energy Technology Data Exchange (ETDEWEB)

Koo, J; Yoon, M [Korea University, Seoul (Korea, Republic of); Chung, W; Chung, M; Kim, D [Kyung Hee University Hospital at Gangdong, Gangdong-gu, Seoul (Korea, Republic of)

2016-06-15

Purpose: To evaluate the reliability of RPN (Risk Priority Number) decided by expert group and to provide preliminary data for adapting FMEA in Korea. Methods: 1163 Incidents reported in ROSIS for 11 years were used as a real data to be compared with, and were categorized into 146 items. The questionnaire was composed of the 146 items and respondents had to valuate ‘occurrence (O)’, ‘severity (S)’, ‘detectability (D)’ of each item on a scale from 1 to 10 according to the proposed AAPM TG-100 rating scales. 19 medical physicists from 19 different organizations in Korea had participated in the survey. Because the number of ROSIS items was not evenly spread enough to be classified into 10 grades, 1–5 scale was chosen instead of 1–10 and survey result was also fit to 5 grades to compare. Results: The average O,S,D were 1.77, 3.50, 2.13, respectively and the item which had the highest RPN(32) was ‘patient movement during treatment’ in the survey. When comparing items ranked in the top 10 of each survey(O) and ROSIS database, two items were duplicated and ‘Simulation’ and ’Treatment’ were the most frequently ranked RT process in top 10 of survey and ROSIS each. The Chronbach α of each RT process were ranged from 0.74 to 0.99 and p-value was <0.001. When comparing O*D, the average difference was 1.4. Conclusion: This work indicates the deviation between actual risk and expectation. Considering that the respondents were Korean and ROSIS is mainly composed of incidents happened in European countries and some of the top 10 items of ROSIS cannot be applied in radiotherapy procedure in Korea, the deviation could have been came from procedural difference. Moreover, if expert group was consisted of experts from various parts, expectation might have been more accurate. Therefore, further research on radiotherapy risk estimation is needed.
A study on the establishment of safety assessment guidelines of commercial grade item dedication in digitalized safety systems

International Nuclear Information System (INIS)

Hwang, H. S.; Kim, B. R.; Oh, S. H.

1999-01-01

Because of obsolescing the components used in safety related systems of nuclear power plants, decreasing the number of suppliers qualified for the nuclear QA program and increasing maintenance costs of them, utilities have been considering to use commercial grade digital computers as an alternative for resolving such issues. However, commercial digital computers use the embedded pre-existing software, including operating system software, which are not developed by using nuclear grade QA program. Thus, it is necessary for utilities to establish processes for dedicating digital commercial grade items. A regulatory body also needs guidance to evaluate the digital commercial products properly. This paper surveyed the regulations and their regulatory guides, which establish the requirements for commercial grade items dedication, industry standards and guidances applicable to safety related systems. This paper provides some guidelines to be applied in evaluating the safety of digital upgrades and new digital plant protection systems in Korea
Item Banking with Embedded Standards

Science.gov (United States)

MacCann, Robert G.; Stanley, Gordon

2009-01-01

An item banking method that does not use Item Response Theory (IRT) is described. This method provides a comparable grading system across schools that would be suitable for low-stakes testing. It uses the Angoff standard-setting method to obtain item ratings that are stored with each item. An example of such a grading system is given, showing how…
Qualitative Development and Content Validation of the PROMIS Pediatric Sleep Health Items.

Science.gov (United States)

Bevans, Katherine B; Meltzer, Lisa J; De La Motte, Anna; Kratchman, Amy; Viél, Dominique; Forrest, Christopher B

2018-04-25

To develop the Patient Reported Outcome Measurement Information System (PROMIS) Pediatric Sleep Health item pool and evaluate its content validity. Participants included 8 expert sleep clinician-researchers, 64 children ages 8-17 years, and 54 parents of children ages 5-17 years. We started with item concepts and expressions from the PROMIS Sleep Disturbance and Sleep Related Impairment adult measures. Additional pediatric sleep health concepts were generated by expert (n = 8), child (n = 28), and parent (n = 33) concept elicitation interviews and a systematic review of existing pediatric sleep health questionnaires. Content validity of the item pool was evaluated with item translatability review, readability analysis, and child (n = 36) and parent (n = 21) cognitive interviews. The final pediatric Sleep Health item pool includes 43 items that assess sleep disturbance (children's capacity to fall and stay asleep, sleep quality, dreams, and parasomnias) and sleep-related impairments (daytime sleepiness, low energy, difficulty waking up, and the impact of sleep and sleepiness on cognition, affect, behavior, and daily activities). Items are translatable and relevant and well understood by children ages 8-17 and parents of children ages 5-17. Rigorous qualitative procedures were used to develop and evaluate the content validity of the PROMIS Pediatric Sleep Health item pool. Once the item pool's psychometric properties are established, the scales will be useful for measuring children's subjective experiences of sleep.
A symptom profile of depression among Asian Americans: is there evidence for differential item functioning of depressive symptoms?

Science.gov (United States)

Kalibatseva, Z; Leong, F T L; Ham, E H

2014-09-01

Theoretical and clinical publications suggest the existence of cultural differences in the expression and experience of depression. Measurement non-equivalence remains a potential methodological explanation for the lower prevalence of depression among Asian Americans compared to European Americans. This study compared DSM-IV depressive symptoms among Asian Americans and European Americans using secondary data analysis of the Collaborative Psychiatric Epidemiology Surveys (CPES). The Composite International Diagnostic Interview (CIDI) was used for the assessment of depressive symptoms. Of the entire sample, 310 Asian Americans and 1974 European Americans reported depressive symptoms and were included in the analyses. Measurement variance was examined with an item response theory differential item functioning (IRT DIF) analysis. χ2 analyses indicated that, compared to Asian Americans, European American participants more frequently endorsed affective symptoms such as 'feeling depressed', 'feeling discouraged' and 'cried more often'. The IRT analysis detected DIF for four out of the 15 depression symptom items. At equal levels of depression, Asian Americans endorsed feeling worthless and appetite changes more easily than European Americans, and European Americans endorsed feeling nervous and crying more often than Asian Americans. Asian Americans did not seem to over-report somatic symptoms; however, European Americans seemed to report more affective symptoms than Asian Americans. The results suggest that there was measurement variance in a few of the depression items.
A score for measuring health risk perception in environmental surveys.

Science.gov (United States)

Marcon, Alessandro; Nguyen, Giang; Rava, Marta; Braggion, Marco; Grassi, Mario; Zanolin, Maria Elisabetta

2015-09-15

In environmental surveys, risk perception may be a source of bias when information on health outcomes is reported using questionnaires. Using the data from a survey carried out in the largest chipboard industrial district in Italy (Viadana, Mantova), we devised a score of health risk perception and described its determinants in an adult population. In 2006, 3697 parents of children were administered a questionnaire that included ratings on 7 environmental issues. Items dimensionality was studied by factor analysis. After testing equidistance across response options by homogeneity analysis, a risk perception score was devised by summing up item ratings. Factor analysis identified one latent factor, which we interpreted as health risk perception, that explained 65.4% of the variance of five items retained after scaling. The scale (range 0-10, mean ± SD 9.3 ± 1.9) had a good internal consistency (Cronbach's alpha 0.87). Most subjects (80.6%) expressed maximum risk perception (score = 10). Italian mothers showed significantly higher risk perception than foreign fathers. Risk perception was higher for parents of young children, and for older parents with a higher education, than for their counterparts. Actual distance to major roads was not associated with the score, while self-reported intense traffic and frequent air refreshing at home predicted higher risk perception. When investigating health effects of environmental hazards using questionnaires, care should be taken to reduce the possibility of awareness bias at the stage of study planning and data analysis. Including appropriate items in study questionnaires can be useful to derive a measure of health risk perception, which can help to identify confounding of association estimates by risk perception. Copyright © 2015 Elsevier B.V. All rights reserved.
Surveying Turkish high school and university students’ attitudes and approaches to physics problem solving

Directory of Open Access Journals (Sweden)

Nuri Balta

2016-04-01

Full Text Available Students’ attitudes and approaches to physics problem solving can impact how well they learn physics and how successful they are in solving physics problems. Prior research in the U.S. using a validated Attitude and Approaches to Problem Solving (AAPS survey suggests that there are major differences between students in introductory physics and astronomy courses and physics experts in terms of their attitudes and approaches to physics problem solving. Here we discuss the validation, administration, and analysis of data for the Turkish version of the AAPS survey for high school and university students in Turkey. After the validation and administration of the Turkish version of the survey, the analysis of the data was conducted by grouping the data by grade level, school type, and gender. While there are no statistically significant differences between the averages of various groups on the survey, overall, the university students in Turkey were more expertlike than vocational high school students. On an item by item basis, there are statistically differences between the averages of the groups on many items. For example, on average, the university students demonstrated less expertlike attitudes about the role of equations and formulas in problem solving, in solving difficult problems, and in knowing when the solution is not correct, whereas they displayed more expertlike attitudes and approaches on items related to metacognition in physics problem solving. A principal component analysis on the data yields item clusters into which the student responses on various survey items can be grouped. A comparison of the responses of the Turkish and American university students enrolled in algebra-based introductory physics courses shows that on more than half of the items, the responses of these two groups were statistically significantly different, with the U.S. students on average responding to the items in a more expertlike manner.
P2-19: The Effect of item Repetition on Item-Context Association Depends on the Prior Exposure of Items

Directory of Open Access Journals (Sweden)

Hongmi Lee

2012-10-01

Full Text Available Previous studies have reported conflicting findings on whether item repetition has beneficial or detrimental effects on source memory. To reconcile such contradictions, we investigated whether the degree of pre-exposure of items can be a potential modulating factor. The experimental procedures spanned two consecutive days. On Day 1, participants were exposed to a set of unfamiliar faces. On Day 2, the same faces presented on the previous day were used again in half of the participants, whereas novel faces were used for the other half. Day 2 procedures consisted of three successive phases: item repetition, source association, and source memory test. In the item repetition phase, half of the face stimuli were repeatedly presented while participants were making male/female judgments. During the source association phase, both the repeated and the unrepeated faces appeared in one of the four locations on the screen. Finally, participants were tested on the location in which a given face was presented during the previous phase and reported the confidence of their memory. Source memory accuracy was measured as the percentage of correct non-guess trials. As results, we found a significant interaction between prior exposure and repetition. Repetition impaired source memory when the items had been pre-exposed on Day 1, while it led to greater accuracy in novel ones. These results show that pre-experimental exposure can modulate the effects of repetition on associative binding between an item and its contextual information, suggesting that pre-existing representation and novelty signal interact to form new episodic memory.
Comparison of Alternate and Original Items on the Montreal Cognitive Assessment.

Science.gov (United States)

Lebedeva, Elena; Huang, Mei; Koski, Lisa

2016-03-01

The Montreal Cognitive Assessment (MoCA) is a screening tool for mild cognitive impairment (MCI) in elderly individuals. We hypothesized that measurement error when using the new alternate MoCA versions to monitor change over time could be related to the use of items that are not of comparable difficulty to their corresponding originals of similar content. The objective of this study was to compare the difficulty of the alternate MoCA items to the original ones. Five selected items from alternate versions of the MoCA were included with items from the original MoCA administered adaptively to geriatric outpatients (N = 78). Rasch analysis was used to estimate the difficulty level of the items. None of the five items from the alternate versions matched the difficulty level of their corresponding original items. This study demonstrates the potential benefits of a Rasch analysis-based approach for selecting items during the process of development of parallel forms. The results suggest that better match of the items from different MoCA forms by their difficulty would result in higher sensitivity to changes in cognitive function over time.
Pattern analysis of total item score and item response of the Kessler Screening Scale for Psychological Distress (K6 in a nationally representative sample of US adults

Directory of Open Access Journals (Sweden)

Shinichiro Tomitaka

2017-02-01

Full Text Available Background Several recent studies have shown that total scores on depressive symptom measures in a general population approximate an exponential pattern except for the lower end of the distribution. Furthermore, we confirmed that the exponential pattern is present for the individual item responses on the Center for Epidemiologic Studies Depression Scale (CES-D. To confirm the reproducibility of such findings, we investigated the total score distribution and item responses of the Kessler Screening Scale for Psychological Distress (K6 in a nationally representative study. Methods Data were drawn from the National Survey of Midlife Development in the United States (MIDUS, which comprises four subsamples: (1 a national random digit dialing (RDD sample, (2 oversamples from five metropolitan areas, (3 siblings of individuals from the RDD sample, and (4 a national RDD sample of twin pairs. K6 items are scored using a 5-point scale: “none of the time,” “a little of the time,” “some of the time,” “most of the time,” and “all of the time.” The pattern of total score distribution and item responses were analyzed using graphical analysis and exponential regression model. Results The total score distributions of the four subsamples exhibited an exponential pattern with similar rate parameters. The item responses of the K6 approximated a linear pattern from “a little of the time” to “all of the time” on log-normal scales, while “none of the time” response was not related to this exponential pattern. Discussion The total score distribution and item responses of the K6 showed exponential patterns, consistent with other depressive symptom scales.
Investigating the Impact of Item Parameter Drift for Item Response Theory Models with Mixture Distributions

Directory of Open Access Journals (Sweden)

Yoon Soo ePark

2016-02-01

Full Text Available This study investigates the impact of item parameter drift (IPD on parameter and ability estimation when the underlying measurement model fits a mixture distribution, thereby violating the item invariance property of unidimensional item response theory (IRT models. An empirical study was conducted to demonstrate the occurrence of both IPD and an underlying mixture distribution using real-world data. Twenty-one trended anchor items from the 1999, 2003, and 2007 administrations of Trends in International Mathematics and Science Study (TIMSS were analyzed using unidimensional and mixture IRT models. TIMSS treats trended anchor items as invariant over testing administrations and uses pre-calibrated item parameters based on unidimensional IRT. However, empirical results showed evidence of two latent subgroups with IPD. Results showed changes in the distribution of examinee ability between latent classes over the three administrations. A simulation study was conducted to examine the impact of IPD on the estimation of ability and item parameters, when data have underlying mixture distributions. Simulations used data generated from a mixture IRT model and estimated using unidimensional IRT. Results showed that data reflecting IPD using mixture IRT model led to IPD in the unidimensional IRT model. Changes in the distribution of examinee ability also affected item parameters. Moreover, drift with respect to item discrimination and distribution of examinee ability affected estimates of examinee ability. These findings demonstrate the need to caution and evaluate IPD using a mixture IRT framework to understand its effect on item parameters and examinee ability.
Investigating the Impact of Item Parameter Drift for Item Response Theory Models with Mixture Distributions.

Science.gov (United States)

Park, Yoon Soo; Lee, Young-Sun; Xing, Kuan

2016-01-01

This study investigates the impact of item parameter drift (IPD) on parameter and ability estimation when the underlying measurement model fits a mixture distribution, thereby violating the item invariance property of unidimensional item response theory (IRT) models. An empirical study was conducted to demonstrate the occurrence of both IPD and an underlying mixture distribution using real-world data. Twenty-one trended anchor items from the 1999, 2003, and 2007 administrations of Trends in International Mathematics and Science Study (TIMSS) were analyzed using unidimensional and mixture IRT models. TIMSS treats trended anchor items as invariant over testing administrations and uses pre-calibrated item parameters based on unidimensional IRT. However, empirical results showed evidence of two latent subgroups with IPD. Results also showed changes in the distribution of examinee ability between latent classes over the three administrations. A simulation study was conducted to examine the impact of IPD on the estimation of ability and item parameters, when data have underlying mixture distributions. Simulations used data generated from a mixture IRT model and estimated using unidimensional IRT. Results showed that data reflecting IPD using mixture IRT model led to IPD in the unidimensional IRT model. Changes in the distribution of examinee ability also affected item parameters. Moreover, drift with respect to item discrimination and distribution of examinee ability affected estimates of examinee ability. These findings demonstrate the need to caution and evaluate IPD using a mixture IRT framework to understand its effects on item parameters and examinee ability.
Applying modern psychometric techniques to melodic discrimination testing: Item response theory, computerised adaptive testing, and automatic item generation.

Science.gov (United States)

Harrison, Peter M C; Collins, Tom; Müllensiefen, Daniel

2017-06-15

Modern psychometric theory provides many useful tools for ability testing, such as item response theory, computerised adaptive testing, and automatic item generation. However, these techniques have yet to be integrated into mainstream psychological practice. This is unfortunate, because modern psychometric techniques can bring many benefits, including sophisticated reliability measures, improved construct validity, avoidance of exposure effects, and improved efficiency. In the present research we therefore use these techniques to develop a new test of a well-studied psychological capacity: melodic discrimination, the ability to detect differences between melodies. We calibrate and validate this test in a series of studies. Studies 1 and 2 respectively calibrate and validate an initial test version, while Studies 3 and 4 calibrate and validate an updated test version incorporating additional easy items. The results support the new test's viability, with evidence for strong reliability and construct validity. We discuss how these modern psychometric techniques may also be profitably applied to other areas of music psychology and psychological science in general.
Comparative survey of comprehensiveness of literature collection between two information systems, 2

International Nuclear Information System (INIS)

Narui, Shigeko; Habara, Tadashi; Izawa, Michiyo; Naramoto, Miyoko; Kajiro, Tadashi

1984-01-01

To make clear a feature of nuclear fusion area of the INIS database in comparison with the INSPEC, a survey has been made of overlap literature included in both databases, and of unique literature included only in the INSPEC. All of the 5,774 items in the categories a50.00 and a28.50R of the INSPEC in 1980 were checked on whether each item was also included in the fusion category A14 of the INIS during four years of 1979 to 1982 or not. The ratios of literature included in the INIS were 52 % and 84 % for journal and technical report, respectively. The ratio for journal was considered in connection with differences in the scope and coverage as well as input system between the two databases. High comprehensiveness for technical report is achievable in the INIS. Comparison of language of literature included in both databases, and time lags for publication are briefly described. (author)
Development and Validation of Culture-Sensitive Physics Learning Environment Survey (CS-PLES

Directory of Open Access Journals (Sweden)

Marie Paz E. Morales

2014-05-01

Full Text Available The study combined qualitative approaches with quantitative research design to come up with a survey instrument called Culture-Sensitive Physics Learning Environment Survey (CS-PLES.This survey instrument is intended to extract the learners’ beliefs and expectations on the integration of culture and language in the teaching and learning process of physics concepts. Significant contribution of the instrument can be traced to establishing and defining the constructs and categories on how curriculum localization and context-based science learning can be developed aligned with students’ expectations and beliefs. The development process employed non-conventional processes adopted from literature which included pilot study to identify pre-deterministic constructs and specific categories for the items to be included in the survey. Data analysis included descriptive statistics and factor analysis to establish the categories or constructs of the survey instruments. Reliability measures of the instrument and its respective constructs were established for standardization. These categories were intended to aid researchers for an in-depth analysis when the instrument is administered for its purpose. The raw statistical categories were qualitatively paralleled with the pre-deterministic constructs to establish congruence of the survey tool to Instructional Congruence Framework (ICF.
An improved non-Markovian degradation model with long-term dependency and item-to-item uncertainty

Science.gov (United States)

Xi, Xiaopeng; Chen, Maoyin; Zhang, Hanwen; Zhou, Donghua

2018-05-01

It is widely noted in the literature that the degradation should be simplified into a memoryless Markovian process for the purpose of predicting the remaining useful life (RUL). However, there actually exists the long-term dependency in the degradation processes of some industrial systems, including electromechanical equipments, oil tankers, and large blast furnaces. This implies the new degradation state depends not only on the current state, but also on the historical states. Such dynamic systems cannot be accurately described by traditional Markovian models. Here we present an improved non-Markovian degradation model with both the long-term dependency and the item-to-item uncertainty. As a typical non-stationary process with dependent increments, fractional Brownian motion (FBM) is utilized to simulate the fractal diffusion of practical degradations. The uncertainty among multiple items can be represented by a random variable of the drift. Based on this model, the unknown parameters are estimated through the maximum likelihood (ML) algorithm, while a closed-form solution to the RUL distribution is further derived using a weak convergence theorem. The practicability of the proposed model is fully verified by two real-world examples. The results demonstrate that the proposed method can effectively reduce the prediction error.
Development and Validation of a Novel Generic Health-related Quality of Life Instrument With 20 Items (HINT-20

Directory of Open Access Journals (Sweden)

Min-Woo Jo

2017-01-01

Full Text Available Objectives Few attempts have been made to develop a generic health-related quality of life (HRQoL instrument and to examine its validity and reliability in Korea. We aimed to do this in our present study. Methods After a literature review of existing generic HRQoL instruments, a focus group discussion, in-depth interviews, and expert consultations, we selected 30 tentative items for a new HRQoL measure. These items were evaluated by assessing their ceiling effects, difficulty, and redundancy in the first survey. To validate the HRQoL instrument that was developed, known-groups validity and convergent/discriminant validity were evaluated and its test-retest reliability was examined in the second survey. Results Of the 30 items originally assessed for the HRQoL instrument, four were excluded due to high ceiling effects and six were removed due to redundancy. We ultimately developed a HRQoL instrument with a reduced number of 20 items, known as the Health-related Quality of Life Instrument with 20 items (HINT-20, incorporating physical, mental, social, and positive health dimensions. The results of the HINT-20 for known-groups validity were poorer in women, the elderly, and those with a low income. For convergent/discriminant validity, the correlation coefficients of items (except vitality in the physical health dimension with the physical component summary of the Short Form 36 version 2 (SF-36v2 were generally higher than the correlations of those items with the mental component summary of the SF-36v2, and vice versa. Regarding test-retest reliability, the intraclass correlation coefficient of the total HINT-20 score was 0.813 (p<0.001. Conclusions A novel generic HRQoL instrument, the HINT-20, was developed for the Korean general population and showed acceptable validity and reliability.

Combining item and bulk material loss-detection uncertainties

International Nuclear Information System (INIS)

Eggers, R.F.

1982-01-01

Loss detection requirements, such as five formula kilograms with 99% probability of detection, which apply to the sum of losses from material in both item and bulk form, constitute a special problem for the nuclear material statistician. Requirements of this type are included in the Material Control and Accounting Reform Amendments described in the Advance Notice of Proposed Rule Making (Federal Register, 46(175):45144-46151). Attribute test sampling of items is the method used to detect gross defects in the inventory of items in a given control unit. Attribute sampling plans are designed to detect a loss of a specificed goal quantity of material with a given probability. In contrast to the methods and statistical models used for item loss detection, bulk material loss detection requires all the material entering and leaving a control unit to be measured and the calculation of a loss estimator that will be tested against an appropriate alarm threshold. The alarm threshold is determined from an estimate of the error inherent in the components of the loss estimator. In this paper a simple grahical method of evaluating the combined capabilities of bulk material loss detection methods and item attribute testing procedures will be described. Quantitative results will be given for several cases, indicating how a decrease in the precision of the item loss detection method tends to force an increase in the precision of the bulk loss detection procedure in order to meet the overall detection requirement. 4 figures
Benthic marine debris, with an emphasis on fishery-related items, surrounding Kodiak Island, Alaska, 1994-1996

Science.gov (United States)

Hess, N.A.; Ribic, C.A.; Vining, I.

1999-01-01

Composition and abundance of benthic marine debris were investigated during three bottom trawl surveys in inlet and offshore locations surrounding Kodiak Island, Alaska, 1994-1996. Debris items were primarily plastic and metal regardless of trawl location. Plastic bait jars, fishing line, and crab pots were the most common fishery-related debris items and were encountered in large amounts in inlets (20-25 items km-2), but were less abundant outside of inlets (4.5-11 items km-2). Overall density of debris was also significantly greater in inlets than outside of inlets. Plastic debris densities in inlets ranged 22-31.5 items km-2, 7.8-18.8 items km-2 outside of inlets. Trawls in inlets contained almost as much metal debris as plastic debris. Density of metal debris ranged from 21.2 to 23.7 items km-2 in inlets, a maximum of 2.7 items km-2 outside of inlets. Inlets around the town of Kodiak had the highest densities of fishery-related and total benthic debris. Differences in benthic debris density between inlets and outside of inlets and differences by area may be due to differences in fishing activity and water circulation patterns. At the current reduced levels of fishing activity, however, yearly monitoring of benthic debris appears unnecessary. Copyright (C) 1999.
Establishing key components of yoga interventions for musculoskeletal conditions: a Delphi survey

Science.gov (United States)

2014-01-01

Background Evidence suggests yoga is a safe and effective intervention for the management of physical and psychosocial symptoms associated with musculoskeletal conditions. However, heterogeneity in the components and reporting of clinical yoga trials impedes both the generalization of study results and the replication of study protocols. The aim of this Delphi survey was to address these issues of heterogeneity, by developing a list of recommendations of key components for the design and reporting of yoga interventions for musculoskeletal conditions. Methods Recognised experts involved in the design, conduct, and teaching of yoga for musculoskeletal conditions were identified from a systematic review, and invited to contribute to the Delphi survey. Forty-one of the 58 experts contacted, representing six countries, agreed to participate. A three-round Delphi was conducted via electronic surveys. Round 1 presented an open-ended question, allowing panellists to individually identify components they considered key to the design and reporting of yoga interventions for musculoskeletal conditions. Thematic analysis of Round 1 identified items for quantitative rating in Round 2; items not reaching consensus were forwarded to Round 3 for re-rating. Results Thirty-six panellists (36/41; 88%) completed the three rounds of the Delphi survey. Panellists provided 348 comments to the Round 1 question. These comments were reduced to 49 items, grouped under five themes, for rating in subsequent rounds. A priori group consensus of ≥80% was reached on 28 items related to five themes concerning defining the yoga intervention, types of yoga practices to include in an intervention, delivery of the yoga protocol, domains of outcome measures, and reporting of yoga interventions for musculoskeletal conditions. Additionally, a priori consensus of ≥50% was reached on five items relating to minimum values for intervention parameters. Conclusions Expert consensus has provided a non
Exploring differential item functioning in the Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC

Directory of Open Access Journals (Sweden)

Pollard Beth

2012-12-01

Full Text Available Abstract Background The Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC is a widely used patient reported outcome in osteoarthritis. An important, but frequently overlooked, aspect of validating health outcome measures is to establish if items exhibit differential item functioning (DIF. That is, if respondents have the same underlying level of an attribute, does the item give the same score in different subgroups or is it biased towards one subgroup or another. The aim of the study was to explore DIF in the Likert format WOMAC for the first time in a UK osteoarthritis population with respect to demographic, social, clinical and psychological factors. Methods The sample comprised a community sample of 763 people with osteoarthritis who participated in the Somerset and Avon Survey of Health. The WOMAC was explored for DIF by gender, age, social deprivation, social class, employment status, distress, body mass index and clinical factors. Ordinal regression models were used to identify DIF items. Results After adjusting for age, two items were identified for the physical functioning subscale as having DIF with age identified as the DIF factor for 2 items, gender for 1 item and body mass index for 1 item. For the WOMAC pain subscale, for people with hip osteoarthritis one item was identified with age-related DIF. The impact of the DIF items rarely had a significant effect on the conclusions of group comparisons. Conclusions Overall, the WOMAC performed well with only a small number of DIF items identified. However, as DIF items were identified in for the WOMAC physical functioning subscale it would be advisable to analyse data taking into account the possible impact of the DIF items when weight, gender or especially age effects, are the focus of interest in UK-based osteoarthritis studies. Similarly for the WOMAC pain subscale in people with hip osteoarthritis it would be worthwhile to analyse data taking into account the
The Technical Quality of Test Items Generated Using a Systematic Approach to Item Writing.

Science.gov (United States)

Siskind, Theresa G.; Anderson, Lorin W.

The study was designed to examine the similarity of response options generated by different item writers using a systematic approach to item writing. The similarity of response options to student responses for the same item stems presented in an open-ended format was also examined. A non-systematic (subject matter expertise) approach and a…
Using Cognitive Testing to Develop Items for Surveying Asian American Cancer Patients and Their Caregivers as a Pathway to Culturally Competent Care.

Science.gov (United States)

Bolcic-Jankovic, Dragana; Lu, Fengxin; Colten, Mary Ellen; McCarthy, Ellen P

2016-02-01

We report the results from cognitive interviews with Asian American patients and their caregivers. We interviewed seven caregivers and six patients who were all bilingual Asian Americans. The main goal of the cognitive interviews was to test a survey instrument developed for a study about perspectives of Asian American patients with advanced cancer who are facing decisions around end-of-life care. We were particularly interested to see whether items commonly used in White and Black populations are culturally meaningful and equivalent in Asian populations, primarily those of Chinese and Vietnamese ethnicity. Our exploration shows that understanding respondents' language proficiency, degree of acculturation, and cultural context of receiving, processing, and communicating information about medical care can help design questions that are appropriate for Asian American patients and caregivers, and therefore can help researchers obtain quality data about the care Asian American cancer patients receive. © The Author(s) 2016.
Human factors survey of advanced instrumentation and controls

International Nuclear Information System (INIS)

Carter, R.J.

1989-01-01

A survey oriented towards identifying the human factors issues in regard to the use of advanced instrumentation and controls (I ampersand C) in the nuclear industry was conducted. A number of United States (US) and Canadian nuclear vendors and utilities were participants in the survey. Human factors items, subsumed under the categories of computer-generated displays (CGD), controls, organizational support, training, and related topics, were discussed. The survey found the industry to be concerned about the human factors issues related to the implementation of advanced I ampersand C. Fifteen potential human factors problems were identified. They include: the need for an advanced I ampersand C guideline equivalent to NUREG-0700; a role change in the control room from operator to supervisor; information overload; adequacy of existing training technology for advanced I ampersand C; and operator acceptance and trust. 11 refs., 1 tab
Characterization of the human talent in health that serves people with chronic disease: construction of a survey

Directory of Open Access Journals (Sweden)

Sonia Carreño-Moreno

2016-02-01

Full Text Available The objective of this work, is provide conceptual elements that constitute an integrated vision of care conditions required by the human talent in health HTH that caters to people with chronic disease (CD and their families, and that are translated into a tool for gathering information of survey type that allow characterization. This research was conducted in three phases: 1 Review of the literature. 2 Structuring a proposed survey 3 Refinement of the final version of the survey. As results, based on the conceptual framework it was possible to reach a comprehensive vision that served as the basis for the development of a survey to identify the conditions of HTH to care for people with chronic illness and their families. This instrument, called GCPC-A-THS (in Spanish, contains 37 items distributed in 6 additional dimensions that include aspects of care such as: sociodemographic variables of HTH, caring ability, information and communication technologies (ICTs as a means of support to care, continuity, security and also includes some items related to the level of professional satisfaction. The work done made it possible to achieve a comprehensive view of the characteristics and conditions required by the HTH for care to people with chronic illness and their families.
Chromospheric activity of periodic variable stars (including eclipsing binaries) observed in DR2 LAMOST stellar spectral survey

Science.gov (United States)

Zhang, Liyun; Lu, Hongpeng; Han, Xianming L.; Jiang, Linyan; Li, Zhongmu; Zhang, Yong; Hou, Yonghui; Wang, Yuefei; Cao, Zihuang

2018-05-01

The LAMOST spectral survey provides a rich databases for studying stellar spectroscopic properties and chromospheric activity. We cross-matched a total of 105,287 periodic variable stars from several photometric surveys and databases (CSS, LINEAR, Kepler, a recently updated eclipsing star catalogue, ASAS, NSVS, some part of SuperWASP survey, variable stars from the Tsinghua University-NAOC Transient Survey, and other objects from some new references) with four million stellar spectra published in the LAMOST data release 2 (DR2). We found 15,955 spectra for 11,469 stars (including 5398 eclipsing binaries). We calculated their equivalent widths (EWs) of their Hα, Hβ, Hγ, Hδ and Caii H lines. Using the Hα line EW, we found 447 spectra with emission above continuum for a total of 316 stars (178 eclipsing binaries). We identified 86 active stars (including 44 eclipsing binaries) with repeated LAMOST spectra. A total of 68 stars (including 34 eclipsing binaries) show chromospheric activity variability. We also found LAMOST spectra of 12 cataclysmic variables, five of which show chromospheric activity variability. We also made photometric follow-up studies of three short period targets (DY CVn, HAT-192-0001481, and LAMOST J164933.24+141255.0) using the Xinglong 60-cm telescope and the SARA 90-cm and 1-m telescopes, and obtained new BVRI CCD light curves. We analyzed these light curves and obtained orbital and starspot parameters. We detected the first flare event with a huge brightness increase of more than about 1.5 magnitudes in R filter in LAMOST J164933.24+141255.0.
Survey nonresponse among ethnic minorities in a national health survey--a mixed-method study of participation, barriers, and potentials.

Science.gov (United States)

Ahlmark, Nanna; Algren, Maria Holst; Holmberg, Teresa; Norredam, Marie Louise; Nielsen, Signe Smith; Blom, Astrid Benedikte; Bo, Anne; Juel, Knud

2015-01-01

The participation rate in the Danish National Health Survey (DNHS) 2010 was significantly lower among ethnic minorities than ethnic Danes. The purpose was to characterize nonresponse among ethnic minorities in DNHS, analyze variations in item nonresponse, and investigate barriers and incentives to participation. This was a mixed-method study. Logistic regression was used to analyze nonresponse using data from DNHS (N = 177,639 and chi-square tests in item nonresponse analyses. We explored barriers and incentives regarding participation through focus groups and cognitive interviews. Informants included immigrants and their descendants of both sexes, with and without higher education. The highest nonresponse rate was for non-Western descendants (80.0%) and immigrants 25 (72.3%) with basic education. Immigrants and descendants had higher odds ratios (OR = 3.07 and OR = 3.35, respectively) for nonresponse than ethnic Danes when adjusted for sex, age, marital status, and education. Non-Western immigrants had higher item nonresponse in several question categories. Barriers to non-participation related to the content, language, format, and layout of both the questionnaire and the cover letter. The sender and setting in which to receive the questionnaire also influenced answering incentives. We observed differences in barriers and incentives between immigrants and descendants. Nonresponse appears related to linguistic and/or educational limitations, to alienation generated by the questions' focus on disease and cultural assumptions, or mistrust regarding anonymity. Ethnic minorities seem particularly affected by such barriers. To increase survey participation, questions could be sensitized to reflect multicultural traditions, and the impact of sender and setting considered.
Public priorities for osteoporosis and fracture research: results from a general population survey.

Science.gov (United States)

Paskins, Zoe; Jinks, Clare; Mahmood, Waheed; Jayakumar, Prakash; Sangan, Caroline B; Belcher, John; Gwilym, Stephen

2017-12-01

This is the first national study of public and patient research priorities in osteoporosis and fracture. We have identified new research areas of importance to members of the public, particularly 'access to information from health professionals'. The findings are being incorporated into the research strategy of the National Osteoporosis Society. This study aimed to prioritise, with patients and public members, research topics for the osteoporosis research agenda. An e-survey to identify topics for research was co-designed with patient representatives. A link to the e-survey was disseminated to supporters of the UK National Osteoporosis Society (NOS) in a monthly e-newsletter. Responders were asked to indicate their top priority for research across four topics (understanding and preventing osteoporosis, living with osteoporosis, treating osteoporosis and treating fractures) and their top three items within each topic. Descriptive statistics were used to describe demographics and item ranking. A latent class analysis was applied to identify a substantive number of clusters with different combinations of binary responses. One thousand one hundred eighty-eight (7.4%) respondents completed the e-survey. The top three items overall were 'Having easy access to advice and information from health professionals' (63.8%), 'Understanding further the safety and benefit of osteoporosis drug treatments' (49.9%) and 'Identifying the condition early by screening' (49.2%). Latent class analysis revealed distinct clusters of responses within each topic including primary care management and self-management. Those without a history of prior fracture or aged under 70 were more likely to rate items within the cluster of self-management as important (21.0 vs 12.9 and 19.8 vs 13.3%, respectively). This is the first study of public research priorities in osteoporosis and has identified new research areas of importance to members of the public including access to information. The findings
Drug-use pattern of Chinese herbal medicines in insomnia: a 4-year survey in Taiwan.

Science.gov (United States)

Chen, L-C; Chen, I-C; Wang, B-R; Shao, C-H

2009-10-01

Insomnia is a common complaint in the general population. Interest in the use of alternative treatments for insomnia is increasing exponentially and is fairly common in Taiwan. We undertook a survey to define the drug utilization patterns of Chinese herbal medicines (CM) for insomnia in Taiwan. The survey was conducted over a period of 4 years, from January 2003 to December 2006. Outpatients with primary insomnia and being treated with CM were studied. Core drug-use indicators were the number of CM items per prescription, the dosing frequency and duration of CM prescriptions, the most common prescribed CM herbs and CM formulae used. Six thousand eight hundred and sixty patients, using 37,046 CM herb items, were screened during the study period. The average CM items per prescription was 5.40. Most of prescriptions (95.23%) were prescribed for administration three times a day. The most often prescribed Chinese herbal products were Hong-Hwa (Carthamus tinctorius) and Jia-Wey-Shiau-Yau-San, which includes Angelica sinensis, Atractylodes macrocephala, Paeonia lactiflora, Bupleurum chinense, and Poria coco. This is the first extensive survey examining the drug utilization patterns of Chinese herbal medicines in the treatment of insomnia. Although the data were generated in Taiwan, the herbs and practices identified are likely to be widely generalizable wherever Chinese herbal remedies are used for insomnia. Multiple herbs and complex formulae were commonly used. The baseline data generated should be of use in informing subsequent studies, including those aimed at a thorough evaluation of the herbs' effectiveness.
Generalizability theory and item response theory

NARCIS (Netherlands)

Glas, Cornelis A.W.; Eggen, T.J.H.M.; Veldkamp, B.P.

2012-01-01

Item response theory is usually applied to items with a selected-response format, such as multiple choice items, whereas generalizability theory is usually applied to constructed-response tasks assessed by raters. However, in many situations, raters may use rating scales consisting of items with a
Sharing the cost of redundant items

DEFF Research Database (Denmark)

Hougaard, Jens Leth; Moulin, Hervé

2014-01-01

We ask how to share the cost of finitely many public goods (items) among users with different needs: some smaller subsets of items are enough to serve the needs of each user, yet the cost of all items must be covered, even if this entails inefficiently paying for redundant items. Typical examples...... are network connectivity problems when an existing (possibly inefficient) network must be maintained. We axiomatize a family cost ratios based on simple liability indices, one for each agent and for each item, measuring the relative worth of this item across agents, and generating cost allocation rules...... additive in costs....
Individuals with knee impairments identify items in need of clarification in the Patient Reported Outcomes Measurement Information System (PROMIS®) pain interference and physical function item banks - a qualitative study.

Science.gov (United States)

Lynch, Andrew D; Dodds, Nathan E; Yu, Lan; Pilkonis, Paul A; Irrgang, James J

2016-05-11

The content and wording of the Patient Reported Outcome Measurement Information System (PROMIS) Physical Function and Pain Interference item banks have not been qualitatively assessed by individuals with knee joint impairments. The purpose of this investigation was to identify items in the PROMIS Physical Function and Pain Interference Item Banks that are irrelevant, unclear, or otherwise difficult to respond to for individuals with impairment of the knee and to suggest modifications based on cognitive interviews. Twenty-nine individuals with knee joint impairments qualitatively assessed items in the Pain Interference and Physical Function Item Banks in a mixed-methods cognitive interview. Field notes were analyzed to identify themes and frequency counts were calculated to identify items not relevant to individuals with knee joint impairments. Issues with clarity were identified in 23 items in the Physical Function Item Bank, resulting in the creation of 43 new or modified items, typically changing words within the item to be clearer. Interpretation issues included whether or not the knee joint played a significant role in overall health and age/gender differences in items. One quarter of the original items (31 of 124) in the Physical Function Item Bank were identified as irrelevant to the knee joint. All 41 items in the Pain Interference Item Bank were identified as clear, although individuals without significant pain substituted other symptoms which interfered with their life. The Physical Function Item Bank would benefit from additional items that are relevant to individuals with knee joint impairments and, by extension, to other lower extremity impairments. Several issues in clarity were identified that are likely to be present in other patient cohorts as well.
Dissociating the neural correlates of intra-item and inter-item working-memory binding.

Directory of Open Access Journals (Sweden)

Carinne Piekema

Full Text Available BACKGROUND: Integration of information streams into a unitary representation is an important task of our cognitive system. Within working memory, the medial temporal lobe (MTL has been conceptually linked to the maintenance of bound representations. In a previous fMRI study, we have shown that the MTL is indeed more active during working-memory maintenance of spatial associations as compared to non-spatial associations or single items. There are two explanations for this result, the mere presence of the spatial component activates the MTL, or the MTL is recruited to bind associations between neurally non-overlapping representations. METHODOLOGY/PRINCIPAL FINDINGS: The current fMRI study investigates this issue further by directly comparing intrinsic intra-item binding (object/colour, extrinsic intra-item binding (object/location, and inter-item binding (object/object. The three binding conditions resulted in differential activation of brain regions. Specifically, we show that the MTL is important for establishing extrinsic intra-item associations and inter-item associations, in line with the notion that binding of information processed in different brain regions depends on the MTL. CONCLUSIONS/SIGNIFICANCE: Our findings indicate that different forms of working-memory binding rely on specific neural structures. In addition, these results extend previous reports indicating that the MTL is implicated in working-memory maintenance, challenging the classic distinction between short-term and long-term memory systems.
Generalizability theory and item response theory

OpenAIRE

Glas, Cornelis A.W.; Eggen, T.J.H.M.; Veldkamp, B.P.

2012-01-01

Item response theory is usually applied to items with a selected-response format, such as multiple choice items, whereas generalizability theory is usually applied to constructed-response tasks assessed by raters. However, in many situations, raters may use rating scales consisting of items with a selected-response format. This chapter presents a short overview of how item response theory and generalizability theory were integrated to model such assessments. Further, the precision of the esti...
What should be included in the assessment of laypersons' paediatric basic life support skills? Results from a Delphi consensus study.

Science.gov (United States)

Hasselager, Asbjørn Børch; Lauritsen, Torsten; Kristensen, Tim; Bohnstedt, Cathrine; Sønderskov, Claus; Østergaard, Doris; Tolsgaard, Martin Grønnebæk

2018-01-18

Assessment of laypersons' Paediatric Basic Life Support (PBLS) skills is important to ensure acquisition of effective PBLS competencies. However limited evidence exists on which PBLS skills are essential for laypersons. The same challenges exist with respect to the assessment of foreign body airway obstruction management (FBAOM) skills. We aimed to establish international consensus on how to assess laypersons' PBLS and FBAOM skills. A Delphi consensus survey was conducted. Out of a total of 84 invited experts, 28 agreed to participate. During the first Delphi round experts suggested items to assess laypersons' PBLS and FBAOM skills. In the second round, the suggested items received comments from and were rated by 26 experts (93%) on a 5-point scale (1 = not relevant to 5 = essential). Revised items were anonymously presented in a third round for comments and 23 (82%) experts completed a re-rating. Items with a score above 3 by more than 80% of the experts in the third round were included in an assessment instrument. In the first round, 19 and 15 items were identified to assess PBLS and FBAOM skills, respectively. The ratings and comments from the last two rounds resulted in nine and eight essential assessment items for PBLS and FBAOM skills, respectively. The PBLS items included: "Responsiveness"," Call for help", "Open airway"," Check breathing", "Rescue breaths", "Compressions", "Ventilations", "Time factor" and "Use of AED". The FBAOM items included: "Identify different stages of foreign body airway obstruction", "Identify consciousness", "Call for help", "Back blows", "Chest thrusts/abdominal thrusts according to age", "Identify loss of consciousness and change to CPR", "Assessment of breathing" and "Ventilation". For assessment of laypersons some PBLS and FBAOM skills described in guidelines are more important than others. Four out of nine of PBLS skills focus on airway and breathing skills, supporting the major importance of these skills for
Secondary Psychometric Examination of the Dimensional Obsessive-Compulsive Scale: Classical Testing, Item Response Theory, and Differential Item Functioning.

Science.gov (United States)

Thibodeau, Michel A; Leonard, Rachel C; Abramowitz, Jonathan S; Riemann, Bradley C

2015-12-01

The Dimensional Obsessive-Compulsive Scale (DOCS) is a promising measure of obsessive-compulsive disorder (OCD) symptoms but has received minimal psychometric attention. We evaluated the utility and reliability of DOCS scores. The study included 832 students and 300 patients with OCD. Confirmatory factor analysis supported the originally proposed four-factor structure. DOCS total and subscale scores exhibited good to excellent internal consistency in both samples (α = .82 to α = .96). Patient DOCS total scores reduced substantially during treatment (t = 16.01, d = 1.02). DOCS total scores discriminated between students and patients (sensitivity = 0.76, 1 - specificity = 0.23). The measure did not exhibit gender-based differential item functioning as tested by Mantel-Haenszel chi-square tests. Expected response options for each item were plotted as a function of item response theory and demonstrated that DOCS scores incrementally discriminate OCD symptoms ranging from low to extremely high severity. Incremental differences in DOCS scores appear to represent unbiased and reliable differences in true OCD symptom severity. © The Author(s) 2014.
The physical examination content of the Japanese National Health and Nutrition Survey: temporal changes.

Science.gov (United States)

Tanaka, Hisako; Imai, Shino; Nakade, Makiko; Imai, Eri; Takimoto, Hidemi

2016-12-01

Survey items of the Japan National Nutrition Survey (J-NNS) have changed over time. Several papers on dietary surveys have been published; however, to date, there are no in-depth papers regarding physical examinations. Therefore, we investigated changes in the survey items in the physical examinations performed in the J-NNS and the National Health and Nutrition Survey (NHNS), with the aim of incorporating useful data for future policy decisions. We summarized the description of physical examinations and marshalled the changes of survey items from the J-NNS and NHNS from 1946 to 2012. The physical examination is roughly classified into the following six components: some are relevant to anthropometric measurements, clinical measurements, physical symptoms, blood tests, lifestyle and medication by interview, and others. Items related to nutritional deficiency, such as anaemia and tendon reflex disappearance, and body weight measurements were collected during the early period, according to the instructions of the General Headquarters. From 1989, blood tests and measurement of physical activity were added, and serum total protein, total cholesterol, triglycerides, HDL-cholesterol, blood glucose, red blood corpuscles and haemoglobin measurements have been performed continuously for more than 20 years. This is the first report on the items of physical examination in the J-NNS and NHNS. Our research results provide basic information for the utilization of the J-NNS and NHNS, to researchers, clinicians or policy makers. Monitoring the current state correctly is essential for national health promotion, and also for improvement of the investigation methods to apply country-by-country comparisons.

The 12-item World Health Organization Disability Assessment Schedule II (WHO-DAS II: a nonparametric item response analysis

Directory of Open Access Journals (Sweden)

Fernandez Ana

2010-05-01

Full Text Available Abstract Background Previous studies have analyzed the psychometric properties of the World Health Organization Disability Assessment Schedule II (WHO-DAS II using classical omnibus measures of scale quality. These analyses are sample dependent and do not model item responses as a function of the underlying trait level. The main objective of this study was to examine the effectiveness of the WHO-DAS II items and their options in discriminating between changes in the underlying disability level by means of item response analyses. We also explored differential item functioning (DIF in men and women. Methods The participants were 3615 adult general practice patients from 17 regions of Spain, with a first diagnosed major depressive episode. The 12-item WHO-DAS II was administered by the general practitioners during the consultation. We used a non-parametric item response method (Kernel-Smoothing implemented with the TestGraf software to examine the effectiveness of each item (item characteristic curves and their options (option characteristic curves in discriminating between changes in the underliying disability level. We examined composite DIF to know whether women had a higher probability than men of endorsing each item. Results Item response analyses indicated that the twelve items forming the WHO-DAS II perform very well. All items were determined to provide good discrimination across varying standardized levels of the trait. The items also had option characteristic curves that showed good discrimination, given that each increasing option became more likely than the previous as a function of increasing trait level. No gender-related DIF was found on any of the items. Conclusions All WHO-DAS II items were very good at assessing overall disability. Our results supported the appropriateness of the weights assigned to response option categories and showed an absence of gender differences in item functioning.
Item Response Theory with Covariates (IRT-C): Assessing Item Recovery and Differential Item Functioning for the Three-Parameter Logistic Model

Science.gov (United States)

Tay, Louis; Huang, Qiming; Vermunt, Jeroen K.

2016-01-01

In large-scale testing, the use of multigroup approaches is limited for assessing differential item functioning (DIF) across multiple variables as DIF is examined for each variable separately. In contrast, the item response theory with covariate (IRT-C) procedure can be used to examine DIF across multiple variables (covariates) simultaneously. To…
Safety climate in Swiss hospital units: Swiss version of the Safety Climate Survey

Science.gov (United States)

Gehring, Katrin; Mascherek, Anna C.; Bezzola, Paula

2015-01-01

Abstract Rationale, aims and objectives Safety climate measurements are a broadly used element of improvement initiatives. In order to provide a sound and easy‐to‐administer instrument for the use in Swiss hospitals, we translated the Safety Climate Survey into German and French. Methods After translating the Safety Climate Survey into French and German, a cross‐sectional survey study was conducted with health care professionals (HCPs) in operating room (OR) teams and on OR‐related wards in 10 Swiss hospitals. Validity of the instrument was examined by means of Cronbach's alpha and missing rates of the single items. Item‐descriptive statistics group differences and percentage of ‘problematic responses’ (PPR) were calculated. Results 3153 HCPs completed the survey (response rate: 63.4%). 1308 individuals were excluded from the analyses because of a profession other than doctor or nurse or invalid answers (n = 1845; nurses = 1321, doctors = 523). Internal consistency of the translated Safety Climate Survey was good (Cronbach's alpha G erman = 0.86; Cronbach's alpha F rench = 0.84). Missing rates at item level were rather low (0.23–4.3%). We found significant group differences in safety climate values regarding profession, managerial function, work area and time spent in direct patient care. At item level, 14 out of 21 items showed a PPR higher than 10%. Conclusions Results indicate that the French and German translations of the Safety Climate Survey might be a useful measurement instrument for safety climate in Swiss hospital units. Analyses at item level allow for differentiating facets of safety climate into more positive and critical safety climate aspects. PMID:25656302
The randomly renewed general item and the randomly inspected item with exponential life distribution

International Nuclear Information System (INIS)

Schneeweiss, W.G.

1979-01-01

For a randomly renewed item the probability distributions of the time to failure and of the duration of down time and the expectations of these random variables are determined. Moreover, it is shown that the same theory applies to randomly checked items with exponential probability distribution of life such as electronic items. The case of periodic renewals is treated as an example. (orig.) [de
Improving Measurement Efficiency of the Inner EAR Scale with Item Response Theory.

Science.gov (United States)

Jessen, Annika; Ho, Andrew D; Corrales, C Eduardo; Yueh, Bevan; Shin, Jennifer J

2018-02-01

Objectives (1) To assess the 11-item Inner Effectiveness of Auditory Rehabilitation (Inner EAR) instrument with item response theory (IRT). (2) To determine whether the underlying latent ability could also be accurately represented by a subset of the items for use in high-volume clinical scenarios. (3) To determine whether the Inner EAR instrument correlates with pure tone thresholds and word recognition scores. Design IRT evaluation of prospective cohort data. Setting Tertiary care academic ambulatory otolaryngology clinic. Subjects and Methods Modern psychometric methods, including factor analysis and IRT, were used to assess unidimensionality and item properties. Regression methods were used to assess prediction of word recognition and pure tone audiometry scores. Results The Inner EAR scale is unidimensional, and items varied in their location and information. Information parameter estimates ranged from 1.63 to 4.52, with higher values indicating more useful items. The IRT model provided a basis for identifying 2 sets of items with relatively lower information parameters. Item information functions demonstrated which items added insubstantial value over and above other items and were removed in stages, creating a 8- and 3-item Inner EAR scale for more efficient assessment. The 8-item version accurately reflected the underlying construct. All versions correlated moderately with word recognition scores and pure tone averages. Conclusion The 11-, 8-, and 3-item versions of the Inner EAR scale have strong psychometric properties, and there is correlational validity evidence for the observed scores. Modern psychometric methods can help streamline care delivery by maximizing relevant information per item administered.
Identifying predictors of physics item difficulty: A linear regression approach

Science.gov (United States)

Mesic, Vanes; Muratovic, Hasnija

2011-06-01

Large-scale assessments of student achievement in physics are often approached with an intention to discriminate students based on the attained level of their physics competencies. Therefore, for purposes of test design, it is important that items display an acceptable discriminatory behavior. To that end, it is recommended to avoid extraordinary difficult and very easy items. Knowing the factors that influence physics item difficulty makes it possible to model the item difficulty even before the first pilot study is conducted. Thus, by identifying predictors of physics item difficulty, we can improve the test-design process. Furthermore, we get additional qualitative feedback regarding the basic aspects of student cognitive achievement in physics that are directly responsible for the obtained, quantitative test results. In this study, we conducted a secondary analysis of data that came from two large-scale assessments of student physics achievement at the end of compulsory education in Bosnia and Herzegovina. Foremost, we explored the concept of “physics competence” and performed a content analysis of 123 physics items that were included within the above-mentioned assessments. Thereafter, an item database was created. Items were described by variables which reflect some basic cognitive aspects of physics competence. For each of the assessments, Rasch item difficulties were calculated in separate analyses. In order to make the item difficulties from different assessments comparable, a virtual test equating procedure had to be implemented. Finally, a regression model of physics item difficulty was created. It has been shown that 61.2% of item difficulty variance can be explained by factors which reflect the automaticity, complexity, and modality of the knowledge structure that is relevant for generating the most probable correct solution, as well as by the divergence of required thinking and interference effects between intuitive and formal physics knowledge
Identifying predictors of physics item difficulty: A linear regression approach

Directory of Open Access Journals (Sweden)

Hasnija Muratovic

2011-06-01

Full Text Available Large-scale assessments of student achievement in physics are often approached with an intention to discriminate students based on the attained level of their physics competencies. Therefore, for purposes of test design, it is important that items display an acceptable discriminatory behavior. To that end, it is recommended to avoid extraordinary difficult and very easy items. Knowing the factors that influence physics item difficulty makes it possible to model the item difficulty even before the first pilot study is conducted. Thus, by identifying predictors of physics item difficulty, we can improve the test-design process. Furthermore, we get additional qualitative feedback regarding the basic aspects of student cognitive achievement in physics that are directly responsible for the obtained, quantitative test results. In this study, we conducted a secondary analysis of data that came from two large-scale assessments of student physics achievement at the end of compulsory education in Bosnia and Herzegovina. Foremost, we explored the concept of “physics competence” and performed a content analysis of 123 physics items that were included within the above-mentioned assessments. Thereafter, an item database was created. Items were described by variables which reflect some basic cognitive aspects of physics competence. For each of the assessments, Rasch item difficulties were calculated in separate analyses. In order to make the item difficulties from different assessments comparable, a virtual test equating procedure had to be implemented. Finally, a regression model of physics item difficulty was created. It has been shown that 61.2% of item difficulty variance can be explained by factors which reflect the automaticity, complexity, and modality of the knowledge structure that is relevant for generating the most probable correct solution, as well as by the divergence of required thinking and interference effects between intuitive and formal
The impact of item order on ratings of cancer risk perception.

Science.gov (United States)

Taylor, Kathryn L; Shelby, Rebecca A; Schwartz, Marc D; Ackerman, Josh; LaSalle, V Holland; Gelmann, Edward P; McGuire, Colleen

2002-07-01

Although perceived risk is central to most theories of health behavior, there is little consensus on its measurement with regard to item wording, response set, or the number of items to include. In a methodological assessment of perceived risk, we assessed the impact of changing the order of three commonly used perceived risk items: quantitative personal risk, quantitative population risk, and comparative risk. Participants were 432 men and women enrolled in an ancillary study of the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial. Three groups of consecutively enrolled participants responded to the three items in one of three question orders. Results indicated that item order was related to the perceived risk ratings of both ovarian (P Perceptions of risk were significantly lower when the comparative rating was made first. The findings suggest that compelling participants to consider their own risk relative to the risk of others results in lower ratings of perceived risk. Although the use of multiple items may provide more information than when only a single method is used, different conclusions may be reached depending on the context in which an item is assessed.
Development of the pediatric quality of life inventory neurofibromatosis type 1 module items for children, adolescents and young adults: qualitative methods.

Science.gov (United States)

Nutakki, Kavitha; Varni, James W; Steinbrenner, Sheila; Draucker, Claire B; Swigonski, Nancy L

2017-03-01

Health-related quality of life (HRQOL) is arguably one of the most important measures in evaluating effectiveness of clinical treatments. At present, there is no disease-specific outcome measure to assess the HRQOL of children, adolescents and young adults with Neurofibromatosis Type 1 (NF1). This study aimed to develop the items and support the content validity for the Pediatric Quality of Life Inventory™ (PedsQL™) NF1 Module for children, adolescents and young adults. The iterative process included multiphase qualitative methods including a literature review, survey of expert opinions, semi-structured interviews, cognitive interviews and pilot testing. Fifteen domains were derived from the qualitative methods, with content saturation achieved, resulting in 115 items. The domains include skin, pain, pain impact, pain management, cognitive functioning, speech, fine motor, balance, vision, perceived physical appearance, communication, worry, treatment, medicines and gastrointestinal symptoms. This study is limited because all participants are recruited from a single-site. Qualitative methods support the content validity for the PedsQL™ NF1 Module for children, adolescents and young adults. The PedsQL™ NF1 Module is now undergoing national multisite field testing for the psychometric validation of the instrument development.
Designing, Testing, and Validating an Attitudinal Survey on an Environmental Topic: A Groundwater Pollution Survey Instrument for Secondary School Students

Science.gov (United States)

Lacosta-Gabari, Idoya; Fernandez-Manzanal, Rosario; Sanchez-Gonzalez, Dolores

2009-01-01

Research in environmental attitudes' assessment has significantly increased in recent years. The development of specific attitude scales for specific environmental problems has often been proposed. This paper describes the Groundwater Pollution Test (GPT), a 19-item survey instrument using a Likert-type scale. The survey has been used with…
17 CFR 260.7a-16 - Inclusion of items, differentiation between items and answers, omission of instructions.

Science.gov (United States)

2010-04-01

... 17 Commodity and Securities Exchanges 3 2010-04-01 2010-04-01 false Inclusion of items, differentiation between items and answers, omission of instructions. 260.7a-16 Section 260.7a-16 Commodity and... INDENTURE ACT OF 1939 Formal Requirements § 260.7a-16 Inclusion of items, differentiation between items and...
From Ethnography to Items: A Mixed Methods Approach to Developing a Survey to Examine Graduate Engineering Student Retention

Science.gov (United States)

Crede, Erin; Borrego, Maura

2013-01-01

As part of a sequential exploratory mixed methods study, 9 months of ethnographically guided observations and interviews were used to develop a survey examining graduate engineering student retention. Findings from the ethnographic fieldwork yielded several themes, including international diversity, research group organization and climate,…
Quality Control in Survey Design: Evaluating a Survey of Educators’ Attitudes Concerning Differentiated Compensation

OpenAIRE

Kelly D. Bradley; Michael Peabody; Shannon O. Sampson

2015-01-01

This study utilized the Rasch model to assess the quality of a survey instrument designed to measure attitudes of administrators and teachers concerning a differentiated teacher compensation program piloted in Kentucky. Researchers addressing potentially contentious issues should ensure their methods stand up to rigorous criticism. The results indicate that the rating scale does not function as expected, with items being too easy to endorse. Future iterations of this survey should be revis...
17 CFR 229.903 - (Item 903) Summary.

Science.gov (United States)

2010-04-01

... effect on investors, including, but not limited to: (i) Changes in the business plan, voting rights, cash... 17 Commodity and Securities Exchanges 2 2010-04-01 2010-04-01 false (Item 903) Summary. 229.903 Section 229.903 Commodity and Securities Exchanges SECURITIES AND EXCHANGE COMMISSION STANDARD...
Differential item functional analysis on pedagogic and content knowledge (PCK) questionnaire for Indonesian teachers using RASCH model

Science.gov (United States)

Rahmani, B. D.

2018-01-01

The purpose of this paper is to evaluate Indonesian senior high school teacher’s pedagogical content knowledge also their perception toward curriculum changing in West Java Indonesia. The data used in this study were derived from a questionnaire survey conducted among teachers in Bandung, West Java. A total of 61 usable responses were collected. The Differential Item Functioning (DIFF) was used to analyze the data whether the item had a difference or not toward gender, education background also on school location. However, the result showed that there was no any significant difference on gender and school location toward the item response but educational background. As a conclusion, the teacher’s educational background influence on giving the response to the questionnaire. Therefore, it is suggested in the future to construct the items on the questionnaire which is coped the differences of the participant particularly the educational background.
Reliability and validity of the International Spinal Cord Injury Basic Pain Data Set items as self-report measures

DEFF Research Database (Denmark)

Jensen, M P; Widerström-Noga, E; Richards, J S

2010-01-01

To evaluate the psychometric properties of a subset of International Spinal Cord Injury Basic Pain Data Set (ISCIBPDS) items that could be used as self-report measures in surveys, longitudinal studies and clinical trials....
The Effects of Test Length and Sample Size on Item Parameters in Item Response Theory

Science.gov (United States)

Sahin, Alper; Anil, Duygu

2017-01-01

This study investigates the effects of sample size and test length on item-parameter estimation in test development utilizing three unidimensional dichotomous models of item response theory (IRT). For this purpose, a real language test comprised of 50 items was administered to 6,288 students. Data from this test was used to obtain data sets of…
Approximation Preserving Reductions among Item Pricing Problems

Science.gov (United States)

Hamane, Ryoso; Itoh, Toshiya; Tomita, Kouhei

When a store sells items to customers, the store wishes to determine the prices of the items to maximize its profit. Intuitively, if the store sells the items with low (resp. high) prices, the customers buy more (resp. less) items, which provides less profit to the store. So it would be hard for the store to decide the prices of items. Assume that the store has a set V of n items and there is a set E of m customers who wish to buy those items, and also assume that each item i ∈ V has the production cost di and each customer ej ∈ E has the valuation vj on the bundle ej ⊆ V of items. When the store sells an item i ∈ V at the price ri, the profit for the item i is pi = ri - di. The goal of the store is to decide the price of each item to maximize its total profit. We refer to this maximization problem as the item pricing problem. In most of the previous works, the item pricing problem was considered under the assumption that pi ≥ 0 for each i ∈ V, however, Balcan, et al. [In Proc. of WINE, LNCS 4858, 2007] introduced the notion of “loss-leader, ” and showed that the seller can get more total profit in the case that pi < 0 is allowed than in the case that pi < 0 is not allowed. In this paper, we derive approximation preserving reductions among several item pricing problems and show that all of them have algorithms with good approximation ratio.
Which Statistic Should Be Used to Detect Item Preknowledge When the Set of Compromised Items Is Known?

Science.gov (United States)

Sinharay, Sandip

2017-09-01

Benefiting from item preknowledge is a major type of fraudulent behavior during educational assessments. Belov suggested the posterior shift statistic for detection of item preknowledge and showed its performance to be better on average than that of seven other statistics for detection of item preknowledge for a known set of compromised items. Sinharay suggested a statistic based on the likelihood ratio test for detection of item preknowledge; the advantage of the statistic is that its null distribution is known. Results from simulated and real data and adaptive and nonadaptive tests are used to demonstrate that the Type I error rate and power of the statistic based on the likelihood ratio test are very similar to those of the posterior shift statistic. Thus, the statistic based on the likelihood ratio test appears promising in detecting item preknowledge when the set of compromised items is known.
A confirmative clinimetric analysis of the 36-item Family Assessment Device.

Science.gov (United States)

Timmerby, Nina; Cosci, Fiammetta; Watson, Maggie; Csillag, Claudio; Schmitt, Florence; Steck, Barbara; Bech, Per; Thastum, Mikael

2018-02-07

The Family Assessment Device (FAD) is a 60-item questionnaire widely used to evaluate self-reported family functioning. However, the factor structure as well as the number of items has been questioned. A shorter and more user-friendly version of the original FAD-scale, the 36-item FAD, has therefore previously been proposed, based on findings in a nonclinical population of adults. We aimed in this study to evaluate the brief 36-item version of the FAD in a clinical population. Data from a European multinational study, examining factors associated with levels of family functioning in adult cancer patients' families, were used. Both healthy and ill parents completed the 60-item version FAD. The psychometric analyses conducted were Principal Component Analysis and Mokken-analysis. A total of 564 participants were included. Based on the psychometric analysis we confirmed that the 36-item version of the FAD has robust psychometric properties and can be used in clinical populations. The present analysis confirmed that the 36-item version of the FAD (18 items assessing 'well-being' and 18 items assessing 'dysfunctional' family function) is a brief scale where the summed total score is a valid measure of the dimensions of family functioning. This shorter version of the FAD is, in accordance with the concept of 'measurement-based care', an easy to use scale that could be considered when the aim is to evaluate self-reported family functioning.

Validation of a mobility item bank for older patients in primary care.

Science.gov (United States)

Cabrero-García, Julio; Ramos-Pichardo, Juan Diego; Muñoz-Mendoza, Carmen Luz; Cabañero-Martínez, María José; González-Llopis, Lorena; Reig-Ferrer, Abilio

2012-12-05

To develop and validate an item bank to measure mobility in older people in primary care and to analyse differential item functioning (DIF) and differential bundle functioning (DBF) by sex. A pool of 48 mobility items was administered by interview to 593 older people attending primary health care practices. The pool contained four domains based on the International Classification of Functioning: changing and maintaining body position, carrying, lifting and pushing, walking and going up and down stairs. The Late Life Mobility item bank consisted of 35 items, and measured with a reliability of 0.90 or more across the full spectrum of mobility, except at the higher end of better functioning. No evidence was found of non-uniform DIF but uniform DIF was observed, mainly for items in the changing and maintaining body position and carrying, lifting and pushing domains. The walking domain did not display DBF, but the other three domains did, principally the carrying, lifting and pushing items. During the design and validation of an item bank to measure mobility in older people, we found that strength (carrying, lifting and pushing) items formed a secondary dimension that produced DBF. More research is needed to determine how best to include strength items in a mobility measure, or whether it would be more appropriate to design separate measures for each construct.
Research on the re-establishment of the classification criteria of strategic items

Energy Technology Data Exchange (ETDEWEB)

Han, Seong Mi; Yang, Seunghyo; Shin, Dong Hoon [Korea Institute of Nuclear Nonproliferation and Control, Daejeon (Korea, Republic of)

2014-05-15

According to these export control laws and regulations, the exporters have to apply the review for classification and export licensing to their own government. In this process, a technical review institute such as Korea Institute of Nuclear Nonproliferation and Control (institute under the NSSC) are referring to Minister's Regulation for the Export and Import of Strategic Goods. In this regulation, there are many criteria to classify the strategic items to be exported. But there are some problems in these criteria. At Typical problem is that classification criteria of Trigger List Items generally is very qualitative and very obscure in contrast with Dual Use Items. So, in most cases, this characteristics of classification criteria of trigger list items have caused much trouble for stakeholders such as government and nuclear related companies. So, there were needs that the classification criteria had to be more correct, obvious and objective. To solve these problems, the past classification cases for technology were re-analyzed and the general criteria were deducted in this study. Previously mentioned, the classification process and criteria were very qualitative and very obscure for the Trigger List Items. So, the re-establishment of the classification criteria was done to solve these problems in this study. Each extracted results were shown in Tables I and II. This re-established criteria are expected to contribute to quantification, disambiguation and objectification of the classification review process. As the future works, we will establish the probability or numerical factor for the extracted criteria through statistical surveys, to make better use of these criteria. And we will push ahead with the NSSC approval to use as the classification guidelines of the trigger list items in review processes.
Universal Authenticated Item Monitoring System (AIMS) second generation equipment

International Nuclear Information System (INIS)

Schoeneman, J.L.; Baumann, M.J.; Fox, L.J.; Jenkins, C.D.; Perlinsk, A.W.

1992-01-01

Sandia National Laboratories (SNL) is in the final stages of developing a Universal Authenticated Item Monitoring System (AIMS). When completed, AIMS will provide applicable agencies in the US government, and those in the International arena, with a secure and convenient method of monitoring the physical status of selected items. The benefit derived from this development activity will be the commercial availability of an item monitoring system with the capability for ''quick set-up'' monitoring, as well as long-term unattended monitoring. The AIMS includes a variety of sensors, a robust and authenticated radio frequency (RF) communication link, a Receiver Processing Unit (RPU), and an inspector-friendly personal computer (PC) interface for collecting, sorting, viewing and archiving pertinent event histories. The system will provide the capability to monitor selected items in a real-time mode, a remotely interrogated mode, and a stand-alone, unattended data collection mode. The sensor suite under development includes advanced motion sensors, interior volumetric intrusion sensors, Re-usable, In-situ Verifiable Authenticated (RIVA) fiber-optic seal sensors, generic utility sensors (to accommodate contact closure inputs), and radiation and environmental sensors. A new generation authentication algorithm recently has been developed that provides a high degree of system security 121. The AIMS has potential safeguards applications in the areas of arms control and treaty verification military asset control, International Atomic Energy Agency (IAEA) and Euratom safeguards verification activities, as well as domestic nuclear safeguard activities. Commercial applications could include high-value inventory control and security systems. This paper describes the second-generation AIMS along with its recently expanded sensor suite and enhanced data collection capabilities
Item Modeling Concept Based on Multimedia Authoring

Directory of Open Access Journals (Sweden)

Janez Stergar

2008-09-01

Full Text Available In this paper a modern item design framework for computer based assessment based on Flash authoring environment will be introduced. Question design will be discussed as well as the multimedia authoring environment used for item modeling emphasized. Item type templates are a structured means of collecting and storing item information that can be used to improve the efficiency and security of the innovative item design process. Templates can modernize the item design, enhance and speed up the development process. Along with content creation, multimedia has vast potential for use in innovative testing. The introduced item design template is based on taxonomy of innovative items which have great potential for expanding the content areas and construct coverage of an assessment. The presented item design approach is based on GUI's – one for question design based on implemented item design templates and one for user interaction tracking/retrieval. The concept of user interfaces based on Flash technology will be discussed as well as implementation of the innovative approach of the item design forms with multimedia authoring. Also an innovative method for user interaction storage/retrieval based on PHP extending Flash capabilities in the proposed framework will be introduced.
Using Cognitive Interviews to Pilot an International Survey of Principal Preparation: A Western Australian Perspective

Science.gov (United States)

Wildy, Helen; Clarke, Simon

2009-01-01

This paper provides an example of the application of the cognitive interview, a qualitative tool for pre-testing a survey instrument to check its cognitive validity, that is, whether the items mean to respondents what they mean to the item designers. The instrument is the survey used in the final phase of the International Study of Principal…
Differential item functioning of the patient-reported outcomes information system (PROMIS®) pain interference item bank by language (Spanish versus English).

Science.gov (United States)

Paz, Sylvia H; Spritzer, Karen L; Reise, Steven P; Hays, Ron D

2017-06-01

About 70% of Latinos, 5 years old or older, in the United States speak Spanish at home. Measurement equivalence of the PROMIS ® pain interference (PI) item bank by language of administration (English versus Spanish) has not been evaluated. A sample of 527 adult Spanish-speaking Latinos completed the Spanish version of the 41-item PROMIS ® pain interference item bank. We evaluate dimensionality, monotonicity and local independence of the Spanish-language items. Then we evaluate differential item functioning (DIF) using ordinal logistic regression with item response theory scores estimated from DIF-free "anchor" items. One of the 41 items in the Spanish version of the PROMIS ® PI item bank was identified as having significant uniform DIF. English- and Spanish-speaking subjects with the same level of pain interference responded differently to 1 of the 41 items in the PROMIS ® PI item bank. This item was not retained due to proprietary issues. The original English language item parameters can be used when estimating PROMIS ® PI scores.
Investigation of the Performance of Multidimensional Equating Procedures for Common-Item Nonequivalent Groups Design

Directory of Open Access Journals (Sweden)

Burcu ATAR

2017-12-01

Full Text Available In this study, the performance of the multidimensional extentions of Stocking-Lord, mean/mean, and mean/sigma equating procedures under common-item nonequivalent groups design was investigated. The performance of those three equating procedures was examined under the combination of various conditions including sample size, ability distribution, correlation between two dimensions, and percentage of anchor items in the test. Item parameter recovery was evaluated calculating RMSE (root man squared error and BIAS values. It was found that Stocking-Lord procedure provided the smaller RMSE and BIAS values for both item discrimination and item difficulty parameter estimates across most conditions.
Calorie Changes in Large Chain Restaurants: Declines in New Menu Items but Room for Improvement.

Science.gov (United States)

Bleich, Sara N; Wolfson, Julia A; Jarlenski, Marian P

2016-01-01

Large chain restaurants reduced the number of calories in newly introduced menu items in 2013 by about 60 calories (or 12%) relative to 2012. This paper describes trends in calories available in large U.S. chain restaurants to understand whether previously documented patterns persist. Data (a census of items for included restaurants) were obtained from the MenuStat project. This analysis included 66 of the 100 largest U.S. restaurants that are available in all three of the data years (2012-2014; N=23,066 items). Generalized linear models were used to examine: (1) per-item calorie changes from 2012 to 2014 among items on the menu in all years; and (2) mean calories in new items in 2013 and 2014 compared with items on the menu in 2012 only. Data were analyzed in 2014. Overall, calories in newly introduced menu items declined by 71 (or 15%) from 2012 to 2013 (p=0.001) and by 69 (or 14%) from 2012 to 2014 (p=0.03). These declines were concentrated mainly in new main course items (85 fewer calories in 2013 and 55 fewer calories in 2014; p=0.01). Although average calories in newly introduced menu items are declining, they are higher than items common to the menu in all 3 years. No differences in mean calories among items on menus in 2012, 2013, or 2014 were found. The previously observed declines in newly introduced menu items among large restaurant chains have been maintained, which suggests the beginning of a trend toward reducing calories. Copyright © 2016 American Journal of Preventive Medicine. Published by Elsevier Inc. All rights reserved.
U.S. Naval Unit Behavioral Health Needs Assessment Survey, Overview of Survey Items and Measures

Science.gov (United States)

2014-05-20

all Soldiers. The BHNAS and MHAT surveys have yielded valuable information regarding the effects of combat and deployment on service members...and Barriers to Care • Amount of Sleep and Sleep Deficit • Sleep Difficulties • Military Specialty • Positive Effects of Assignment • Contribution...nonopioid prescription painkillers was added; (3) the definition of “constantly and frequent” was omitted in the question; and (4) the NUBHNAS
A strategy for optimizing item-pool management

NARCIS (Netherlands)

Ariel, A.; van der Linden, Willem J.; Veldkamp, Bernard P.

2006-01-01

Item-pool management requires a balancing act between the input of new items into the pool and the output of tests assembled from it. A strategy for optimizing item-pool management is presented that is based on the idea of a periodic update of an optimal blueprint for the item pool to tune item
Overview of classical test theory and item response theory for the quantitative assessment of items in developing patient-reported outcomes measures.

Science.gov (United States)

Cappelleri, Joseph C; Jason Lundy, J; Hays, Ron D

2014-05-01

The US Food and Drug Administration's guidance for industry document on patient-reported outcomes (PRO) defines content validity as "the extent to which the instrument measures the concept of interest" (FDA, 2009, p. 12). According to Strauss and Smith (2009), construct validity "is now generally viewed as a unifying form of validity for psychological measurements, subsuming both content and criterion validity" (p. 7). Hence, both qualitative and quantitative information are essential in evaluating the validity of measures. We review classical test theory and item response theory (IRT) approaches to evaluating PRO measures, including frequency of responses to each category of the items in a multi-item scale, the distribution of scale scores, floor and ceiling effects, the relationship between item response options and the total score, and the extent to which hypothesized "difficulty" (severity) order of items is represented by observed responses. If a researcher has few qualitative data and wants to get preliminary information about the content validity of the instrument, then descriptive assessments using classical test theory should be the first step. As the sample size grows during subsequent stages of instrument development, confidence in the numerical estimates from Rasch and other IRT models (as well as those of classical test theory) would also grow. Classical test theory and IRT can be useful in providing a quantitative assessment of items and scales during the content-validity phase of PRO-measure development. Depending on the particular type of measure and the specific circumstances, the classical test theory and/or the IRT should be considered to help maximize the content validity of PRO measures. Copyright © 2014 Elsevier HS Journals, Inc. All rights reserved.
Item-Level Psychometrics of the Glasgow Outcome Scale: Extended Structured Interviews.

Science.gov (United States)

Hong, Ickpyo; Li, Chih-Ying; Velozo, Craig A

2016-04-01

The Glasgow Outcome Scale-Extended (GOSE) structured interview captures critical components of activities and participation, including home, shopping, work, leisure, and family/friend relationships. Eighty-nine community dwelling adults with mild-moderate traumatic brain injury (TBI) were recruited (average = 2.7 year post injury). Nine items of the 19 items were used for the psychometrics analysis purpose. Factor analysis and item-level psychometrics were investigated using the Rasch partial-credit model. Although the principal components analysis of residuals suggests that a single measurement factor dominates the measure, the instrument did not meet the factor analysis criteria. Five items met the rating scale criteria. Eight items fit the Rasch model. The instrument demonstrated low person reliability (0.63), low person strata (2.07), and a slight ceiling effect. The GOSE demonstrated limitations in precisely measuring activities/participation for individuals after TBI. Future studies should examine the impact of the low precision of the GOSE on effect size. © The Author(s) 2016.
Development and pilot of an international survey: 'Radiation Therapists and Psychosocial Support'.

Science.gov (United States)

Elsner, Kelly L; Naehrig, Diana; Halkett, Georgia K B; Dhillon, Haryana M

2018-06-07

Up to one third of radiation therapy patients are reported to have unmet psychosocial needs. Radiation therapists (RTs) have daily contact with patients and can provide daily psychosocial support to reduce patient anxiety, fear and loneliness. However, RTs vary in their values, skills, training, knowledge and involvement in providing psychosocial support. The aims of this study were to: (1) develop an online survey instrument to explore RT values, skills, training and knowledge regarding patient anxiety and psychosocial support, and (2) pilot the instrument with RT professionals to assess content validity, functionality and length. An online cross-sectional survey, titled 'Radiation therapists and psychosocial support' was developed. Items included patient vignettes, embedded items from RT research, and the Professional Quality of Life Scale (ProQOL5). Four radiation oncology departments volunteered to pilot the survey; each nominated four RT staff to participate. Survey data were analysed descriptively and qualitative feedback grouped and coded to determine whether the survey needed to be refined. Thirteen of sixteen RTs completed the pilot survey and feedback form. Median time to completion was 35 mins, with 54% of respondents stating this was too long. Respondents reported content, questions and response options were relevant and appropriate. Feedback was used to: refine the survey instrument, minimise responder burden and drop out and improve functionality and quality of data collection. This pilot of the 'Radiation therapists and psychosocial support' survey instrument demonstrated content validity and usability. The main survey will be circulated to a representative sample of RTs for completion. © 2018 The Authors. Journal of Medical Radiation Sciences published by John Wiley & Sons Australia, Ltd on behalf of Australian Society of Medical Imaging and Radiation Therapy and New Zealand Institute of Medical Radiation Technology.
Development and validation of a ten-item questionnaire with explanatory illustrations to assess upper extremity disorders: favorable effect of illustrations in the item reduction process.

Science.gov (United States)

Kurimoto, Shigeru; Suzuki, Mikako; Yamamoto, Michiro; Okui, Nobuyuki; Imaeda, Toshihiko; Hirata, Hitoshi

2011-11-01

The purpose of this study is to develop a short and valid measure for upper extremity disorders and to assess the effect of attached illustrations in item reduction of a self-administered disability questionnaire while retaining psychometric properties. A validated questionnaire used to assess upper extremity disorders, the Hand20, was reduced to ten items using two item-reduction techniques. The psychometric properties of the abbreviated form, the Hand10, were evaluated on an independent sample that was used for the shortening process. Validity, reliability, and responsiveness of the Hand10 were retained in the item reduction process. It was possible that the use of explanatory illustrations attached to the Hand10 helped with its reproducibility. The illustrations for the Hand10 promoted text comprehension and motivation to answer the items. These changes resulted in high acceptability; more than 99.3% of patients, including 98.5% of elderly patients, could complete the Hand10 properly. The illustrations had favorable effects on the item reduction process and made it possible to retain precision of the instrument. The Hand10 is a reliable and valid instrument for individual-level applications with the advantage of being compact and broadly applicable, even in elderly individuals.
No evidence for an item limit in change detection.

Directory of Open Access Journals (Sweden)

Shaiyan Keshvari

Full Text Available Change detection is a classic paradigm that has been used for decades to argue that working memory can hold no more than a fixed number of items ("item-limit models". Recent findings force us to consider the alternative view that working memory is limited by the precision in stimulus encoding, with mean precision decreasing with increasing set size ("continuous-resource models". Most previous studies that used the change detection paradigm have ignored effects of limited encoding precision by using highly discriminable stimuli and only large changes. We conducted two change detection experiments (orientation and color in which change magnitudes were drawn from a wide range, including small changes. In a rigorous comparison of five models, we found no evidence of an item limit. Instead, human change detection performance was best explained by a continuous-resource model in which encoding precision is variable across items and trials even at a given set size. This model accounts for comparison errors in a principled, probabilistic manner. Our findings sharply challenge the theoretical basis for most neural studies of working memory capacity.
Reliability and validity of the Spanish version of the 10-item Connor-Davidson Resilience Scale (10-item CD-RISC in young adults

Directory of Open Access Journals (Sweden)

García-Campayo Javier

2011-08-01

Full Text Available Abstract Background The 10-item Connor-Davidson Resilience Scale (10-item CD-RISC is an instrument for measuring resilience that has shown good psychometric properties in its original version in English. The aim of this study was to evaluate the validity and reliability of the Spanish version of the 10-item CD-RISC in young adults and to verify whether it is structured in a single dimension as in the original English version. Findings Cross-sectional observational study including 681 university students ranging in age from 18 to 30 years. The number of latent factors in the 10 items of the scale was analyzed by exploratory factor analysis. Confirmatory factor analysis was used to verify whether a single factor underlies the 10 items of the scale as in the original version in English. The convergent validity was analyzed by testing whether the mean of the scores of the mental component of SF-12 (MCS and the quality of sleep as measured with the Pittsburgh Sleep Index (PSQI were higher in subjects with better levels of resilience. The internal consistency of the 10-item CD-RISC was estimated using the Cronbach α test and test-retest reliability was estimated with the intraclass correlation coefficient. The Cronbach α coefficient was 0.85 and the test-retest intraclass correlation coefficient was 0.71. The mean MCS score and the level of quality of sleep in both men and women were significantly worse in subjects with lower resilience scores. Conclusions The Spanish version of the 10-item CD-RISC showed good psychometric properties in young adults and thus can be used as a reliable and valid instrument for measuring resilience. Our study confirmed that a single factor underlies the resilience construct, as was the case of the original scale in English.
Why we eat what we eat. The Eating Motivation Survey (TEMS).

Science.gov (United States)

Renner, Britta; Sproesser, Gudrun; Strohbach, Stefanie; Schupp, Harald T

2012-08-01

Understanding why people select certain food items in everyday life is crucial for the creation of interventions to promote normal eating and to prevent the development of obesity and eating disorders. The Eating Motivation Survey (TEMS) was developed within a frame of three different studies. In Study 1, a total of 331 motives for eating behavior were generated on the basis of different data sources (previous research, nutritionist interviews, and expert discussions). In Study 2, 1250 respondents were provided with a set of motives from Study 1 and the Eating Motivation Survey was finalized. In Study 3, a sample of 1040 participants filled in the Eating Motivation Survey. Confirmatory factor analysis with fifteen factors for food choice yielded a satisfactory model fit for a full (78 items) and brief survey version (45 items) with RMSEA .048 and .037, 90% CI .047-.049 and .035-.039, respectively. Factor structure was generally invariant across random selected groups, gender, and BMI, which indicates a high stability for the Eating Motivation Survey. On the mean level, however, significant differences in motivation for food choice associated with gender, age, and BMI emerged. Implications of the fifteen distinct motivations to choose foods in everyday life are discussed. Copyright © 2012 Elsevier Ltd. All rights reserved.
Summary, the 16th quality control survey for radioisotope in vitro tests in Japan, 1994

Energy Technology Data Exchange (ETDEWEB)

NONE

1995-11-01

The results of the 16th quality control survey for radioisotope in vitro tests in Japan (1994) are summarized. Of 399 medical facilities conducting radioisotope in vitro tests, 201 were enrolled in this study. Forty items including ACTH and {alpha}-fetoprotein were selected as the subjects. Freeze-drying samples were sent to the facilities. The quality of assay tubes, duration between fusion of the samples and assay, and the condition of preservation were examined, and those influence on the assay values were studied. Radioimmunoassay, immunoradiometric assay, and other procedures using enzymes, fluorescence, and chemiluminescense were conducted. The assay values of some of the items were significantly influenced by repeated freezing and fusion of the samples. Data were collected from individual items and kits used, and analyzed. The significant difference of values between different facilities and kits used were considered due to difference of assay principle, antibodies used, and standard items. The concentration of the samples needs to be improved. (S.Y.).
Development and Validation of the 34-Item Disability Screening Questionnaire (DSQ-34 for Use in Low and Middle Income Countries Epidemiological and Development Surveys.

Directory of Open Access Journals (Sweden)

Jean-François Trani

Full Text Available Although 80% of persons with disabilities live in low and middle-income countries, there is still a lack of comprehensive, cross-culturally validated tools to identify persons facing activity limitations and functioning difficulties in these settings. In absence of such a tool, disability estimates vary considerably according to the methodology used, and policies are based on unreliable estimates.The Disability Screening Questionnaire composed of 27 items (DSQ-27 was initially designed by a group of international experts in survey development and disability in Afghanistan for a national survey. Items were selected based on major domains of activity limitations and functioning difficulties linked to an impairment as defined by the International Classification of Functioning, Disability and Health. Face, content and construct validity, as well as sensitivity and specificity were examined. Based on the results obtained, the tool was subsequently refined and expanded to 34 items, tested and validated in Darfur, Sudan. Internal consistency for the total DSQ-34 using a raw and standardized Cronbach's Alpha and within each domain using a standardized Cronbach's Alpha was examined in the Asian context (India and Nepal. Exploratory factor analysis (EFA using principal axis factoring (PAF evaluated the lowest number of factors to account for the common variance among the questions in the screen. Test-retest reliability was determined by calculating intraclass correlation (ICC and inter-rater reliability by calculating the kappa statistic; results were checked using Bland-Altman plots. The DSQ-34 was further tested for standard error of measurement (SEM and for the minimum detectable change (MDC. Good internal consistency was indicated by Cronbach's Alpha of 0.83/0.82 for India and 0.76/0.78 for Nepal. We confirmed our assumption for EFA using the Kaiser-Meyer-Olkin measure of sampling well above the accepted cutoff of 0.40 for India (0.82 and Nepal (0
Development and Validation of the 34-Item Disability Screening Questionnaire (DSQ-34) for Use in Low and Middle Income Countries Epidemiological and Development Surveys.

Science.gov (United States)

Trani, Jean-François; Babulal, Ganesh Muneshwar; Bakhshi, Parul

2015-01-01

Although 80% of persons with disabilities live in low and middle-income countries, there is still a lack of comprehensive, cross-culturally validated tools to identify persons facing activity limitations and functioning difficulties in these settings. In absence of such a tool, disability estimates vary considerably according to the methodology used, and policies are based on unreliable estimates. The Disability Screening Questionnaire composed of 27 items (DSQ-27) was initially designed by a group of international experts in survey development and disability in Afghanistan for a national survey. Items were selected based on major domains of activity limitations and functioning difficulties linked to an impairment as defined by the International Classification of Functioning, Disability and Health. Face, content and construct validity, as well as sensitivity and specificity were examined. Based on the results obtained, the tool was subsequently refined and expanded to 34 items, tested and validated in Darfur, Sudan. Internal consistency for the total DSQ-34 using a raw and standardized Cronbach's Alpha and within each domain using a standardized Cronbach's Alpha was examined in the Asian context (India and Nepal). Exploratory factor analysis (EFA) using principal axis factoring (PAF) evaluated the lowest number of factors to account for the common variance among the questions in the screen. Test-retest reliability was determined by calculating intraclass correlation (ICC) and inter-rater reliability by calculating the kappa statistic; results were checked using Bland-Altman plots. The DSQ-34 was further tested for standard error of measurement (SEM) and for the minimum detectable change (MDC). Good internal consistency was indicated by Cronbach's Alpha of 0.83/0.82 for India and 0.76/0.78 for Nepal. We confirmed our assumption for EFA using the Kaiser-Meyer-Olkin measure of sampling well above the accepted cutoff of 0.40 for India (0.82) and Nepal (0.82). The

Item response theory - A first approach

Science.gov (United States)

Nunes, Sandra; Oliveira, Teresa; Oliveira, Amílcar

2017-07-01

The Item Response Theory (IRT) has become one of the most popular scoring frameworks for measurement data, frequently used in computerized adaptive testing, cognitively diagnostic assessment and test equating. According to Andrade et al. (2000), IRT can be defined as a set of mathematical models (Item Response Models - IRM) constructed to represent the probability of an individual giving the right answer to an item of a particular test. The number of Item Responsible Models available to measurement analysis has increased considerably in the last fifteen years due to increasing computer power and due to a demand for accuracy and more meaningful inferences grounded in complex data. The developments in modeling with Item Response Theory were related with developments in estimation theory, most remarkably Bayesian estimation with Markov chain Monte Carlo algorithms (Patz & Junker, 1999). The popularity of Item Response Theory has also implied numerous overviews in books and journals, and many connections between IRT and other statistical estimation procedures, such as factor analysis and structural equation modeling, have been made repeatedly (Van der Lindem & Hambleton, 1997). As stated before the Item Response Theory covers a variety of measurement models, ranging from basic one-dimensional models for dichotomously and polytomously scored items and their multidimensional analogues to models that incorporate information about cognitive sub-processes which influence the overall item response process. The aim of this work is to introduce the main concepts associated with one-dimensional models of Item Response Theory, to specify the logistic models with one, two and three parameters, to discuss some properties of these models and to present the main estimation procedures.
Psychometric aspects of item mapping for criterion-referenced interpretation and bookmark standard setting.

Science.gov (United States)

Huynh, Huynh

2010-01-01

Locating an item on an achievement continuum (item mapping) is well-established in technical work for educational/psychological assessment. Applications of item mapping may be found in criterion-referenced (CR) testing (or scale anchoring, Beaton and Allen, 1992; Huynh, 1994, 1998a, 2000a, 2000b, 2006), computer-assisted testing, test form assembly, and in standard setting methods based on ordered test booklets. These methods include the bookmark standard setting originally used for the CTB/TerraNova tests (Lewis, Mitzel, Green, and Patz, 1999), the item descriptor process (Ferrara, Perie, and Johnson, 2002) and a similar process described by Wang (2003) for multiple-choice licensure and certification examinations. While item response theory (IRT) models such as the Rasch and two-parameter logistic (2PL) models traditionally place a binary item at its location, Huynh has argued in the cited papers that such mapping may not be appropriate in selecting items for CR interpretation and scale anchoring.
A Descriptive, Cross-sectional Survey of Turkish Nurses' Knowledge of Pressure Ulcer Risk, Prevention, and Staging.

Science.gov (United States)

Gul, Asiye; Andsoy, Isil Isik; Ozkaya, Birgul; Zeydan, Ayten

2017-06-01

Nurses' knowledge of pressure ulcer (PU) prevention and management is an important first step in the provision of optimal care. To evaluate PU prevention/risk, staging, and wound description knowledge, a descriptive, cross-sectional survey was conducted among nurses working in an acute care Turkish hospital. The survey instrument was a modified and translated version of the Pieper Pressure Ulcer Knowledge Test (PUKT), and its validity and reliability were established. Nurses completed a Personal Characteristics Form, including sociodemographic information and exposure to educational presentations and information about and experience with PUs, followed by the 49-item modified PUKT which includes 33 prevention/risk items, 9 staging items, and 7 wound description items. All items are true/false questions with an I don't know option (scoring: minimum 0, maximum 49). Correct answers received 1 point and incorrect/unknown answers received 0 points. The paper-pencil questionnaires were distributed by 2 researchers to all nurses in the participating hospital and completed by those willing to be included. Responses were analyzed using descriptive statistics. Pearson's correlation test was used to examine the relationship between quantitative variables, and mean scores were compared using the Mann-Whitney U and Kruskal-Wallis tests. Among the 308 participating nurses (mean age 29.5 ± 8.1 [range 19-56] years) most were women (257, 83.4%) with 7.3 ± 7.8 (range 1-36) years of experience. The mean knowledge score for the entire sample was 29.7 ± 6.7 (range 8-42). The overall percentage of correct answers was 60.6% to 61.8% for PU prevention/risk assessment, 60% for wound description, and 56.6% for PU staging. Knowledge scores were significantly (P pressure on the heels" (22, 7.1%). The results of this study suggest education and experience caring for patients who are at risk for or have a PU affect nurses' knowledge. This study, and additional research examining nurse
[A self administered survey to assess bullying in schools].

Science.gov (United States)

Lecannelier, Felipe; Varela, Jorge; Rodríguez, Jorge; Hoffmann, Marianela; Flores, Fernanda; Ascanio, Lorena

2011-04-01

Bullying is common in schools and has negative consequences. It can be assessed using a self-reported instrument. To validate a Spanish self-reporting tool called "Survey of High School Bullying Abuse of Power" (MIAP). The instrument has 13 questions, of which 7 are multiple choice, rendering a total of 49 items. It was applied to 2.341 children of seventh and eighth grade attending private, subsidized and municipal schools in the city of Concepción, Chile. Expert judge analysis and estimated reliability using the Cronbach Alpha were used to validate the survey. The instrument obtained a Cronbach Alpha coefficient of 0.8892, classified as good. This analysis generated four scales that explained 30.9% of the variance. They were called "Witness Bullying" with 18 items, accounting for 11.4% of the variance, "Bullying Victim" with 12 items, accounting for 7.5% of the variance, "Bullying Perpetrator and Severe bullying Victim", with 10 items explaining 6.4% of the variance and "Aggressor Bullying" with 6 items accounting for 5.7% of the variance. The MIAP can recognize four basic factors that facilitate the analysis and understanding of bullying, with good levels of reliability and validity. The remaining questions also deliver valuable information.
48 CFR 46.709 - Warranties of commercial items.

Science.gov (United States)

2010-10-01

... 48 Federal Acquisition Regulations System 1 2010-10-01 2010-10-01 false Warranties of commercial... CONTRACT MANAGEMENT QUALITY ASSURANCE Warranties 46.709 Warranties of commercial items. The contracting officer should take advantage of commercial warranties, including extended warranties, where appropriate...
Telephone versus face-to-face interviewing for household drug use surveys.

Science.gov (United States)

Aquilino, W S

1992-01-01

This research investigated the use of telephone versus face-to-face interviewing to gather data on the use of tobacco, alcohol, and illicit drugs. Telephone and personal drug use surveys of the 18-34 year-old household population were conducted in the state of New Jersey in 1986-1987. Survey modes were compared in terms of unit and item nonresponse rates, sample coverage, and levels of self-reported drug use. Results showed that the telephone survey achieved response rates lower than the personal survey, but comparable to telephone surveys of less threatening topics. Item nonresponse to sensitive drug questions was lower by phone than with the self-administered answer sheets in the personal mode. The exclusion of households without telephones in the telephone survey is a potential source of bias, and may lead to underestimation of alcohol and drug use for minority populations. After controlling for telephone status, the telephone survey furnished significantly lower drug use estimates on several indicators than the personal survey, with the largest mode differences found for Blacks.
Current educational status of paediatric rheumatology in Europe: the results of PReS survey.

Science.gov (United States)

Demirkaya, E; Ozen, S; Türker, T; Kuis, W; Saurenmann, R K

2009-01-01

To understand the status of education and problems in paediatric rheumatology practice in Europe, through a survey. A 26-item questionnaire was conducted during the 14th Congress of the Paediatric Rheumatology European Society in Istanbul, 2007. Physicians who were practicing or studying within the field of paediatric rheumatology for at least one year were included in the survey. One hundred and twenty eight physicians, 79 paediatric rheumatologists (including 5 paediatric immunologists and 10 paediatric nephrologists), 34 paediatric rheumatology fellows and 15 adult rheumatologists completed the survey. The physicians were from: Europe 95 (81.9%), South America 12 (10.4%), Middle East 5 (4.3%), Asia 2 (1.7%), Africa 2 (1.7%). The duration of training for paediatric rheumatology ranged between 1-5 years (mean: 3.12+/-1.11). Sixty physicians scored their education as unsatisfactory and among those, 48 physicians were from Europe. Physicians reported good skills in the following items; intraarticular injections (83.3%); soft tissue injections (47.6%); evaluation of radiographs (67.5%); whereas competence in the evaluation of computed tomography/magnetic resonance imaging (30.5%); and musculoskeletal sonography (16.7%) was much lower. A need for improved basic science and rotations among relevant fields were specifically expressed. Being a relatively new speciality in the realm of paediatrics, paediatric rheumatology education at the European level needs to be further discussed, revised and uniformed.
Cleaning and disinfection of patient care items, in relation to small animals.

Science.gov (United States)

Weese, J Scott

2015-03-01

Patient care involves several medical and surgical items, including those that come into contact with sterile or other high-risk body sites and items that have been used on other patients. These situations create a risk for infection if items are contaminated, and the implications can range from single infections to large outbreaks. To minimize the risk, proper equipment cleaning, disinfection/sterilization, storage, and monitoring practices are required. Risks posed by different items; the required level of cleaning, disinfection, or sterilization; the methods that are available and appropriate; and how to ensure efficacy, must be considered when designing and implementing an infection control program. Copyright © 2015 Elsevier Inc. All rights reserved.
47 CFR 65.820 - Included items.

Science.gov (United States)

2010-10-01

...) Cash working capital. The average amount of investor-supplied capital needed to provide funds for a carrier's day-to-day interstate operations. Class A carriers may calculate a cash working capital... study or using the formula in paragraph (e) of this section, may calculate the cash working capital...
Exploring differential item functioning (DIF) with the Rasch model: a comparison of gender differences on eighth grade science items in the United States and Spain.

Science.gov (United States)

Babiar, Tasha Calvert

2011-01-01

Traditionally, women and minorities have not been fully represented in science and engineering. Numerous studies have attributed these differences to gaps in science achievement as measured by various standardized tests. Rather than describe mean group differences in science achievement across multiple cultures, this study focused on an in-depth item-level analysis across two countries: Spain and the United States. This study investigated eighth-grade gender differences on science items across the two countries. A secondary purpose of the study was to explore the nature of gender differences using the many-faceted Rasch Model as a way to estimate gender DIF. A secondary analysis of data from the Third International Mathematics and Science Study (TIMSS) was used to address three questions: 1) Does gender DIF in science achievement exist? 2) Is there a relationship between gender DIF and characteristics of the science items? 3) Do the relationships between item characteristics and gender DIF in science items replicate across countries. Participants included 7,087 eight grade students from the United States and 3,855 students from Spain who participated in TIMSS. The Facets program (Linacre and Wright, 1992) was used to estimate gender DIF. The results of the analysis indicate that the content of the item seemed to be related to gender DIF. The analysis also suggests that there is a relationship between gender DIF and item format. No pattern of gender DIF related to cognitive demand was found. The general pattern of gender DIF was similar across the two countries used in the analysis. The strength of item-level analysis as opposed to group mean difference analysis is that gender differences can be detected at the item level, even when no mean differences can be detected at the group level.
Plutonium Finishing Plant (PFP) Criticality Alarm System Commercial Grade Item (CGI) Critical Characteristics

International Nuclear Information System (INIS)

WHITE, W.F.

1999-01-01

This document specifies the critical characteristics for Commercial Grade Items (CGI) procured for PFP's criticality alarm system as required by HNF-PRO-268 and HNF-PRO-1819. These are the minimum specifications that the equipment must meet in order to properly perform its safety function. There may be several manufacturers or models that meet the critical characteristics for any one item. PFP's Criticality Alarm System includes the nine criticality alarm system panels and their associated hardware. This includes all parts up to the first breaker in the electrical distribution system. Specific system boundaries and justifications are contained in HNF-SD-CP-SDD-003, ''Definition and Means of Maintaining the Criticality Detectors and Alarms Portion of the PFP Safety Envelope.'' The procurement requirements associated with the system necessitates procurement of some system equipment as Commercial Grade Items in accordance with HNF-PRO-268, ''Control of Purchased Items and Services.''
Investigating Separate and Concurrent Approaches for Item Parameter Drift in 3PL Item Response Theory Equating

Science.gov (United States)

Arce-Ferrer, Alvaro J.; Bulut, Okan

2017-01-01

This study examines separate and concurrent approaches to combine the detection of item parameter drift (IPD) and the estimation of scale transformation coefficients in the context of the common item nonequivalent groups design with the three-parameter item response theory equating. The study uses real and synthetic data sets to compare the two…
Evaluation of Patient Satisfaction Surveys in Pediatric Orthopaedics.

Science.gov (United States)

Segal, Lee S; Plantikow, Carla; Hall, Randon; Wilson, Kristina; Shrader, M Wade

2015-01-01

Patient satisfaction survey scores are increasingly being tied to incentive compensation, impact how we practice medicine, influence decisions on where patients seek care, and in the future may be required for accreditation. The goal of this study is to compare the results of an internal distribution of patient satisfaction surveys at the point of care to responses received by mail in a hospital-based, high-volume pediatric orthopaedic practice. A pediatric outpatient survey is used at our institution to evaluate patient satisfaction. Surveys are randomly mailed out to families seen in our clinic by the survey vendor, and the results are determined on a quarterly basis. We distributed the same survey in a similar manner in our clinic. The results of the surveys, external/mailed (EXM) versus internal/point of care (INP) over the same 3-month time period (second quarter 2013) were compared. The survey questions are dichotomized from an ordinal scale into either excellent (9 to 10) or not excellent (0 to 8) commonly used in patient satisfaction methodology. We evaluated the raw data from the INP surveys for the question on provider rating by evaluating the mean score, the standard excellent response (9 to 10), and an expanded excellent response (8 to 10). Response rate was 72/469 (15.4%) for EXM, and 231/333 (69.4%) for INP. An excellent response for the "rating your provider" question was 72.2% (EXM) versus 84.8% (INP) (P=0.015). Our analysis of the raw data (INP) has a mean rating of 9.42. The expanded scale (8 to 10) for an excellent response increased the provider rating to 94.4% (P=0.001). Waiting time response within 15 minutes was the only item that correlated with rating of provider (P=0.02). For the majority of the items, the INP responses were consistently higher than the EXM responses, including 6/7 responses that were statistically significant (Ppatient satisfaction surveys will be important in determining health care outcomes. Properly designed and
Household food security in Isfahan based on current population survey adapted questionnaire

Directory of Open Access Journals (Sweden)

Morteza Rafiei

2013-01-01

Full Text Available Background: Food security is a state in which all people at every time have physical and economic access to adequate food to obviate nutritional needs and live a healthy and active life. Therefore, this study was performed to quantitatively evaluate the household food security in Esfahan using the localized version of US Household Food Security Survey Module (US HFSSM. Methods: This descriptive cross-sectional study was performed in year 2006 on 3000 households of Esfahan. The study instrument used in this work is 18-item US food security module, which is developed into a localized 15-item questionnaire. This study is performed in two stages of families with no children (under 18 years old and families with children over 18 years old. Results: The results showed that item severity coefficient, ratio of responses given by households and item infit and outfit coefficient in adult′s and children′s questionnaire respectively. According to obtained data, scale score of +3 in adults group is described as determination limit of slight food insecurity and +6 is stated as the limit for severe food insecurity. For children′s group, scale score of +2 is defined to be the limit of slight food insecurity and +5 is the determination limit of severe food insecurity. Conclusions: The main hypothesis of this survey analysis is based on the raw scale score of USFSSM The item of "lack of enough money for buying food" (item 2 and the item of "lack of balanced meal" (3 rd item have the lowest severity coefficient. Then, the ascending rate of item severity continues in first item, 4 th item and keeps increasing into 10 th item.
Identification of metallic items that caused nickel dermatitis in Danish patients.

Science.gov (United States)

Thyssen, Jacob P; Menné, Torkil; Johansen, Jeanne D

2010-09-01

Nickel allergy is prevalent as assessed by epidemiological studies. In an attempt to further identify and characterize sources that may result in nickel allergy and dermatitis, we analysed items identified by nickel-allergic dermatitis patients as causative of nickel dermatitis by using the dimethylglyoxime (DMG) test. Dermatitis patients with nickel allergy of current relevance were identified over a 2-year period in a tertiary referral patch test centre. When possible, their work tools and personal items were examined with the DMG test. Among 95 nickel-allergic dermatitis patients, 70 (73.7%) had metallic items investigated for nickel release. A total of 151 items were investigated, and 66 (43.7%) gave positive DMG test reactions. Objects were nearly all purchased or acquired after the introduction of the EU Nickel Directive. Only one object had been inherited, and only two objects had been purchased outside of Denmark. DMG testing is valuable as a screening test for nickel release and should be used to identify relevant exposures in nickel-allergic patients. Mainly consumer items, but also work tools used in an occupational setting, released nickel in dermatitis patients. This study confirmed 'risk items' from previous studies, including mobile phones.
INVESTIGATION OF MIS ITEM 011589A AND 3013 CONTAINERS HAVING SIMILAR CHARACTERISTICS

Energy Technology Data Exchange (ETDEWEB)

Friday, G

2006-08-23

Recent testing has identified the presence of hydrogen and oxygen in MIS Item 011589A. This isolated observation has effectuated concern regarding the potential for flammable gas mixtures in containers in the storage inventory. This study examines the known physicochemical characteristics of MIS Item 011589A and queries the ISP Database for items that are most similar or potentially similar. Items identified as most similar are believed to have the highest probability of being chemically and structurally identical to MIS Item 011589A. Items identified as potentially like MIS Item 011589A have some attributes in common, have the potential to generate gases, but have a lower probability of having similar gas generating characteristics. MIS Item 011589A is an oxide that was generated prior to 1990 at Rocky Flats in Building 707. It was associated with foundry processing and had an actinide assay of approximately 77%. Prompt gamma analysis of MIS Item 011589A indicated the presence of chloride, fluorine, magnesium, sodium, and aluminum. Queries based on MIS representation classification and process of origin were applied to the ISP Database. Evaluation criteria included binning classification (i.e., innocuous, pressure, or pressure and corrosion), availability of prompt gamma analyses, presence of chlorine and magnesium, percentage of chlorine by weight, peak ratios (i.e., Na:Cl and Mg:Na), moisture, and percent assay. These queries identified 15 items that were most similar and 106 items that were potentially like MIS Item 011589A. Although these queries identified containers that could potentially generate flammable gases, verification and confirmation can only be accomplished by destructive evaluation and testing of containers from the storage inventory.
Development and Preliminary Testing of the Food Choice Priorities Survey (FCPS): Assessing the Importance of Multiple Factors on College Students' Food Choices.

Science.gov (United States)

Vilaro, Melissa J; Zhou, Wenjun; Colby, Sarah E; Byrd-Bredbenner, Carol; Riggsbee, Kristin; Olfert, Melissa D; Barnett, Tracey E; Mathews, Anne E

2017-12-01

Understanding factors that influence food choice may help improve diet quality. Factors that commonly affect adults' food choices have been described, but measures that identify and assess food choice factors specific to college students are lacking. This study developed and tested the Food Choice Priorities Survey (FCPS) among college students. Thirty-seven undergraduates participated in two focus groups ( n = 19; 11 in the male-only group, 8 in the female-only group) and interviews ( n = 18) regarding typical influences on food choice. Qualitative data informed the development of survey items with a 5-point Likert-type scale (1 = not important, 5 = extremely important). An expert panel rated FCPS items for clarity, relevance, representativeness, and coverage using a content validity form. To establish test-retest reliability, 109 first-year college students completed the 14-item FCPS at two time points, 0-48 days apart ( M = 13.99, SD = 7.44). Using Cohen's weighted κ for responses within 20 days, 11 items demonstrated moderate agreement and 3 items had substantial agreement. Factor analysis revealed a three-factor structure (9 items). The FCPS is designed for college students and provides a way to determine the factors of greatest importance regarding food choices among this population. From a public health perspective, practical applications include using the FCPS to tailor health communications and behavior change interventions to factors most salient for food choices of college students.
Nickel and cobalt release from jewellery and metal clothing items in Korea.

Science.gov (United States)

Cheong, Seung Hyun; Choi, You Won; Choi, Hae Young; Byun, Ji Yeon

2014-01-01

In Korea, the prevalence of nickel allergy has shown a sharply increasing trend. Cobalt contact allergy is often associated with concomitant reactions to nickel, and is more common in Korea than in western countries. The aim of the present study was to investigate the prevalence of items that release nickel and cobalt on the Korean market. A total of 471 items that included 193 branded jewellery, 202 non-branded jewellery and 76 metal clothing items were sampled and studied with a dimethylglyoxime (DMG) test and a cobalt spot test to detect nickel and cobalt release, respectively. Nickel release was detected in 47.8% of the tested items. The positive rates in the DMG test were 12.4% for the branded jewellery, 70.8% for the non-branded jewellery, and 76.3% for the metal clothing items. Cobalt release was found in 6.2% of items. Among the types of jewellery, belts and hair pins showed higher positive rates in both the DMG test and the cobalt spot test. Our study shows that the prevalence of items that release nickel or cobalt among jewellery and metal clothing items is high in Korea. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Behavioral Health Needs Assessment Survey (BHNAS): Overview of Survey Items and Measures

Science.gov (United States)

2013-02-12

medication use • Personal and unit morale • Unit cohesion • Attitudes toward leadership • Positive effects of deployment • Navy support during deployment...to select any of the following: • Over-the-counter drugs (including Aspirin, Tylenol, Motrin, Ibuprofen, Aleve) • Prescription painkillers that...are not opioids (including Celebrex, Vioxx, Bextra, topical lidocaine) • Prescription opioid/narcotic painkiller (including OxyContin, Percocet
Survey nonresponse among ethnic minorities in a national health survey - a mixed-method study of participation, barriers, and potentials

DEFF Research Database (Denmark)

Ahlmark, Nanna; Algren, Maria Holst; Holmberg, Teresa

2015-01-01

, to alienation generated by the questions' focus on disease and cultural assumptions, or mistrust regarding anonymity. Ethnic minorities seem particularly affected by such barriers. To increase survey participation, questions could be sensitized to reflect multicultural traditions, and the impact of sender......Objectives. The participation rate in the Danish National Health Survey (DNHS) 2010 was significantly lower among ethnic minorities than ethnic Danes. The purpose was to characterize nonresponse among ethnic minorities in DNHS, analyze variations in item nonresponse, and investigate barriers...... and incentives to participation. Design. This was a mixed-method study. Logistic regression was used to analyze nonresponse using data from DNHS (N = 177,639 and chi-square tests in item nonresponse analyses. We explored barriers and incentives regarding participation through focus groups and cognitive...

76 FR 60474 - Commercial Item Handbook

Science.gov (United States)

2011-09-29

... DEPARTMENT OF DEFENSE Defense Acquisition Regulations System Commercial Item Handbook AGENCY.... SUMMARY: DoD has updated its Commercial Item Handbook. The purpose of the Handbook is to help acquisition personnel develop sound business strategies for procuring commercial items. DoD is seeking industry input on...
Using the STROBE statement to assess reporting in blindness prevalence surveys in low and middle income countries.

Science.gov (United States)

Ramke, Jacqueline; Palagyi, Anna; Jordan, Vanessa; Petkovic, Jennifer; Gilbert, Clare E

2017-01-01

Cross-sectional blindness prevalence surveys are essential to plan and monitor eye care services. Incomplete or inaccurate reporting can prevent effective translation of research findings. The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement is a 32 item checklist developed to improve reporting of observational studies. The aim of this study was to assess the completeness of reporting in blindness prevalence surveys in low and middle income countries (LMICs) using STROBE. MEDLINE, EMBASE and Web of Science databases were searched on April 8 2016 to identify cross-sectional blindness prevalence surveys undertaken in LMICs and published after STROBE was published in December 2007. The STROBE tool was applied to all included studies, and each STROBE item was categorized as 'yes' (met criteria), 'no' (did not meet criteria) or 'not applicable'. The 'Completeness of reporting (COR) score' for each manuscript was calculated: COR score = yes / [yes + no]. In journals with included studies the instructions to authors and reviewers were checked for reference to STROBE. The 89 included studies were undertaken in 32 countries and published in 37 journals. The mean COR score was 60.9% (95% confidence interval [CI] 58.1-63.7%; range 30.8-88.9%). The mean COR score did not differ between surveys published in journals with author instructions referring to STROBE (10/37 journals; 61.1%, 95%CI 56.4-65.8%) or in journals where STROBE was not mentioned (60.9%, 95%CI 57.4-64.3%; p = 0.93). While reporting in blindness prevalence surveys is strong in some areas, others need improvement. We recommend that more journals adopt the STROBE checklist and ensure it is used by authors and reviewers.
A Survey of Equipment in the Singing Voice Studio and Its Perceived Effectiveness by Vocologists and Student Singers.

Science.gov (United States)

Gerhard, Julia; Rosow, David E

2016-05-01

Speech-language pathologists have long used technology for the clinical measurement of the speaking voice, but present research shows that vocal pedagogues and voice students are becoming more accepting of technology in the studio. As a result, the equipment and technology used in singing voice studios by speech-language pathologists and vocal pedagogues are changing. Although guides exist regarding equipment and technology necessary for developing a voice laboratory and private voice studio, there are no data documenting the current implementation of these items and their perceived effectiveness. This study seeks to document current trends in equipment used in voice laboratories and studios. Two separate surveys were distributed to 60 vocologists and approximately 300 student singers representative of the general singing student population. The surveys contained questions about the inventory of items found in voice studios and perceived effectiveness of these items. Data were analyzed using descriptive analyses and statistical analyses when applicable. Twenty-six of 60 potential vocologists responded, and 66 student singers responded. The vocologists reported highly uniform inventories and ratings of studio items. There were wide-ranging differences between the inventories reported by the vocologist and student singer groups. Statistically significant differences between ratings of effectiveness of studio items were found for 11 of the 17 items. In all significant cases, vocologists rated usefulness to be higher than student singers. Although the order of rankings of vocologists and student singers was similar, a much higher percentage of vocologists report the items as being efficient and effective than students. The historically typical studio items, including the keyboard and mirror, were ranked as most effective by both vocologists and student singers. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Spare Items validation

International Nuclear Information System (INIS)

Fernandez Carratala, L.

1998-01-01

There is an increasing difficulty for purchasing safety related spare items, with certifications by manufacturers for maintaining the original qualifications of the equipment of destination. The main reasons are, on the top of the logical evolution of technology, applied to the new manufactured components, the quitting of nuclear specific production lines and the evolution of manufacturers quality systems, originally based on nuclear codes and standards, to conventional industry standards. To face this problem, for many years different Dedication processes have been implemented to verify whether a commercial grade element is acceptable to be used in safety related applications. In the same way, due to our particular position regarding the spare part supplies, mainly from markets others than the american, C.N. Trillo has developed a methodology called Spare Items Validation. This methodology, which is originally based on dedication processes, is not a single process but a group of coordinated processes involving engineering, quality and management activities. These are to be performed on the spare item itself, its design control, its fabrication and its supply for allowing its use in destinations with specific requirements. The scope of application is not only focussed on safety related items, but also to complex design, high cost or plant reliability related components. The implementation in C.N. Trillo has been mainly curried out by merging, modifying and making the most of processes and activities which were already being performed in the company. (Author)
Development and implementation of a local government survey to measure community supports for healthy eating and active living

Directory of Open Access Journals (Sweden)

Latetia V Moore

2017-06-01

Full Text Available The ability to make healthy choices is influenced by where one lives, works, shops, and plays. Locally enacted policies and standards can influence these surroundings but little is known about the prevalence of such policies and standards that support healthier behaviors. In this paper, we describe the development of a survey questionnaire designed to capture local level policy supports for healthy eating and active living and findings and lessons learned from a 2012 pilot in two states, Minnesota and California, including respondent burden, survey sampling and administration methods, and survey item feasibility issues. A 38-item, web-based, self-administered survey and sampling frame were developed to assess the prevalence of 22 types of healthy eating and active living policies in a representative sample of local governments in the two states. The majority of respondents indicated the survey required minimal effort to complete with half taking <20 min to complete the survey. A non-response follow-up plan including emails and phone calls was required to achieve a 68% response rate (versus a 37% response rate for email only reminders. Local governments with larger residential populations reported having healthy eating and active living policies and standards more often than smaller governments. Policies that support active living were more common than those that support healthy eating and varied within the two states. The methods we developed are a feasible data collection tool for estimating the prevalence of municipal healthy eating and active living policies and standards at the state and national level.
The utility of single-item readiness screeners in middle school.

Science.gov (United States)

Lewis, Crystal G; Herman, Keith C; Huang, Francis L; Stormont, Melissa; Grossman, Caroline; Eddy, Colleen; Reinke, Wendy M

2017-10-01

This study examined the benefit of utilizing one-item academic and one-item behavior readiness teacher-rated screeners at the beginning of the school year to predict end-of-school year outcomes for middle school students. The Middle School Academic and Behavior Readiness (M-ABR) screeners were developed to provide an efficient and effective way to assess readiness in students. Participants included 889 students in 62 middle school classrooms in an urban Missouri school district. Concurrent validity with the M-ABR items and other indicators of readiness in the fall were evaluated using Pearson product-moment correlation coefficients, with the academic readiness item having medium to strong correlations with other baseline academic indicators (r=±0.56 to 0.91) and the behavior readiness item having low to strong correlations with baseline behavior items (r=±0.20 to 0.79). Next, the predictive validity of the M-ABR items was analyzed with hierarchical linear regressions using end-of-year outcomes as the dependent variable. The academic and behavior readiness items demonstrated adequate validity for all outcomes with moderate effects (β=±0.31 to 0.73 for academic outcomes and β=±0.24 to 0.59 for behavioral outcomes) after controlling for baseline demographics. Even after controlling for baseline scores, the M-ABR items predicted unique variance in almost all outcome variables. Four conditional probability indices were calculated to obtain an optimal cut score, to determine ready vs. not ready, for both single-item M-ABR scales. The cut point of "fair" yielded the most acceptable values for the indices. The odd ratios (OR) of experiencing negative outcomes given a "fair" or lower readiness rating (2 or below on the M-ABR screeners) at the beginning of the year were significant and strong for all outcomes (OR=2.29 to OR=14.46), except for internalizing problems. These findings suggest promise for using single readiness items to screen for varying negative end
Probing University Students' Pre-Knowledge in Quantum Physics with QPCS Survey

Science.gov (United States)

Asikainen, Mervi A.

2017-01-01

The study investigated the use of Quantum Physics Conceptual Survey (QPCS) in probing student understanding of quantum physics. Altogether 103 Finnish university students responded to QPCS. The mean scores of the student responses were calculated and the test was evaluated using common five indices: Item difficulty index, Item discrimination…
Item Analysis in Introductory Economics Testing.

Science.gov (United States)

Tinari, Frank D.

1979-01-01

Computerized analysis of multiple choice test items is explained. Examples of item analysis applications in the introductory economics course are discussed with respect to three objectives: to evaluate learning; to improve test items; and to help improve classroom instruction. Problems, costs and benefits of the procedures are identified. (JMD)
A review of the effects on IRT item parameter estimates with a focus on misbehaving common items in test equating

Directory of Open Access Journals (Sweden)

Michalis P Michaelides

2010-10-01

Full Text Available Many studies have investigated the topic of change or drift in item parameter estimates in the context of Item Response Theory. Content effects, such as instructional variation and curricular emphasis, as well as context effects, such as the wording, position, or exposure of an item have been found to impact item parameter estimates. The issue becomes more critical when items with estimates exhibiting differential behavior across test administrations are used as common for deriving equating transformations. This paper reviews the types of effects on IRT item parameter estimates and focuses on the impact of misbehaving or aberrant common items on equating transformations. Implications relating to test validity and the judgmental nature of the decision to keep or discard aberrant common items are discussed, with recommendations for future research into more informed and formal ways of dealing with misbehaving common items.
A Review of the Effects on IRT Item Parameter Estimates with a Focus on Misbehaving Common Items in Test Equating.

Science.gov (United States)

Michaelides, Michalis P

2010-01-01

Many studies have investigated the topic of change or drift in item parameter estimates in the context of item response theory (IRT). Content effects, such as instructional variation and curricular emphasis, as well as context effects, such as the wording, position, or exposure of an item have been found to impact item parameter estimates. The issue becomes more critical when items with estimates exhibiting differential behavior across test administrations are used as common for deriving equating transformations. This paper reviews the types of effects on IRT item parameter estimates and focuses on the impact of misbehaving or aberrant common items on equating transformations. Implications relating to test validity and the judgmental nature of the decision to keep or discard aberrant common items are discussed, with recommendations for future research into more informed and formal ways of dealing with misbehaving common items.
A Comparison of the 27-Item and 12-Item Intolerance of Uncertainty Scales

Science.gov (United States)

Khawaja, Nigar G.; Yu, Lai Ngo Heidi

2010-01-01

The 27-item Intolerance of Uncertainty Scale (IUS) has become one of the most frequently used measures of Intolerance of Uncertainty. More recently, an abridged, 12-item version of the IUS has been developed. The current research used clinical (n = 50) and non-clinical (n = 56) samples to examine and compare the psychometric properties of both…
PENGEMBANGAN TES BERPIKIR KRITIS DENGAN PENDEKATAN ITEM RESPONSE THEORY

Directory of Open Access Journals (Sweden)

Fajrianthi Fajrianthi

2016-06-01

Full Text Available Penelitian ini bertujuan untuk menghasilkan sebuah alat ukur (tes berpikir kritis yang valid dan reliabel untuk digunakan, baik dalam lingkup pendidikan maupun kerja di Indonesia. Tahapan penelitian dilakukan berdasarkan tahap pengembangan tes menurut Hambleton dan Jones (1993. Kisi-kisi dan pembuatan butir didasarkan pada konsep dalam tes Watson-Glaser Critical Thinking Appraisal (WGCTA. Pada WGCTA, berpikir kritis terdiri dari lima dimensi yaitu Inference, Recognition Assumption, Deduction, Interpretation dan Evaluation of arguments. Uji coba tes dilakukan pada 1.453 peserta tes seleksi karyawan di Surabaya, Gresik, Tuban, Bojonegoro, Rembang. Data dikotomi dianalisis dengan menggunakan model IRT dengan dua parameter yaitu daya beda dan tingkat kesulitan butir. Analisis dilakukan dengan menggunakan program statistik Mplus versi 6.11 Sebelum melakukan analisis dengan IRT, dilakukan pengujian asumsi yaitu uji unidimensionalitas, independensi lokal dan Item Characteristic Curve (ICC. Hasil analisis terhadap 68 butir menghasilkan 15 butir dengan daya beda yang cukup baik dan tingkat kesulitan butir yang berkisar antara –4 sampai dengan 2.448. Sedikitnya jumlah butir yang berkualitas baik disebabkan oleh kelemahan dalam menentukan subject matter experts di bidang berpikir kritis dan pemilihan metode skoring. Kata kunci: Pengembangan tes, berpikir kritis, item response theory DEVELOPING CRITICAL THINKING TEST UTILISING ITEM RESPONSE THEORY Abstract The present study was aimed to develop a valid and reliable instrument in assesing critical thinking which can be implemented both in educational and work settings in Indonesia. Following the Hambleton and Jones’s (1993 procedures on test development, the study developed the instrument by employing the concept of critical thinking from Watson-Glaser Critical Thinking Appraisal (WGCTA. The study included five dimensions of critical thinking as adopted from the WGCTA: Inference, Recognition
More is not Always Better: The Relation between Item Response and Item Response Time in Raven’s Matrices

Directory of Open Access Journals (Sweden)

Frank Goldhammer

2015-03-01

Full Text Available The role of response time in completing an item can have very different interpretations. Responding more slowly could be positively related to success as the item is answered more carefully. However, the association may be negative if working faster indicates higher ability. The objective of this study was to clarify the validity of each assumption for reasoning items considering the mode of processing. A total of 230 persons completed a computerized version of Raven’s Advanced Progressive Matrices test. Results revealed that response time overall had a negative effect. However, this effect was moderated by items and persons. For easy items and able persons the effect was strongly negative, for difficult items and less able persons it was less negative or even positive. The number of rules involved in a matrix problem proved to explain item difficulty significantly. Most importantly, a positive interaction effect between the number of rules and item response time indicated that the response time effect became less negative with an increasing number of rules. Moreover, exploratory analyses suggested that the error type influenced the response time effect.
Ethical imperatives against item restriction in the Supplemental Nutrition Assistance Program.

Science.gov (United States)

Chrisinger, Benjamin W

2017-07-01

The Supplemental Nutrition Assistance Program (SNAP, formerly known as food stamps) is the federal government's largest form of food assistance, and a frequent focus of political and scholarly debate. Previous discourse in the public health community and recent proposals in state legislatures have suggested limiting the use of SNAP benefits on unhealthy food items, such as sugar-sweetened beverages (SSBs). This paper identifies two possible underlying motivations for item restriction, health and morals, and analyzes the level of empirical support for claims about the current state of the program, as well as expectations about how item restriction would change participant outcomes. It also assesses how item restriction would reduce individual agency of low-income individuals, and identifies mechanisms by which this may adversely affect program participants. Finally, this paper offers alternative policies to promote healthier purchasing and eating among SNAP participants that can be pursued without reducing individual agency. Health advocates and officials must more fully weigh the attendant risks of implementing SNAP item restrictions, including the reduction of individual agency of a vulnerable population. Copyright © 2017 Elsevier Inc. All rights reserved.
Methodology for the development and calibration of the SCI-QOL item banks.

Science.gov (United States)

Tulsky, David S; Kisala, Pamela A; Victorson, David; Choi, Seung W; Gershon, Richard; Heinemann, Allen W; Cella, David

2015-05-01

To develop a comprehensive, psychometrically sound, and conceptually grounded patient reported outcomes (PRO) measurement system for individuals with spinal cord injury (SCI). Individual interviews (n=44) and focus groups (n=65 individuals with SCI and n=42 SCI clinicians) were used to select key domains for inclusion and to develop PRO items. Verbatim items from other cutting-edge measurement systems (i.e. PROMIS, Neuro-QOL) were included to facilitate linkage and cross-population comparison. Items were field tested in a large sample of individuals with traumatic SCI (n=877). Dimensionality was assessed with confirmatory factor analysis. Local item dependence and differential item functioning were assessed, and items were calibrated using the item response theory (IRT) graded response model. Finally, computer adaptive tests (CATs) and short forms were administered in a new sample (n=245) to assess test-retest reliability and stability. A calibration sample of 877 individuals with traumatic SCI across five SCI Model Systems sites and one Department of Veterans Affairs medical center completed SCI-QOL items in interview format. We developed 14 unidimensional calibrated item banks and 3 calibrated scales across physical, emotional, and social health domains. When combined with the five Spinal Cord Injury--Functional Index physical function banks, the final SCI-QOL system consists of 22 IRT-calibrated item banks/scales. Item banks may be administered as CATs or short forms. Scales may be administered in a fixed-length format only. The SCI-QOL measurement system provides SCI researchers and clinicians with a comprehensive, relevant and psychometrically robust system for measurement of physical-medical, physical-functional, emotional, and social outcomes. All SCI-QOL instruments are freely available on Assessment CenterSM.
Negative effects of item repetition on source memory.

Science.gov (United States)

Kim, Kyungmi; Yi, Do-Joon; Raye, Carol L; Johnson, Marcia K

2012-08-01

In the present study, we explored how item repetition affects source memory for new item-feature associations (picture-location or picture-color). We presented line drawings varying numbers of times in Phase 1. In Phase 2, each drawing was presented once with a critical new feature. In Phase 3, we tested memory for the new source feature of each item from Phase 2. Experiments 1 and 2 demonstrated and replicated the negative effects of item repetition on incidental source memory. Prior item repetition also had a negative effect on source memory when different source dimensions were used in Phases 1 and 2 (Experiment 3) and when participants were explicitly instructed to learn source information in Phase 2 (Experiments 4 and 5). Importantly, when the order between Phases 1 and 2 was reversed, such that item repetition occurred after the encoding of critical item-source combinations, item repetition no longer affected source memory (Experiment 6). Overall, our findings did not support predictions based on item predifferentiation, within-dimension source interference, or general interference from multiple traces of an item. Rather, the findings were consistent with the idea that prior item repetition reduces attention to subsequent presentations of the item, decreasing the likelihood that critical item-source associations will be encoded.
Comparing Lay Community and Academic Survey Center Interviewers in Conducting Household Interviews in Latino Communities.

Science.gov (United States)

Chan-Golston, Alec M; Friedlander, Scott; Glik, Deborah C; Prelip, Michael L; Belin, Thomas R; Brookmeyer, Ron; Santos, Robert; Chen, Jie; Ortega, Alexander N

2016-01-01

The employment of professional interviewers from academic survey centers to conduct surveys has been standard practice. Because one goal of community-engaged research is to provide professional skills to community residents, this paper considers whether employing locally trained lay interviewers from within the community may be as effective as employing interviewers from an academic survey center with regard to unit and item nonresponse rates and cost. To study a nutrition-focused intervention, 1035 in-person household interviews were conducted in East Los Angeles and Boyle Heights, 503 of which were completed by lay community interviewers. A chi-square test was used to assess differences in unit nonresponse rates between professional and community interviewers and Welch's t tests were used to assess differences in item nonresponse rates. A cost comparison analysis between the two interviewer groups was also conducted. Interviewers from the academic survey center had lower unit nonresponse rates than the lay community interviewers (16.2% vs. 23.3%; p < 0.01). However, the item nonresponse rates were lower for the community interviewers than the professional interviewers (1.4% vs. 3.3%; p < 0.01). Community interviewers cost approximately $415.38 per survey whereas professional interviewers cost approximately $537.29 per survey. With a lower cost per completed survey and lower item nonresponse rates, lay community interviewers are a viable alternative to professional interviewers for fieldwork in community-based research. Additional research is needed to assess other important aspects of data quality interviewer such as interviewer effects and response error.
Examination of the PROMIS upper extremity item bank.

Science.gov (United States)

Hung, Man; Voss, Maren W; Bounsanga, Jerry; Crum, Anthony B; Tyser, Andrew R

Clinical measurement. The psychometric properties of the PROMIS v1.2 UE item bank were tested on various samples prior to its release, but have not been fully evaluated among the orthopaedic population. This study assesses the performance of the UE item bank within the UE orthopaedic patient population. The UE item bank was administered to 1197 adult patients presenting to a tertiary orthopaedic clinic specializing in hand and UE conditions and was examined using traditional statistics and Rasch analysis. The UE item bank fits a unidimensional model (outfit MNSQ range from 0.64 to 1.70) and has adequate reliabilities (person = 0.84; item = 0.82) and local independence (item residual correlations range from -0.37 to 0.34). Only one item exhibits gender differential item functioning. Most items target low levels of function. The UE item bank is a useful clinical assessment tool. Additional items covering higher functions are needed to enhance validity. Supplemental testing is recommended for patients at higher levels of function until more high function UE items are developed. 2c. Copyright © 2016 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.
Assessing nicotine dependence in adolescent E-cigarette users: The 4-item Patient-Reported Outcomes Measurement Information System (PROMIS) Nicotine Dependence Item Bank for electronic cigarettes.

Science.gov (United States)

Morean, Meghan E; Krishnan-Sarin, Suchitra; S O'Malley, Stephanie

2018-04-26

Adolescent e-cigarette use (i.e., "vaping") likely confers risk for developing nicotine dependence. However, there have been no studies assessing e-cigarette nicotine dependence in youth. We evaluated the psychometric properties of the 4-item Patient-Reported Outcomes Measurement Information System Nicotine Dependence Item Bank for E-cigarettes (PROMIS-E) for assessing youth e-cigarette nicotine dependence and examined risk factors for experiencing stronger dependence symptoms. In 2017, 520 adolescent past-month e-cigarette users completed the PROMIS-E during a school-based survey (50.5% female, 84.8% White, 16.22[1.19] years old). Adolescents also reported on sex, grade, race, age at e-cigarette use onset, vaping frequency, nicotine e-liquid use, and past-month cigarette smoking. Analyses included conducting confirmatory factor analysis and examining the internal consistency of the PROMIS-E. Bivariate correlations and independent-samples t-tests were used to examine unadjusted relationships between e-cigarette nicotine dependence and the proposed risk factors. Regression models were run in which all potential risk factors were entered as simultaneous predictors of PROMIS-E scores. The single-factor structure of the PROMIS-E was confirmed and evidenced good internal consistency. Across models, larger PROMIS-E scores were associated with being in a higher grade, initiating e-cigarette use at an earlier age, vaping more frequently, using nicotine e-liquid (and higher nicotine concentrations), and smoking cigarettes. Adolescent e-cigarette users reported experiencing nicotine dependence, which was assessed using the psychometrically sound PROMIS-E. Experiencing stronger nicotine dependence symptoms was associated with characteristics that previously have been shown to confer risk for frequent vaping and tobacco cigarette dependence. Copyright © 2018 Elsevier B.V. All rights reserved.
Psychometric Consequences of Subpopulation Item Parameter Drift

Science.gov (United States)

Huggins-Manley, Anne Corinne

2017-01-01

This study defines subpopulation item parameter drift (SIPD) as a change in item parameters over time that is dependent on subpopulations of examinees, and hypothesizes that the presence of SIPD in anchor items is associated with bias and/or lack of invariance in three psychometric outcomes. Results show that SIPD in anchor items is associated…

A new Integrated Negative Symptom structure of the Positive and Negative Syndrome Scale (PANSS) in schizophrenia using item response analysis.

Science.gov (United States)

Khan, Anzalee; Lindenmayer, Jean-Pierre; Opler, Mark; Yavorsky, Christian; Rothman, Brian; Lucic, Luka

2013-10-01

Debate persists with regard to how best to categorize the syndromal dimension of negative symptoms in schizophrenia. The aim was to first review published Principle Components Analysis (PCA) of the PANSS, and extract items most frequently included in the negative domain, and secondly, to examine the quality of items using Item Response Theory (IRT) to select items that best represent a measurable dimension (or dimensions) of negative symptoms. First, 22 factor analyses and PCA met were included. Second, using a large dataset (n=7187) of participants in clinical trials with chronic schizophrenia, we extracted items loading on one or more PCA. Third, items not loading with a value of ≥ 0.5, or loading on more than one component with values of ≥ 0.5 were discarded. Fourth, resulting items were included in a non-parametric IRT and retained based on Option Characteristic Curves (OCCs) and Item Characteristic Curves (ICCs). 15 items loaded on a negative domain in at least one study, with Emotional Withdrawal loading on all studies. Non-parametric IRT retained nine items as an Integrated Negative Factor: Emotional Withdrawal, Blunted Affect, Passive/Apathetic Social Withdrawal, Poor Rapport, Lack of Spontaneity/Conversation Flow, Active Social Avoidance, Disturbance of Volition, Stereotyped Thinking and Difficulty in Abstract Thinking. This is the first study to use a psychometric IRT process to arrive at a set of negative symptom items. Future steps will include further examination of these nine items in terms of their stability, sensitivity to change, and correlations with functional and cognitive outcomes. © 2013 Elsevier B.V. All rights reserved.
The 12 item Social and Economic Conservatism Scale (SECS).

Science.gov (United States)

Everett, Jim A C

2013-01-01

Recent years have seen a surge in psychological research on the relationship between political ideology (particularly conservatism) and cognition, affect, behaviour, and even biology. Despite this flurry of investigation, however, there is as yet no accepted, validated, and widely used multi-item scale of conservatism that is concise, that is modern in its conceptualisation, and that includes both social and economic conservatism subscales. In this paper the 12-Item Social and Economic Conservatism Scale (SECS) is proposed and validated to help fill this gap. The SECS is suggested to be an important and useful tool for researchers working in political psychology.
Few items in the thyroid-related quality of life instrument ThyPRO exhibited differential item functioning.

Science.gov (United States)

Watt, Torquil; Groenvold, Mogens; Hegedüs, Laszlo; Bonnema, Steen Joop; Rasmussen, Åse Krogh; Feldt-Rasmussen, Ulla; Bjorner, Jakob Bue

2014-02-01

To evaluate the extent of differential item functioning (DIF) within the thyroid-specific quality of life patient-reported outcome measure, ThyPRO, according to sex, age, education and thyroid diagnosis. A total of 838 patients with benign thyroid diseases completed the ThyPRO questionnaire (84 five-point items, 13 scales). Uniform and nonuniform DIF were investigated using ordinal logistic regression, testing for both statistical significance and magnitude (∆R(2) > 0.02). Scale level was estimated by the sum score, after purification. Twenty instances of DIF in 17 of the 84 items were found. Eight according to diagnosis, where the goiter scale was the one most affected, possibly due to differing perceptions in patients with auto-immune thyroid diseases compared to patients with simple goiter. Eight DIFs according to age were found, of which 5 were in positively worded items, which younger patients were more likely to endorse; one according to gender: women were more likely to report crying, and three according to educational level. The vast majority of DIF had only minor influence on the scale scores (0.1-2.3 points on the 0-100 scales), but two DIF corresponded to a difference of 4.6 and 9.8, respectively. Ordinal logistic regression identified DIF in 17 of 84 items. The potential impact of this on the present scales was low, but items displaying DIF could be avoided when developing abbreviated scales, where the potential impact of DIF (due to fewer items) will be larger.
Converging evidence for control of color-word Stroop interference at the item level.

Science.gov (United States)

Bugg, Julie M; Hutchison, Keith A

2013-04-01

Prior studies have shown that cognitive control is implemented at the list and context levels in the color-word Stroop task. At first blush, the finding that Stroop interference is reduced for mostly incongruent items as compared with mostly congruent items (i.e., the item-specific proportion congruence [ISPC] effect) appears to provide evidence for yet a third level of control, which modulates word reading at the item level. However, evidence to date favors the view that ISPC effects reflect the rapid prediction of high-contingency responses and not item-specific control. In Experiment 1, we first show that an ISPC effect is obtained when the relevant dimension (i.e., color) signals proportion congruency, a problematic pattern for theories based on differential response contingencies. In Experiment 2, we replicate and extend this pattern by showing that item-specific control settings transfer to new stimuli, ruling out alternative frequency-based accounts. In Experiment 3, we revert to the traditional design in which the irrelevant dimension (i.e., word) signals proportion congruency. Evidence for item-specific control, including transfer of the ISPC effect to new stimuli, is apparent when 4-item sets are employed but not when 2-item sets are employed. We attribute this pattern to the absence of high-contingency responses on incongruent trials in the 4-item set. These novel findings provide converging evidence for reactive control of color-word Stroop interference at the item level, reveal theoretically important factors that modulate reliance on item-specific control versus contingency learning, and suggest an update to the item-specific control account (Bugg, Jacoby, & Chanani, 2011).
Loglinear multidimensional IRT models for polytomously scired Items

NARCIS (Netherlands)

Kelderman, Henk

1988-01-01

A loglinear item response theory (IRT) model is proposed that relates polytomously scored item responses to a multidimensional latent space. Each item may have a different response function where each item response may be explained by one or more latent traits. Item response functions may follow a
48 CFR 852.214-72 - Alternate item(s).

Science.gov (United States)

2010-10-01

... AND FORMS SOLICITATION PROVISIONS AND CONTRACT CLAUSES Texts of Provisions and Clauses 852.214-72... 2008) Bids on []* will be given equal consideration along with bids on []** and any such bids received... [].** * Contracting officer will insert an alternate item that is considered acceptable. ** Contracting officer will...
Mokken scale analysis of mental health and well-being questionnaire item responses: a non-parametric IRT method in empirical research for applied health researchers.

Science.gov (United States)

Stochl, Jan; Jones, Peter B; Croudace, Tim J

2012-06-11

Mokken scaling techniques are a useful tool for researchers who wish to construct unidimensional tests or use questionnaires that comprise multiple binary or polytomous items. The stochastic cumulative scaling model offered by this approach is ideally suited when the intention is to score an underlying latent trait by simple addition of the item response values. In our experience, the Mokken model appears to be less well-known than for example the (related) Rasch model, but is seeing increasing use in contemporary clinical research and public health. Mokken's method is a generalisation of Guttman scaling that can assist in the determination of the dimensionality of tests or scales, and enables consideration of reliability, without reliance on Cronbach's alpha. This paper provides a practical guide to the application and interpretation of this non-parametric item response theory method in empirical research with health and well-being questionnaires. Scalability of data from 1) a cross-sectional health survey (the Scottish Health Education Population Survey) and 2) a general population birth cohort study (the National Child Development Study) illustrate the method and modeling steps for dichotomous and polytomous items respectively. The questionnaire data analyzed comprise responses to the 12 item General Health Questionnaire, under the binary recoding recommended for screening applications, and the ordinal/polytomous responses to the Warwick-Edinburgh Mental Well-being Scale. After an initial analysis example in which we select items by phrasing (six positive versus six negatively worded items) we show that all items from the 12-item General Health Questionnaire (GHQ-12)--when binary scored--were scalable according to the double monotonicity model, in two short scales comprising six items each (Bech's "well-being" and "distress" clinical scales). An illustration of ordinal item analysis confirmed that all 14 positively worded items of the Warwick-Edinburgh Mental
Mokken scale analysis of mental health and well-being questionnaire item responses: a non-parametric IRT method in empirical research for applied health researchers

Directory of Open Access Journals (Sweden)

Stochl Jan

2012-06-01

Full Text Available Abstract Background Mokken scaling techniques are a useful tool for researchers who wish to construct unidimensional tests or use questionnaires that comprise multiple binary or polytomous items. The stochastic cumulative scaling model offered by this approach is ideally suited when the intention is to score an underlying latent trait by simple addition of the item response values. In our experience, the Mokken model appears to be less well-known than for example the (related Rasch model, but is seeing increasing use in contemporary clinical research and public health. Mokken's method is a generalisation of Guttman scaling that can assist in the determination of the dimensionality of tests or scales, and enables consideration of reliability, without reliance on Cronbach's alpha. This paper provides a practical guide to the application and interpretation of this non-parametric item response theory method in empirical research with health and well-being questionnaires. Methods Scalability of data from 1 a cross-sectional health survey (the Scottish Health Education Population Survey and 2 a general population birth cohort study (the National Child Development Study illustrate the method and modeling steps for dichotomous and polytomous items respectively. The questionnaire data analyzed comprise responses to the 12 item General Health Questionnaire, under the binary recoding recommended for screening applications, and the ordinal/polytomous responses to the Warwick-Edinburgh Mental Well-being Scale. Results and conclusions After an initial analysis example in which we select items by phrasing (six positive versus six negatively worded items we show that all items from the 12-item General Health Questionnaire (GHQ-12 – when binary scored – were scalable according to the double monotonicity model, in two short scales comprising six items each (Bech’s “well-being” and “distress” clinical scales. An illustration of ordinal item analysis
Purchases of Consumable Items Transferred to the Defense Logistics Agency

National Research Council Canada - National Science Library

Young, Shelton

1995-01-01

Defense Management Report Decision 926, "Consolidation of Inventory Control Points," included a recommendation to transfer all consumable items managed by the Military Departments to the Defense Logistics Agency (DLA...
Predictive validity of the Work Ability Index and its individual items in the general population.

Science.gov (United States)

Lundin, Andreas; Leijon, Ola; Vaez, Marjan; Hallgren, Mats; Torgén, Margareta

2017-06-01

This study assesses the predictive ability of the full Work Ability Index (WAI) as well as its individual items in the general population. The Work, Health and Retirement Study (WHRS) is a stratified random national sample of 25-75-year-olds living in Sweden in 2000 that received a postal questionnaire ( n = 6637, response rate = 53%). Current and subsequent sickness absence was obtained from registers. The ability of the WAI to predict long-term sickness absence (LTSA; ⩾ 90 consecutive days) during a period of four years was analysed by logistic regression, from which the Area Under the Receiver Operating Characteristic curve (AUC) was computed. There were 313 incident LTSA cases among 1786 employed individuals. The full WAI had acceptable ability to predict LTSA during the 4-year follow-up (AUC = 0.79; 95% CI 0.76 to 0.82). Individual items were less stable in their predictive ability. However, three of the individual items: current work ability compared with lifetime best, estimated work impairment due to diseases, and number of diagnosed current diseases, exceeded AUC > 0.70. Excluding the WAI item on number of days on sickness absence did not result in an inferior predictive ability of the WAI. The full WAI has acceptable predictive validity, and is superior to its individual items. For public health surveys, three items may be suitable proxies of the full WAI; current work ability compared with lifetime best, estimated work impairment due to diseases, and number of current diseases diagnosed by a physician.
Losing Items in the Psychogeriatric Nursing Home

Directory of Open Access Journals (Sweden)

J. van Hoof PhD

2016-09-01

Full Text Available Introduction: Losing items is a time-consuming occurrence in nursing homes that is ill described. An explorative study was conducted to investigate which items got lost by nursing home residents, and how this affects the residents and family caregivers. Method: Semi-structured interviews and card sorting tasks were conducted with 12 residents with early-stage dementia and 12 family caregivers. Thematic analysis was applied to the outcomes of the sessions. Results: The participants stated that numerous personal items and assistive devices get lost in the nursing home environment, which had various emotional, practical, and financial implications. Significant amounts of time are spent on trying to find items, varying from 1 hr up to a couple of weeks. Numerous potential solutions were identified by the interviewees. Discussion: Losing items often goes together with limitations to the participation of residents. Many family caregivers are reluctant to replace lost items, as these items may get lost again.
Dutch-Flemish translation of nine pediatric item banks from the Patient-Reported Outcomes Measurement Information System (PROMIS)®.

Science.gov (United States)

Haverman, Lotte; Grootenhuis, Martha A; Raat, Hein; van Rossum, Marion A J; van Dulmen-den Broeder, Eline; Hoppenbrouwers, Karel; Correia, Helena; Cella, David; Roorda, Leo D; Terwee, Caroline B

2016-03-01

The Patient-Reported Outcomes Measurement Information System (PROMIS(®)) is a new, state-of-the-art assessment system for measuring patient-reported health and well-being of adults and children. It has the potential to be more valid, reliable, and responsive than existing PROMs. The items banks are designed to be self-reported and completed by children aged 8-18 years. The PROMIS items can be administered in short forms or through computerized adaptive testing. This paper describes the translation and cultural adaption of nine PROMIS item banks (151 items) for children in Dutch-Flemish. The translation was performed by FACITtrans using standardized PROMIS methodology and approved by the PROMIS Statistical Center. The translation included four forward translations, two back-translations, three independent reviews (at least two Dutch, one Flemish), and pretesting in 24 children from the Netherlands and Flanders. For some items, it was necessary to have separate translations for Dutch and Flemish: physical function-mobility (three items), anger (one item), pain interference (two items), and asthma impact (one item). Challenges faced in the translation process included scarcity or overabundance of possible translations, unclear item descriptions, constructs broader/smaller in the target language, difficulties in rank ordering items, differences in unit of measurement, irrelevant items, or differences in performance of activities. By addressing these challenges, acceptable translations were obtained for all items. The Dutch-Flemish PROMIS items are linguistically equivalent to the original USA version. Short forms are now available for use, and entire item banks are ready for cross-cultural validation in the Netherlands and Flanders.
Assessing the specificity of posttraumatic stress disorder's dysphoric items within the dysphoria model.

Science.gov (United States)

Armour, Cherie; Shevlin, Mark

2013-10-01

The factor structure of posttraumatic stress disorder (PTSD) currently used by the Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition (DSM-IV), has received limited support. A four-factor dysphoria model is widely supported. However, the dysphoria factor of this model has been hailed as a nonspecific factor of PTSD. The present study investigated the specificity of the dysphoria factor within the dysphoria model by conducting a confirmatory factor analysis while statistically controlling for the variance attributable to depression. The sample consisted of 429 individuals who met the diagnostic criteria for PTSD in the National Comorbidity Survey. The results concluded that there was no significant attenuation in any of the PTSD items. This finding is pertinent given several proposals for the removal of dysphoric items from the diagnostic criteria set of PTSD in the upcoming DSM-5.
Multi-item economic production quantity model for imperfect items with multiple production setups and rework under the effect of preservation technology and learning environment

Directory of Open Access Journals (Sweden)

Preeti Jawla

2016-09-01

Full Text Available This study aims to investigate the multi-item inventory model in a production/rework system with multiple production setups. Rework can be depicted as the transformation of production rejects, failed, or non-conforming items into re-usable products of the same or lower quality during or after inspection. Rework is very valuable and profitable, especially if materials are limited in availability and also pricey. Moreover, rework can be a good contribution to a ‘green image environment’. In this paper, we establish a multi-item inventory model to determine the optimal inventory replenishment policy for the economic production quantity (EPQ model for imperfect, deteriorating items with multiple productions and rework under inflation and learning environment. In inventory modelling, Inflation plays a very important role. In one cycle, production system produces items in n production setups and one rework setup, i.e. system follows (n, 1 policy. To reduce the deterioration of products preservation technology investment is also considered in this model. Holding cost is taken as time dependent. We develop expressions for the average profit per time unit, including procurement of input materials, costs for production, rework, deterioration cost and storage of serviceable and reworkable lots. Using those expressions, the proposed model is demonstrated numerically and the sensitivity analysis is also performed to study the behaviour of the model.
Adult exposures from MDCT including multiphase studies: first Italian nationwide survey

Energy Technology Data Exchange (ETDEWEB)

Palorini, Federica; Origgi, Daniela [Fisica Sanitaria Istituto Europeo di Oncologia, Milan (Italy); Granata, Claudio [UOC di Radiologia Istituto Giannina Gaslini, Genoa (Italy); Matranga, Domenica [Universita degli Studi di Palermo, Dipartimento di Scienze per la Promozione della Salute e Materno-infantile ' ' G. D' Alessandro' ' , Palermo (Italy); Salerno, Sergio [Policlinico Universita di Palermo, Dipartimento di Scienze Radiologiche, Palermo (Italy)

2014-02-15

To evaluate the radiation dose in routine multidetector computed tomography (MDCT) examinations in Italian population. This was a retrospective multicentre study included 5,668 patients from 65 radiology departments who had undergone common CT protocols: head, chest, abdomen, chest-abdomen-pelvis (CAP), spine and cardiac. Data included patient characteristics, CT parameters, volumetric CT dose index (CTDI{sub vol}) and dose length product (DLP) for each CT acquisition phase. Descriptive statistics were calculated, and a multi-regression analysis was used to outline the main factors affecting exposure. The 75th percentiles of CTDI{sub vol} (mGy) and DLP (mGy cm) for whole head were 69 mGy and 1,312 mGy cm, respectively; for chest, 15 mGy and 569 mGy cm; spine, 42 mGy and 888 mGy cm; cardiac, 7 mGy and 131 mGy cm for calcium score, and 61 mGy and 1,208 mGy cm for angiographic CT studies. High variability was present in the DLP of abdomen and CAP protocols, where multiphase examinations dominated (71 % and 73 % respectively): for abdomen, 18 mGy, with 555 and 920 mGy cm in abdomen and abdomen-pelvis acquisitions respectively; for CAP, 17 mGy, with 508, 850 and 1,200 mGy cm in abdomen, abdomen-pelvis and CAP acquisitions respectively. The results of this survey could help in the definition of updated diagnostic reference levels (DRL). (orig.)
Behavioral decoding of working memory items inside and outside the focus of attention.

Science.gov (United States)

Mallett, Remington; Lewis-Peacock, Jarrod A

2018-03-31

How we attend to our thoughts affects how we attend to our environment. Holding information in working memory can automatically bias visual attention toward matching information. By observing attentional biases on reaction times to visual search during a memory delay, it is possible to reconstruct the source of that bias using machine learning techniques and thereby behaviorally decode the content of working memory. Can this be done when more than one item is held in working memory? There is some evidence that multiple items can simultaneously bias attention, but the effects have been inconsistent. One explanation may be that items are stored in different states depending on the current task demands. Recent models propose functionally distinct states of representation for items inside versus outside the focus of attention. Here, we use behavioral decoding to evaluate whether multiple memory items-including temporarily irrelevant items outside the focus of attention-exert biases on visual attention. Only the single item in the focus of attention was decodable. The other item showed a brief attentional bias that dissipated until it returned to the focus of attention. These results support the idea of dynamic, flexible states of working memory across time and priority. © 2018 New York Academy of Sciences.
A survey of evidence users about the information need of acupuncture clinical evidence.

Science.gov (United States)

Shi, Xiue; Wang, Xiaoqin; Liu, Yali; Li, Xiuxia; Wei, Dang; Zhao, Xu; Gu, Jing; Yang, Kehu

2016-11-10

The PRISMA statement was rarely used in the field of acupuncture, possibly because of knowledge gaps and the lack of items tailored for characteristics of acupuncture. And with an increasing number of systematic reviews in acupuncture, it is necessary to develop an extension of PRISMA for acupuncture. And this study was the first step of our project, of which the aim was to investigate the need for information of clinical evidence on acupuncture from the perspectives of evidence users. We designed a questionnaire based on a pilot survey and a literature review of acupuncture systematic review or meta-analysis(SR/MA). Participants from five cities (Lanzhou, Chengdu, Shanghai, Nanjing and Beijing) representing the different regions of China, including clinicians, researchers and postgraduates in their second year of Master studies or higher level, were surveyed. A total of 269 questionnaires were collected in 18 hospitals, medical universities and research agencies, and 251 (93 %) with complete data were used for analysis. The average age of respondents was 33 years (SD 8.959, range 25-58) with male 43 % and female 57 %. Most respondents had less than 5 years of working experience on acupuncture, and read only one to five articles per month. Electronic databases, search engines and academic conferences were the most common sources for obtaining information. Fifty-six percent of the respondents expressed low satisfaction of the completeness of information from the literature. The eight items proposed for acupuncture SR/MAs received all high scores, and five of the items scored higher than eight on a scale zero to ten. The differences for the scores of most items between postgraduates and non-postgraduates were not statistically significant. The majority of the respondents were not very satisfied with the information provided in acupuncture SRs. Most of the items proposed in this questionnaire received high scores, and opinions from postgraduates and non
A survey of resilience, burnout, and tolerance of uncertainty in Australian general practice registrars

Directory of Open Access Journals (Sweden)

Cooke Georga PE

2013-01-01

Full Text Available Abstract Background Burnout and intolerance of uncertainty have been linked to low job satisfaction and lower quality patient care. While resilience is related to these concepts, no study has examined these three concepts in a cohort of doctors. The objective of this study was to measure resilience, burnout, compassion satisfaction, personal meaning in patient care and intolerance of uncertainty in Australian general practice (GP registrars. Methods We conducted a paper-based cross-sectional survey of GP registrars in Australia from June to July 2010, recruited from a newsletter item or registrar education events. Survey measures included the Resilience Scale-14, a single-item scale for burnout, Professional Quality of Life (ProQOL scale, Personal Meaning in Patient Care scale, Intolerance of Uncertainty-12 scale, and Physician Response to Uncertainty scale. Results 128 GP registrars responded (response rate 90%. Fourteen percent of registrars were found to be at risk of burnout using the single-item scale for burnout, but none met the criteria for burnout using the ProQOL scale. Secondary traumatic stress, general intolerance of uncertainty, anxiety due to clinical uncertainty and reluctance to disclose uncertainty to patients were associated with being at higher risk of burnout, but sex, age, practice location, training duration, years since graduation, and reluctance to disclose uncertainty to physicians were not. Only ten percent of registrars had high resilience scores. Resilience was positively associated with compassion satisfaction and personal meaning in patient care. Resilience was negatively associated with burnout, secondary traumatic stress, inhibitory anxiety, general intolerance to uncertainty, concern about bad outcomes and reluctance to disclose uncertainty to patients. Conclusions GP registrars in this survey showed a lower level of burnout than in other recent surveys of the broader junior doctor population in both Australia
An Item Bank for Abuse of Prescription Pain Medication from the Patient-Reported Outcomes Measurement Information System (PROMIS®).

Science.gov (United States)

Pilkonis, Paul A; Yu, Lan; Dodds, Nathan E; Johnston, Kelly L; Lawrence, Suzanne M; Hilton, Thomas F; Daley, Dennis C; Patkar, Ashwin A; McCarty, Dennis

2017-08-01

There is a need to monitor patients receiving prescription opioids to detect possible signs of abuse. To address this need, we developed and calibrated an item bank for severity of abuse of prescription pain medication as part of the Patient-Reported Outcomes Measurement Information System (PROMIS ® ). Comprehensive literature searches yielded an initial bank of 5,310 items relevant to substance use and abuse, including abuse of prescription pain medication, from over 80 unique instruments. After qualitative item analysis (i.e., focus groups, cognitive interviewing, expert review, and item revision), 25 items for abuse of prescribed pain medication were included in field testing. Items were written in a first-person, past-tense format, with a three-month time frame and five response options reflecting frequency or severity. The calibration sample included 448 respondents, 367 from the general population (ascertained through an internet panel) and 81 from community treatment programs participating in the National Drug Abuse Treatment Clinical Trials Network. A final bank of 22 items was calibrated using the two-parameter graded response model from item response theory. A seven-item static short form was also developed. The test information curve showed that the PROMIS ® item bank for abuse of prescription pain medication provided substantial information in a broad range of severity. The initial psychometric characteristics of the item bank support its use as a computerized adaptive test or short form, with either version providing a brief, precise, and efficient measure relevant to both clinical and community samples. © 2016 American Academy of Pain Medicine. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com
An Effective Multimedia Item Shell Design for Individualized Education: The Crome Project

Directory of Open Access Journals (Sweden)

Irene Cheng

2008-01-01

Full Text Available There are several advantages to creating multimedia item types and applying computer-based adaptive testing in education. First is the capability to motivate learning by making the learners feel more engaged and in an interactive environment. Second is a better concept representation, which is not possible in conventional multiple-choice tests. Third is the advantage of individualized curriculum design, rather than a curriculum designed for an average student. Fourth is a good choice of the next question, associated with the appropriate difficulty level based on a student's response to the current question. However, many issues need to be addressed when achieving these goals, including: (a the large number of item types required to represent the current multiple-choice questions in multimedia formats, (b the criterion used to determine the difficulty level of a multimedia question item, and (c the methodology applied to the question selection process for individual students. In this paper, we propose a multimedia item shell design that not only reduces the number of item types required, but also computes difficulty level of an item automatically. The concept of question seed is introduced to make content creation more cost-effective. The proposed item shell framework facilitates efficient communication between user responses at the client, and the scoring agents integrated with a student ability assessor at the server. We also describe approaches for automatically estimating difficulty level of questions, and discuss preliminary evaluation of multimedia item types by students.

Adult Attachment Ratings (AAR): an item response theory analysis.

Science.gov (United States)

Pilkonis, Paul A; Kim, Yookyung; Yu, Lan; Morse, Jennifer Q

2014-01-01

The Adult Attachment Ratings (AAR) include 3 scales for anxious, ambivalent attachment (excessive dependency, interpersonal ambivalence, and compulsive care-giving), 3 for avoidant attachment (rigid self-control, defensive separation, and emotional detachment), and 1 for secure attachment. The scales include items (ranging from 6-16 in their original form) scored by raters using a 3-point format (0 = absent, 1 = present, and 2 = strongly present) and summed to produce a total score. Item response theory (IRT) analyses were conducted with data from 414 participants recruited from psychiatric outpatient, medical, and community settings to identify the most informative items from each scale. The IRT results allowed us to shorten the scales to 5-item versions that are more precise and easier to rate because of their brevity. In general, the effective range of measurement for the scales was 0 to +2 SDs for each of the attachment constructs; that is, from average to high levels of attachment problems. Evidence for convergent and discriminant validity of the scales was investigated by comparing them with the Experiences of Close Relationships-Revised (ECR-R) scale and the Kobak Attachment Q-sort. The best consensus among self-reports on the ECR-R, informant ratings on the ECR-R, and expert judgments on the Q-sort and the AAR emerged for anxious, ambivalent attachment. Given the good psychometric characteristics of the scale for secure attachment, however, this measure alone might provide a simple alternative to more elaborate procedures for some measurement purposes. Conversion tables are provided for the 7 scales to facilitate transformation from raw scores to IRT-calibrated (theta) scores.
Large Item Disposal At The Drigg Low Level Waste Repository, United Kingdom

International Nuclear Information System (INIS)

Griffiths, Steve

2012-01-01

Currently the UK operates only one repository for low level radioactive waste, the LLWR near Drigg in Cumbria. It is located on the West Cumbrian coast near the village of Drigg. LLWR is designed for the management of solid LLW and has operated as the principal national disposal facility for LLW since 1959. LLWR is managed and operated on behalf of the Nuclear Decommissioning Authority (NDA) by UK Nuclear Waste Management Ltd. (UKNWM), parent body of LLW Repository Ltd. UKNWM is a consortium led by URS, Studsvik and AREVA. Waste is accepted at LLWR based on conditions for acceptance (1). Although there is some history of disposal of non-containerised 'large items' at the Drigg site these are anecdotally described as 'not quite fitting into an ISO container (2)' and enquiries indicate that their disposal was restricted to the legacy times when items were tumble-tipped into open trenches at the site, a practise now long ceased. The feasibility of true single large item disposal at the LLWR presents complex problems arising from the poor suitability of both rail and road infrastructure in UK. LLWR is serviced both by road and rail links. The static weight of large items being taken nominally as up to ∼300 metric tons would not necessarily preclude transportation by rail but the practicalities of this route are limited. The ageing rail infrastructure includes numerous tunnels, bridges and sections of line with overhead electrification. All these would require either careful justification or significant work to ensure the safe transit of large loads. Nuclear facilities in UK are by design in remote locations, not all of which are serviced by rail connections and the rail network itself has evolved to service inter-city transportation rather than heavy freight and as such tends to route through town centres, exacerbating the tunnel, bridge and pantograph concerns already identified. Within only a few miles of the LLWR itself there are requirements to pass both over and
The 12 item Social and Economic Conservatism Scale (SECS.

Directory of Open Access Journals (Sweden)

Jim A C Everett

Full Text Available Recent years have seen a surge in psychological research on the relationship between political ideology (particularly conservatism and cognition, affect, behaviour, and even biology. Despite this flurry of investigation, however, there is as yet no accepted, validated, and widely used multi-item scale of conservatism that is concise, that is modern in its conceptualisation, and that includes both social and economic conservatism subscales. In this paper the 12-Item Social and Economic Conservatism Scale (SECS is proposed and validated to help fill this gap. The SECS is suggested to be an important and useful tool for researchers working in political psychology.
[Impact of passing items above the ceiling on the assessment results of Peabody developmental motor scales].

Science.gov (United States)

Zhao, Gai; Bian, Yang; Li, Ming

2013-12-18

To analyze the impact of passing items above the roof level in the gross motor subtest of Peabody development motor scales (PDMS-2) on its assessment results. In the subtests of PDMS-2, 124 children from 1.2 to 71 months were administered. Except for the original scoring method, a new scoring method which includes passing items above the ceiling were developed. The standard scores and quotients of the two scoring methods were compared using the independent-samples t test. Only one child could pass the items above the ceiling in the stationary subtest, 19 children in the locomotion subtest, and 17 children in the visual-motor integration subtest. When the scores of these passing items were included in the raw scores, the total raw scores got the added points of 1-12, the standard scores added 0-1 points and the motor quotients added 0-3 points. The diagnostic classification was changed only in two children. There was no significant difference between those two methods about motor quotients or standard scores in the specific subtest (P>0.05). The passing items above a ceiling of PDMS-2 isn't a rare situation. It usually takes place in the locomotion subtest and visual-motor integration subtest. Including these passing items into the scoring system will not make significant difference in the standard scores of the subtests or the developmental motor quotients (DMQ), which supports the original setting of a ceiling established by upassing 3 items in a row. However, putting the passing items above the ceiling into the raw score will improve tracking of children's developmental trajectory and intervention effects.
Three-item Direct Observation Screen (TIDOS) for autism spectrum disorder.

Science.gov (United States)

Oner, Pinar; Oner, Ozgur; Munir, Kerim

2014-08-01

We compared ratings on the Three-Item Direct Observation Screen test for autism spectrum disorders completed by pediatric residents with the Social Communication Questionnaire parent reports as an augmentative tool for improving autism spectrum disorder screening performance. We examined three groups of children (18-60 months) comparable in age (18-24 month, 24-36 month, 36-60 preschool subgroups) and gender distribution: n = 86 with Diagnostic and Statistical Manual of Mental Disorders (4th ed., text rev.) autism spectrum disorders; n = 76 with developmental delay without autism spectrum disorders; and n = 97 with typical development. The Three-Item Direct Observation Screen test included the following (a) Joint Attention, (b) Eye Contact, and (c) Responsiveness to Name. The parent Social Communication Questionnaire ratings had a sensitivity of .73 and specificity of .70 for diagnosis of autism spectrum disorders. The Three-Item Direct Observation Screen test item Joint Attention had a sensitivity of .82 and specificity of .90, Eye Contact had a sensitivity of .89 and specificity of .91, and Responsiveness to Name had a sensitivity of .67 and specificity of .87. In the Three-Item Direct Observation Screen test, having at least one of the three items positive had a sensitivity of .95 and specificity of .85. Age, diagnosis of autism spectrum disorder, and developmental level were important factors affecting sensitivity and specificity. The results indicate that augmentation of autism spectrum disorder screening by observational items completed by trained pediatric-oriented professionals can be a highly effective tool in improving screening performance. If supported by future population studies, the results suggest that primary care practitioners will be able to be trained to use this direct procedure to augment screening for autism spectrum disorders in the community. © The Author(s) 2013.
Item selection via Bayesian IRT models.

Science.gov (United States)

Arima, Serena

2015-02-10

With reference to a questionnaire that aimed to assess the quality of life for dysarthric speakers, we investigate the usefulness of a model-based procedure for reducing the number of items. We propose a mixed cumulative logit model, which is known in the psychometrics literature as the graded response model: responses to different items are modelled as a function of individual latent traits and as a function of item characteristics, such as their difficulty and their discrimination power. We jointly model the discrimination and the difficulty parameters by using a k-component mixture of normal distributions. Mixture components correspond to disjoint groups of items. Items that belong to the same groups can be considered equivalent in terms of both difficulty and discrimination power. According to decision criteria, we select a subset of items such that the reduced questionnaire is able to provide the same information that the complete questionnaire provides. The model is estimated by using a Bayesian approach, and the choice of the number of mixture components is justified according to information criteria. We illustrate the proposed approach on the basis of data that are collected for 104 dysarthric patients by local health authorities in Lecce and in Milan. Copyright © 2014 John Wiley & Sons, Ltd.
Re-evaluating a vision-related quality of life questionnaire with item response theory (IRT and differential item functioning (DIF analyses

Directory of Open Access Journals (Sweden)

Knol Dirk L

2011-09-01

Full Text Available Abstract Background For the Low Vision Quality Of Life questionnaire (LVQOL it is unknown whether the psychometric properties are satisfactory when an item response theory (IRT perspective is considered. This study evaluates some essential psychometric properties of the LVQOL questionnaire in an IRT model, and investigates differential item functioning (DIF. Methods Cross-sectional data were used from an observational study among visually-impaired patients (n = 296. Calibration was performed for every dimension of the LVQOL in the graded response model. Item goodness-of-fit was assessed with the S-X2-test. DIF was assessed on relevant background variables (i.e. age, gender, visual acuity, eye condition, rehabilitation type and administration type with likelihood-ratio tests for DIF. The magnitude of DIF was interpreted by assessing the largest difference in expected scores between subgroups. Measurement precision was assessed by presenting test information curves; reliability with the index of subject separation. Results All items of the LVQOL dimensions fitted the model. There was significant DIF on several items. For two items the maximum difference between expected scores exceeded one point, and DIF was found on multiple relevant background variables. Item 1 'Vision in general' from the "Adjustment" dimension and item 24 'Using tools' from the "Reading and fine work" dimension were removed. Test information was highest for the "Reading and fine work" dimension. Indices for subject separation ranged from 0.83 to 0.94. Conclusions The items of the LVQOL showed satisfactory item fit to the graded response model; however, two items were removed because of DIF. The adapted LVQOL with 21 items is DIF-free and therefore seems highly appropriate for use in heterogeneous populations of visually impaired patients.
36-Item Short Form Survey (SF-36) Versus Gait Speed As Predictor of Preclinical Mobility Disability in Older Women: The Women's Health Initiative.

Science.gov (United States)

Laddu, Deepika R; Wertheim, Betsy C; Garcia, David O; Woods, Nancy F; LaMonte, Michael J; Chen, Bertha; Anton-Culver, Hoda; Zaslavsky, Oleg; Cauley, Jane A; Chlebowski, Rowan; Manson, JoAnn E; Thomson, Cynthia A; Stefanick, Marcia L

2018-04-01

To compare the value of clinically measured gait speed with that of the self-reported Medical Outcomes Study 36-item Short-Form Survey Physical Function Index (SF-36 PF) in predicting future preclinical mobility disability (PCMD) in older women. Prospective cohort study. Forty clinical centers in the United States. Women aged 65 to 79 enrolled in the Women's Health Initiative Clinical Trials with gait speed and SF-36 assessed at baseline (1993-1998) and follow-up Years 1, 3, and 6 (N = 3,587). Women were categorized as nondecliners or decliners based on changes (from baseline to Year 1) in gait speed and SF-36 PF scores. Logistic regression models were used to estimate incident PCMD (gait speed 36 PF with that of measured gait speed. Slower baseline gait speed and lower SF-36 PF scores were associated with higher adjusted odds of PCMD at Years 3 and 6 (all P 36, decliners were 1.42 times as likely to have developed PCMD by Year 3 and 1.49 times as likely by Year 6. Baseline gait speed (AUC = 0.713) was nonsignificantly better than SF-36 (AUC = 0.705) at predicting PCMD over 6 years (P = .21); including measures at a second time point significantly improved model discrimination for predicting PCMD (all P 36 PF did, although the results may be limited given that gait speed served as a predictor and to define the PCMD outcome. Nonetheless, monitoring trajectories of change in mobility are better predictors of future mobility disability than single measures. © 2018, Copyright the Authors Journal compilation © 2018, The American Geriatrics Society.
FY 1992 Report on results of the survey/research project commissioned by Sunshine Project. Surveys on hydrogen-fired turbines; 1992 nendo suiso nensho turbine no chosa seika hokokusho

Energy Technology Data Exchange (ETDEWEB)

NONE

1993-03-01

Summarized herein are results of comprehensive surveys on hydrogen energy supply/utilization systems, centered by hydrogen-fired turbines for power generation. The surveyed items include hydrogen energy supply/utilization systems on an international scale, current state of power generation techniques and utilization of hydrogen, hydrogen-fired turbines for power generation, materials techniques for hydrogen-fired turbines, studies on and evaluation of economic viability of each system, expected effects, and problems involved in development. The surveys on the hydrogen production techniques pick up electrolysis with a solid polymer electrolyte as a promising candidate, and extract the scaling-up techniques, improvement of membrane durability, etc. as the research themes. The surveys on the hydrogen storage/transportation techniques indicate that hydrogen can be carried by a chemical medium for transportation/storage at normal temperature and pressure, for which the problems associated with medium loss and safety must be studied, and that the research themes for hydrogen-occluding alloys should include increasing quantities of hydrogen occluded for bulk transportation/storage at low energy, and decreasing cost. The surveys on hydrogen-fired turbines extract a number of problems to be solved, e.g., controlling hydrogen combustion, turbine designs, materials withstanding superhigh temperature for high-temperature combustion of hydrogen, and optimization of the power generation systems. (NEDO)
Development of an item bank for food parenting practices based on published instruments and reports from Canadian and US parents.

Science.gov (United States)

O'Connor, Teresia M; Pham, Truc; Watts, Allison W; Tu, Andrew W; Hughes, Sheryl O; Beauchamp, Mark R; Baranowski, Tom; Mâsse, Louise C

2016-08-01

Research to understand how parents influence their children's dietary intake and eating behaviors has expanded in the past decades and a growing number of instruments are available to assess food parenting practices. Unfortunately, there is no consensus on how constructs should be defined or operationalized, making comparison of results across studies difficult. The aim of this study was to develop a food parenting practice item bank with items from published scales and supplement with parenting practices that parents report using. Items from published scales were identified from two published systematic reviews along with an additional systematic review conducted for this study. Parents (n = 135) with children 5-12 years old from the US and Canada, stratified to represent the demographic distribution of each country, were recruited to participate in an online semi-qualitative survey on food parenting. Published items and parent responses were coded using the same framework to reduce the number of items into representative concepts using a binning and winnowing process. The literature contributed 1392 items and parents contributed 1985 items, which were reduced to 262 different food parenting concepts (26% exclusive from literature, 12% exclusive from parents, and 62% represented in both). Food parenting practices related to 'Structure of Food Environment' and 'Behavioral and Educational' were emphasized more by parent responses, while practices related to 'Consistency of Feeding Environment' and 'Emotional Regulation' were more represented among published items. The resulting food parenting item bank should next be calibrated with item response modeling for scientists to use in the future. Copyright © 2016 Elsevier Ltd. All rights reserved.
Item response theory analysis of the life orientation test-revised: age and gender differential item functioning analyses.

Science.gov (United States)

Steca, Patrizia; Monzani, Dario; Greco, Andrea; Chiesi, Francesca; Primi, Caterina

2015-06-01

This study is aimed at testing the measurement properties of the Life Orientation Test-Revised (LOT-R) for the assessment of dispositional optimism by employing item response theory (IRT) analyses. The LOT-R was administered to a large sample of 2,862 Italian adults. First, confirmatory factor analyses demonstrated the theoretical conceptualization of the construct measured by the LOT-R as a single bipolar dimension. Subsequently, IRT analyses for polytomous, ordered response category data were applied to investigate the items' properties. The equivalence of the items across gender and age was assessed by analyzing differential item functioning. Discrimination and severity parameters indicated that all items were able to distinguish people with different levels of optimism and adequately covered the spectrum of the latent trait. Additionally, the LOT-R appears to be gender invariant and, with minor exceptions, age invariant. Results provided evidence that the LOT-R is a reliable and valid measure of dispositional optimism. © The Author(s) 2014.
Development and testing of a novel survey to assess Stakeholder-driven Community Diffusion of childhood obesity prevention efforts.

Science.gov (United States)

Korn, Ariella R; Hennessy, Erin; Hammond, Ross A; Allender, Steven; Gillman, Matthew W; Kasman, Matt; McGlashan, Jaimie; Millar, Lynne; Owen, Brynle; Pachucki, Mark C; Swinburn, Boyd; Tovar, Alison; Economos, Christina D

2018-05-31

Involving groups of community stakeholders (e.g., steering committees) to lead community-wide health interventions appears to support multiple outcomes ranging from policy and systems change to individual biology. While numerous tools are available to measure stakeholder characteristics, many lack detail on reliability and validity, are not context specific, and may not be sensitive enough to capture change over time. This study describes the development and reliability of a novel survey to measure Stakeholder-driven Community Diffusion via assessment of stakeholders' social networks, knowledge, and engagement about childhood obesity prevention. This study was completed in three phases. Phase 1 included conceptualization and online survey development through literature reviews and expert input. Phase 2 included a retrospective study with stakeholders from two completed whole-of-community interventions. Between May-October 2015, 21 stakeholders from the Shape Up Somerville and Romp & Chomp interventions recalled their social networks, knowledge, and engagement pre-post intervention. We also assessed one-week test-retest reliability of knowledge and engagement survey modules among Shape Up Somerville respondents. Phase 3 included survey modifications and a second prospective reliability assessment. Test-retest reliability was assessed in May 2016 among 13 stakeholders involved in ongoing interventions in Victoria, Australia. In Phase 1, we developed a survey with 7, 20 and 50 items for the social networks, knowledge, and engagement survey modules, respectively. In the Phase 2 retrospective study, Shape Up Somerville and Romp & Chomp networks included 99 and 54 individuals. Pre-post Shape Up Somerville and Romp & Chomp mean knowledge scores increased by 3.5 points (95% CI: 0.35-6.72) and (- 0.42-7.42). Engagement scores did not change significantly (Shape Up Somerville: 1.1 points (- 0.55-2.73); Romp & Chomp: 0.7 points (- 0.43-1.73)). Intraclass correlation
Assessing Psychopathy Among Justice Involved Adolescents with the PCL: YV: An Item Response Theory Examination Across Gender

Science.gov (United States)

Tsang, Siny; Schmidt, Karen M.; Vincent, Gina M.; Salekin, Randall T.; Moretti, Marlene M.; Odgers, Candice L.

2014-01-01

This study used an item response theory (IRT) model and a large adolescent sample of justice involved youth (N = 1,007, 38% female) to examine the item functioning of the Psychopathy Checklist – Youth Version (PCL: YV). Items that were most discriminating (or most sensitive to changes) of the latent trait (thought to be psychopathy) among adolescents included “Glibness/superficial charm”, “Lack of remorse”, and “Need for stimulation”, whereas items that were least discriminating included “Pathological lying”, “Failure to accept responsibility”, and “Lacks goals.” The items “Impulsivity” and “Irresponsibility” were the most likely to be rated high among adolescents, whereas “Parasitic lifestyle”, and “Glibness/superficial charm” were the most likely to be rated low. Evidence of differential item functioning (DIF) on four of the 13 items was found between boys and girls. “Failure to accept responsibility” and “Impulsivity” were endorsed more frequently to describe adolescent girls than boys at similar levels of the latent trait, and vice versa for “Grandiose sense of self-worth” and “Lacks goals.” The DIF findings suggest that four PCL: YV items function differently between boys and girls. PMID:25580672
A signal detection-item response theory model for evaluating neuropsychological measures.

Science.gov (United States)

Thomas, Michael L; Brown, Gregory G; Gur, Ruben C; Moore, Tyler M; Patt, Virginie M; Risbrough, Victoria B; Baker, Dewleen G

2018-02-05

Models from signal detection theory are commonly used to score neuropsychological test data, especially tests of recognition memory. Here we show that certain item response theory models can be formulated as signal detection theory models, thus linking two complementary but distinct methodologies. We then use the approach to evaluate the validity (construct representation) of commonly used research measures, demonstrate the impact of conditional error on neuropsychological outcomes, and evaluate measurement bias. Signal detection-item response theory (SD-IRT) models were fitted to recognition memory data for words, faces, and objects. The sample consisted of U.S. Infantry Marines and Navy Corpsmen participating in the Marine Resiliency Study. Data comprised item responses to the Penn Face Memory Test (PFMT; N = 1,338), Penn Word Memory Test (PWMT; N = 1,331), and Visual Object Learning Test (VOLT; N = 1,249), and self-report of past head injury with loss of consciousness. SD-IRT models adequately fitted recognition memory item data across all modalities. Error varied systematically with ability estimates, and distributions of residuals from the regression of memory discrimination onto self-report of past head injury were positively skewed towards regions of larger measurement error. Analyses of differential item functioning revealed little evidence of systematic bias by level of education. SD-IRT models benefit from the measurement rigor of item response theory-which permits the modeling of item difficulty and examinee ability-and from signal detection theory-which provides an interpretive framework encompassing the experimentally validated constructs of memory discrimination and response bias. We used this approach to validate the construct representation of commonly used research measures and to demonstrate how nonoptimized item parameters can lead to erroneous conclusions when interpreting neuropsychological test data. Future work might include the
Software Note: Using BILOG for Fixed-Anchor Item Calibration

Science.gov (United States)

DeMars, Christine E.; Jurich, Daniel P.

2012-01-01

The nonequivalent groups anchor test (NEAT) design is often used to scale item parameters from two different test forms. A subset of items, called the anchor items or common items, are administered as part of both test forms. These items are used to adjust the item calibrations for any differences in the ability distributions of the groups taking…
Differential item functioning analysis with ordinal logistic regression techniques. DIFdetect and difwithpar.

Science.gov (United States)

Crane, Paul K; Gibbons, Laura E; Jolley, Lance; van Belle, Gerald

2006-11-01

We present an ordinal logistic regression model for identification of items with differential item functioning (DIF) and apply this model to a Mini-Mental State Examination (MMSE) dataset. We employ item response theory ability estimation in our models. Three nested ordinal logistic regression models are applied to each item. Model testing begins with examination of the statistical significance of the interaction term between ability and the group indicator, consistent with nonuniform DIF. Then we turn our attention to the coefficient of the ability term in models with and without the group term. If including the group term has a marked effect on that coefficient, we declare that it has uniform DIF. We examined DIF related to language of test administration in addition to self-reported race, Hispanic ethnicity, age, years of education, and sex. We used PARSCALE for IRT analyses and STATA for ordinal logistic regression approaches. We used an iterative technique for adjusting IRT ability estimates on the basis of DIF findings. Five items were found to have DIF related to language. These same items also had DIF related to other covariates. The ordinal logistic regression approach to DIF detection, when combined with IRT ability estimates, provides a reasonable alternative for DIF detection. There appear to be several items with significant DIF related to language of test administration in the MMSE. More attention needs to be paid to the specific criteria used to determine whether an item has DIF, not just the technique used to identify DIF.
Inventions on presenting textual items in Graphical User Interface

OpenAIRE

Mishra, Umakant

2014-01-01

Although a GUI largely replaces textual descriptions by graphical icons, the textual items are not completely removed. The textual items are inevitably used in window titles, message boxes, help items, menu items and popup items. Textual items are necessary for communicating messages that are beyond the limitation of graphical messages. However, it is necessary to harness the textual items on the graphical interface in such a way that they complement each other to produce the best effect. One...
Science Library of Test Items. Volume Eighteen. A Collection of Multiple Choice Test Items Relating Mainly to Chemistry.

Science.gov (United States)

New South Wales Dept. of Education, Sydney (Australia).

As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…
Science Library of Test Items. Volume Seventeen. A Collection of Multiple Choice Test Items Relating Mainly to Biology.

Science.gov (United States)

New South Wales Dept. of Education, Sydney (Australia).

As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…
Science Library of Test Items. Volume Nineteen. A Collection of Multiple Choice Test Items Relating Mainly to Geology.

Science.gov (United States)

New South Wales Dept. of Education, Sydney (Australia).

As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…

Survey of Water Chemistry and Corrosion of NPP

Energy Technology Data Exchange (ETDEWEB)

Jung, Ki Sok; Hong, Bong Geon

2008-06-15

Status of water chemistry of nuclear power plant and materials corrosion has been surveyed. For PWR, system chemistry of primary coolant and secondary coolant as well as the related corrosion of materials was surveyed. For BWR, system chemistry as whole has been surveyed with its accompanying corrosion problems. Radiolysis of coolant water and activation of corrosion products also was surveyed. Future NPP such as supercritical water cooled reactor and fusion reactor has also been surveyed for their water chemistry and corrosion problems. As a result, proposal for some research items has been suggested. Some related corrosion research techniques and electrochemical fundamentals are also presented.
Survey of Water Chemistry and Corrosion of NPP

International Nuclear Information System (INIS)

Jung, Ki Sok; Hong, Bong Geon

2008-06-01

Status of water chemistry of nuclear power plant and materials corrosion has been surveyed. For PWR, system chemistry of primary coolant and secondary coolant as well as the related corrosion of materials was surveyed. For BWR, system chemistry as whole has been surveyed with its accompanying corrosion problems. Radiolysis of coolant water and activation of corrosion products also was surveyed. Future NPP such as supercritical water cooled reactor and fusion reactor has also been surveyed for their water chemistry and corrosion problems. As a result, proposal for some research items has been suggested. Some related corrosion research techniques and electrochemical fundamentals are also presented
Feed mechanism and method for feeding minute items

Science.gov (United States)

Stringer, Timothy Kent [Bucyrus, KS; Yerganian, Simon Scott [Lee's Summit, MO

2009-10-20

A feeding mechanism and method for feeding minute items, such as capacitors, resistors, or solder preforms. The mechanism is adapted to receive a plurality of the randomly-positioned and randomly-oriented extremely small or minute items, and to isolate, orient, and position one or more of the items in a specific repeatable pickup location wherefrom they may be removed for use by, for example, a computer-controlled automated assembly machine. The mechanism comprises a sliding shelf adapted to receive and support the items; a wiper arm adapted to achieve a single even layer of the items; and a pushing arm adapted to push the items into the pickup location. The mechanism can be adapted for providing the items with a more exact orientation, and can also be adapted for use in a liquid environment.
Patient Safety Culture Survey in Pediatric Complex Care Settings: A Factor Analysis.

Science.gov (United States)

Hessels, Amanda J; Murray, Meghan; Cohen, Bevin; Larson, Elaine L

2017-04-19

Children with complex medical needs are increasing in number and demanding the services of pediatric long-term care facilities (pLTC), which require a focus on patient safety culture (PSC). However, no tool to measure PSC has been tested in this unique hybrid acute care-residential setting. The objective of this study was to evaluate the psychometric properties of the Nursing Home Survey on Patient Safety Culture tool slightly modified for use in the pLTC setting. Factor analyses were performed on data collected from 239 staff at 3 pLTC in 2012. Items were screened by principal axis factoring, and the original structure was tested using confirmatory factor analysis. Exploratory factor analysis was conducted to identify the best model fit for the pLTC data, and factor reliability was assessed by Cronbach alpha. The extracted, rotated factor solution suggested items in 4 (staffing, nonpunitive response to mistakes, communication openness, and organizational learning) of the original 12 dimensions may not be a good fit for this population. Nevertheless, in the pLTC setting, both the original and the modified factor solutions demonstrated similar reliabilities to the published consistencies of the survey when tested in adult nursing homes and the items factored nearly identically as theorized. This study demonstrates that the Nursing Home Survey on Patient Safety Culture with minimal modification may be an appropriate instrument to measure PSC in pLTC settings. Additional psychometric testing is recommended to further validate the use of this instrument in this setting, including examining the relationship to safety outcomes. Increased use will yield data for benchmarking purposes across these specialized settings to inform frontline workers and organizational leaders of areas of strength and opportunity for improvement.
What should be included in the assessment of laypersons' paediatric basic life support skills?

DEFF Research Database (Denmark)

Hasselager, Asbjørn Børch; Lauritsen, Torsten; Kristensen, Tim

2018-01-01

BACKGROUND: Assessment of laypersons' Paediatric Basic Life Support (PBLS) skills is important to ensure acquisition of effective PBLS competencies. However limited evidence exists on which PBLS skills are essential for laypersons. The same challenges exist with respect to the assessment of foreign...... body airway obstruction management (FBAOM) skills. We aimed to establish international consensus on how to assess laypersons' PBLS and FBAOM skills. METHODS: A Delphi consensus survey was conducted. Out of a total of 84 invited experts, 28 agreed to participate. During the first Delphi round experts...... suggested items to assess laypersons' PBLS and FBAOM skills. In the second round, the suggested items received comments from and were rated by 26 experts (93%) on a 5-point scale (1 = not relevant to 5 = essential). Revised items were anonymously presented in a third round for comments and 23 (82%) experts...
Development and validation of Neonatal Satisfaction Survey--NSS-13.

Science.gov (United States)

Hagen, Inger H; Vadset, Tove B; Barstad, Johan; Svindseth, Marit F

2015-06-01

The purpose of this study was to develop and validate a survey to investigate parents' satisfaction with neonatal wards in a population of parents of children with a gestation age of ≥24 weeks to 3 months after full-term birth. We explored the literature and conducted three focus groups: two with expert health personnel and one with parents. We tested the survey in a parent population (N = 105) and report the different stages in the validation process along with the full survey, the Neonatal Satisfaction Survey - 13 categories (NSS-13). We found 13 subcategories in the Neonatal Satisfaction Survey. The subcategories measure parents' satisfaction with neonatal units based on staff, admission, nurses, anxiety, siblings (parents' perceptions of caring for the siblings of the newborn), information, timeout, doctors, facilities, nutrition, preparation for discharge, trust and visitors. Each subcategory showed acceptable internal consistency. The full version of the Neonatal Satisfaction Survey presents 69 items, and each subcategory contains two to eleven items. The Neonatal Satisfaction Survey seems suitable to measure parents' satisfaction with neonatal units and can be used in full, but it can also measure subcategories. Parents' satisfaction with neonatal units can be used to improve the quality in such wards. We consider this study as the first in a series to validate the NSS-13. The full survey with subcategories is presented in this paper. © 2014 Nordic College of Caring Science.
Achievement report for fiscal 1982 on Sunshine Program-entrusted research and development. Survey on patent and information (Hydrogen energy); 1982 nendo tokkyo joho chosa kenkyu seika hokokokusho. Suiso energy

Energy Technology Data Exchange (ETDEWEB)

NONE

1983-03-01

Patents related to the research under the Sunshine Program are surveyed so as to ensure that the program be promoted smoothly and efficiently. Since the scope of the hydrogen energy technology is extensive, branches supposed to be relatively important only are surveyed, which include the production of hydrogen (thermochemical process, photochemical process, and electrolysis), storage and transportation of hydrogen, safety of hydrogen, hydrogen fuel cells, hydrogen-fueled engines, and hydrogen combustion devices. The basic policy to follow in the extraction of necessary patents is that all related to the hydrogen energy technology be collected from as many fields as possible. However, it is impossible to read all the laid-open patents. Under such circumstances, out of the items in IPC (International Patent Classification) used by the Patent Agency, those deemed to be closely related to the hydrogen energy technology are designated and, when the classification item attached to the official gazette matches one of the IPC classification items, it is extracted as a desired item after deliberation of its relationship with the hydrogen energy technology. (NEDO)
Applying Hierarchical Model Calibration to Automatically Generated Items.

Science.gov (United States)

Williamson, David M.; Johnson, Matthew S.; Sinharay, Sandip; Bejar, Isaac I.

This study explored the application of hierarchical model calibration as a means of reducing, if not eliminating, the need for pretesting of automatically generated items from a common item model prior to operational use. Ultimately the successful development of automatic item generation (AIG) systems capable of producing items with highly similar…
41 CFR 101-27.404 - Review of items.

Science.gov (United States)

2010-07-01

... 41 Public Contracts and Property Management 2 2010-07-01 2010-07-01 true Review of items. 101-27.404 Section 101-27.404 Public Contracts and Property Management Federal Property Management...-Elimination of Items From Inventory § 101-27.404 Review of items. Except for standby or reserve stocks, items...
Towards an authoring system for item construction

NARCIS (Netherlands)

Rikers, Jos H.A.N.

1988-01-01

The process of writing test items is analyzed, and a blueprint is presented for an authoring system for test item writing to reduce invalidity and to structure the process of item writing. The developmental methodology is introduced, and the first steps in the process are reported. A historical
Modeling Local Item Dependence in Cloze and Reading Comprehension Test Items Using Testlet Response Theory

Science.gov (United States)

Baghaei, Purya; Ravand, Hamdollah

2016-01-01

In this study the magnitudes of local dependence generated by cloze test items and reading comprehension items were compared and their impact on parameter estimates and test precision was investigated. An advanced English as a foreign language reading comprehension test containing three reading passages and a cloze test was analyzed with a…
A systematic review of childhood maltreatment assessments in population-representative surveys since 1990.

Science.gov (United States)

Hovdestad, Wendy; Campeau, Aimée; Potter, Dawn; Tonmyr, Lil

2015-01-01

Population-representative surveys that assess childhood maltreatment and health are a valuable resource to explore the implications of child maltreatment for population health. Systematic identification and evaluation of such surveys is needed to facilitate optimal use of their data and to inform future research. To inform researchers of the existence and nature of population-representative surveys relevant to understanding links between childhood maltreatment and health; to evaluate the assessment of childhood maltreatment in this body of work. We included surveys that: 1) were representative of the non-institutionalized population of any size nation or of any geopolitical region ≥ 10 million people; 2) included a broad age range (≥ 40 years); 3) measured health; 4) assessed childhood maltreatment retrospectively; and 5) were conducted since 1990. We used Internet and database searching (including CINAHL, Embase, ERIC, Global Health, MEDLINE, PsycINFO, Scopus, Social Policy and Practice: January 1990 to March 2014), expert consultation, and other means to identify surveys and associated documentation. Translations of non-English survey content were verified by fluent readers of survey languages. We developed checklists to abstract and evaluate childhood maltreatment content. Fifty-four surveys from 39 countries met inclusion criteria. Sample sizes ranged from 1,287-51,945 and response rates from 15%-96%. Thirteen surveys assessed neglect, 15 emotional abuse; 18 exposure to family violence; 26 physical abuse; 48 sexual abuse. Fourteen surveys assessed more than three types; six of these were conducted since 2010. In nine surveys childhood maltreatment assessments were detailed (+10 items for at least one type of maltreatment). Seven surveys' assessments had known reliability and/or validity. Data from 54 surveys can be used to explore the population health relevance of child maltreatment. Assessment of childhood maltreatment is not comprehensive but there is
Lateral Violence in Nursing Survey: Instrument Development and Validation

Directory of Open Access Journals (Sweden)

Lynne S. Nemeth

2017-07-01

Full Text Available An examination of the psychometric properties of the Lateral Violence in Nursing Survey (LVNS, an instrument previously developed to measure the perceived incidence and severity of lateral violence (LV in the nursing workplace, was carried out. Conceptual clustering and principal components analysis were used with survey responses from 663 registered nurses and ancillary nursing staff in a southeastern tertiary care medical center. Where appropriate, Cronbach’s alpha (α evaluated internal consistency. The prevalence/severity of lateral violence items constitute two distinct subscales (LV by self and others with Cronbach’s alpha of 0.74 and 0.86, respectively. The items asking about potential causes of LV are unidimensional and internally consistent (alpha = 0.77 but there is no conceptually coherent theme underlying the various causes. Respondents rating a potential LV cause as “major” scored higher on both prevalence/severity subscales than those rating it a “minor” cause or not a cause. Subsets of items on the LVNS are internally reliable, supporting construct validity. Revisions of the original LVNS instrument will improve its use in future work.
Report on surveys in fiscal 2000 on the survey on foundation to establish industrial technology strategy. Survey on lifetime education for mechanical engineers; 2000 nendo sangyo gijutsu senryaku sakusei kiban chosa hokokusho. Kikai gijutsusha shogai kyoiku chosa

Energy Technology Data Exchange (ETDEWEB)

NONE

2001-03-01

With an objective to set up a system for full-scale lifetime education, a feasibility study has been performed on preparing new educational materials and performing seminars. The continuing education program developed by the Japan Society of Mechanical Engineers is intended to assist engineers studied under the new engineer system to accumulate actual work experience after having acquired the qualification, and motivate ability maintenance and self-enlightenment after having acquired the engineer qualification, mainly through the training meetings and lecture meetings held by the Japan Society of Mechanical Engineers. The education items in the major field course include dynamics of materials and machines, energy and flow, information and control, and design and production. The items in the general common course include presentation and communication, kinetics learned from actual cases, engineer ethics, product liability and product safety. The seminar was held for two days for engineers graduated from undergraduate departments and experienced in actual works for two years or longer as the object. The actual status survey related to the continuing education by corporations revealed that 83% of the corporations has implemented intra-corporation training, and encouraged positive participation to outside seminars. (NEDO)
10 CFR 835.605 - Labeling items and containers.

Science.gov (United States)

2010-01-01

... 10 Energy 4 2010-01-01 2010-01-01 false Labeling items and containers. 835.605 Section 835.605... items and containers. Except as provided at § 835.606, each item or container of radioactive material... information to permit individuals handling, using, or working in the vicinity of the items or containers to...
Assessing the validity of single-item life satisfaction measures: results from three large samples.

Science.gov (United States)

Cheung, Felix; Lucas, Richard E

2014-12-01

The present paper assessed the validity of single-item life satisfaction measures by comparing single-item measures to the Satisfaction with Life Scale (SWLS)-a more psychometrically established measure. Two large samples from Washington (N = 13,064) and Oregon (N = 2,277) recruited by the Behavioral Risk Factor Surveillance System and a representative German sample (N = 1,312) recruited by the Germany Socio-Economic Panel were included in the present analyses. Single-item life satisfaction measures and the SWLS were correlated with theoretically relevant variables, such as demographics, subjective health, domain satisfaction, and affect. The correlations between the two life satisfaction measures and these variables were examined to assess the construct validity of single-item life satisfaction measures. Consistent across three samples, single-item life satisfaction measures demonstrated substantial degree of criterion validity with the SWLS (zero-order r = 0.62-0.64; disattenuated r = 0.78-0.80). Patterns of statistical significance for correlations with theoretically relevant variables were the same across single-item measures and the SWLS. Single-item measures did not produce systematically different correlations compared to the SWLS (average difference = 0.001-0.005). The average absolute difference in the magnitudes of the correlations produced by single-item measures and the SWLS was very small (average absolute difference = 0.015-0.042). Single-item life satisfaction measures performed very similarly compared to the multiple-item SWLS. Social scientists would get virtually identical answer to substantive questions regardless of which measure they use.
Identifying group-sensitive physical activities: a differential item functioning analysis of NHANES data.

Science.gov (United States)

Gao, Yong; Zhu, Weimo

2011-05-01

The purpose of this study was to identify subgroup-sensitive physical activities (PA) using differential item functioning (DIF) analysis. A sub-unweighted sample of 1857 (men=923 and women=934) from the 2003-2004 National Health and Nutrition Examination Survey PA questionnaire data was used for the analyses. Using the Mantel-Haenszel, the simultaneous item bias test, and the ANOVA DIF methods, 33 specific leisure-time moderate and/or vigorous PA (MVPA) items were analyzed for DIF across race/ethnicity, gender, education, income, and age groups. Many leisure-time MVPA items were identified as large DIF items. When participating in the same amount of leisure-time MVPA, non-Hispanic blacks were more likely to participate in basketball and dance activities than non-Hispanic whites (NHW); NHW were more likely to participated in golf and hiking than non-Hispanic blacks; Hispanics were more likely to participate in dancing, hiking, and soccer than NHW, whereas NHW were more likely to engage in bicycling, golf, swimming, and walking than Hispanics; women were more likely to participate in aerobics, dancing, stretching, and walking than men, whereas men were more likely to engage in basketball, fishing, golf, running, soccer, weightlifting, and hunting than women; educated persons were more likely to participate in jogging and treadmill exercise than less educated persons; persons with higher incomes were more likely to engage in golf than those with lower incomes; and adults (20-59 yr) were more likely to participate in basketball, dancing, jogging, running, and weightlifting than older adults (60+ yr), whereas older adults were more likely to participate in walking and golf than younger adults. DIF methods are able to identify subgroup-sensitive PA and thus provide useful information to help design group-sensitive, targeted interventions for disadvantaged PA subgroups. © 2011 by the American College of Sports Medicine
Obtaining a Proportional Allocation by Deleting Items

NARCIS (Netherlands)

Dorn, B.; de Haan, R.; Schlotter, I.; Röthe, J.

2017-01-01

We consider the following control problem on fair allocation of indivisible goods. Given a set I of items and a set of agents, each having strict linear preference over the items, we ask for a minimum subset of the items whose deletion guarantees the existence of a proportional allocation in the
Item-Based Top-N Recommendation Algorithms

Science.gov (United States)

2003-01-20

basket of items, utilized by many e-commerce sites, cannot take advantage of pre-computed user-to-user similarities. Finally, even though the...not discriminate between items that are present in frequent itemsets and items that are not, while still maintaining the computational advantages of...453219 0.02% 7.74 ccard 42629 68793 398619 0.01% 9.35 ecommerce 6667 17491 91222 0.08% 13.68 em 8002 1648 769311 5.83% 96.14 ml 943 1682 100000 6.31
Self-assessment of competencies in dental education in Germany - a multicentred survey.

Science.gov (United States)

Bitter, K; Rüttermann, S; Lippmann, M; Hahn, P; Giesler, M

2016-11-01

The aim was to assess the competencies of undergraduate dental students in Germany in the domains team competence, communicative competence, learning competence and scholarship. The survey was conducted at 11 dental schools that are equally distributed all over Germany. Competencies were assessed with the Freiburg Questionnaire to Assess Competencies in Medicine (FCM). A short version of the FCM was used in this study. This short form included the four domains: team competence (three items), communicative competence (eight items), learning competence (five items) and scholarship (four items). Students had to rate each item twice: first with regard to the respondent's current level of competence and second with regard to the level of competence that respondents think is required by their job. All items were rated on a five-point Likert scale (1 'very much' and 5 'not at all'). Responsible lecturers from all selected dental schools received another questionnaire to answer the questions whether the FCM domain corresponding learning objectives were taught at the respective dental school. A total of 317 undergraduate students from 11 dental schools in their last clinical year participated. The response rate varied between 48% and 92%. Cronbach's α for the FCM scales addressing the current level of competencies ranged from 0.70 to 0.89 and for the scales measuring the presumed level of competencies demanded by their job ranged from 0.72 to 0.82. The mean values of the scales for the assessment of the presumed level of competencies demanded by the job were significantly lower compared to the mean values of the scales for the current level of competencies (P competence (SRM 1.34), learning competence (SRM 1.27) and communicative competence (SRM 1.18). Overall, the learning objectives that correspond to the assessed domains of competencies were taught to 19.6% completely, to 55.4% partially and to 25% not at all at the participating dental schools. The results of the

Interprofessional Education in the Internal Medicine Clerkship Post-LCME Standard Issuance: Results of a National Survey.

Science.gov (United States)

Alexandraki, Irene; Hernandez, Caridad A; Torre, Dario M; Chretien, Katherine C

2017-08-01

Several decades of work have detailed the value and goals of interprofessional education (IPE) within the health professions, defining IPE competencies and best practices. In 2013, the Liaison Committee for Medical Education (LCME) elevated IPE to a U.S. medical school accreditation standard. To examine the status of IPE within internal medicine (IM) clerkships including perspectives, curricular content, barriers, and assessment a year after the LCME standard issuance. Anonymous online survey. IM clerkship directors from each of the Clerkship Directors in Internal Medicine's 121 U.S. and Canadian member medical schools in 2014. In 2014, a section on IPE (18 items) was included in the Clerkship Directors in Internal Medicine annual survey of its 121 U.S. and Canadian member medical schools. Items (18) assessed clerkship director (CD) perspectives, status of IPE curricula in IM clerkships, and barriers to IPE implementation. Data were analyzed using descriptive statistics and qualitative analysis of free-text responses to one of the survey questions. The overall survey response rate was 78% (94/121). The majority (88%) agreed that IPE is important to the practice of IM, and 71% believed IPE should be part of the IM clerkship. Most (76%) CDs agreed there is need for faculty development programs in IPE; 27% had such a program at their institution. Lack of curricular time, scheduling conflicts, and lack of faculty trained in IPE were the most frequently cited barriers. Twenty-nine percent had formal IPE activities within their IM clerkships, and 38% were planning to make changes. Of those with formal IPE activities, over a third (37%) did not involve student assessment. Since LCME standard issuance, only a minority of IM clerkships have included formal IPE activities, with lectures as the predominant method. Opportunities exist for enhancing educational methods as well as IPE faculty development.
A Review of Classical Methods of Item Analysis.

Science.gov (United States)

French, Christine L.

Item analysis is a very important consideration in the test development process. It is a statistical procedure to analyze test items that combines methods used to evaluate the important characteristics of test items, such as difficulty, discrimination, and distractibility of the items in a test. This paper reviews some of the classical methods for…
Development and validation of a new survey: Perceptions of Teaching as a Profession (PTaP)

Science.gov (United States)

Adams, Wendy

2017-01-01

To better understand the impact of efforts to train more science teachers such as the PhysTEC Project and to help with early identification of future teachers, we are developing the survey of Perceptions of Teaching as a Profession (PTaP) to measure students' views of teaching as a career, their interest in teaching and the perceived climate of physics departments towards teaching as a profession. The instrument consists of a series of statements which require a response using a 5-point Likert-scale and can be easily administered online. The survey items were drafted by a team of researchers and physics teacher candidates and then reviewed by an advisory committee of 20 physics teacher educators and practicing teachers. We conducted 27 interviews with both teacher candidates and non-teaching STEM majors. The survey was refined through an iterative process of student interviews and item clarification until all items were interpreted consistently and answered for consistent reasons. In this presentation the preliminary results from the student interviews as well as the results of item analysis and a factor analysis on 900 student responses will be shared.
Electronics. Criterion-Referenced Test (CRT) Item Bank.

Science.gov (United States)

Davis, Diane, Ed.

This document contains 519 criterion-referenced multiple choice and true or false test items for a course in electronics. The test item bank is designed to work with both the Vocational Instructional Management System (VIMS) and the Vocational Administrative Management System (VAMS) in Missouri. The items are grouped into 15 units covering the…
What Do K-12 Teachers Think about Including Student Surveys in Their Performance Ratings?

Science.gov (United States)

Dretzke, Beverly J.; Sheldon, Timothy D.; Lim, Alicia

2015-01-01

This study investigated K-12 teachers' opinions about the use of student surveys as a component of a teacher evaluation system. Surveys were administered to teachers at the beginning of the school year and again in the spring. Analyses of teachers' responses on the fall survey indicated tentative support for the inclusion of student feedback in…
Readability and Comprehension of the Geriatric Depression Scale and PROMIS® Physical Function Items in Older African Americans and Latinos.

Science.gov (United States)

Paz, Sylvia H; Jones, Loretta; Calderón, José L; Hays, Ron D

2017-02-01

Depression and physical function are particularly important health domains for the elderly. The Geriatric Depression Scale (GDS) and the Patient-Reported Outcomes Measurement Information System (PROMIS ® ) physical function item bank are two surveys commonly used to measure these domains. It is unclear if these two instruments adequately measure these aspects of health in minority elderly. The aim of this study was to estimate the readability of the GDS and PROMIS ® physical function items and to assess their comprehensibility using a sample of African American and Latino elderly. Readability was estimated using the Flesch-Kincaid and Flesch Reading Ease (FRE) formulae for English versions, and a Spanish adaptation of the FRE formula for the Spanish versions. Comprehension of the GDS and PROMIS ® items by minority elderly was evaluated with 30 cognitive interviews. Readability estimates of a number of items in English and Spanish of the GDS and PROMIS ® physical functioning items exceed the U.S. recommended 5th-grade threshold for vulnerable populations, or were rated as 'fairly difficult', 'difficult', or 'very difficult' to read. Cognitive interviews revealed that many participants felt that more than the two (yes/no) GDS response options were needed to answer the questions. Wording of several PROMIS ® items was considered confusing, and interpreting responses was problematic because they were based on using physical aids. Problems with item wording and response options of the GDS and PROMIS ® physical function items may reduce reliability and validity of measurement when used with minority elderly.
26 CFR 301.6501(o)-3 - Partnership items.

Science.gov (United States)

2010-04-01

... 26 Internal Revenue 18 2010-04-01 2010-04-01 false Partnership items. 301.6501(o)-3 Section 301... § 301.6501(o)-3 Partnership items. (a) Partnership item defined. For purposes of section 6501(o) (as it..., and § 301.6511(g)-1, the term “partnership item” means— (1) Any item required to be taken into account...
Safety Gear Decontamination Practices Among Florida Firefighters: Analysis of a Text-Based Survey Methodology.

Science.gov (United States)

Moore, Kevin J; Koru-Sengul, Tulay; Alvarez, Armando; Schaefer-Solle, Natasha; Harrison, Tyler R; Kobetz, Erin N; Caban-Martinez, Alberto J

2018-02-01

Despite the National Fire Protection Association (NFPA) 1851 Personal Protective Equipment Care and Maintenance guidelines, little is known about the routine cleaning of firefighter bunker gear. In collaboration with a large Florida firefighter union, a mobile phone text survey was administered, which included eight questions in an item logic format. In total, 250 firefighters participated in the survey of which 65% reported cleaning their bunker gear in the past 12 months. Approximately 32% ( n = 52) indicated that they had above average confidence in gear cleaning procedures. Arriving at a fire incident response was a significant predictor of gear cleaning in the 12 months preceding survey administration. Using mobile phone-based texting for periodic queries on adherence to NFPA cleaning guidelines and safety message distribution may assist firefighters to increase decontamination procedure frequency.
Uncontrolled Web-based administration of surveys on factual health-related knowledge: a randomized study of untimed versus timed quizzing.

Science.gov (United States)

Domnich, Alexander; Panatto, Donatella; Signori, Alessio; Bragazzi, Nicola Luigi; Cristina, Maria Luisa; Amicizia, Daniela; Gasparini, Roberto

2015-04-13

Health knowledge and literacy are among the main determinants of health. Assessment of these issues via Web-based surveys is growing continuously. Research has suggested that approximately one-fifth of respondents submit cribbed answers, or cheat, on factual knowledge items, which may lead to measurement error. However, little is known about methods of discouraging cheating in Web-based surveys on health knowledge. This study aimed at exploring the usefulness of imposing a survey time limit to prevent help-seeking and cheating. On the basis of sample size estimation, 94 undergraduate students were randomly assigned in a 1:1 ratio to complete a Web-based survey on nutrition knowledge, with or without a time limit of 15 minutes (30 seconds per item); the topic of nutrition was chosen because of its particular relevance to public health. The questionnaire consisted of two parts. The first was the validated consumer-oriented nutrition knowledge scale (CoNKS) consisting of 20 true/false items; the second was an ad hoc questionnaire (AHQ) containing 10 questions that would be very difficult for people without health care qualifications to answer correctly. It therefore aimed at measuring cribbing and not nutrition knowledge. AHQ items were somewhat encyclopedic and amenable to Web searching, while CoNKS items had more complex wording, so that simple copying/pasting of a question in a search string would not produce an immediate correct answer. A total of 72 of the 94 subjects started the survey. Dropout rates were similar in both groups (11%, 4/35 and 14%, 5/37 in the untimed and timed groups, respectively). Most participants completed the survey from portable devices, such as mobile phones and tablets. To complete the survey, participants in the untimed group took a median 2.3 minutes longer than those in the timed group; the effect size was small (Cohen's r=.29). Subjects in the untimed group scored significantly higher on CoNKS (mean difference of 1.2 points, P=.008
A Balance Sheet for Educational Item Banking.

Science.gov (United States)

Hiscox, Michael D.

Educational item banking presents observers with a considerable paradox. The development of test items from scratch is viewed as wasteful, a luxury in times of declining resources. On the other hand, item banking has failed to become a mature technology despite large amounts of money and the efforts of talented professionals. The question of which…
School nutritional capacity, resources and practices are associated with availability of food/beverage items in schools.

Science.gov (United States)

Mâsse, Louise C; de Niet, Judith E

2013-02-19

The school food environment is important to target as less healthful food and beverages are widely available at schools. This study examined whether the availability of specific food/beverage items was associated with a number of school environmental factors. Principals from elementary (n=369) and middle/high schools (n=118) in British Columbia (BC), Canada completed a survey measuring characteristics of the school environment. Our measurement framework integrated constructs from the Theories of Organizational Change and elements from Stillman's Tobacco Policy Framework adapted for obesity prevention. Our measurement framework included assessment of policy institutionalization of nutritional guidelines at the district and school levels, climate, nutritional capacity and resources (nutritional resources and participation in nutritional programs), nutritional practices, and school community support for enacting stricter nutritional guidelines. We used hierarchical mixed-effects logistic regression analyses to examine associations with the availability of fruit, vegetables, pizza/hamburgers/hot dogs, chocolate candy, sugar-sweetened beverages, and french fried potatoes. In elementary schools, fruit and vegetable availability was more likely among schools that have more nutritional resources (OR=6.74 and 5.23, respectively). In addition, fruit availability in elementary schools was highest in schools that participated in the BC School Fruit and Vegetable Nutritional Program and the BC Milk program (OR=4.54 and OR=3.05, respectively). In middle/high schools, having more nutritional resources was associated with vegetable availability only (OR=5.78). Finally, middle/high schools that have healthier nutritional practices (i.e., which align with upcoming provincial/state guidelines) were less likely to have the following food/beverage items available at school: chocolate candy (OR= .80) and sugar-sweetened beverages (OR= .76). School nutritional capacity, resources
Optimization of detector size and scan rate for beta/gamma material release surveys

International Nuclear Information System (INIS)

Bishop, R.V.

1993-01-01

DOE facilities are required to offer for sale to the public items of salvageable value when they are no longer required by the facilities. These items have to be surveyed to ensure radioactive contamination levels do not exceed the values listed in DOE Order 5400.5. Most facilities use portable contamination monitoring.equipment with probe areas between 20 and 100 cm 2 to check for fixed contamination. This procedure is very labor intensive and results in survey costs that often exceed the costs recovered from selling the items. A solution would be to use large area (> 100 cm 2 ) detectors to find and quantify contamination. Large area scintillation detectors that can be used for beta and alpha detection simultaneously are becoming available commercially. Combining these with a rate meter that can differentiate between alpha and beta events can result in a survey that takes considerably less time to do and will save a proportional amount of money in doing so. The use and limitations of this combination of detectors and rate meters will be discussed
Promoting cold-start items in recommender systems.

Science.gov (United States)

Liu, Jin-Hu; Zhou, Tao; Zhang, Zi-Ke; Yang, Zimo; Liu, Chuang; Li, Wei-Min

2014-01-01

As one of the major challenges, cold-start problem plagues nearly all recommender systems. In particular, new items will be overlooked, impeding the development of new products online. Given limited resources, how to utilize the knowledge of recommender systems and design efficient marketing strategy for new items is extremely important. In this paper, we convert this ticklish issue into a clear mathematical problem based on a bipartite network representation. Under the most widely used algorithm in real e-commerce recommender systems, the so-called item-based collaborative filtering, we show that to simply push new items to active users is not a good strategy. Interestingly, experiments on real recommender systems indicate that to connect new items with some less active users will statistically yield better performance, namely, these new items will have more chance to appear in other users' recommendation lists. Further analysis suggests that the disassortative nature of recommender systems contributes to such observation. In a word, getting in-depth understanding on recommender systems could pave the way for the owners to popularize their cold-start products with low costs.
Promoting Cold-Start Items in Recommender Systems

Science.gov (United States)

Liu, Jin-Hu; Zhou, Tao; Zhang, Zi-Ke; Yang, Zimo; Liu, Chuang; Li, Wei-Min

2014-01-01

As one of the major challenges, cold-start problem plagues nearly all recommender systems. In particular, new items will be overlooked, impeding the development of new products online. Given limited resources, how to utilize the knowledge of recommender systems and design efficient marketing strategy for new items is extremely important. In this paper, we convert this ticklish issue into a clear mathematical problem based on a bipartite network representation. Under the most widely used algorithm in real e-commerce recommender systems, the so-called item-based collaborative filtering, we show that to simply push new items to active users is not a good strategy. Interestingly, experiments on real recommender systems indicate that to connect new items with some less active users will statistically yield better performance, namely, these new items will have more chance to appear in other users' recommendation lists. Further analysis suggests that the disassortative nature of recommender systems contributes to such observation. In a word, getting in-depth understanding on recommender systems could pave the way for the owners to popularize their cold-start products with low costs. PMID:25479013
Control of Suspect/Counterfeit and Defective Items

Energy Technology Data Exchange (ETDEWEB)

Sheriff, Marnelle L.

2013-09-03

This procedure implements portions of the requirements of MSC-MP-599, Quality Assurance Program Description. It establishes the Mission Support Alliance (MSA) practices for minimizing the introduction of and identifying, documenting, dispositioning, reporting, controlling, and disposing of suspect/counterfeit and defective items (S/CIs). employees whose work scope relates to Safety Systems (i.e., Safety Class [SC] or Safety Significant [SS] items), non-safety systems and other applications (i.e., General Service [GS]) where engineering has determined that their use could result in a potential safety hazard. MSA implements an effective Quality Assurance (QA) Program providing a comprehensive network of controls and verification providing defense-in-depth by preventing the introduction of S/CIs through the design, procurement, construction, operation, maintenance, and modification of processes. This procedure focuses on those safety systems, and other systems, including critical load paths of lifting equipment, where the introduction of S/CIs would have the greatest potential for creating unsafe conditions.
Mixed-Format Test Score Equating: Effect of Item-Type Multidimensionality, Length and Composition of Common-Item Set, and Group Ability Difference

Science.gov (United States)

Wang, Wei

2013-01-01

Mixed-format tests containing both multiple-choice (MC) items and constructed-response (CR) items are now widely used in many testing programs. Mixed-format tests often are considered to be superior to tests containing only MC items although the use of multiple item formats leads to measurement challenges in the context of equating conducted under…
Non-ignorable missingness item response theory models for choice effects in examinee-selected items.

Science.gov (United States)

Liu, Chen-Wei; Wang, Wen-Chung

2017-11-01

Examinee-selected item (ESI) design, in which examinees are required to respond to a fixed number of items in a given set, always yields incomplete data (i.e., when only the selected items are answered, data are missing for the others) that are likely non-ignorable in likelihood inference. Standard item response theory (IRT) models become infeasible when ESI data are missing not at random (MNAR). To solve this problem, the authors propose a two-dimensional IRT model that posits one unidimensional IRT model for observed data and another for nominal selection patterns. The two latent variables are assumed to follow a bivariate normal distribution. In this study, the mirt freeware package was adopted to estimate parameters. The authors conduct an experiment to demonstrate that ESI data are often non-ignorable and to determine how to apply the new model to the data collected. Two follow-up simulation studies are conducted to assess the parameter recovery of the new model and the consequences for parameter estimation of ignoring MNAR data. The results of the two simulation studies indicate good parameter recovery of the new model and poor parameter recovery when non-ignorable missing data were mistakenly treated as ignorable. © 2017 The British Psychological Society.
Differential Item Functioning of Pathological Gambling Criteria: An Examination of Gender, Race/Ethnicity, and Age

OpenAIRE

Sacco, Paul; Torres, Luis R.; Cunningham-Williams, Renee M.; Woods, Carol; Unick, G. Jay

2011-01-01

This study tested for the presence of differential item functioning (DIF) in DSM-IV Pathological Gambling Disorder (PGD) criteria based on gender, race/ethnicity and age. Using a nationally representative sample of adults from the National Epidemiologic Survey on Alcohol and Related Conditions (NESARC), indicating current gambling (n = 10,899), Multiple Indicator-Multiple Cause (MIMIC) models tested for DIF, controlling for income, education, and marital status. Compared to the reference grou...
Item bias in self-reported functional ability among 75-year-old men and women in three Nordic localities

DEFF Research Database (Denmark)

Avlund, K; Era, P; Davidsen, M

1996-01-01

to geographical locality and gender. Information about self-reported functional ability was gathered from surveys on 75-year-old men and women in Glostrup (Denmark), Göteborg (Sweden) and Jyväskylä (Finland). The data were collected by structured home interviews about mobility and Physical activities of daily......The purpose of this article is to analyse item bias in a measure of self-reported functional ability among 75-year-old people in three Nordic localities. The present item bias analysis examines whether the construction of a functional ability index from several variables results in bias in relation...... living (PADL) in relation to tiredness, reduced speed and dependency and combined into three tiredness-scales, three reduced speed-scales and two dependency-scales. The analysis revealed item bias regarding geographical locality in seven out of eight of the functional ability scales, but nearly no bias...
A Case Study on an Item Writing Process: Use of Test Specifications, Nature of Group Dynamics, and Individual Item Writers' Characteristics

Science.gov (United States)

Kim, Jiyoung; Chi, Youngshin; Huensch, Amanda; Jun, Heesung; Li, Hongli; Roullion, Vanessa

2010-01-01

This article discusses a case study on an item writing process that reflects on our practical experience in an item development project. The purpose of the article is to share our lessons from the experience aiming to demystify item writing process. The study investigated three issues that naturally emerged during the project: how item writers use…

Attitudes of non-practicing chiropractors: a pilot survey concerning factors related to attrition

Directory of Open Access Journals (Sweden)

Wyatt Lawrence H

2010-11-01

Full Text Available Abstract Background Research into attitudes about chiropractors who are no longer engaged in active clinical practice is non-existent. Yet non-practicing chiropractors (NPCs represent a valid sub-group worthy of study. Aim The purpose of this research was to assess attrition attitudes of NPCs about the chiropractic profession and develop a scale to assess such attitudes. Methods A 48 item survey was developed using the PsychData software. This survey included 35 Likert-style items assessing various aspects of the profession namely financial, educational, psychosocial and political. An internet discussion site where NPCs may be members was accessed for recruitment purposes. Results A total of 70 valid responses were received for analysis. A majority of respondents were male with 66% being in non-practice status for 3 to 5 years and less with 43% indicating that they had graduated since the year 2000. Most respondents were employed either in other healthcare professions and non-chiropractic education. A majority of NPCs believed that business ethics in chiropractic were questionable and that overhead expense and student loans were factors in practice success. A majority of NPCs were in associate practice at one time with many believing that associates were encouraged to prolong the care of patients and that associate salaries were not fair. Most NPCs surveyed believed that chiropractic was not a good career choice and would not recommend someone to become a chiropractor. From this survey, a 12 item scale was developed called the "chiropractor attrition attitude scale" for future research. Reliability analysis of this novel scale demonstrated a coefficient alpha of 0.90. Conclusion The low response rate indicates that findings cannot be generalized to the NPC population. This study nonetheless demonstrates that NPCs attrition attitudes can be assessed. The lack of a central database of NPCs is a challenge to future research. Appropriate
FY 1992 report on the survey of geothermal development promotion. Supplementary survey on data processing (No.38 - West area of Mt. Aso); 1991 nendo chinetsu kaihatsu sokushin chosa. Data shori ni kakawaru hosoku chosa hokokusho (No.38 Asosan seibu chiiki)

Energy Technology Data Exchange (ETDEWEB)

NONE

1993-09-01

As a part of the survey of geothermal development promotion in FY 1992, chemical/isotopic analysis of fumarolic gas in the Yoshioka district was made to elucidate the underground geothermal structure in the west area of Mt. Aso in Kumamoto Prefecture. Items for analysis of fumarolic gas were 16 items including the temperature, concentration of non-condensable gas, CO2, H2O, CH{sub 4}, {delta}D(CH{sub 4}) and {delta}{sup 13}C(CO2). Items for analysis of condensed water were 9 items including pH, Na, NH{sub 4}, {delta}D(H2O) and {delta}{sup 18}O. As a result of the analysis, the main component of non-condensable gas of fumarolic gas was CO2, and the composition was similar to that of the fumarolic gas in the Yunoya/Tarutama district in the periphery. It was presumed that the origin and formation mechanism of fumarolic gas were also similar to those in the Yunoya/Tarutama district. It was presumed that the deep geothermal reservoir which is the source of vapor/gas generation was composed of the neutral or alkalescent geothermal water, and a possibility that the reservoir is connected with the deep geothermal reservoir in the Yunoya district was presumed from a viewpoint of geographical location. (NEDO)
Guideline for the seismic technical evaluation of replacement items for nuclear power plants

International Nuclear Information System (INIS)

Harris, S.P.; Cushing, R.W.; Johnson, H.W.; Abeles, J.M.

1993-02-01

Seismic qualification for equipment originally installed in nuclear power plants was typically performed by the original equipment suppliers or manufactures (OES/OEM). Many of the OES/OEM no longer maintain quality assurance programs with adequate controls for supplying nuclear equipment. Utilities themselves must provide reasonable assurance in the continued seismic adequacy of such replacement items. This guideline provides practical, cost-effective techniques which can be used to provide reasonable assurance that replacement items will meet seismic performance requirements necessary to maintain the seismic design basis of commercial nuclear power plants. It also provides a method for determining when a seismic technical evaluation of replacement items (STERI) is required as part of the procurement process for spare and replacement items. Guidance on supplier program requirements necessary to maintain continued seismic adequacy and on documentation of maintaining required seismic adequacy is also included
The Effects of Item Format and Cognitive Domain on Students' Science Performance in TIMSS 2011

Science.gov (United States)

Liou, Pey-Yan; Bulut, Okan

2017-12-01

The purpose of this study was to examine eighth-grade students' science performance in terms of two test design components, item format, and cognitive domain. The portion of Taiwanese data came from the 2011 administration of the Trends in International Mathematics and Science Study (TIMSS), one of the major international large-scale assessments in science. The item difficulty analysis was initially applied to show the proportion of correct items. A regression-based cumulative link mixed modeling (CLMM) approach was further utilized to estimate the impact of item format, cognitive domain, and their interaction on the students' science scores. The results of the proportion-correct statistics showed that constructed-response items were more difficult than multiple-choice items, and that the reasoning cognitive domain items were more difficult compared to the items in the applying and knowing domains. In terms of the CLMM results, students tended to obtain higher scores when answering constructed-response items as well as items in the applying cognitive domain. When the two predictors and the interaction term were included together, the directions and magnitudes of the predictors on student science performance changed substantially. Plausible explanations for the complex nature of the effects of the two test-design predictors on student science performance are discussed. The results provide practical, empirical-based evidence for test developers, teachers, and stakeholders to be aware of the differential function of item format, cognitive domain, and their interaction in students' science performance.
Analysis of Chemical Composition of Non-Ferrous Metal Items from the Ananyino Burial Ground

Directory of Open Access Journals (Sweden)

Saprykina Irina А.

2016-03-01

Full Text Available The article presents results of an analysis conducted by the authors in order to study chemical composition of items from non-ferrous metals found on the Ananyino burial ground. A number of research methods, including OES, XRF and TXRF was applied to study a selection of 387 samples of arrow- and spearheads, celts, tail-pieces, warhammers, poleaxes, knives and daggers, as well as items of attire and jewelry, some sporadic details of harness and bridle. The fi ndings are quite comparable. The results were classifi ed by the geochemical principle of 1,0% alloyage threshold. It was found out that the sample primarily consists of copper items, including “pure” copper and copper with a wide range of trace elements (particularly, Ni, As, Sb. The core (48% consists of copper items with traces of antimony and arsenic, or “pure” copper (7%, tin or triple bronze (40%; it also includes some other types of alloys based on copper or silver (5%. As the analysis has shown, complex ores seem to be the most probable source of copper. Traditionally, the Urals, the Sayan and the Altay Mountains, Kazakhstan and the Northern Caucasus were regarded as the most probable minefi elds to supply ores to the barren regions of Eastern Europe. While ore sources for products made of metallurgical “pure” copper are localized within the Ural mining and metallurgical region, metal sources for items cast from different groups of alloys (rather than imports of ready-made products require further research.
Cross-cultural equivalence of the organisational culture survey in Australia

Directory of Open Access Journals (Sweden)

R. Erwee

2001-09-01

Full Text Available The aim of this study is to assess whether the cross-cultural equivalence of the Organisational Culture Survey (OCS persist in an Australian context. The nature of the instrument is presented which includes a clear statement of its South African origin and its’ place within a logical positivist paradigm. The sample consisted of 326 respondents from a population of managers of the Australian Institute of Management. This study confirms the instrument’s validity and internal consistency within an Australian context, but that further research is required into the functional and conceptual equivalence of the survey items and dimensions underpinning the items to conclusively establish its utility. Finally, aspects of the ‘organisational culture’ construct underlying the survey need revision given recent trends in related systems, complexity and chaos theories. Opsomming Technikons propageer die beoefening van ko˛peratiewe onderwys,’n opvoedkundige strategiewat leer deur produktiewewerkservaring integreermet die teoretiese kurrikulum. ByTechnikon SAegter, het slegs sowat 35% van die formele programme ’n verpligte leerervarings komponent.Teoretiese-begrondings navorsingsmetodologie is gebruik omsekere basiese veronderstellings van akademiese personeel te bepaal. Eerder as om’n spesifieke navorsingsprobleemas vertrekpunt te gebruik, ondersoek teoretiese-begronding’n areavan belang en laat die metodiek die relevante sake toe omte voorskyn te kom. Semi-gestruktureerde onderhoude,met vier ope vrae, is gevoer met ’n gestratifiseerde eweskansige steekproef van 25 akademiese personeellede vanTechnikon SA. Daar is bevind dat alhoewel daar beperkte oortuiging en gewillige uitlewing van kooperatiewe onderwys is, is dit nie beduidend as kenmerkend van die organisasie kultuur vanTechnikon SA nie.
Automated Item Generation with Recurrent Neural Networks.

Science.gov (United States)

von Davier, Matthias

2018-03-12

Utilizing technology for automated item generation is not a new idea. However, test items used in commercial testing programs or in research are still predominantly written by humans, in most cases by content experts or professional item writers. Human experts are a limited resource and testing agencies incur high costs in the process of continuous renewal of item banks to sustain testing programs. Using algorithms instead holds the promise of providing unlimited resources for this crucial part of assessment development. The approach presented here deviates in several ways from previous attempts to solve this problem. In the past, automatic item generation relied either on generating clones of narrowly defined item types such as those found in language free intelligence tests (e.g., Raven's progressive matrices) or on an extensive analysis of task components and derivation of schemata to produce items with pre-specified variability that are hoped to have predictable levels of difficulty. It is somewhat unlikely that researchers utilizing these previous approaches would look at the proposed approach with favor; however, recent applications of machine learning show success in solving tasks that seemed impossible for machines not too long ago. The proposed approach uses deep learning to implement probabilistic language models, not unlike what Google brain and Amazon Alexa use for language processing and generation.
Assessing the Validity of Single-item Life Satisfaction Measures: Results from Three Large Samples

Science.gov (United States)

Cheung, Felix; Lucas, Richard E.

2014-01-01

Purpose The present paper assessed the validity of single-item life satisfaction measures by comparing single-item measures to the Satisfaction with Life Scale (SWLS) - a more psychometrically established measure. Methods Two large samples from Washington (N=13,064) and Oregon (N=2,277) recruited by the Behavioral Risk Factor Surveillance System (BRFSS) and a representative German sample (N=1,312) recruited by the Germany Socio-Economic Panel (GSOEP) were included in the present analyses. Single-item life satisfaction measures and the SWLS were correlated with theoretically relevant variables, such as demographics, subjective health, domain satisfaction, and affect. The correlations between the two life satisfaction measures and these variables were examined to assess the construct validity of single-item life satisfaction measures. Results Consistent across three samples, single-item life satisfaction measures demonstrated substantial degree of criterion validity with the SWLS (zero-order r = 0.62 – 0.64; disattenuated r = 0.78 – 0.80). Patterns of statistical significance for correlations with theoretically relevant variables were the same across single-item measures and the SWLS. Single-item measures did not produce systematically different correlations compared to the SWLS (average difference = 0.001 – 0.005). The average absolute difference in the magnitudes of the correlations produced by single-item measures and the SWLS were very small (average absolute difference = 0.015 −0.042). Conclusions Single-item life satisfaction measures performed very similarly compared to the multiple-item SWLS. Social scientists would get virtually identical answer to substantive questions regardless of which measure they use. PMID:24890827
Science Library of Test Items. Volume Twenty-Two. A Collection of Multiple Choice Test Items Relating Mainly to Skills.

Science.gov (United States)

New South Wales Dept. of Education, Sydney (Australia).

As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…
Science Library of Test Items. Volume Twenty. A Collection of Multiple Choice Test Items Relating Mainly to Physics, 1.

Science.gov (United States)

New South Wales Dept. of Education, Sydney (Australia).

As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…
Burnout among Canadian Psychiatry Residents: A National Survey

Science.gov (United States)

Halli, Priyanka; Ogrodniczuk, John S.; Hadjipavlou, George

2016-01-01

Objective: Burnout is a serious problem for health care providers that has implications for clinical practice and personal health. While burnout is known to affect residents, no studies have examined the prevalence or impact of burnout among Canadian psychiatry residents. Method: Residents in all Canadian psychiatry training programs were surveyed between May 1, 2014, and July 1, 2014. The survey included a well-validated, single-item measure to assess symptoms of burnout, several demographic questions, and Likert-scale items to assess residents’ appraisals of empathic functioning and strategies for coping with stress from patient encounters. Results: Responses were obtained from 400 residents, for a response rate of 48%. Twenty-one percent (N = 84) of residents reported symptoms of burnout. Burnout was reported more frequently by residents in postgraduate year 2 than by those in other years and was associated with engagement in personal psychotherapy during residency. No association was found between burnout and age, gender, or location of residency program. Residents who endorsed symptoms of burnout reported higher levels of compromised empathic functioning, were less likely to consult with supervisors about stressful clinical experiences, and were more likely to engage in unhealthy coping strategies. Conclusions: Symptoms of burnout affect one-fifth of Canadian psychiatry residents. The associations between burnout symptoms and problematic clinical and personal functioning suggest areas of concern for those involved in the training of Canadian psychiatry residents. PMID:27310237
Optimal pricing and marketing planning for deteriorating items.

Science.gov (United States)

Moosavi Tabatabaei, Seyed Reza; Sadjadi, Seyed Jafar; Makui, Ahmad

2017-01-01

Optimal pricing and marketing planning plays an essential role in production decisions on deteriorating items. This paper presents a mathematical model for a three-level supply chain, which includes one producer, one distributor and one retailer. The proposed study considers the production of a deteriorating item where demand is influenced by price, marketing expenditure, quality of product and after-sales service expenditures. The proposed model is formulated as a geometric programming with 5 degrees of difficulty and the problem is solved using the recent advances in optimization techniques. The study is supported by several numerical examples and sensitivity analysis is performed to analyze the effects of the changes in different parameters on the optimal solution. The preliminary results indicate that with the change in parameters influencing on demand, inventory holding, inventory deteriorating and set-up costs change and also significantly affect total revenue.
Measuring psychological trauma after spinal cord injury: Development and psychometric characteristics of the SCI-QOL Psychological Trauma item bank and short form.

Science.gov (United States)

Kisala, Pamela A; Victorson, David; Pace, Natalie; Heinemann, Allen W; Choi, Seung W; Tulsky, David S

2015-05-01

To describe the development and psychometric properties of the SCI-QOL Psychological Trauma item bank and short form. Using a mixed-methods design, we developed and tested a Psychological Trauma item bank with patient and provider focus groups, cognitive interviews, and item response theory based analytic approaches, including tests of model fit, differential item functioning (DIF) and precision. We tested a 31-item pool at several medical institutions across the United States, including the University of Michigan, Kessler Foundation, Rehabilitation Institute of Chicago, the University of Washington, Craig Hospital and the James J. Peters/Bronx Veterans Administration hospital. A total of 716 individuals with SCI completed the trauma items The 31 items fit a unidimensional model (CFI=0.952; RMSEA=0.061) and demonstrated good precision (theta range between 0.6 and 2.5). Nine items demonstrated negligible DIF with little impact on score estimates. The final calibrated item bank contains 19 items The SCI-QOL Psychological Trauma item bank is a psychometrically robust measurement tool from which a short form and a computer adaptive test (CAT) version are available.
Psychometric properties of the Global Operative Assessment of Laparoscopic Skills (GOALS) using item response theory.

Science.gov (United States)

Watanabe, Yusuke; Madani, Amin; Ito, Yoichi M; Bilgic, Elif; McKendy, Katherine M; Feldman, Liane S; Fried, Gerald M; Vassiliou, Melina C

2017-02-01

The extent to which each item assessed using the Global Operative Assessment of Laparoscopic Skills (GOALS) contributes to the total score remains unknown. The purpose of this study was to evaluate the level of difficulty and discriminative ability of each of the 5 GOALS items using item response theory (IRT). A total of 396 GOALS assessments for a variety of laparoscopic procedures over a 12-year time period were included. Threshold parameters of item difficulty and discrimination power were estimated for each item using IRT. The higher slope parameters seen with "bimanual dexterity" and "efficiency" are indicative of greater discriminative ability than "depth perception", "tissue handling", and "autonomy". IRT psychometric analysis indicates that the 5 GOALS items do not demonstrate uniform difficulty and discriminative power, suggesting that they should not be scored equally. "Bimanual dexterity" and "efficiency" seem to have stronger discrimination. Weighted scores based on these findings could improve the accuracy of assessing individual laparoscopic skills. Copyright © 2016 Elsevier Inc. All rights reserved.
Identifying the Source of Misfit in Item Response Theory Models.

Science.gov (United States)

Liu, Yang; Maydeu-Olivares, Alberto

2014-01-01

When an item response theory model fails to fit adequately, the items for which the model provides a good fit and those for which it does not must be determined. To this end, we compare the performance of several fit statistics for item pairs with known asymptotic distributions under maximum likelihood estimation of the item parameters: (a) a mean and variance adjustment to bivariate Pearson's X(2), (b) a bivariate subtable analog to Reiser's (1996) overall goodness-of-fit test, (c) a z statistic for the bivariate residual cross product, and (d) Maydeu-Olivares and Joe's (2006) M2 statistic applied to bivariate subtables. The unadjusted Pearson's X(2) with heuristically determined degrees of freedom is also included in the comparison. For binary and ordinal data, our simulation results suggest that the z statistic has the best Type I error and power behavior among all the statistics under investigation when the observed information matrix is used in its computation. However, if one has to use the cross-product information, the mean and variance adjusted X(2) is recommended. We illustrate the use of pairwise fit statistics in 2 real-data examples and discuss possible extensions of the current research in various directions.
Item Banks for Substance Use from the Patient-Reported Outcomes Measurement Information System (PROMIS®): Severity of Use and Positive Appeal of Use*

Science.gov (United States)

Pilkonis, Paul A.; Yu, Lan; Dodds, Nathan E.; Johnston, Kelly L.; Lawrence, Suzanne; Hilton, Thomas F.; Daley, Dennis C.; Patkar, Ashwin A.; McCarty, Dennis

2015-01-01

Background Two item banks for substance use were developed as part of the Patient-Reported Outcomes Measurement Information System (PROMIS®): severity of substance use and positive appeal of substance use. Methods Qualitative item analysis (including focus groups, cognitive interviewing, expert review, and item revision) reduced an initial pool of more than 5,300 items for substance use to 119 items included in field testing. Items were written in a first-person, past-tense format, with 5 response options reflecting frequency or severity. Both 30-day and 3-month time frames were tested. The calibration sample of 1,336 respondents included 875 individuals from the general population (ascertained through an internet panel) and 461patients from addiction treatment centers participating in the National Drug Abuse Treatment Clinical Trials Network. Results Final banks of 37 and 18 items were calibrated for severity of substance use and positive appeal of substance use, respectively, using the two-parameter graded response model from item response theory (IRT). Initial calibrations were similar for the 30-day and 3-month time frames, and final calibrations used data combined across the time frames, making the items applicable with either interval. Seven-item static short forms were also developed from each item bank. Conclusions Test information curves showed that the PROMIS item banks provided substantial information in a broad range of severity, making them suitable for treatment, observational, and epidemiological research in both clinical and community settings. PMID:26423364
Does remembering emotional items impair recall of same-emotion items?

Science.gov (United States)

Sison, Jo Ann G; Mather, Mara

2007-04-01

In the part-set cuing effect, cuing a subset of previously studied items impairs recall of the remaining noncued items. This experiment reveals that cuing participants with previously-studied emotional pictures (e.g., fear-evoking pictures of people) can impair recall of pictures involving the same emotion but different content (e.g., fear-evoking pictures of animals). This indicates that new events can be organized in memory using emotion as a grouping function to create associations. However, whether new information is organized in memory along emotional or nonemotional lines appears to be a flexible process that depends on people's current focus. Mentioning in the instructions that the pictures were either amusement- or fear-related led to memory impairment for pictures with the same emotion as cued pictures, whereas mentioning that the pictures depicted either animals or people led to memory impairment for pictures with the same type of actor.
Modeling Item-Level and Step-Level Invariance Effects in Polytomous Items Using the Partial Credit Model

Science.gov (United States)

Gattamorta, Karina A.; Penfield, Randall D.; Myers, Nicholas D.

2012-01-01

Measurement invariance is a common consideration in the evaluation of the validity and fairness of test scores when the tested population contains distinct groups of examinees, such as examinees receiving different forms of a translated test. Measurement invariance in polytomous items has traditionally been evaluated at the item-level,…
Item Information in the Rasch Model

NARCIS (Netherlands)

Engelen, Ron J.H.; van der Linden, Willem J.; Oosterloo, Sebe J.

1988-01-01

Fisher's information measure for the item difficulty parameter in the Rasch model and its marginal and conditional formulations are investigated. It is shown that expected item information in the unconditional model equals information in the marginal model, provided the assumption of sampling
The InVEST Volcanic Concept Survey: Exploring Student Understanding about Volcanoes

Science.gov (United States)

Parham, Thomas L., Jr.; Cervato, Cinzia; Gallus, William A., Jr.; Larsen, Michael; Hobbs, Jon; Stelling, Pete; Greenbowe, Thomas; Gupta, Tanya; Knox, John A.; Gill, Thomas E.

2010-01-01

Results from the Volcanic Concept Survey (VCS) indicated that many undergraduates do not fully understand volcanic systems and plate tectonics. During the 2006 academic year, a ten-item conceptual survey was distributed to undergraduate students enrolled in Earth science courses at five U.S. colleges and universities. A trained team of graders…

48 CFR 1845.7210-1 - Utilization surveys.

Science.gov (United States)

2010-10-01

... report Government-owned plant equipment in accordance with FAR 45.502(g) and 45.509-2(b)(4). Items that... ADMINISTRATION CONTRACT MANAGEMENT GOVERNMENT PROPERTY Contract Property Management 1845.7210-1 Utilization surveys. (a) The property administrator is responsible for ensuring that the contractor has effective...
Work ability as prognostic risk marker of disability pension : Single-item work ability score versus multi-item work ability index

NARCIS (Netherlands)

Roelen, C.A.M.; Rhenen, van W.; Groothoff, J.W.; Klink, van der J.J.L.; Twisk, W.R.; Heymans, M.W.

2014-01-01

Work ability predicts future disability pension (DP). A single-item work ability score (WAS) is emerging as a measure for work ability. This study compared single-item WAS with the multi-item work ability index (WAI) in its ability to identify workers at risk of DP.
Psychometric properties of the revised Malay version Medical Outcome Study Social Support Survey using confirmatory factor analysis among postpartum mothers.

Science.gov (United States)

Norhayati, Mohd Noor; Aniza, Abd Aziz; Nik Hazlina, Nik Hussain; Azman, Mohd Yacob

2015-12-01

Social support is an essential component for the physical and emotional well-being of postpartum mothers. The objective of this study is to determine the psychometric properties of the revised Malay version Medical Outcome Study (MOS) Social Support Survey using a confirmatory validity approach. A cross-sectional study was conducted involving 144 postpartum mothers attending Obstetric and Gynecology Clinic, Universiti Sains Malaysia Hospital. Construct validity and internal consistency assessment was performed after the translation, content validity and face validity process. The data were analyzed using SPSS 20.0 (SPSS Inc., Chicago, IL, USA) and AMOS 20.0 (SPSS Inc., Chicago, IL, USA). The original questionnaire consists of four domains (emotional/informational support, tangible support, affectionate support and positive social interaction) and 19 items. Affectionate support domain with three items only was treated as a separate construct and was not included in the factor analysis. The final confirmatory model with three constructs and 13 items demonstrated acceptable factor loadings, domain to domain correlation and best fit; (χ2[df]=1.665 [61]; P-value=0.001; Tucker-Lewis Index=0.944; comparative fit index=0.956; root mean square error of approximation=0.068). Composite reliability, average variance extracted and Cronbach's α of the domains ranged from 0.649 to 0.903; 0.390 to 0.699; 0.616 to 0.902, respectively. The study suggested that the four-factor model with 16 items (including one separate factor of affectionate) of the revised Malay version MOS Social Support Survey was acceptable to be used to measure social support after childbirth because it is valid, reliable and simple. © 2015 Wiley Publishing Asia Pty Ltd.
The patient safety climate in healthcare organizations (PSCHO) survey: Short-form development.

Science.gov (United States)

Benzer, Justin K; Meterko, Mark; Singer, Sara J

2017-08-01

Measures of safety climate are increasingly used to guide safety improvement initiatives. However, cost and respondent burden may limit the use of safety climate surveys. The purpose of this study was to develop a 15- to 20-item safety climate survey based on the Patient Safety Climate in Healthcare Organizations survey, a well-validated 38-item measure of safety climate. The Patient Safety Climate in Healthcare Organizations was administered to all senior managers, all physicians, and a 10% random sample of all other hospital personnel in 69 private sector hospitals and 30 Veterans Health Administration hospitals. Both samples were randomly divided into a derivation sample to identify a short-form subset and a confirmation sample to assess the psychometric properties of the proposed short form. The short form consists of 15 items represented 3 overarching domains in the long-form scale-organization, work unit, and interpersonal. The proposed short form efficiently captures 3 important sources of variance in safety climate: organizational, work-unit, and interpersonal. The short-form development process was a practical method that can be applied to other safety climate surveys. This safety climate short form may increase response rates in studies that involve busy clinicians or repeated measures. Published 2017. This article is a U.S. Government work and is in the public domain in the USA.
School nutrition survey.

Science.gov (United States)

O'Connor, M; Kiely, D; Mulvihill, M; Winters, A; Bollard, C; Hamilton, A; Corrigan, C; Moore, E

1993-05-01

Food we eat has an important influence on health and well-being. Many eating habits are established in childhood. 456 children aged eight to 12 years participated in this survey of food eaten at school. Of all the food items eaten as a snack, 48.6% were categorised as junk. 75.8% of the sandwiches brought to school for lunch were made with white bread. Of the remaining food items brought for lunch 63.5% were of the junk variety. Compared with those who brought a snack or lunch from home, those given money to buy their own were more likely to eat junk (p daily food intake but health food practises for even a third of food intake may be of a value for health and long term eating habits. Nutritional education with the reinforcement of high nutritional standards in schools could improve the situation.
CERN Running Club – Sale of Items

CERN Multimedia

CERN Running club

2018-01-01

The CERN Running Club is organising a sale of items on 26 June from 11:30 – 13:00 in the entry area of Restaurant 2 (504 R-202). The items for sale are souvenir prizes of past Relay Races and comprise: Backpacks, thermos, towels, gloves & caps, lamps, long sleeve winter shirts and windproof vest. All items will be sold at 5 CHF.
Results of the staff survey: your priorities

CERN Multimedia

Staff Association

2014-01-01

This is the first in a series of articles which will give some details about the results of the Staff Association staff survey To know your priorities and the evolution of your concerns over the last decade we study how, in each of our latest three surveys, you chose from a list of 15 items the five most important and classified them by assigning them a priority, from the most important to the fifth most important. The list of fifteen items, and a short description, follows. Career evolution (classification, level of recruitment, advancement, promotion) Salary level Family policy (recognition of partners, allowances, school fees, kindergarten, nursery, crèche, parental leave) Health insurance Non-residence and international indemnity Annual salary adjustment (cost variation index) Contract policy (duration, recruitment, award of IC, conditions of the beginning and ending of the contract) Motivation at work (interest, team, supervision, mobility, reward scheme) Pensions (retirement, disability, o...
Collaboration challenges in systematic reviews: a survey of health sciences librarians.

Science.gov (United States)

Nicholson, Joey; McCrillis, Aileen; Williams, Jeff D

2017-10-01

While many librarians have been asked to participate in systematic reviews with researchers, often these researchers are not familiar with the systematic review process or the appropriate role for librarians. The purpose of this study was to identify the challenges and barriers that librarians face when collaborating on systematic reviews. To take a wider view of the whole process of collaborating on systematic reviews, the authors deliberately focused on interpersonal and methodological issues other than searching itself. To characterize the biggest challenges that librarians face while collaborating on systematic review projects, we used a web-based survey. The thirteen-item survey included seventeen challenges grouped into two categories: methodological and interpersonal. Participants were required to indicate the frequency and difficulty of the challenges listed. Open-ended questions allowed survey participants to describe challenges not listed in the survey and to describe strategies used to overcome challenges. Of the 17 challenges listed in the survey, 8 were reported as common by over 40% of respondents. These included methodological issues around having too broad or narrow research questions, lacking eligibility criteria, having unclear research questions, and not following established methods. The remaining challenges were interpersonal, including issues around student-led projects and the size of the research team. Of the top 8 most frequent challenges, 5 were also ranked as most difficult to handle. Open-ended responses underscored many of the challenges included in the survey and revealed several additional challenges. These results suggest that the most frequent and challenging issues relate to development of the research question and general communication with team members. Clear protocols for collaboration on systematic reviews, as well as a culture of mentorship, can help librarians prevent and address these challenges.
The Protective Behavioral Strategies for Marijuana Scale: Further examination using item response theory.

Science.gov (United States)

Pedersen, Eric R; Huang, Wenjing; Dvorak, Robert D; Prince, Mark A; Hummer, Justin F

2017-08-01

Given recent state legislation legalizing marijuana for recreational purposes and majority popular opinion favoring these laws, we developed the Protective Behavioral Strategies for Marijuana scale (PBSM) to identify strategies that may mitigate the harms related to marijuana use among those young people who choose to use the drug. In the current study, we expand on the initial exploratory study of the PBSM to further validate the measure with a large and geographically diverse sample (N = 2,117; 60% women, 30% non-White) of college students from 11 different universities across the United States. We sought to develop a psychometrically sound item bank for the PBSM and to create a short assessment form that minimizes respondent burden and time. Quantitative item analyses, including exploratory and confirmatory factor analyses with item response theory (IRT) and evaluation of differential item functioning (DIF), revealed an item bank of 36 items that was examined for unidimensionality and good content coverage, as well as a short form of 17 items that is free of bias in terms of gender (men vs. women), race (White vs. non-White), ethnicity (Hispanic vs. non-Hispanic), and recreational marijuana use legal status (state recreational marijuana was legal for 25.5% of participants). We also provide a scoring table for easy transformation from sum scores to IRT scale scores. The PBSM item bank and short form associated strongly and negatively with past month marijuana use and consequences. The measure may be useful to researchers and clinicians conducting intervention and prevention programs with young adults. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Teaching implementation science in a new Master of Science Program in Germany: a survey of stakeholder expectations.

Science.gov (United States)

Ullrich, Charlotte; Mahler, Cornelia; Forstner, Johanna; Szecsenyi, Joachim; Wensing, Michel

2017-04-27

Implementation science in healthcare is an evolving discipline in German-speaking countries. In 2015, the Medical Faculty of the University of Heidelberg, Germany, implemented a two-year full-time Master of Science program Health Services Research and Implementation Science. The curriculum introduces implementation science in the context of a broader program that also covers health services research, healthcare systems, research methods, and generic academic skills. Our aim was to assess the expectations of different stakeholder groups regarding the master's program. An online survey listing desired competencies of prospective graduates was developed and administered to four groups: national experts in the field (including potential employers of graduates), teaching staff, enrolled students, and prospective students (N = 169). Competencies were extracted from the curriculum's module handbook. A five-point Likert scale was used for the assessment of 42 specific items. Data were analyzed descriptively. A total of 83 people participated in the survey (response rate 49%). The online survey showed a strong agreement across the groups concerning the desired competencies of graduates. About two-thirds of the listed competencies (27 items) were felt to be crucial or very important by 80% or more of participants, with little difference between stakeholder groups. Of the eight items specifically related to implementation in practice, six were in this category. Knowledge of implementation strategies (90% very important), knowledge of barriers and enablers of implementation (89%), and knowledge of evidence-based practice (89%) were the top priorities. The master's program is largely orientated towards the desired competencies of graduates according to students, teaching staff, and national experts.
A New Extension of the Binomial Error Model for Responses to Items of Varying Difficulty in Educational Testing and Attitude Surveys.

Directory of Open Access Journals (Sweden)

James A Wiley

Full Text Available We put forward a new item response model which is an extension of the binomial error model first introduced by Keats and Lord. Like the binomial error model, the basic latent variable can be interpreted as a probability of responding in a certain way to an arbitrarily specified item. For a set of dichotomous items, this model gives predictions that are similar to other single parameter IRT models (such as the Rasch model but has certain advantages in more complex cases. The first is that in specifying a flexible two-parameter Beta distribution for the latent variable, it is easy to formulate models for randomized experiments in which there is no reason to believe that either the latent variable or its distribution vary over randomly composed experimental groups. Second, the elementary response function is such that extensions to more complex cases (e.g., polychotomous responses, unfolding scales are straightforward. Third, the probability metric of the latent trait allows tractable extensions to cover a wide variety of stochastic response processes.
Practice of radiation dose control for tech-modification items in Qinshan Nuclear Power Plant

International Nuclear Information System (INIS)

Zhang Yong; Chen Zhongyu; Xu Hongming; Fan Liguang; Jiang Jianqi; Bu Weidong

2006-01-01

In order to improve the safety and reliability of nuclear power plant operation, many tech-modifications related to system or equipment have been completed since operation in Qinshan NPP. this paper introduces radiation dose control for mainly tech-modifications items related to radiation, including radiation protection optimization measures and experience in aspects of item planning, program writing, process control, etc. (authors)
Work ability as prognostic risk marker of disability pension: single-item work ability score versus multi-item work ability index

NARCIS (Netherlands)

Roelen, C.A.M.; van Rhenen, W.; Groothoff, J.W.; van der Klink, J.J.L.; Twisk, J.W.R.; Heymans, M.W.

2014-01-01

Objectives Work ability predicts future disability pension (DP). A single-item work ability score (WAS) is emerging as a measure for work ability. This study compared single-item WAS with the multi-item work ability index (WAI) in its ability to identify workers at risk of DP. Methods This
Work ability as prognostic risk marker of disability pension : single-item work ability score versus multi-item work ability index

NARCIS (Netherlands)

Roelen, Corne A. M.; van Rhenen, Willem; Groothoff, Johan W.; van der Klink, Jac J. L.; Twisk, Jos W. R.; Heymans, Martijn W.

Objectives Work ability predicts future disability pension (DP). A single-item work ability score (WAS) is emerging as a measure for work ability. This study compared single-item WAS with the multi-item work ability index (WAI) in its ability to identify workers at risk of DP. Methods This
Development and Evaluation of the PROMIS® Pediatric Positive Affect Item Bank, Child-Report and Parent-Proxy Editions.

Science.gov (United States)

Forrest, Christopher B; Ravens-Sieberer, Ulrike; Devine, Janine; Becker, Brandon D; Teneralli, Rachel; Moon, JeanHee; Carle, Adam; Tucker, Carole A; Bevans, Katherine B

2018-03-01

The purpose of this study is to describe the psychometric evaluation and item response theory calibration of the PROMIS Pediatric Positive Affect item bank, child-report and parent-proxy editions. The initial item pool comprising 53 items, previously developed using qualitative methods, was administered to 1,874 children 8-17 years old and 909 parents of children 5-17 years old. Analyses included descriptive statistics, reliability, factor analysis, differential item functioning, and construct validity. A total of 14 items were deleted, because of poor psychometric performance, and an 8-item short form constructed from the remaining 39 items was administered to a national sample of 1,004 children 8-17 years old, and 1,306 parents of children 5-17 years old. The combined sample was used in item response theory (IRT) calibration analyses. The final item bank appeared unidimensional, the items appeared locally independent, and the items were free from differential item functioning. The scales showed excellent reliability and convergent and discriminant validity. Positive affect decreased with children's age and was lower for those with a special health care need. After IRT calibration, we found that 4 and 8 item short forms had a high degree of precision (reliability) across a wide range of the latent trait (>4 SD units). The PROMIS Pediatric Positive Affect item bank and its short forms provide an efficient, precise, and valid assessment of positive affect in children and youth.
Validation Study for the Brief Measure of Quality of Life and Quality of Care: A Questionnaire for the National Random Sampling Hospital Survey.

Science.gov (United States)

Shimizu, Megumi; Fujisawa, Daisuke; Kurihara, Miho; Sato, Kazuki; Morita, Tatsuya; Kato, Masashi; Miyashita, Mitsunori

2017-08-01

To monitor quality of life (QOL) for patients with cancer in a large population-based survey, we developed a short QOL and quality-of-care (QOC) questionnaire. To determine the validity and reliability of this new questionnaire for evaluating QOL in patients with cancer. Outpatients and inpatients at National Cancer Center Hospital East were administered a questionnaire, including the following items-the short QOL and QOC questionnaire (physical distress, pain, emotional distress, walk burden, and need for help with self-care; perceived general health status; and satisfaction with medical care and treatment by doctor, communication with doctor, support by health-care staff other than doctor, care for physical symptoms such as pain, and psychological care), the Functional Assessment of Cancer Therapy-General (FACT-G), the Cancer Care Evaluation Scale (CCES) for patients, and demographic and medical data. We then readministered the short QOL and QOC questionnaire. In total, 329 outpatients and 239 inpatients completed the survey (response rates: 80% and 90%, respectively). Total Cronbach α for the short QOL and QOC questionnaire was 0.83 for outpatients and 0.82 for inpatients. Items of the questionnaire correlated with cancer-specific measurements, FACT-G, and CCES. Intraclass correlation coefficients for all items of the questionnaire were 0.79 and 0.89 in each setting. Items of QOL and QOC did not correlate with each other. The validity and reliability of the short QOL and QOC questionnaire appear sufficient. This questionnaire enables continuous monitoring of patient QOL in large population-based surveys.
5 CFR 532.221 - Industries included in regular nonappropriated fund surveys.

Science.gov (United States)

2010-01-01

... nonappropriated fund surveys. 532.221 Section 532.221 Administrative Personnel OFFICE OF PERSONNEL MANAGEMENT... wholesalers. 44132 Tire dealers. 44311 Appliance, television, and other electronic stores. 44411 Home centers. 44611 Pharmacies and drug stores. 4471 Gasoline stations. 44814 Family clothing stores. 4521 Department...
Acceptance issues for large items and difficult waste

International Nuclear Information System (INIS)

Palmer, J.; Lock, Peter

2002-01-01

Peter Lock described some particular cases which had given rise to difficult acceptance issues at NIREX, ranging from large size items to the impacts of chemicals used during decontamination on the mobility of radionuclides in a disposal facility: The UK strategy for intermediate level and certain low level radioactive waste disposal is based on production of cementitious waste-forms packaged in a standard range of containers as follows: 500 litre Drum - the normal container for most operational ILW (0.8 m diameter x 1.2 m high); 3 m"3 Box - a larger container for solid wastes (1.72 m x 1.72 m plan x 1.2 m high); 3 m"3 Drum - a larger container for in-drum mixing and immobilisation of sludge waste-forms (1.72 m diameter x 1.2 m high); 4 m Box - for large items of waste, especially from decommissioning (4.0 m x 2.4 m plan x 2.2 m high); 2 m LLW Box - for higher-density wastes (2.0 m x 2.4 m plan x 2.2 m high). In addition the majority of LLW is packaged by supercompaction followed by grouting in modified ISO freight containers (6 m x 2.5 m x 2.5 m). Some wastes do not fit easily into this strategy. These wastes include: very large items, (too big for the 4 m box) which, if dealt with whole, pose transport and disposal problems. These items are discussed further in Section 2; waste whose characteristics make packaging difficult. Such wastes are described in more detail in Section 3
Factorial Structure and Age-Related Psychometrics of the MIDUS Personality Adjective Items across the Lifespan

Science.gov (United States)

Zimprich, Daniel; Allemand, Mathias; Lachman, Margie E.

2014-01-01

The present study addresses issues of measurement invariance and comparability of factor parameters of Big Five personality adjective items across age. Data from the Midlife in the United States (MIDUS) survey were used to investigate age-related developmental psychometrics of the MIDUS personality adjective items in two large cross-sectional samples (exploratory sample: N = 862; analysis sample: N = 3,000). After having established and replicated a comprehensive five-factor structure of the measure, increasing levels of measurement invariance were tested across ten age groups. Results indicate that the measure demonstrates strict measurement invariance in terms of number of factors and factor loadings. Also, we found that factor variances and covariances were equal across age groups. By contrast, a number of age-related factor mean differences emerged. The practical implications of these results are discussed and future research is suggested. PMID:21910548
Binomial test models and item difficulty

NARCIS (Netherlands)

van der Linden, Willem J.

1979-01-01

In choosing a binomial test model, it is important to know exactly what conditions are imposed on item difficulty. In this paper these conditions are examined for both a deterministic and a stochastic conception of item responses. It appears that they are more restrictive than is generally

Geophex Airborne Unmanned Survey System

International Nuclear Information System (INIS)

Won, I.L.; Keiswetter, D.

1995-01-01

Ground-based surveys place personnel at risk due to the proximity of buried unexploded ordnance (UXO) items or by exposure to radioactive materials and hazardous chemicals. The purpose of this effort is to design, construct, and evaluate a portable, remotely-piloted, airborne, geophysical survey system. This non-intrusive system will provide stand-off capability to conduct surveys and detect buried objects, structures, and conditions of interest at hazardous locations. During a survey, the operators remain remote from, but within visual distance of, the site. The sensor system never contacts the Earth, but can be positioned near the ground so that weak geophysical anomalies can be detected. The Geophex Airborne Unmanned Survey System (GAUSS) is designed to detect and locate small-scale anomalies at hazardous sites using magnetic and electromagnetic survey techniques. The system consists of a remotely-piloted, radio-controlled, model helicopter (RCH) with flight computer, light-weight geophysical sensors, an electronic positioning system, a data telemetry system, and a computer base-station. The report describes GAUSS and its test results
Geophex Airborne Unmanned Survey System

Energy Technology Data Exchange (ETDEWEB)

Won, I.L.; Keiswetter, D.

1995-12-31

Ground-based surveys place personnel at risk due to the proximity of buried unexploded ordnance (UXO) items or by exposure to radioactive materials and hazardous chemicals. The purpose of this effort is to design, construct, and evaluate a portable, remotely-piloted, airborne, geophysical survey system. This non-intrusive system will provide stand-off capability to conduct surveys and detect buried objects, structures, and conditions of interest at hazardous locations. During a survey, the operators remain remote from, but within visual distance of, the site. The sensor system never contacts the Earth, but can be positioned near the ground so that weak geophysical anomalies can be detected. The Geophex Airborne Unmanned Survey System (GAUSS) is designed to detect and locate small-scale anomalies at hazardous sites using magnetic and electromagnetic survey techniques. The system consists of a remotely-piloted, radio-controlled, model helicopter (RCH) with flight computer, light-weight geophysical sensors, an electronic positioning system, a data telemetry system, and a computer base-station. The report describes GAUSS and its test results.
[Survey of student pharmacists' attitudes toward new procedures expected for future pharmacists].

Science.gov (United States)

Tokunaga, Jin; Takamura, Norito; Ogata, Kenji; Yoshida, Hiroki; Setoguchi, Nao; Sato, Keizo

2010-06-01

Bedsides conventional bedside training the Department of Pharmacy of Kyushu University of Health and Welfare covers advanced practices focused on new procedures expected for future pharmacists. A questionnaire survey was conducted among the 4th year students of the 6-year curriculum of the department in order to retrospectively evaluate their attitudes toward basic life support, and the necessity and feasibility of items related to the training. Sixty-nine percent of the students responded that they would provide appropriate treatment under a situation where basic life support was needed. The item regarded as most necessary and feasible before training was "treatment for basic life support--cardiopulmonary resuscitation." After training, however, "checking vital signs," "physical assessment," and "pharmacist's assistance in medication" were the items rated as equal to or higher than "treatment for basic life support--cardiopulmonary resuscitation." The lowest ranked item in terms of necessity and feasibility both before and after training was "intramuscular/subcutaneous injection," followed by "intravenous injection" and "normal intravenous collection of blood" in that order. The results of this attitude survey demonstrated that many students were willing to perform such operations as part of checking vital signs and physical assessment.
Measuring resilience after spinal cord injury: Development, validation and psychometric characteristics of the SCI-QOL Resilience item bank and short form.

Science.gov (United States)

Victorson, David; Tulsky, David S; Kisala, Pamela A; Kalpakjian, Claire Z; Weiland, Brian; Choi, Seung W

2015-05-01

To describe the development and psychometric properties of the Spinal Cord Injury--Quality of Life (SCI-QOL) Resilience item bank and short form. Using a mixed-methods design, we developed and tested a resilience item bank through the use of focus groups with individuals with SCI and clinicians with expertise in SCI, cognitive interviews, and item-response theory based analytic approaches, including tests of model fit and differential item functioning (DIF). We tested a 32-item pool at several medical institutions across the United States, including the University of Michigan, Kessler Foundation, the Rehabilitation Institute of Chicago, the University of Washington, Craig Hospital and the James J. Peters/Bronx Department of Veterans Affairs medical center. A total of 717 individuals with SCI completed the Resilience items. A unidimensional model was observed (CFI=0.968; RMSEA=0.074) and measurement precision was good (theta range between -3.1 and 0.9). Ten items were flagged for DIF, however, after examination of effect sizes we found this to be negligible with little practical impact on score estimates. The final calibrated item bank resulted in 21 retained items. This study indicates that the SCI-QOL Resilience item bank represents a psychometrically robust measurement tool. Short form items are also suggested and computer adaptive tests are available.
Vegetable parenting practices scale: Item response modeling analyses

Science.gov (United States)

Our objective was to evaluate the psychometric properties of a vegetable parenting practices scale using multidimensional polytomous item response modeling which enables assessing item fit to latent variables and the distributional characteristics of the items in comparison to the respondents. We al...
Boundary curves of individual items in the distribution of total depressive symptom scores approximate an exponential pattern in a general population.

Science.gov (United States)

Tomitaka, Shinichiro; Kawasaki, Yohei; Ide, Kazuki; Akutagawa, Maiko; Yamada, Hiroshi; Furukawa, Toshiaki A; Ono, Yutaka

2016-01-01

Previously, we proposed a model for ordinal scale scoring in which individual thresholds for each item constitute a distribution by each item. This lead us to hypothesize that the boundary curves of each depressive symptom score in the distribution of total depressive symptom scores follow a common mathematical model, which is expressed as the product of the frequency of the total depressive symptom scores and the probability of the cumulative distribution function of each item threshold. To verify this hypothesis, we investigated the boundary curves of the distribution of total depressive symptom scores in a general population. Data collected from 21,040 subjects who had completed the Center for Epidemiologic Studies Depression Scale (CES-D) questionnaire as part of a national Japanese survey were analyzed. The CES-D consists of 20 items (16 negative items and four positive items). The boundary curves of adjacent item scores in the distribution of total depressive symptom scores for the 16 negative items were analyzed using log-normal scales and curve fitting. The boundary curves of adjacent item scores for a given symptom approximated a common linear pattern on a log normal scale. Curve fitting showed that an exponential fit had a markedly higher coefficient of determination than either linear or quadratic fits. With negative affect items, the gap between the total score curve and boundary curve continuously increased with increasing total depressive symptom scores on a log-normal scale, whereas the boundary curves of positive affect items, which are not considered manifest variables of the latent trait, did not exhibit such increases in this gap. The results of the present study support the hypothesis that the boundary curves of each depressive symptom score in the distribution of total depressive symptom scores commonly follow the predicted mathematical model, which was verified to approximate an exponential mathematical pattern.
Boundary curves of individual items in the distribution of total depressive symptom scores approximate an exponential pattern in a general population

Directory of Open Access Journals (Sweden)

Shinichiro Tomitaka

2016-10-01

Full Text Available Background Previously, we proposed a model for ordinal scale scoring in which individual thresholds for each item constitute a distribution by each item. This lead us to hypothesize that the boundary curves of each depressive symptom score in the distribution of total depressive symptom scores follow a common mathematical model, which is expressed as the product of the frequency of the total depressive symptom scores and the probability of the cumulative distribution function of each item threshold. To verify this hypothesis, we investigated the boundary curves of the distribution of total depressive symptom scores in a general population. Methods Data collected from 21,040 subjects who had completed the Center for Epidemiologic Studies Depression Scale (CES-D questionnaire as part of a national Japanese survey were analyzed. The CES-D consists of 20 items (16 negative items and four positive items. The boundary curves of adjacent item scores in the distribution of total depressive symptom scores for the 16 negative items were analyzed using log-normal scales and curve fitting. Results The boundary curves of adjacent item scores for a given symptom approximated a common linear pattern on a log normal scale. Curve fitting showed that an exponential fit had a markedly higher coefficient of determination than either linear or quadratic fits. With negative affect items, the gap between the total score curve and boundary curve continuously increased with increasing total depressive symptom scores on a log-normal scale, whereas the boundary curves of positive affect items, which are not considered manifest variables of the latent trait, did not exhibit such increases in this gap. Discussion The results of the present study support the hypothesis that the boundary curves of each depressive symptom score in the distribution of total depressive symptom scores commonly follow the predicted mathematical model, which was verified to approximate an
Efficient Algorithms for Segmentation of Item-Set Time Series

Science.gov (United States)

Chundi, Parvathi; Rosenkrantz, Daniel J.

We propose a special type of time series, which we call an item-set time series, to facilitate the temporal analysis of software version histories, email logs, stock market data, etc. In an item-set time series, each observed data value is a set of discrete items. We formalize the concept of an item-set time series and present efficient algorithms for segmenting a given item-set time series. Segmentation of a time series partitions the time series into a sequence of segments where each segment is constructed by combining consecutive time points of the time series. Each segment is associated with an item set that is computed from the item sets of the time points in that segment, using a function which we call a measure function. We then define a concept called the segment difference, which measures the difference between the item set of a segment and the item sets of the time points in that segment. The segment difference values are required to construct an optimal segmentation of the time series. We describe novel and efficient algorithms to compute segment difference values for each of the measure functions described in the paper. We outline a dynamic programming based scheme to construct an optimal segmentation of the given item-set time series. We use the item-set time series segmentation techniques to analyze the temporal content of three different data sets—Enron email, stock market data, and a synthetic data set. The experimental results show that an optimal segmentation of item-set time series data captures much more temporal content than a segmentation constructed based on the number of time points in each segment, without examining the item set data at the time points, and can be used to analyze different types of temporal data.
Optimising the selection of food items for food frequency questionnaires using Mixed Integer Linear Programming

NARCIS (Netherlands)

Lemmen-Gerdessen, van J.C.; Souverein, O.W.; Veer, van 't P.; Vries, de J.H.M.

2015-01-01

Objective To support the selection of food items for FFQs in such a way that the amount of information on all relevant nutrients is maximised while the food list is as short as possible. Design Selection of the most informative food items to be included in FFQs was modelled as a Mixed Integer Linear
Contamination of clothing and other items by sweat during exercise 201Tl myocardial perfusion scintigraphy

International Nuclear Information System (INIS)

Yokoo, Shigeki; Niio, Yasuo; Yamamoto, Tomoaki; Miyashita, Makoto

1999-01-01

We measured the radioactivity on patient's upper and lower garments, towels, broad sashes for the bust, and electrodes contaminated by sweat due to exercise 201 Tl myocardial perfusion scintigraphy. In measuring activity, a scintillation survey meter adjusted to the energy of 201 Tl was used. In measuring the radioactivity of clothing, more than 4 Bq/cm 2 was considered to be a significant level of contamination. We detected contamination in 30% of upper garments and towels, 19% of broad sashes, 8% of lower garments and 4% of electrodes. Among these materials, several items of clothing and other items showed contamination exceeding 40 Bq/cm 2 . Towels were remarkably contaminated, with one towel showing a maximum contamination level of 420 Bq/cm 2 . Examinations done by exercise 201 Tl myocardial perfusion scintigraphy often result in the contamination of clothing and other items through sweating. This contamination is especially common in summer, particularly in upper garments and towels. The contamination ratio for towels was over 50%. The contamination ratio increased as the level of exercise became more difficult. When the exercise load was more than 100 W, the contamination ratio was 50%. In cases of extreme contamination, images of contaminated upper garments could be obtained by the scintigraphy camera. The areas of high activity on the images seemed to correspond to areas of the body where sweating was profuse. Based on these results, we should pay close attention to the handling of clothing and other items used in exercise testing by 201 Tl myocardial perfusion scintigraphy and the points used in measuring contaminated clothing and other items after testing. (author)
Optimal pricing and marketing planning for deteriorating items.

Directory of Open Access Journals (Sweden)

Seyed Reza Moosavi Tabatabaei

Full Text Available Optimal pricing and marketing planning plays an essential role in production decisions on deteriorating items. This paper presents a mathematical model for a three-level supply chain, which includes one producer, one distributor and one retailer. The proposed study considers the production of a deteriorating item where demand is influenced by price, marketing expenditure, quality of product and after-sales service expenditures. The proposed model is formulated as a geometric programming with 5 degrees of difficulty and the problem is solved using the recent advances in optimization techniques. The study is supported by several numerical examples and sensitivity analysis is performed to analyze the effects of the changes in different parameters on the optimal solution. The preliminary results indicate that with the change in parameters influencing on demand, inventory holding, inventory deteriorating and set-up costs change and also significantly affect total revenue.
Optimal pricing and marketing planning for deteriorating items

Science.gov (United States)

Moosavi Tabatabaei, Seyed Reza; Sadjadi, Seyed Jafar; Makui, Ahmad

2017-01-01

Optimal pricing and marketing planning plays an essential role in production decisions on deteriorating items. This paper presents a mathematical model for a three-level supply chain, which includes one producer, one distributor and one retailer. The proposed study considers the production of a deteriorating item where demand is influenced by price, marketing expenditure, quality of product and after-sales service expenditures. The proposed model is formulated as a geometric programming with 5 degrees of difficulty and the problem is solved using the recent advances in optimization techniques. The study is supported by several numerical examples and sensitivity analysis is performed to analyze the effects of the changes in different parameters on the optimal solution. The preliminary results indicate that with the change in parameters influencing on demand, inventory holding, inventory deteriorating and set-up costs change and also significantly affect total revenue. PMID:28306750
The Eating Motivation Survey: results from the USA, India and Germany.

Science.gov (United States)

Sproesser, Gudrun; Ruby, Matthew B; Arbit, Naomi; Rozin, Paul; Schupp, Harald T; Renner, Britta

2018-02-01

Research has shown that there is a large variety of different motives underlying why people eat what they eat, which can be assessed with The Eating Motivation Survey (TEMS). The present study investigates the consistency and measurement invariance of the fifteen basic motives included in TEMS in countries with greatly differing eating environments. The fifteen-factor structure of TEMS (brief version: forty-six items) was tested in confirmatory factor analyses. An online survey was conducted. US-American, Indian and German adults (total N 749) took part. Despite the complexity of the model, fit indices indicated a reasonable model fit (for the total sample: χ 2/df=4·03; standardized root-mean-squared residual (SRMR)=0·063; root-mean-square error of approximation (RMSEA)=0·064 (95 % CI 0·062, 0·066)). Only the comparative fit index (CFI) was below the recommended threshold (for the total sample: CFI=0·84). Altogether, 181 out of 184 item loadings were above the recommended threshold of 0·30. Furthermore, the factorial structure of TEMS was invariant across countries with respect to factor configuration and factor loadings (configural v. metric invariance model: ΔCFI=0·009; ΔRMSEA=0·001; ΔSRMR=0·001). Moreover, forty-three out of forty-six items showed invariant intercepts across countries. The fifteen-factor structure of TEMS was, in general, confirmed across countries despite marked differences in eating environments. Moreover, latent means of fourteen out of fifteen motive factors can be compared across countries in future studies. This is a first step towards determining generalizability of the fifteen basic eating motives of TEMS across eating environments.
Negative effects of item repetition on source memory

OpenAIRE

Kim, Kyungmi; Yi, Do-Joon; Raye, Carol L.; Johnson, Marcia K.

2012-01-01

In the present study, we explored how item repetition affects source memory for new item–feature associations (picture–location or picture–color). We presented line drawings varying numbers of times in Phase 1. In Phase 2, each drawing was presented once with a critical new feature. In Phase 3, we tested memory for the new source feature of each item from Phase 2. Experiments 1 and 2 demonstrated and replicated the negative effects of item repetition on incidental source memory. Prior item re...
A Survey of Secondary School Students' Reading Strategy Use ...

African Journals Online (AJOL)

A Survey of Secondary School Students' Reading Strategy Use, Teachers' ... Jimma Zone as well as their English teachers' perceived use of reading strategies ... 16 items that deal with the reading strategies they use when they teach reading ...
Assessing difference between classical test theory and item ...

African Journals Online (AJOL)

Assessing difference between classical test theory and item response theory methods in scoring primary four multiple choice objective test items. ... All research participants were ranked on the CTT number correct scores and the corresponding IRT item pattern scores from their performance on the PRISMADAT. Wilcoxon ...
The basics of item response theory using R

CERN Document Server

Baker, Frank B

2017-01-01

This graduate-level textbook is a tutorial for item response theory that covers both the basics of item response theory and the use of R for preparing graphical presentation in writings about the theory. Item response theory has become one of the most powerful tools used in test construction, yet one of the barriers to learning and applying it is the considerable amount of sophisticated computational effort required to illustrate even the simplest concepts. This text provides the reader access to the basic concepts of item response theory freed of the tedious underlying calculations. It is intended for those who possess limited knowledge of educational measurement and psychometrics. Rather than presenting the full scope of item response theory, this textbook is concise and practical and presents basic concepts without becoming enmeshed in underlying mathematical and computational complexities. Clearly written text and succinct R code allow anyone familiar with statistical concepts to explore and apply item re...
Science Library of Test Items. Volume Twenty-One. A Collection of Multiple Choice Test Items Relating Mainly to Physics, 2.

Science.gov (United States)

New South Wales Dept. of Education, Sydney (Australia).

As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…
Sonographer practitioner development in Australia: Qualitative analysis of an Australian sonographers' survey

International Nuclear Information System (INIS)

McGregor, Rodney; O'Loughlin, Kate; Cox, Jennifer; Clarke, Jill; Snowden, Adrian

2009-01-01

Sonographer practitioner development involves the expansion and extension of the sonographer role to include reporting on ultrasound examinations. Australian sonographers have not seen the same degree of role extension and expansion as their counterparts in the United Kingdom, despite increasing levels of discussion regarding sonographer practitioner development. The aim of this study was to determine if Australian sonographers want to extend their professional role and what they consider are the important issues associated with role extension. This paper reports on qualitative data derived from a survey of Australian sonographers and investigates if Australian sonographers are interested in extending and expanding their professional role and responsibilities and, if they do, what might be necessary or desirable from a professional point of view for this change to occur. A survey was mailed to all members of the Australian Sonographers Association (ASA) in October 2006. The 31-item survey included 28 closed-ended and 3 opened-ended items to provide both quantitative and qualitative data. The quantitative data will be reported separately. Qualitative data was derived from responses to the opened-ended questions, which asked respondents to elaborate on their attitudes and feelings about role extension and development. Analysis used Nvivo7 software to aid in uncovering common themes from the qualitative data. The analysis focused on the reported incentives or motivations for becoming a sonographer practitioner as well as disincentives or perceived hurdles that would discourage respondents from becoming sonographer practitioners. The three most reported incentives or motivations for becoming a sonographer practitioner were professional recognition, remuneration and increased knowledge. The three most commonly reported disincentives or perceived hurdles that would discourage respondents from becoming sonographer practitioners were legal issues, insurance and further
Method effects: the problem with negatively versus positively keyed items.

Science.gov (United States)

Lindwall, Magnus; Barkoukis, Vassilis; Grano, Caterina; Lucidi, Fabio; Raudsepp, Lennart; Liukkonen, Jarmo; Thøgersen-Ntoumani, Cecilie

2012-01-01

Using confirmatory factor analyses, we examined method effects on Rosenberg's Self-Esteem Scale (RSES; Rosenberg, 1965) in a sample of older European adults. Nine hundred forty nine community-dwelling adults 60 years of age or older from 5 European countries completed the RSES as well as measures of depression and life satisfaction. The 2 models that had an acceptable fit with the data included method effects. The method effects were associated with both positively and negatively worded items. Method effects models were invariant across gender and age, but not across countries. Both depression and life satisfaction predicted method effects. Individuals with higher depression scores and lower life satisfaction scores were more likely to endorse negatively phrased items.

Few items in the thyroid-related quality of life instrument ThyPRO exhibited differential item functioning

DEFF Research Database (Denmark)

Watt, Torquil; Grønvold, Mogens; Hegedüs, Laszlo

2014-01-01

To evaluate the extent of differential item functioning (DIF) within the thyroid-specific quality of life patient-reported outcome measure, ThyPRO, according to sex, age, education and thyroid diagnosis.......To evaluate the extent of differential item functioning (DIF) within the thyroid-specific quality of life patient-reported outcome measure, ThyPRO, according to sex, age, education and thyroid diagnosis....
Psychometric evaluation of an inpatient consumer survey measuring satisfaction with psychiatric care.

Science.gov (United States)

Ortiz, Glorimar; Schacht, Lucille

2012-01-01

Measurement of consumers' satisfaction in psychiatric settings is important because it has been correlated with improved clinical outcomes and administrative measures of high-quality care. These consumer satisfaction measurements are actively used as performance measures required by the accreditation process and for quality improvement activities. Our objectives were (i) to re-evaluate, through exploratory factor analysis (EFA) and confirmatory factor analysis (CFA), the structure of an instrument intended to measure consumers' satisfaction with care in psychiatric settings and (ii) to examine and publish the psychometric characteristics, validity and reliability, of the Inpatient Consumer Survey (ICS). To psychometrically test the structure of the ICS, 34 878 survey results, submitted by 90 psychiatric hospitals in 2008, were extracted from the Behavioral Healthcare Performance Measurement System (BHPMS). Basic descriptive item-response and correlation analyses were performed for total surveys. Two datasets were randomly created for analysis. A random sample of 8229 survey results was used for EFA. Another random sample of 8261 consumer survey results was used for CFA. This same sample was used to perform validity and reliability analyses. The item-response analysis showed that the mean range for a disagree/agree five-point scale was 3.10-3.94. Correlation analysis showed a strong relationship between items. Six domains (dignity, rights, environment, empowerment, participation, and outcome) with internal reliabilities between good to moderate (0.87-0.73) were shown to be related to overall care satisfaction. Overall reliability for the instrument was excellent (0.94). Results from CFA provided support for the domains structure of the ICS proposed through EFA. The overall findings from this study provide evidence that the ICS is a reliable measure of consumer satisfaction in psychiatric inpatient settings. The analysis has shown the ICS to provide valid and
Exploring the importance of different items as reasons for leaving emergency medical services between fully compensated, partially compensated, and non-compensated/volunteer samples.

Science.gov (United States)

Blau, Gary; Chapman, Susan; Gibson, Gregory; Bentley, Melissa A

2011-01-01

The purpose of our study was to investigate the importance of different items as reasons for leaving the Emergency Medical Service (EMS) profession. An exit survey was returned by three distinct EMS samples: 127 full compensated, 45 partially compensated and 72 non-compensated/volunteer respondents, who rated the importance of 17 different items for affecting their decision to leave EMS. Unfortunately, there were a high percentage of "not applicable" responses for 10 items. We focused on those seven items that had a majority of useable responses across the three samples. Results showed that the desire for better pay and benefits was a more important reason for leaving EMS for the partially compensated versus fully compensated respondents. Perceived lack of advancement opportunity was a more important reason for leaving for the partially compensated and volunteer groups versus the fully compensated group. Study limitations are discussed and suggestions for future research offered.
Item bias detection in the Hospital Anxiety and Depression Scale using structural equation modeling: comparison with other item bias detection methods

NARCIS (Netherlands)

Verdam, M.G.E.; Oort, F.J.; Sprangers, M.A.G.

Purpose Comparison of patient-reported outcomes may be invalidated by the occurrence of item bias, also known as differential item functioning. We show two ways of using structural equation modeling (SEM) to detect item bias: (1) multigroup SEM, which enables the detection of both uniform and
Calibration of Automatically Generated Items Using Bayesian Hierarchical Modeling.

Science.gov (United States)

Johnson, Matthew S.; Sinharay, Sandip

For complex educational assessments, there is an increasing use of "item families," which are groups of related items. However, calibration or scoring for such an assessment requires fitting models that take into account the dependence structure inherent among the items that belong to the same item family. C. Glas and W. van der Linden…
ACER Chemistry Test Item Collection. ACER Chemtic Year 12.

Science.gov (United States)

Australian Council for Educational Research, Hawthorn.

The chemistry test item banks contains 225 multiple-choice questions suitable for diagnostic and achievement testing; a three-page teacher's guide; answer key with item facilities; an answer sheet; and a 45-item sample achievement test. Although written for the new grade 12 chemistry course in Victoria, Australia, the items are widely applicable.…
Web survey of sleep problems associated with early-onset bipolar spectrum disorders.

Science.gov (United States)

Lofthouse, Nicholas; Fristad, Mary; Splaingard, Mark; Kelleher, Kelly; Hayes, John; Resko, Susan

2008-05-01

As research on sleep difficulties associated with Early-Onset Bipolar Spectrum Disorders (EBSD) is limited, a web-based survey was developed to further explore these problems. 494 parents of 4-to-12 year-olds, identified by parents as being diagnosed with EBSD, completed a web survey about past and current EBSD-related sleep problems. The survey included Children's Sleep Habits Questionnaire (CSHQ) items and sleep problems from the International Classification of Sleep Disorders 2nd edition. Nearly all parents reported some type of past or current EBSD-sleep problem. Most occurred during a worst mood period, particularly with mixed manic-depressive symptoms. Symptoms caused impairments at home, school, or with peers in 96.9% of the sample and across all three contexts in 64.0% of children. Sleep problems were also noted after three-day weekends and Spring and Fall Daylight Savings time changes. Findings, study limitations, and implications for treatment and etiology are discussed.
FY 1992 Survey report of the technologies for creating/processing advanced biomaterials. Research and development of the technologies for creating/processing advanced biomaterials (Comprehensive survey and research); 1992 nendo senshin bio zairyo no sosei kako gijutsu chosa hokokusho. Senshin bio zairyo no sosei kako gijutsu no kenkyu kaihatsu (sogo chosa kenkyu)

Energy Technology Data Exchange (ETDEWEB)

NONE

1993-03-01

This project is aimed at development of the materials which show functions in a living body by coating a substrate of, e.g., silica or glass, with layered novel peptide synthesized to include unusual amino acid required to have the functions. The existing peptide-related technologies are reviewed and the natural peptide list is prepared. A total of 15 literature is surveyed, and the contents are pigeonholed into 8 items; (1) prospects of peptide engineering, (2) designs of peptide structures, (3) technologies of peptide synthesis, (4) synthesis of unusual amino acid and inclusion into peptide, (5) analysis of peptide structures, (6) physiological activity of peptide, (7) development of peptide materials and function manifestation, and (8) information retrieval of natural peptide (comprising 30 amino acids or less). The item (2) involves analysis and prediction of hydrophobicity of oligopeptide, item (3) chemical synthesis of protein, and protease-aided condensation of dipeptide, item (6) peptide having activity with plant, and item (7) solar cells based on a photoelectric conversion material and pigment-sensitized colloidal titanium oxide. (NEDO)
Counterfeit and Fraudulent Items - Mitigating the risk

International Nuclear Information System (INIS)

Tannenbaum, Marc

2011-01-01

This presentation (slides) provides an overview of the industry's challenges and activities. Firstly, it outlines the differences between counterfeit, fraudulent, suspect, and also substandard items. Notice is given that items could be found not to meet the standard, but the difference in the intent to deceive with counterfeit and fraudulent items is the critical element. Examples from other industries are used which also rely heavily on the assurance of quality for safety. It also informs that EPRI has just completed a report in October 2009 in coordination with other US government agencies and industry organizations; this report, entitled Counterfeit, Substandard and Fraudulent Items, number 1019163, is available for free on the EPRI web site. As a follow-up to this report, EPRI is developing a CFSI Database; any country interested in a collaborative agreement is invited to use and contribute to the database information. Finally, it stresses the importance of the oversight of contractors, training to raise the awareness of the employees and the inspectors, and having a response plan for identified items
School nutrition survey.

LENUS (Irish Health Repository)

O'Connor, M

1993-05-01

Food we eat has an important influence on health and well-being. Many eating habits are established in childhood. 456 children aged eight to 12 years participated in this survey of food eaten at school. Of all the food items eaten as a snack, 48.6% were categorised as junk. 75.8% of the sandwiches brought to school for lunch were made with white bread. Of the remaining food items brought for lunch 63.5% were of the junk variety. Compared with those who brought a snack or lunch from home, those given money to buy their own were more likely to eat junk (p < 0.01). Food eaten at school reflects approximately one third of a child\\'s daily food intake but health food practises for even a third of food intake may be of a value for health and long term eating habits. Nutritional education with the reinforcement of high nutritional standards in schools could improve the situation.
The GP Patient Survey for use in primary care in the National Health Service in the UK--development and psychometric characteristics.

Science.gov (United States)

Campbell, John; Smith, Patten; Nissen, Sonja; Bower, Peter; Elliott, Marc; Roland, Martin

2009-08-22

The UK National GP Patient Survey is one of the largest ever survey programmes of patients registered to receive primary health care, inviting five million respondents to report their experience of NHS primary healthcare. The third such annual survey (2008/9) involved the development of a new survey instrument. We describe the process of that development, and the findings of an extensive pilot survey in UK primary healthcare. The survey was developed following recognised guidelines and involved expert and stakeholder advice, cognitive testing of early versions of the survey instrument, and piloting of the questionnaire in a cross sectional pilot survey of 1,500 randomly selected individuals from the UK electoral register with two reminders to non-respondents. The questionnaire comprises 66 items addressing a range of aspects of UK primary healthcare. A response rate of 590/1500 (39.3%) was obtained. Non response to individual items ranged from 0.8% to 15.3% (median 5.2%). Participants did not always follow internal branching instructions in the questionnaire although electronic controls allow for correction of this problem in analysis. There was marked skew in the distribution of responses to a number of items indicating an overall favourable impression of care. Principal components analysis of 23 items offering evaluation of various aspects of primary care identified three components (relating to doctor or nurse care, or addressing access to care) accounting for 68.3% of the variance in the sample. The GP Patient Survey has been carefully developed and pilot-tested. Survey findings, aggregated at practice level, will be used to inform the distribution of pound sterling 65 million ($107 million) of UK NHS resource in 2008/9 and this offers the opportunity for NHS service planners and providers to take account of users' experiences of health care in planning and delivering primary healthcare in the UK.
Utilizing Response Time Distributions for Item Selection in CAT

Science.gov (United States)

Fan, Zhewen; Wang, Chun; Chang, Hua-Hua; Douglas, Jeffrey

2012-01-01

Traditional methods for item selection in computerized adaptive testing only focus on item information without taking into consideration the time required to answer an item. As a result, some examinees may receive a set of items that take a very long time to finish, and information is not accrued as efficiently as possible. The authors propose two…
大型教育調查研究中的差別試題功能：次級分析中的核心概念及建模方法 Differential Item Functioning Analyses in Large-Scale Educational Surveys: Key Concepts and Modeling Approaches for Secondary Analysts

Directory of Open Access Journals (Sweden)

朱小姝 Xiao-Shu Zhu

2011-03-01

Full Text Available 大型教育評量研究常採用多階段抽樣的設計（multi-stage sampling design），透過對母群體之抽樣單位進行分層以抽取受測者。此外，還會採用複雜題本設計（complex booklet design）的方式將題目組成多份測驗題本。在此情況下，欲確保公正測量出不同受測群體的能力，關鍵在於能夠有效偵測所採用的題目是否具差別試題功能（differential item functioning, DIF）。本文旨在介紹探討在大型教育評量複雜設計之下能用以偵測差別試題功能的建模方法，並應用六種可用於偵測DIF 的多階層廣義線性模式（hierarchical generalized linear models, HGLMs），再透過電腦模擬比較它們偵測DIF 的效力。接著又將這些模式應用到國際數學與科學教育成就趨勢調查研究（TIMSS）的實證數據上，藉以探測是否存在一致性的性別DIF（uniform gender DIF）。 Many educational surveys employ a multi-stage sampling design for students, which makes use of stratification and/or clustering of population units, as well as a complex booklet design for items from an item pool. In these surveys, the reliable detection of item bias or differential item functioning (DIF across student groups is a key component for ensuring fair representations of different student groups. In this paper, we describe several modeling approaches that can be useful for detecting DIF in educational surveys. We illustrate the key ideas by investigating the performance of six hierarchical generalized linear models (HGLMs using a small simulation study and by applying them to real data from the Trends in Mathematics and Science Study (TIMSS study where we use them to investigate potential uniform gender DIF.
Relationship between Future Time Orientation and Item Nonresponse on Subjective Probability Questions: A Cross-Cultural Analysis.

Science.gov (United States)

Lee, Sunghee; Liu, Mingnan; Hu, Mengyao

2017-06-01

Time orientation is an unconscious yet fundamental cognitive process that provides a framework for organizing personal experiences in temporal categories of past, present and future, reflecting the relative emphasis given to these categories. Culture lies central to individuals' time orientation, leading to cultural variations in time orientation. For example, people from future-oriented cultures tend to emphasize the future and store information relevant for the future more than those from present- or past-oriented cultures. For survey questions that ask respondents to report expected probabilities of future events, this may translate into culture-specific question difficulties, manifested through systematically varying "I don't know" item nonresponse rates. This study drew on the time orientation theory and examined culture-specific nonresponse patterns on subjective probability questions using methodologically comparable population-based surveys from multiple countries. The results supported our hypothesis. Item nonresponse rates on these questions varied significantly in the way that future-orientation at the group as well as individual level was associated with lower nonresponse rates. This pattern did not apply to non-probability questions. Our study also suggested potential nonresponse bias. Examining culture-specific constructs, such as time orientation, as a framework for measurement mechanisms may contribute to improving cross-cultural research.
Item analysis and evaluation in the examinations in the faculty of ...

African Journals Online (AJOL)

2014-11-05

Nov 5, 2014 ... Key words: Classical test theory, item analysis, item difficulty, item discrimination, item response theory, reliability ... the probability of answering an item correctly or of attaining ..... A Monte Carlo comparison of item and person.
Examining the Effect of Reverse Worded Items on the Factor Structure of the Need for Cognition Scale.

Directory of Open Access Journals (Sweden)

Xijuan Zhang

Full Text Available Reverse worded (RW items are often used to reduce or eliminate acquiescence bias, but there is a rising concern about their harmful effects on the covariance structure of the scale. Therefore, results obtained via traditional covariance analyses may be distorted. This study examined the effect of the RW items on the factor structure of the abbreviated 18-item Need for Cognition (NFC scale using confirmatory factor analysis. We modified the scale to create three revised versions, varying from no RW items to all RW items. We also manipulated the type of the RW items (polar opposite vs. negated. To each of the four scales, we fit four previously developed models. The four models included a 1-factor model, a 2-factor model distinguishing between positively worded (PW items and RW items, and two 2-factor models, each with one substantive factor and one method factor. Results showed that the number and type of the RW items affected the factor structure of the NFC scale. Consistent with previous research findings, for the original NFC scale, which contains both PW and RW items, the 1-factor model did not have good fit. In contrast, for the revised scales that had no RW items or all RW items, the 1-factor model had reasonably good fit. In addition, for the scale with polar opposite and negated RW items, the factor model with a method factor among the polar opposite items had considerably better fit than the 1-factor model.
Are great apes able to reason from multi-item samples to populations of food items?

Science.gov (United States)

Eckert, Johanna; Rakoczy, Hannes; Call, Josep

2017-10-01

Inductive learning from limited observations is a cognitive capacity of fundamental importance. In humans, it is underwritten by our intuitive statistics, the ability to draw systematic inferences from populations to randomly drawn samples and vice versa. According to recent research in cognitive development, human intuitive statistics develops early in infancy. Recent work in comparative psychology has produced first evidence for analogous cognitive capacities in great apes who flexibly drew inferences from populations to samples. In the present study, we investigated whether great apes (Pongo abelii, Pan troglodytes, Pan paniscus, Gorilla gorilla) also draw inductive inferences in the opposite direction, from samples to populations. In two experiments, apes saw an experimenter randomly drawing one multi-item sample from each of two populations of food items. The populations differed in their proportion of preferred to neutral items (24:6 vs. 6:24) but apes saw only the distribution of food items in the samples that reflected the distribution of the respective populations (e.g., 4:1 vs. 1:4). Based on this observation they were then allowed to choose between the two populations. Results show that apes seemed to make inferences from samples to populations and thus chose the population from which the more favorable (4:1) sample was drawn in Experiment 1. In this experiment, the more attractive sample not only contained proportionally but also absolutely more preferred food items than the less attractive sample. Experiment 2, however, revealed that when absolute and relative frequencies were disentangled, apes performed at chance level. Whether these limitations in apes' performance reflect true limits of cognitive competence or merely performance limitations due to accessory task demands is still an open question. © 2017 Wiley Periodicals, Inc.
Assessing Impact, DIF, and DFF in Accommodated Item Scores: A Comparison of Multilevel Measurement Model Parameterizations

Science.gov (United States)

Beretvas, S. Natasha; Cawthon, Stephanie W.; Lockhart, L. Leland; Kaye, Alyssa D.

2012-01-01

This pedagogical article is intended to explain the similarities and differences between the parameterizations of two multilevel measurement model (MMM) frameworks. The conventional two-level MMM that includes item indicators and models item scores (Level 1) clustered within examinees (Level 2) and the two-level cross-classified MMM (in which item…
An NCME Instructional Module on Polytomous Item Response Theory Models

Science.gov (United States)

Penfield, Randall David

2014-01-01

A polytomous item is one for which the responses are scored according to three or more categories. Given the increasing use of polytomous items in assessment practices, item response theory (IRT) models specialized for polytomous items are becoming increasingly common. The purpose of this ITEMS module is to provide an accessible overview of…
Item response theory analysis of the Lichtenberg Financial Decision Screening Scale.

Science.gov (United States)

Teresi, Jeanne A; Ocepek-Welikson, Katja; Lichtenberg, Peter A

2017-01-01

The focus of these analyses was to examine the psychometric properties of the Lichtenberg Financial Decision Screening Scale (LFDSS). The purpose of the screen was to evaluate the decisional abilities and vulnerability to exploitation of older adults. Adults aged 60 and over were interviewed by social, legal, financial, or health services professionals who underwent in-person training on the administration and scoring of the scale. Professionals provided a rating of the decision-making abilities of the older adult. The analytic sample included 213 individuals with an average age of 76.9 (SD = 10.1). The majority (57%) were female. Data were analyzed using item response theory (IRT) methodology. The results supported the unidimensionality of the item set. Several IRT models were tested. Ten ordinal and binary items evidenced a slightly higher reliability estimate (0.85) than other versions and better coverage in terms of the range of reliable measurement across the continuum of financial incapacity.

The Carnegie Dietary Survey of Interwar Britain.

Science.gov (United States)

Shave, Samantha A

2015-01-01

This research note describes an under-used collection of papers which document interwar income, nutrition and health in Britain which were created in the administration of the Carnegie Dietary Survey by John Boyd-Orr in the Rowett Institute with funding from the Carnegie United Kingdom Trust. The survey was conducted in 16 rural and urban places across England and Scotland between 1937-9, and are now held at the Specialist Collections Centre at the University of Aberdeen. While the importance of the survey in informing knowledge about nutrition and the development of rationing has been acknowledged in the field of social medicine, the survey data has primarily been used by epidemiological scientists and economic historians. After outlining the survey's past influences and uses, this item details the possible ways the data could be used by social, economic and local population historians.
Behaviors in Advance Care Planning and ACtions Survey (BACPACS): development and validation part 1.

Science.gov (United States)

Kassam, Aliya; Douglas, Maureen L; Simon, Jessica; Cunningham, Shannon; Fassbender, Konrad; Shaw, Marta; Davison, Sara N

2017-11-22

Although advance care planning (ACP) is fairly well understood, significant barriers to patient participation remain. As a result, tools to assess patient behaviour are required. The objective of this study was to improve the measurement of patient engagement in ACP by detecting existing survey design issues and establishing content and response process validity for a new survey entitled Behaviours in Advance Care Planning and ACtions Survey (BACPACS). We based our new tool on that of an existing ACP engagement survey. Initial item reduction was carried out using behavior change theories by content and design experts to help reduce response burden and clarify questions. Thirty-two patients with chronic diseases (cancer, heart failure or renal failure) were recruited for the think aloud cognitive interviewing with the new, shortened survey evaluating patient engagement with ACP. Of these, n = 27 had data eligible for analysis (n = 8 in round 1 and n = 19 in rounds 2 and 3). Interviews were audio-recorded and analyzed using the constant comparison method. Three reviewers independently listened to the interviews, summarized findings and discussed discrepancies until consensus was achieved. Item reduction from key content expert review and conversation analysis helped decrease number of items from 116 in the original ACP Engagement Survey to 24-38 in the new BACPACS depending on branching of responses. For the think aloud study, three rounds of interviews were needed until saturation for patient clarity was achieved. The understanding of ACP as a construct, survey response options, instructions and terminology pertaining to patient engagement in ACP warranted further clarification. Conversation analysis, content expert review and think aloud cognitive interviewing were useful in refining the new survey instrument entitled BACPACS. We found evidence for both content and response process validity for this new tool.
Development and psychometric evaluation of the PROMIS Pediatric Life Satisfaction item banks, child-report, and parent-proxy editions.

Science.gov (United States)

Forrest, Christopher B; Devine, Janine; Bevans, Katherine B; Becker, Brandon D; Carle, Adam C; Teneralli, Rachel E; Moon, JeanHee; Tucker, Carole A; Ravens-Sieberer, Ulrike

2018-01-01

To describe the psychometric evaluation and item response theory calibration of the PROMIS Pediatric Life Satisfaction item banks, child-report, and parent-proxy editions. A pool of 55 life satisfaction items was administered to 1992 children 8-17 years old and 964 parents of children 5-17 years old. Analyses included descriptive statistics, reliability, factor analysis, differential item functioning, and assessment of construct validity. Thirteen items were deleted because of poor psychometric performance. An 8-item short form was administered to a national sample of 996 children 8-17 years old, and 1294 parents of children 5-17 years old. The combined sample (2988 children and 2258 parents) was used in item response theory (IRT) calibration analyses. The final item banks were unidimensional, the items were locally independent, and the items were free from impactful differential item functioning. The 8-item and 4-item short form scales showed excellent reliability, convergent validity, and discriminant validity. Life satisfaction decreased with declining socio-economic status, presence of a special health care need, and increasing age for girls, but not boys. After IRT calibration, we found that 4- and 8-item short forms had a high degree of precision (reliability) across a wide range (>4 SD units) of the latent variable. The PROMIS Pediatric Life Satisfaction item banks and their short forms provide efficient, precise, and valid assessments of life satisfaction in children and youth.
Characterization of Disability in Canadians with Mental Disorders Using an Abbreviated Version of a DSM-5 Emerging Measure: The 12-Item WHO Disability Assessment Schedule (WHODAS) 2.0.

Science.gov (United States)

Sjonnesen, Kirsten; Bulloch, Andrew G M; Williams, Jeanne; Lavorato, Dina; B Patten, Scott

2016-04-01

The World Health Organization Disability Assessment Schedule 2.0 (WHODAS 2.0) is a disability scale included in Section 3 of the fifth edition of the Diagnostic and Statistical Manual of Mental Disorders (DSM-5) as a possible replacement for the Global Assessment of Functioning Scale (GAF). To assist Canadian psychiatrists with interpretation of the scale, we have conducted a descriptive analysis using data from the 2012 Canadian Community Health Survey-Mental Health component (CCHS-MH). The 2012 CCHS-MH was a cross-sectional survey of the Canadian community (n = 23,757). The survey included an abbreviated 12-item version of the WHODAS 2.0. Mental disorder diagnoses were assessed for schizophrenia, other psychosis, major depressive episode (MDE), generalized anxiety disorder (GAD), bipolar I disorder, substance abuse/dependence, and alcohol abuse/dependence. Mean scores ranged from 14.2 (95% CI, 14.1 to 14.3) for the overall community population to 23.1 (95% CI, 19.5 to 26.7) for those with schizophrenia, with higher scores indicating greater disability. Furthermore, the difference in scores between those with lifetime and past-month episodes suggests that the scale is sensitive to changes occurring during the course of these disorders; for example, scores varied from 23.6 (95% CI, 22.2 to 25.1) for past-month MDE to 14.4 (95% CI, 14.2 to 14.7) in the lifetime MDE group without a past-year episode. This analysis suggests that the WHODAS 2.0 may be a suitable replacement for the GAF. As a disability measure, even though it is not a mental health-specific instrument, the 12-item WHODAS 2.0 appears to be sensitive to the impact of mental disorders and to changes over the time course of a mental disorder. However, the clinical utility of this measure requires additional assessment. © The Author(s) 2016.
Survey Definitions of Gout for Epidemiologic Studies: Comparison With Crystal Identification as the Gold Standard

NARCIS (Netherlands)

Dalbeth, N.; Schumacher, H.R.; Fransen, J.; Neogi, T.; Jansen, T.L; Brown, M.; Louthrenoo, W.; Vazquez-Mellado, J.; Eliseev, M.; McCarthy, G.; Stamp, L.K.; Perez-Ruiz, F.; Sivera, F.; Ea, H.K.; Gerritsen, M.; Scire, C.A.; Cavagna, L.; Lin, C.; Chou, Y.Y.; Tausche, A.K.; Rocha Castelar-Pinheiro, G. da; Janssen, M; Chen, J.H.; Cimmino, M.A.; Uhlig, T.; Taylor, W.J.

2016-01-01

OBJECTIVE: To identify the best-performing survey definition of gout from items commonly available in epidemiologic studies. METHODS: Survey definitions of gout were identified from 34 epidemiologic studies contributing to the Global Urate Genetics Consortium (GUGC) genome-wide association study.
Health state evaluation of an item: A general framework and graphical representation

International Nuclear Information System (INIS)

Jiang, R.; Jardine, A.K.S.

2008-01-01

This paper presents a general theoretical framework to evaluate the health state of an item based on condition monitoring information. The item's health state is defined in terms of its relative health level and overall health level. The former is evaluated based on the relative magnitude of the composite covariate and the latter is evaluated using a fractile life of the residual life distribution at the decision instant. In addition, a method is developed to graphically represent the degradation model, failure threshold model, and the observation history of the composite covariate. As a result, the health state of the monitored item can be intuitively presented and the evaluated result can be subsequently used in a condition-based maintenance optimization decision model, which is amenable to computer modeling. A numerical example is included to illustrate the proposed approach and its appropriateness
EOQ Model for Delayed Deteriorating Items with Shortages and Trade Credit Policy

Directory of Open Access Journals (Sweden)

R Sundararajan

2015-08-01

Full Text Available This paper deals with a deterministic inventory model for deteriorating items under the condition of permissible delay in payments with constant demand rate is a function of time which differs from before and after deterioration for a single item. Shortages are allowed and completely backlogged which is a function of time. Under these assumptions, this paper develops a retailer's model for obtaining an optimal cycle length and ordering quantity in deteriorating items of an inventory model. Thus, our objective is retailer's cost minimization problem to nd an optimal replenishment policy under various parameters. The convexity of the objective function is derived and the numerical examples are provided to support the proposed model. Sensitivity analysis of the optimal solution with respect to major parameters of the model is included and the implications are discussed.
Programmatic Environmental Scans: A Survey Based on Program Planning and Evaluation Concepts

Directory of Open Access Journals (Sweden)

Donna J. Peterson

2015-10-01

Full Text Available Within Extension, environmental scans are most commonly used to assess community or organizational issues or for strategic planning purposes. However, Extension has expanded the use of environmental scans to systematically identify “what programs exist” on a given topic or focus area. Yet, despite recent attention to the topic of environmental scanning in Extension, survey instruments used to conduct environmental scans have not been published. Given the emphasis on implementation of evidence-based practices and programs, having a ready-made survey that can be used to identify programs on a specific topic and that could subsequently lead to an evaluability assessment of those programs would be a useful resource. To encourage the use of environmental scans to identify existing evidence-based programs, this article describes a survey instrument developed for the purpose of scanning for 4-H Healthy Living programs ready for rigorous outcome evaluation and/or national replication. It focuses on the rationale for survey items, as well as provides a summary and definition of those items. The survey tool can be easily adapted for future programmatic environmental scans both within and outside Extension.
Utilising a multi-item questionnaire to assess household food security in Australia.

Science.gov (United States)

Butcher, Lucy M; O'Sullivan, Therese A; Ryan, Maria M; Lo, Johnny; Devine, Amanda

2018-03-15

Currently, two food sufficiency questions are utilised as a proxy measure of national food security status in Australia. These questions do not capture all dimensions of food security and have been attributed to underreporting of the problem. The purpose of this study was to investigate food security using the short form of the US Household Food Security Survey Module (HFSSM) within an Australian context; and explore the relationship between food security status and multiple socio-demographic variables. Two online surveys were completed by 2334 Australian participants from November 2014 to February 2015. Surveys contained the short form of the HFSSM and twelve socio-demographic questions. Cross-tabulations chi-square tests and a multinomial logistic regression model were employed to analyse the survey data. Food security status of the respondents was classified accordingly: High or Marginal (64%, n = 1495), Low (20%, n = 460) or Very Low (16%, n = 379). Significant independent predictors of food security were age (P important issue across Australia and that certain groups, regardless of income, are particularly vulnerable. Government policy and health promotion interventions that specifically target "at risk" groups may assist to more effectively address the problem. Additionally, the use of a multi-item measure is worth considering as a national indicator of food security in Australia. © 2018 Australian Health Promotion Association.
Shaping Collaboration 2006: action items for the LHC

Energy Technology Data Exchange (ETDEWEB)

Goldfarb, S [CERN-PH, 1211 Geneva 23 (Switzerland); Herr, J; Neal, H A [Assistant Research Scientist, University of Michigan (United States); Research Process Manager, University of Michigan (United States); Professor of Physics, University of Michigan (United States)], E-mail: steven.goldfarb@cern.ch

2008-07-15

Shaping Collaboration 2006 [1] was a workshop held in Geneva, on December 11-13, 2006, to examine the status and future of collaborative tool technology and its usage for large global scientific collaborations, such as those of the CERN LHC [2]. The workshop brought together some of the leading experts in the field of collaborative tools (WACE 2006) [3] with physicists and developers of the LHC collaborations and HENP (High-Energy and Nuclear Physics). We highlight important presentations and key discussions held during the workshop, then focus on a large and aggressive set of goals and specific action items targeted at institutes from all levels of the LHC organization. This list of action items, assembled during a panel discussion at the close of the LHC sessions, includes recommendations for the LHC Users, their Universities, Project Managers, Spokespersons, National Funding Agencies and Host Laboratories. We present this list, along with suggestions for priorities in addressing the immediate and long-term needs of HENP.
Shaping Collaboration 2006: action items for the LHC

International Nuclear Information System (INIS)

Goldfarb, S; Herr, J; Neal, H A

2008-01-01

Shaping Collaboration 2006 [1] was a workshop held in Geneva, on December 11-13, 2006, to examine the status and future of collaborative tool technology and its usage for large global scientific collaborations, such as those of the CERN LHC [2]. The workshop brought together some of the leading experts in the field of collaborative tools (WACE 2006) [3] with physicists and developers of the LHC collaborations and HENP (High-Energy and Nuclear Physics). We highlight important presentations and key discussions held during the workshop, then focus on a large and aggressive set of goals and specific action items targeted at institutes from all levels of the LHC organization. This list of action items, assembled during a panel discussion at the close of the LHC sessions, includes recommendations for the LHC Users, their Universities, Project Managers, Spokespersons, National Funding Agencies and Host Laboratories. We present this list, along with suggestions for priorities in addressing the immediate and long-term needs of HENP
Community Survey Q2: What to emphasize in Q1

Data.gov (United States)

Town of Chapel Hill, North Carolina — This question is from the 2015 Chapel Hill Community Survey.Which THREE of these items do you think should receive the most emphasis from Town leaders over the next...
41 CFR 101-27.204 - Types of shelf-life items.

Science.gov (United States)

2010-07-01

... 41 Public Contracts and Property Management 2 2010-07-01 2010-07-01 true Types of shelf-life items...-Management of Shelf-Life Materials § 101-27.204 Types of shelf-life items. Shelf-life items are classified as nonextendable (Type I) and extendable (Type II). Type I items have a definite storage life after which the item...
Constructing the 32-item Fitness-to-Drive Screening Measure.

Science.gov (United States)

Medhizadah, Shabnam; Classen, Sherrilene; Johnson, Andrew M

2018-04-01

The Fitness-to-Drive Screening Measure © (FTDS) enables proxies to identify at-risk older drivers via 54 driving-related items, but may be too lengthy for widespread uptake. We reduced the number of items in the FTDS and validated the shorter measure, using 200 caregiver responses. Exploratory factor analysis and classical test theory techniques were used to determine the most interpretable factor model and the minimum number of items to be used for predicting fitness to drive. The extent to which the shorter FTDS predicted the results of the 54-item FTDS was evaluated through correlational analysis. A three-factor model best represented the empirical data. Classical test theory techniques lead to the development of the 32-item FTDS. The 32-item FTDS was highly correlated ( r = .99, p = .05) with the FTDS. The 32-item FTDS may provide raters with a faster and more efficient way to identify at-risk older drivers.
Extent of awareness and prevalence of adulteration in selected food items in rural Dehradun

Directory of Open Access Journals (Sweden)

Ashok Kumar Srivastava

2016-09-01

Full Text Available Background: Adulteration of food items is common phenomenon in India. It includes both willful adulteration to improve texture and quality of food items and supply of substandard food items. The usual outcomes is outbreak of food borne illness. Aims & Objectives: i To estimate the prevalence of food adulteration in selected food items ii the awareness of subjects regarding food adulteration act and iii their buying practices. Material and Methods: Samplesize:150 households was sampled, based on prevalence of adulteration to be around 50%, with 95% confidence interval and absolute allowable error of 10%. Sample household were drawn from the selected villages randomly. Pre-designed and pretested questionnaires was administered to fulfill the objectives and food items were tested using NICE food adulteration kit. Data were analyzed by numeral with percentage, Pearson’s correlation test and F test. Results: In 59.3% households, housewives purchased the food items for the house. The prevalence of adulteration ranged from 17.3% to 66.2% in selected food items. Loose product was purchased by 54.3%. The food labels on packed items was not read by 86.3%. Mean percentage of purity was highest among literates (57.3 ±12.3 than illiterates and those having primary education. Statistically significant F ratio was seen for mean percentage of purity and respondent’s literacy status. Conclusion: Adulterant is rampant in poor strata of society due to consumer’s illiteracy and lack of awareness towards food safety rules.
Tailored Cloze: Improved with Classical Item Analysis Techniques.

Science.gov (United States)

Brown, James Dean

1988-01-01

The reliability and validity of a cloze procedure used as an English-as-a-second-language (ESL) test in China were improved by applying traditional item analysis and selection techniques. The 'best' test items were chosen on the basis of item facility and discrimination indices, and were administered as a 'tailored cloze.' 29 references listed.…
Measuring children's self-reported sport participation, risk perception and injury history: development and validation of a survey instrument.

Science.gov (United States)

Siesmaa, Emma J; Blitvich, Jennifer D; White, Peta E; Finch, Caroline F

2011-01-01

Despite the health benefits associated with children's sport participation, the occurrence of injury in this context is common. The extent to which sport injuries impact children's ongoing involvement in sport is largely unknown. Surveys have been shown to be useful for collecting children's injury and sport participation data; however, there are currently no published instruments which investigate the impact of injury on children's sport participation. This study describes the processes undertaken to assess the validity of two survey instruments for collecting self-reported information about child cricket and netball related participation, injury history and injury risk perceptions, as well as the reliability of the cricket-specific version. Face and content validity were assessed through expert feedback from primary and secondary level teachers and from representatives of peak sporting bodies for cricket and netball. Test-retest reliability was measured using a sample of 59 child cricketers who completed the survey on two occasions, 3-4 weeks apart. Based on expert feedback relating to face and content validity, modification and/or deletion of some survey items was undertaken. Survey items with low test-retest reliability (κ≤0.40) were modified or deleted, items with moderate reliability (κ=0.41-0.60) were modified slightly and items with higher reliability (κ≥0.61) were retained, with some undergoing minor modifications. This is the first survey of its kind which has been successfully administered to cricketers aged 10-16 years to collect information about injury risk perceptions and intentions for continued sport participation. Implications for its generalisation to other child sport participants are discussed. Copyright © 2010 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
Linking NHS data for pediatric pharmacovigilance: Results of a Delphi survey.

Science.gov (United States)

Hopf, Y M; Francis, J; Helms, P J; Haughney, J; Bond, C

2016-01-01

Adverse drug events are a major cause of patient safety incidents. Current systems of pharmacovigilance under-report adverse drug reactions (ADRs), especially in children, leading to delays in their identification. This is of particular concern, as children especially have an increased vulnerability to ADRs. The objective was to seek consensus among healthcare professionals (HCPs) about barriers and facilitators to the linkage of routinely collected health data for pediatric pharmacovigilance in Scotland. A Delphi survey was conducted with a random sample of HCPs including nurses, pharmacists and doctors, working in primary or secondary care, in Scotland. Participants were identified from sampling frames of the target professionals such as an NHS workforce list for general practitioners and recruited by postal invitation. A total of 819 HCPs were invited to take part. Those agreeing to participate were given the option of completing the questionnaires online or as hard copy. Reminders were sent twice at a fortnightly interval. Questions content included description of professional role as well as testing for the willingness to support the proposed project and was informed by the Theoretical Domains Framework of Behavior Change (TDF) and earlier qualitative work. Three Delphi rounds were administered, including a first round for item generation. 121 of those invited agreed to take part (15%). The first round of the Delphi study included 21 open questions and generated over a 1000 individual statements from 61 participants that returned the questionnaires (50.4%). These were rationalized to 149 items for the second round in which participants rated their views on the importance (or not) of each item on a 9-point Likert scale (strongly disagree - strongly agree). After the third round, there was consensus on items that focused on professional standards, and practical requirements, overall there was support for data linkage and a multi-professional approach. It would
Assessment of the Item Selection and Weighting in the Birmingham Vasculitis Activity Score for Wegener's Granulomatosis

Science.gov (United States)

MAHR, ALFRED D.; NEOGI, TUHINA; LAVALLEY, MICHAEL P.; DAVIS, JOHN C.; HOFFMAN, GARY S.; MCCUNE, W. JOSEPH; SPECKS, ULRICH; SPIERA, ROBERT F.; ST.CLAIR, E. WILLIAM; STONE, JOHN H.; MERKEL, PETER A.

2013-01-01

Objective To assess the Birmingham Vasculitis Activity Score for Wegener's Granulomatosis (BVAS/WG) with respect to its selection and weighting of items. Methods This study used the BVAS/WG data from the Wegener's Granulomatosis Etanercept Trial. The scoring frequencies of the 34 predefined items and any “other” items added by clinicians were calculated. Using linear regression with generalized estimating equations in which the physician global assessment (PGA) of disease activity was the dependent variable, we computed weights for all predefined items. We also created variables for clinical manifestations frequently added as other items, and computed weights for these as well. We searched for the model that included the items and their generated weights yielding an activity score with the highest R2 to predict the PGA. Results We analyzed 2,044 BVAS/WG assessments from 180 patients; 734 assessments were scored during active disease. The highest R2 with the PGA was obtained by scoring WG activity based on the following items: the 25 predefined items rated on ≥5 visits, the 2 newly created fatigue and weight loss variables, the remaining minor other and major other items, and a variable that signified whether new or worse items were present at a specific visit. The weights assigned to the items ranged from 1 to 21. Compared with the original BVAS/WG, this modified score correlated significantly more strongly with the PGA. Conclusion This study suggests possibilities to enhance the item selection and weighting of the BVAS/WG. These changes may increase this instrument's ability to capture the continuum of disease activity in WG. PMID:18512722
Retention of Esperanto Is Affected by Delay-Interval Task and Item Closure: A Partial Resolution of the Delay-Retention Effect

Science.gov (United States)

Brosvic, Gary M.; Epstein, Michael L.; Dihoff, Roberta E.; Cook, Michael L.

2006-01-01

The present studies were undertaken to examine the effects of manipulating delay-interval task (Study 1) and timing of feedback (Study 2) on acquisition and retention. Participants completed a 100-item cumulative final examination, which included 50 items from each laboratory examination, plus 50 entirely new items. Acquisition and retention were…

Some links on this page may take you to non-federal websites. Their policies may differ from this site.