WorldWideScience

Sample records for single item assessing

  1. Assessing the Validity of Single-item Life Satisfaction Measures: Results from Three Large Samples

    Science.gov (United States)

    Cheung, Felix; Lucas, Richard E.

    2014-01-01

    Purpose The present paper assessed the validity of single-item life satisfaction measures by comparing single-item measures to the Satisfaction with Life Scale (SWLS) - a more psychometrically established measure. Methods Two large samples from Washington (N=13,064) and Oregon (N=2,277) recruited by the Behavioral Risk Factor Surveillance System (BRFSS) and a representative German sample (N=1,312) recruited by the Germany Socio-Economic Panel (GSOEP) were included in the present analyses. Single-item life satisfaction measures and the SWLS were correlated with theoretically relevant variables, such as demographics, subjective health, domain satisfaction, and affect. The correlations between the two life satisfaction measures and these variables were examined to assess the construct validity of single-item life satisfaction measures. Results Consistent across three samples, single-item life satisfaction measures demonstrated substantial degree of criterion validity with the SWLS (zero-order r = 0.62 – 0.64; disattenuated r = 0.78 – 0.80). Patterns of statistical significance for correlations with theoretically relevant variables were the same across single-item measures and the SWLS. Single-item measures did not produce systematically different correlations compared to the SWLS (average difference = 0.001 – 0.005). The average absolute difference in the magnitudes of the correlations produced by single-item measures and the SWLS were very small (average absolute difference = 0.015 −0.042). Conclusions Single-item life satisfaction measures performed very similarly compared to the multiple-item SWLS. Social scientists would get virtually identical answer to substantive questions regardless of which measure they use. PMID:24890827

  2. Assessing the validity of single-item life satisfaction measures: results from three large samples.

    Science.gov (United States)

    Cheung, Felix; Lucas, Richard E

    2014-12-01

    The present paper assessed the validity of single-item life satisfaction measures by comparing single-item measures to the Satisfaction with Life Scale (SWLS)-a more psychometrically established measure. Two large samples from Washington (N = 13,064) and Oregon (N = 2,277) recruited by the Behavioral Risk Factor Surveillance System and a representative German sample (N = 1,312) recruited by the Germany Socio-Economic Panel were included in the present analyses. Single-item life satisfaction measures and the SWLS were correlated with theoretically relevant variables, such as demographics, subjective health, domain satisfaction, and affect. The correlations between the two life satisfaction measures and these variables were examined to assess the construct validity of single-item life satisfaction measures. Consistent across three samples, single-item life satisfaction measures demonstrated substantial degree of criterion validity with the SWLS (zero-order r = 0.62-0.64; disattenuated r = 0.78-0.80). Patterns of statistical significance for correlations with theoretically relevant variables were the same across single-item measures and the SWLS. Single-item measures did not produce systematically different correlations compared to the SWLS (average difference = 0.001-0.005). The average absolute difference in the magnitudes of the correlations produced by single-item measures and the SWLS was very small (average absolute difference = 0.015-0.042). Single-item life satisfaction measures performed very similarly compared to the multiple-item SWLS. Social scientists would get virtually identical answer to substantive questions regardless of which measure they use.

  3. Single-item measure for assessing quality of life in children with drug-resistant epilepsy.

    Science.gov (United States)

    Conway, Lauryn; Widjaja, Elysa; Smith, Mary Lou

    2018-03-01

    The current study investigated the psychometric properties of a single-item quality of life (QOL) measure, the Global Quality of Life in Childhood Epilepsy question (G-QOLCE), in children with drug-resistant epilepsy. Data came from the Impact of Pediatric Epilepsy Surgery on Health-Related Quality of Life Study (PESQOL), a multicenter prospective cohort study (n = 118) with observations collected at baseline and at 6 months of follow-up on children aged 4-18 years. QOL was measured with the QOLCE-76 and KIDSCREEN-27. The G-QOLCE was an overall QOL question derived from the QOLCE-76. Construct validity and reliability were assessed with Spearman's correlation and intraclass correlation coefficient (ICC). Responsiveness was examined through distribution-based and anchor-based methods. The G-QOLCE showed moderate (r ≥ 0.30) to strong (r ≥ 0.50) correlations with composite scores, and most subscales of the QOLCE-76 and KIDSCREEN-27 at baseline and 6-month follow-up. The G-QOLCE had moderate test-retest reliability (ICC range: 0.49-0.72) and was able to detect clinically important change in patients' QOL (standardized response mean: 0.38; probability of change: 0.65; Guyatt's responsiveness statistics: 0.62 and 0.78). Caregiver anxiety and family functioning contributed most strongly to G-QOLCE scores over time. Results offer promising preliminary evidence regarding the validity, reliability, and responsiveness of the proposed single-item QOL measure. The G-QOLCE is a potentially useful tool that can be feasibly administered in a busy clinical setting to evaluate clinical status and impact of treatment outcomes in pediatric epilepsy.

  4. A psychometric comparison of three scales and a single-item measure to assess sexual satisfaction.

    Science.gov (United States)

    Mark, Kristen P; Herbenick, Debby; Fortenberry, J Dennis; Sanders, Stephanie; Reece, Michael

    2014-01-01

    This study was designed to systematically compare and contrast the psychometric properties of three scales developed to measure sexual satisfaction and a single-item measure of sexual satisfaction. The Index of Sexual Satisfaction (ISS), Global Measure of Sexual Satisfaction (GMSEX), and the New Sexual Satisfaction Scale-Short (NSSS-S) were compared to one another and to a single-item measure of sexual satisfaction. Conceptualization of the constructs, distribution of scores, internal consistency, convergent validity, test-retest reliability, and factor structure were compared between the measures. A total of 211 men and 214 women completed the scales and a measure of relationship satisfaction, with 33% (n = 139) of the sample reassessed two months later. All scales demonstrated appropriate distribution of scores and adequate internal consistency. The GMSEX, NSSS-S, and the single-item measure demonstrated convergent validity. Test-retest reliability was demonstrated by the ISS, GMSEX, and NSSS-S, but not the single-item measure. Taken together, the GMSEX received the strongest psychometric support in this sample for a unidimensional measure of sexual satisfaction and the NSSS-S received the strongest psychometric support in this sample for a bidimensional measure of sexual satisfaction.

  5. Work-related stress assessed by a text message single-item stress question.

    Science.gov (United States)

    Arapovic-Johansson, B; Wåhlin, C; Kwak, L; Björklund, C; Jensen, I

    2017-12-02

    Given the prevalence of work stress-related ill-health in the Western world, it is important to find cost-effective, easy-to-use and valid measures which can be used both in research and in practice. To examine the validity and reliability of the single-item stress question (SISQ), distributed weekly by short message service (SMS) and used for measurement of work-related stress. The convergent validity was assessed through associations between the SISQ and subscales of the Job Demand-Control-Support model, the Effort-Reward Imbalance model and scales measuring depression, exhaustion and sleep. The predictive validity was assessed using SISQ data collected through SMS. The reliability was analysed by the test-retest procedure. Correlations between the SISQ and all the subscales except for job strain and esteem reward were significant, ranging from -0.186 to 0.627. The SISQ could also predict sick leave, depression and exhaustion at 12-month follow-up. The analysis on reliability revealed a satisfactory stability with a weighted kappa between 0.804 and 0.868. The SISQ, administered through SMS, can be used for the screening of stress levels in a working population. © The Author 2017. Published by Oxford University Press on behalf of the Society of Occupational Medicine. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  6. Psychometric properties of a single-item scale to assess sleep quality among individuals with fibromyalgia

    Directory of Open Access Journals (Sweden)

    Sadosky Alesia B

    2009-06-01

    Full Text Available Abstract Background Sleep disturbances are a common and bothersome symptom of fibromyalgia (FM. This study reports psychometric properties of a single-item scale to assess sleep quality among individuals with FM. Methods Analyses were based on data from two randomized, double-blind, placebo-controlled trials of pregabalin (studies 1056 and 1077. In a daily diary, patients reported the quality of their sleep on a numeric rating scale ranging from 0 ("best possible sleep" to 10 ("worst possible sleep". Test re-test reliability of the Sleep Quality Scale was evaluated by computing intraclass correlation coefficients. Pearson correlation coefficients were computed between baseline Sleep Quality scores and baseline pain diary and Medical Outcomes Study (MOS Sleep scores. Responsiveness to treatment was evaluated by standardized effect sizes computed as the difference between least squares mean changes in Sleep Quality scores in the pregabalin and placebo groups divided by the standard deviation of Sleep Quality scores across all patients at baseline. Results Studies 1056 and 1077 included 748 and 745 patients, respectively. Most patients were female (study 1056: 94.4%; study 1077: 94.5% and white (study 1056: 90.2%; study 1077: 91.0%. Mean ages were 48.8 years (study 1056 and 50.1 years (study 1077. Test re-test reliability coefficients of the Sleep Quality Scale were 0.91 and 0.90 in the 1056 and 1077 studies, respectively. Pearson correlation coefficients between baseline Sleep Quality scores and baseline pain diary scores were 0.64 (p Conclusion These results provide evidence of the reproducibility, convergent validity, and responsiveness to treatment of the Sleep Quality Scale and provide a foundation for its further use and evaluation in FM patients.

  7. Assessing Health Status in Inflammatory Bowel Disease using a Novel Single-Item Numeric Rating Scale

    Science.gov (United States)

    Surti, Bijal; Spiegel, Brennan; Ippoliti, Andrew; Vasiliauskas, Eric; Simpson, Peter; Shih, David; Targan, Stephan; McGovern, Dermot; Melmed, Gil Y.

    2014-01-01

    Background Current instruments used to measure disease activity and health-related quality of life (HRQOL) in patients with Crohn’s disease (CD) and ulcerative colitis (UC) are often cumbersome, time-consuming, and expensive; although used in clinical trials, they are not convenient for clinical practice. A numeric rating scale (NRS) is a quick, inexpensive, and convenient patient-reported outcome (PRO) that can capture the patient’s overall perception of health. Aims To assess the validity, reliability, and responsiveness of an NRS and evaluate its use in clinical practice in patients with CD and UC. Methods We prospectively evaluated patient-reported NRS scores and measured correlations between NRS and a range of severity measures, including physician-reported NRS, Crohn’s disease activity index (CDAI), Harvey-Bradshaw index (HBI), inflammatory bowel disease questionnaire (IBDQ), and C-reactive protein (CRP) in patients with CD. Subsequently, we evaluated the correlation between the NRS and standard measures of health status (HBI or simple colitis clinical activity index [SCCAI]) and laboratory tests (sedimentation rate [ESR], CRP, and fecal calprotectin) in patients with CD and UC. Results The patient-reported NRS showed excellent correlation with CDAI (R2=0.59, p<0.0001), IBDQ (R2=0.66, p<0.0001), and HBI (R2=0.32, p<0.0001) in patients with CD. The NRS showed poor, but statistically significant correlation with SCCAI (R2=0.25, p<0.0001) in patients with UC. The NRS did not correlate with CRP, ESR, or calprotectin. The NRS was reliable and responsive to change. Conclusions The NRS is a valid, reliable, and responsive measure that may be useful to evaluate patients with CD and possibly UC. PMID:23250673

  8. MIMIC Methods for Assessing Differential Item Functioning in Polytomous Items

    Science.gov (United States)

    Wang, Wen-Chung; Shih, Ching-Lin

    2010-01-01

    Three multiple indicators-multiple causes (MIMIC) methods, namely, the standard MIMIC method (M-ST), the MIMIC method with scale purification (M-SP), and the MIMIC method with a pure anchor (M-PA), were developed to assess differential item functioning (DIF) in polytomous items. In a series of simulations, it appeared that all three methods…

  9. Diagnostic Value of Subjective Memory Complaints Assessed with a Single Item in Dominantly Inherited Alzheimer’s Disease: Results of the DIAN Study

    Directory of Open Access Journals (Sweden)

    Christoph Laske

    2015-01-01

    Full Text Available Objective. We examined the diagnostic value of subjective memory complaints (SMCs assessed with a single item in a large cross-sectional cohort consisting of families with autosomal dominant Alzheimer’s disease (ADAD participating in the Dominantly Inherited Alzheimer Network (DIAN. Methods. The baseline sample of 183 mutation carriers (MCs and 117 noncarriers (NCs was divided according to Clinical Dementia Rating (CDR scale into preclinical (CDR 0; MCs: n=107; NCs: n=109, early symptomatic (CDR 0.5; MCs: n=48; NCs: n=8, and dementia stage (CDR ≥ 1; MCs: n=28; NCs: n=0. These groups were subdivided by the presence or absence of SMCs. Results. At CDR 0, SMCs were present in 12.1% of MCs and 9.2% of NCs (P=0.6. At CDR 0.5, SMCs were present in 66.7% of MCs and 62.5% of NCs (P=1.0. At CDR ≥ 1, SMCs were present in 96.4% of MCs. SMCs in MCs were significantly associated with CDR, logical memory scores, Geriatric Depression Scale, education, and estimated years to onset. Conclusions. The present study shows that SMCs assessed by a single-item scale have no diagnostic value to identify preclinical ADAD in asymptomatic individuals. These results demonstrate the need of further improvement of SMC measures that should be examined in large clinical trials.

  10. Assessing item fit for unidimensional item response theory models using residuals from estimated item response functions.

    Science.gov (United States)

    Haberman, Shelby J; Sinharay, Sandip; Chon, Kyong Hee

    2013-07-01

    Residual analysis (e.g. Hambleton & Swaminathan, Item response theory: principles and applications, Kluwer Academic, Boston, 1985; Hambleton, Swaminathan, & Rogers, Fundamentals of item response theory, Sage, Newbury Park, 1991) is a popular method to assess fit of item response theory (IRT) models. We suggest a form of residual analysis that may be applied to assess item fit for unidimensional IRT models. The residual analysis consists of a comparison of the maximum-likelihood estimate of the item characteristic curve with an alternative ratio estimate of the item characteristic curve. The large sample distribution of the residual is proved to be standardized normal when the IRT model fits the data. We compare the performance of our suggested residual to the standardized residual of Hambleton et al. (Fundamentals of item response theory, Sage, Newbury Park, 1991) in a detailed simulation study. We then calculate our suggested residuals using data from an operational test. The residuals appear to be useful in assessing the item fit for unidimensional IRT models.

  11. Item Response Theory for Peer Assessment

    Science.gov (United States)

    Uto, Masaki; Ueno, Maomi

    2016-01-01

    As an assessment method based on a constructivist approach, peer assessment has become popular in recent years. However, in peer assessment, a problem remains that reliability depends on the rater characteristics. For this reason, some item response models that incorporate rater parameters have been proposed. Those models are expected to improve…

  12. The development of a single-item Food Choice Questionnaire

    NARCIS (Netherlands)

    Onwezen, M.C.; Reinders, M.J.; Verain, M.C.D.; Snoek, H.M.

    2019-01-01

    Based on the multi-item Food Choice Questionnaire (FCQ) originally developed by Steptoe and colleagues (1995), the current study developed a single-item FCQ that provides an acceptable balance between practical needs and psychometric concerns. Studies 1 (N = 1851) and 2 (2a (N = 3290), 2b (N =

  13. Concurrent Validation of the Clinical Opiate Withdrawal Scale (COWS) and Single-Item Indices against the Clinical Institute Narcotic Assessment (CINA) Opioid Withdrawal Instrument

    Science.gov (United States)

    Tompkins, D. Andrew; Bigelow, George E.; Harrison, Joseph A.; Johnson, Rolley E.; Fudala, Paul J.; Strain, Eric C.

    2009-01-01

    Introduction The Clinical Opiate Withdrawal Scale (COWS) is an 11-item clinician-administered scale assessing opioid withdrawal. Though commonly used in clinical practice, it has not been systematically validated. The present study validated the COWS in comparison to the validated Clinical Institute Narcotic Assessment (CINA) scale. Method Opioid-dependent volunteers were enrolled in a residential trial and stabilized on morphine 30 mg given subcutaneously four times daily. Subjects then underwent double-blind, randomized challenges of intramuscularly administered placebo and naloxone (0.4 mg) on separate days, during which the COWS, CINA, and visual analog scale (VAS) assessments were concurrently obtained. Subjects completing both challenges were included (N=46). Correlations between mean peak COWS and CINA scores as well as self-report VAS questions were calculated. Results Mean peak COWS and CINA scores of 7.6 and 24.4, respectively, occurred on average 30 minutes post-injection of naloxone. Mean COWS and CINA scores 30 minutes after placebo injection were 1.3 and 18.9, respectively. The Pearson correlation coefficient for peak COWS and CINA scores during the naloxone challenge session was 0.85 (p<0.001). Peak COWS scores also correlated well with peak VAS self-report scores of bad drug effect (r=0.57, p<0.001) and feeling sick (r=0.57, p<0.001), providing additional evidence of concurrent validity. Placebo was not associated with any significant elevation of COWS, CINA, or VAS scores, indicating discriminant validity. Cronbach’s alpha for the COWS was 0.78, indicating good internal consistency (reliability). Discussion COWS, CINA, and certain VAS items are all valid measurement tools for acute opiate withdrawal. PMID:19647958

  14. Using Item Response Theory to Describe the Nonverbal Literacy Assessment (NVLA)

    Science.gov (United States)

    Fleming, Danielle; Wilson, Mark; Ahlgrim-Delzell, Lynn

    2018-01-01

    The Nonverbal Literacy Assessment (NVLA) is a literacy assessment designed for students with significant intellectual disabilities. The 218-item test was initially examined using confirmatory factor analysis. This method showed that the test worked as expected, but the items loaded onto a single factor. This article uses item response theory to…

  15. Single-Item Measurement of Suicidal Behaviors: Validity and Consequences of Misclassification.

    Directory of Open Access Journals (Sweden)

    Alexander J Millner

    Full Text Available Suicide is a leading cause of death worldwide. Although research has made strides in better defining suicidal behaviors, there has been less focus on accurate measurement. Currently, the widespread use of self-report, single-item questions to assess suicide ideation, plans and attempts may contribute to measurement problems and misclassification. We examined the validity of single-item measurement and the potential for statistical errors. Over 1,500 participants completed an online survey containing single-item questions regarding a history of suicidal behaviors, followed by questions with more precise language, multiple response options and narrative responses to examine the validity of single-item questions. We also conducted simulations to test whether common statistical tests are robust against the degree of misclassification produced by the use of single-items. We found that 11.3% of participants that endorsed a single-item suicide attempt measure engaged in behavior that would not meet the standard definition of a suicide attempt. Similarly, 8.8% of those who endorsed a single-item measure of suicide ideation endorsed thoughts that would not meet standard definitions of suicide ideation. Statistical simulations revealed that this level of misclassification substantially decreases statistical power and increases the likelihood of false conclusions from statistical tests. Providing a wider range of response options for each item reduced the misclassification rate by approximately half. Overall, the use of single-item, self-report questions to assess the presence of suicidal behaviors leads to misclassification, increasing the likelihood of statistical decision errors. Improving the measurement of suicidal behaviors is critical to increase understanding and prevention of suicide.

  16. Modeling Composite Assessment Data Using Item Response Theory

    Science.gov (United States)

    Ueckert, Sebastian

    2018-01-01

    Composite assessments aim to combine different aspects of a disease in a single score and are utilized in a variety of therapeutic areas. The data arising from these evaluations are inherently discrete with distinct statistical properties. This tutorial presents the framework of the item response theory (IRT) for the analysis of this data type in a pharmacometric context. The article considers both conceptual (terms and assumptions) and practical questions (modeling software, data requirements, and model building). PMID:29493119

  17. Single-item memory, associative memory, and the human hippocampus

    OpenAIRE

    Gold, Jeffrey J.; Hopkins, Ramona O.; Squire, Larry R.

    2006-01-01

    We tested recognition memory for items and associations in memory-impaired patients with bilateral lesions thought to be limited to the hippocampal region. In Experiment 1 (Combined memory test), participants studied words and then took a memory test in which studied words, new words, studied word pairs, and recombined word pairs were presented in a mixed order. In Experiment 2 (Separated memory test), participants studied single words and then took a memory test involving studied word and ne...

  18. Assessing difference between classical test theory and item ...

    African Journals Online (AJOL)

    Assessing difference between classical test theory and item response theory methods in scoring primary four multiple choice objective test items. ... All research participants were ranked on the CTT number correct scores and the corresponding IRT item pattern scores from their performance on the PRISMADAT. Wilcoxon ...

  19. Writing, Evaluating and Assessing Data Response Items in Economics.

    Science.gov (United States)

    Trotman-Dickenson, D. I.

    1989-01-01

    Describes some of the problems in writing data response items in economics for use by A Level and General Certificate of Secondary Education (GCSE) students. Examines the experience of two series of workshops on writing items, evaluating them and assessing responses from schools. Offers suggestions for producing packages of data response items as…

  20. Development and validation of the Single Item Narcissism Scale (SINS).

    Science.gov (United States)

    Konrath, Sara; Meier, Brian P; Bushman, Brad J

    2014-01-01

    The narcissistic personality is characterized by grandiosity, entitlement, and low empathy. This paper describes the development and validation of the Single Item Narcissism Scale (SINS). Although the use of longer instruments is superior in most circumstances, we recommend the SINS in some circumstances (e.g. under serious time constraints, online studies). In 11 independent studies (total N = 2,250), we demonstrate the SINS' psychometric properties. The SINS is significantly correlated with longer narcissism scales, but uncorrelated with self-esteem. It also has high test-retest reliability. We validate the SINS in a variety of samples (e.g., undergraduates, nationally representative adults), intrapersonal correlates (e.g., positive affect, depression), and interpersonal correlates (e.g., aggression, relationship quality, prosocial behavior). The SINS taps into the more fragile and less desirable components of narcissism. The SINS can be a useful tool for researchers, especially when it is important to measure narcissism with constraints preventing the use of longer measures.

  1. A Model-Free Diagnostic for Single-Peakedness of Item Responses Using Ordered Conditional Means

    Science.gov (United States)

    Polak, Marike; De Rooij, Mark; Heiser, Willem J.

    2012-01-01

    In this article we propose a model-free diagnostic for single-peakedness (unimodality) of item responses. Presuming a unidimensional unfolding scale and a given item ordering, we approximate item response functions of all items based on ordered conditional means (OCM). The proposed OCM methodology is based on Thurstone & Chave's (1929) "criterion…

  2. The utility of single-item readiness screeners in middle school.

    Science.gov (United States)

    Lewis, Crystal G; Herman, Keith C; Huang, Francis L; Stormont, Melissa; Grossman, Caroline; Eddy, Colleen; Reinke, Wendy M

    2017-10-01

    This study examined the benefit of utilizing one-item academic and one-item behavior readiness teacher-rated screeners at the beginning of the school year to predict end-of-school year outcomes for middle school students. The Middle School Academic and Behavior Readiness (M-ABR) screeners were developed to provide an efficient and effective way to assess readiness in students. Participants included 889 students in 62 middle school classrooms in an urban Missouri school district. Concurrent validity with the M-ABR items and other indicators of readiness in the fall were evaluated using Pearson product-moment correlation coefficients, with the academic readiness item having medium to strong correlations with other baseline academic indicators (r=±0.56 to 0.91) and the behavior readiness item having low to strong correlations with baseline behavior items (r=±0.20 to 0.79). Next, the predictive validity of the M-ABR items was analyzed with hierarchical linear regressions using end-of-year outcomes as the dependent variable. The academic and behavior readiness items demonstrated adequate validity for all outcomes with moderate effects (β=±0.31 to 0.73 for academic outcomes and β=±0.24 to 0.59 for behavioral outcomes) after controlling for baseline demographics. Even after controlling for baseline scores, the M-ABR items predicted unique variance in almost all outcome variables. Four conditional probability indices were calculated to obtain an optimal cut score, to determine ready vs. not ready, for both single-item M-ABR scales. The cut point of "fair" yielded the most acceptable values for the indices. The odd ratios (OR) of experiencing negative outcomes given a "fair" or lower readiness rating (2 or below on the M-ABR screeners) at the beginning of the year were significant and strong for all outcomes (OR=2.29 to OR=14.46), except for internalizing problems. These findings suggest promise for using single readiness items to screen for varying negative end

  3. Development and Validation of the Single Item Narcissism Scale (SINS)

    Science.gov (United States)

    Konrath, Sara; Meier, Brian P.; Bushman, Brad J.

    2014-01-01

    Main Objectives The narcissistic personality is characterized by grandiosity, entitlement, and low empathy. This paper describes the development and validation of the Single Item Narcissism Scale (SINS). Although the use of longer instruments is superior in most circumstances, we recommend the SINS in some circumstances (e.g. under serious time constraints, online studies). Methods In 11 independent studies (total N = 2,250), we demonstrate the SINS' psychometric properties. Results The SINS is significantly correlated with longer narcissism scales, but uncorrelated with self-esteem. It also has high test-retest reliability. We validate the SINS in a variety of samples (e.g., undergraduates, nationally representative adults), intrapersonal correlates (e.g., positive affect, depression), and interpersonal correlates (e.g., aggression, relationship quality, prosocial behavior). The SINS taps into the more fragile and less desirable components of narcissism. Significance The SINS can be a useful tool for researchers, especially when it is important to measure narcissism with constraints preventing the use of longer measures. PMID:25093508

  4. Development and validation of the Single Item Narcissism Scale (SINS.

    Directory of Open Access Journals (Sweden)

    Sara Konrath

    Full Text Available MAIN OBJECTIVES: The narcissistic personality is characterized by grandiosity, entitlement, and low empathy. This paper describes the development and validation of the Single Item Narcissism Scale (SINS. Although the use of longer instruments is superior in most circumstances, we recommend the SINS in some circumstances (e.g. under serious time constraints, online studies. METHODS: In 11 independent studies (total N = 2,250, we demonstrate the SINS' psychometric properties. RESULTS: The SINS is significantly correlated with longer narcissism scales, but uncorrelated with self-esteem. It also has high test-retest reliability. We validate the SINS in a variety of samples (e.g., undergraduates, nationally representative adults, intrapersonal correlates (e.g., positive affect, depression, and interpersonal correlates (e.g., aggression, relationship quality, prosocial behavior. The SINS taps into the more fragile and less desirable components of narcissism. SIGNIFICANCE: The SINS can be a useful tool for researchers, especially when it is important to measure narcissism with constraints preventing the use of longer measures.

  5. Face validity of the single work ability item

    DEFF Research Database (Denmark)

    Gupta, Nidhi; Jensen, Bjørn Søvsø; Søgaard, Karen

    2014-01-01

    with a total of 5,810 h, including 2,640 working hours. RESULTS: A significant moderate correlation between work ability and %HRR was observed among males (R = -0.33, P = 0.005), but not among females (R = 0.11, P = 0.431). In a gender-stratified multi-adjusted logistic regression analysis, males with high...... %HRR were more likely to report a reduced work ability compared to males with low %HRR [OR = 4.75, 95% confidence interval (95% CI) = 1.31 to 17.25]. However, this association was not found among females (OR = 0.26, 95% CI 0.03 to 2.16), and a significant interaction between work ability, %HRR......PURPOSE: The purpose of this study was to investigate the face validity of the self-reported single item work ability with objectively measured heart rate reserve (%HRR) among blue-collar workers. METHODS: We utilized data from 127 blue-collar workers (Female = 53; Male = 74) aged 18-65 years from...

  6. Applying Item Response Theory methods to design a learning progression-based science assessment

    Science.gov (United States)

    Chen, Jing

    Learning progressions are used to describe how students' understanding of a topic progresses over time and to classify the progress of students into steps or levels. This study applies Item Response Theory (IRT) based methods to investigate how to design learning progression-based science assessments. The research questions of this study are: (1) how to use items in different formats to classify students into levels on the learning progression, (2) how to design a test to give good information about students' progress through the learning progression of a particular construct and (3) what characteristics of test items support their use for assessing students' levels. Data used for this study were collected from 1500 elementary and secondary school students during 2009--2010. The written assessment was developed in several formats such as the Constructed Response (CR) items, Ordered Multiple Choice (OMC) and Multiple True or False (MTF) items. The followings are the main findings from this study. The OMC, MTF and CR items might measure different components of the construct. A single construct explained most of the variance in students' performances. However, additional dimensions in terms of item format can explain certain amount of the variance in student performance. So additional dimensions need to be considered when we want to capture the differences in students' performances on different types of items targeting the understanding of the same underlying progression. Items in each item format need to be improved in certain ways to classify students more accurately into the learning progression levels. This study establishes some general steps that can be followed to design other learning progression-based tests as well. For example, first, the boundaries between levels on the IRT scale can be defined by using the means of the item thresholds across a set of good items. Second, items in multiple formats can be selected to achieve the information criterion at all

  7. The Single-Item Math Anxiety Scale: An Alternative Way of Measuring Mathematical Anxiety

    Science.gov (United States)

    Núñez-Peña, M. Isabel; Guilera, Georgina; Suárez-Pellicioni, Macarena

    2014-01-01

    This study examined whether the Single-Item Math Anxiety Scale (SIMA), based on the item suggested by Ashcraft, provided valid and reliable scores of mathematical anxiety. A large sample of university students (n = 279) was administered the SIMA and the 25-item Shortened Math Anxiety Rating Scale (sMARS) to evaluate the relation between the scores…

  8. Work ability as prognostic risk marker of disability pension : Single-item work ability score versus multi-item work ability index

    NARCIS (Netherlands)

    Roelen, C.A.M.; Rhenen, van W.; Groothoff, J.W.; Klink, van der J.J.L.; Twisk, W.R.; Heymans, M.W.

    2014-01-01

    Work ability predicts future disability pension (DP). A single-item work ability score (WAS) is emerging as a measure for work ability. This study compared single-item WAS with the multi-item work ability index (WAI) in its ability to identify workers at risk of DP.

  9. Work ability as prognostic risk marker of disability pension: single-item work ability score versus multi-item work ability index

    NARCIS (Netherlands)

    Roelen, C.A.M.; van Rhenen, W.; Groothoff, J.W.; van der Klink, J.J.L.; Twisk, J.W.R.; Heymans, M.W.

    2014-01-01

    Objectives Work ability predicts future disability pension (DP). A single-item work ability score (WAS) is emerging as a measure for work ability. This study compared single-item WAS with the multi-item work ability index (WAI) in its ability to identify workers at risk of DP. Methods This

  10. Work ability as prognostic risk marker of disability pension : single-item work ability score versus multi-item work ability index

    NARCIS (Netherlands)

    Roelen, Corne A. M.; van Rhenen, Willem; Groothoff, Johan W.; van der Klink, Jac J. L.; Twisk, Jos W. R.; Heymans, Martijn W.

    Objectives Work ability predicts future disability pension (DP). A single-item work ability score (WAS) is emerging as a measure for work ability. This study compared single-item WAS with the multi-item work ability index (WAI) in its ability to identify workers at risk of DP. Methods This

  11. Goodness-of-Fit Assessment of Item Response Theory Models

    Science.gov (United States)

    Maydeu-Olivares, Alberto

    2013-01-01

    The article provides an overview of goodness-of-fit assessment methods for item response theory (IRT) models. It is now possible to obtain accurate "p"-values of the overall fit of the model if bivariate information statistics are used. Several alternative approaches are described. As the validity of inferences drawn on the fitted model…

  12. Advanced Marketing Core Curriculum. Test Items and Assessment Techniques.

    Science.gov (United States)

    Smith, Clifton L.; And Others

    This document contains duties and tasks, multiple-choice test items, and other assessment techniques for Missouri's advanced marketing core curriculum. The core curriculum begins with a list of 13 suggested textbook resources. Next, nine duties with their associated tasks are given. Under each task appears one or more citations to appropriate…

  13. The 12-item World Health Organization Disability Assessment Schedule II (WHO-DAS II: a nonparametric item response analysis

    Directory of Open Access Journals (Sweden)

    Fernandez Ana

    2010-05-01

    Full Text Available Abstract Background Previous studies have analyzed the psychometric properties of the World Health Organization Disability Assessment Schedule II (WHO-DAS II using classical omnibus measures of scale quality. These analyses are sample dependent and do not model item responses as a function of the underlying trait level. The main objective of this study was to examine the effectiveness of the WHO-DAS II items and their options in discriminating between changes in the underlying disability level by means of item response analyses. We also explored differential item functioning (DIF in men and women. Methods The participants were 3615 adult general practice patients from 17 regions of Spain, with a first diagnosed major depressive episode. The 12-item WHO-DAS II was administered by the general practitioners during the consultation. We used a non-parametric item response method (Kernel-Smoothing implemented with the TestGraf software to examine the effectiveness of each item (item characteristic curves and their options (option characteristic curves in discriminating between changes in the underliying disability level. We examined composite DIF to know whether women had a higher probability than men of endorsing each item. Results Item response analyses indicated that the twelve items forming the WHO-DAS II perform very well. All items were determined to provide good discrimination across varying standardized levels of the trait. The items also had option characteristic curves that showed good discrimination, given that each increasing option became more likely than the previous as a function of increasing trait level. No gender-related DIF was found on any of the items. Conclusions All WHO-DAS II items were very good at assessing overall disability. Our results supported the appropriateness of the weights assigned to response option categories and showed an absence of gender differences in item functioning.

  14. Assessing Differential Item Functioning on the Test of Relational Reasoning

    Directory of Open Access Journals (Sweden)

    Denis Dumas

    2018-03-01

    Full Text Available The test of relational reasoning (TORR is designed to assess the ability to identify complex patterns within visuospatial stimuli. The TORR is designed for use in school and university settings, and therefore, its measurement invariance across diverse groups is critical. In this investigation, a large sample, representative of a major university on key demographic variables, was collected, and the resulting data were analyzed using a multi-group, multidimensional item-response theory model-comparison procedure. No significant differential item functioning was found on any of the TORR items across any of the demographic groups of interest. This finding is interpreted as evidence of the cultural fairness of the TORR, and potential test-development choices that may have contributed to that cultural fairness are discussed.

  15. Item reduction and psychometric validation of the Oily Skin Self Assessment Scale (OSSAS) and the Oily Skin Impact Scale (OSIS).

    Science.gov (United States)

    Arbuckle, Robert; Clark, Marci; Harness, Jane; Bonner, Nicola; Scott, Jane; Draelos, Zoe; Rizer, Ronald; Yeh, Yating; Copley-Merriman, Kati

    2009-01-01

    Developed using focus groups, the Oily Skin Self Assessment Scale (OSSAS) and Oily Skin Impact Scale (OSIS) are patient-reported outcome measures of oily facial skin. The aim of this study was to finalize the item-scale structure of the instruments and perform psychometric validation in adults with self-reported oily facial skin. The OSSAS and OSIS were administered to 202 adult subjects with oily facial skin in the United States. A subgroup of 152 subjects returned, 4 to 10 days later, for test–retest reliability evaluation. Of the 202 participants, 72.8% were female; 64.4% had self-reported nonsevere acne. Item reduction resulted in a 14-item OSSAS with Sensation (five items), Tactile (four items) and Visual (four items) domains, a single blotting item, and an overall oiliness item. The OSIS was reduced to two three-item domains assessing Annoyance and Self-Image. Confirmatory factor analysis supported the construct validity of the final item-scale structures. The OSSAS and OSIS scales had acceptable item convergent validity (item-scale correlations >0.40) and floor and ceiling effects (skin severity (P skin (P skin), as assessments of self-reported oily facial skin severity and its emotional impact, respectively.

  16. Item Response Theory with Covariates (IRT-C): Assessing Item Recovery and Differential Item Functioning for the Three-Parameter Logistic Model

    Science.gov (United States)

    Tay, Louis; Huang, Qiming; Vermunt, Jeroen K.

    2016-01-01

    In large-scale testing, the use of multigroup approaches is limited for assessing differential item functioning (DIF) across multiple variables as DIF is examined for each variable separately. In contrast, the item response theory with covariate (IRT-C) procedure can be used to examine DIF across multiple variables (covariates) simultaneously. To…

  17. Cross-National Prevalence of Traditional Bullying, Traditional Victimization, Cyberbullying and Cyber-Victimization: Comparing Single-Item and Multiple-Item Approaches of Measurement

    Science.gov (United States)

    Yanagida, Takuya; Gradinger, Petra; Strohmeier, Dagmar; Solomontos-Kountouri, Olga; Trip, Simona; Bora, Carmen

    2016-01-01

    Many large-scale cross-national studies rely on a single-item measurement when comparing prevalence rates of traditional bullying, traditional victimization, cyberbullying, and cyber-victimization between countries. However, the reliability and validity of single-item measurement approaches are highly problematic and might be biased. Data from…

  18. Developing a Model for Optimizing Inventory of Repairable Items at Single Operating Base

    OpenAIRE

    Le, Tin

    2016-01-01

    The use of EOQ model in inventory management is popular. However, EOQ models has many disadvantages, especially, when the model is applied to manage repairable items. In order to deal with high-cost and repairable items, Craig C. Sherbrooke introduced a model in his book “Optimal Inventory Modeling of Systems: Multi-Echelon Techniques”. The research focus is to implement and develop a program to execute the single-site in-ventory model for repairable items. The model helps to significantl...

  19. Missouri Assessment Program (MAP), Spring 2000: Secondary Science, Released Items, Grade 10.

    Science.gov (United States)

    Missouri State Dept. of Elementary and Secondary Education, Jefferson City.

    This assessment sample provides information on the Missouri Assessment Program (MAP) for grade 10 science. The sample consists of six items taken from the test booklet and scoring guides for the six items. The items assess ecosystems, mechanics, and data analysis. (MM)

  20. Single-item screening for agoraphobic symptoms : validation of a web-based audiovisual screening instrument

    NARCIS (Netherlands)

    van Ballegooijen, Wouter; Riper, Heleen; Donker, Tara; Martin Abello, Katherina; Marks, Isaac; Cuijpers, Pim

    2012-01-01

    The advent of web-based treatments for anxiety disorders creates a need for quick and valid online screening instruments, suitable for a range of social groups. This study validates a single-item multimedia screening instrument for agoraphobia, part of the Visual Screener for Common Mental Disorders

  1. Methods for Assessing Item, Step, and Threshold Invariance in Polytomous Items Following the Partial Credit Model

    Science.gov (United States)

    Penfield, Randall D.; Myers, Nicholas D.; Wolfe, Edward W.

    2008-01-01

    Measurement invariance in the partial credit model (PCM) can be conceptualized in several different but compatible ways. In this article the authors distinguish between three forms of measurement invariance in the PCM: step invariance, item invariance, and threshold invariance. Approaches for modeling these three forms of invariance are proposed,…

  2. Evaluation of a single-item screening question to detect limited health literacy in peritoneal dialysis patients.

    Science.gov (United States)

    Jain, Deepika; Sheth, Heena; Bender, Filitsa H; Weisbord, Steven D; Green, Jamie A

    2014-01-01

    Studies have shown that a single-item question might be useful in identifying patients with limited health literacy. However, the utility of the approach has not been studied in patients receiving maintenance peritoneal dialysis (PD). We assessed health literacy in a cohort of 31 PD patients by administering the Rapid Estimate of Adult Literacy in Medicine (REALM) and a single-item health literacy (SHL) screening question "How confident are you filling out medical forms by yourself?" (Extremely, Quite a bit, Somewhat, A little bit, or Not at all). To determine the accuracy of the single-item question for detecting limited health literacy, we performed sensitivity and specificity analyses of the SHL and plotted the area under the receiver operating characteristic (AUROC) curve using the REALM as a reference standard. Using a cut-off of "Somewhat" or less confident, the sensitivity of the SHL for detecting limited health literacy was 80%, and the specificity was 88%. The positive likelihood ratio was 6.9. The SHL had an AUROC of 0.79 (95% confidence interval: 0.52 to 1.00). Our results show that the SHL could be effective in detecting limited health literacy in PD patients.

  3. Development and evaluation of CAHPS survey items assessing how well healthcare providers address health literacy.

    Science.gov (United States)

    Weidmer, Beverly A; Brach, Cindy; Hays, Ron D

    2012-09-01

    The complexity of health information often exceeds patients' skills to understand and use it. To develop survey items assessing how well healthcare providers communicate health information. Domains and items for the Consumer Assessment of Healthcare Providers and Systems (CAHPS) Item Set for Addressing Health Literacy were identified through an environmental scan and input from stakeholders. The draft item set was translated into Spanish and pretested in both English and Spanish. The revised item set was field tested with a randomly selected sample of adult patients from 2 sites using mail and telephonic data collection. Item-scale correlations, confirmatory factor analysis, and internal consistency reliability estimates were estimated to assess how well the survey items performed and identify composite measures. Finally, we regressed the CAHPS global rating of the provider item on the CAHPS core communication composite and the new health literacy composites. A total of 601 completed surveys were obtained (52% response rate). Two composite measures were identified: (1) Communication to Improve Health Literacy (16 items); and (2) How Well Providers Communicate About Medicines (6 items). These 2 composites were significantly uniquely associated with the global rating of the provider (communication to improve health literacy: PLiteracy composite accounted for 90% of the variance of the original 16-item composite. This study provides support for reliability and validity of the CAHPS Item Set for Addressing Health Literacy. These items can serve to assess whether healthcare providers have communicated effectively with their patients and as a tool for quality improvement.

  4. The work ability index and single-item question: associations with sick leave, symptoms, and health--a prospective study of women on long-term sick leave.

    Science.gov (United States)

    Ahlstrom, Linda; Grimby-Ekman, Anna; Hagberg, Mats; Dellve, Lotta

    2010-09-01

    This study investigated the association between the work ability index (WAI) and the single-item question on work ability among women working in human service organizations (HSO) currently on long-term sick leave. It also examined the association between the WAI and the single-item question in relation to sick leave, symptoms, and health. Predictive values of the WAI, the changed WAI, the single-item question and the changed single-item question were investigated for degree of sick leave, symptoms, and health. This cohort study comprised 324 HSO female workers on long-term (>60 days) sick leave, with follow-ups at 6 and 12 months. Participants responded to questionnaires. Data on work ability, sick leave, health, and symptoms were analyzed with regard to associations and predictability. Spearman correlation and mixed-model analysis were performed for repeated measurements over time. The study showed a very strong association between the WAI and the single-item question among all participants. Both the WAI and the single-item question showed similar patterns of associations with sick leave, health, and symptoms. The predictive value for the degree of sick leave and health-related quality of life (HRQoL) was strong for both the WAI and the single-item question, and slightly less strong for vitality, neck pain, both self-rated general and mental health, and behavioral and current stress. This study suggests that the single-item question on work ability could be used as a simple indicator for assessing the status and progress of work ability among women on long-term sick leave.

  5. The Single Item Literacy Screener: Evaluation of a brief instrument to identify limited reading ability

    Directory of Open Access Journals (Sweden)

    Chew Lisa D

    2006-03-01

    Full Text Available Abstract Background Reading skills are important for accessing health information, using health care services, managing one's health and achieving desirable health outcomes. Our objective was to assess the diagnostic accuracy of the Single Item Literacy Screener (SILS to identify limited reading ability, one component of health literacy, as measured by the S-TOFHLA. Methods Cross-sectional interview with 999 adults with diabetes residing in Vermont and bordering states. Participants were randomly recruited from Primary Care practices in the Vermont Diabetes Information System June 2003 – December 2004. The main outcome was limited reading ability. The primary predictor was the SILS. Results Of the 999 persons screened, 169 (17% had limited reading ability. The sensitivity of the SILS in detecting limited reading ability was 54% [95% CI: 47%, 61%] and the specificity was 83% [95% CI: 81%, 86%] with an area under the Receiver Operating Characteristics Curve (ROC of 0.73 [95% CI: 0.69, 0.78]. Seven hundred seventy (77% screened negative on the SILS and 692 of these subjects had adequate reading skills (negative predictive value = 0.90 [95% CI: 0.88, 0.92]. Of the 229 who scored positive on the SILS, 92 had limited reading ability (positive predictive value = 0.4 [95% CI: 0.34, 0.47]. Conclusion The SILS is a simple instrument designed to identify patients with limited reading ability who need help reading health-related materials. The SILS performs moderately well at ruling out limited reading ability in adults and allows providers to target additional assessment of health literacy skills to those most in need. Further study of the use of the SILS in clinical settings and with more diverse populations is warranted.

  6. Development of the Assessment Items of Debris Flow Using the Delphi Method

    Science.gov (United States)

    Byun, Yosep; Seong, Joohyun; Kim, Mingi; Park, Kyunghan; Yoon, Hyungkoo

    2016-04-01

    In recent years in Korea, Typhoon and the localized extreme rainfall caused by the abnormal climate has increased. Accordingly, debris flow is becoming one of the most dangerous natural disaster. This study aimed to develop the assessment items which can be used for conducting damage investigation of debris flow. Delphi method was applied to classify the realms of assessment items. As a result, 29 assessment items which can be classified into 6 groups were determined.

  7. Using personality item characteristics to predict single-item reliability, retest reliability, and self-other agreement

    NARCIS (Netherlands)

    de Vries, Reinout Everhard; Realo, Anu; Allik, Jüri

    2016-01-01

    The use of reliability estimates is increasingly scrutinized as scholars become more aware that test–retest stability and self–other agreement provide a better approximation of the theoretical and practical usefulness of an instrument than its internal reliability. In this study, we investigate item

  8. Matrix Sampling of Items in Large-Scale Assessments

    Directory of Open Access Journals (Sweden)

    Ruth A. Childs

    2003-07-01

    Full Text Available Matrix sampling of items -' that is, division of a set of items into different versions of a test form..-' is used by several large-scale testing programs. Like other test designs, matrixed designs have..both advantages and disadvantages. For example, testing time per student is less than if each..student received all the items, but the comparability of student scores may decrease. Also,..curriculum coverage is maintained, but reporting of scores becomes more complex. In this paper,..matrixed designs are compared with more traditional designs in nine categories of costs:..development costs, materials costs, administration costs, educational costs, scoring costs,..reliability costs, comparability costs, validity costs, and reporting costs. In choosing among test..designs, a testing program should examine the costs in light of its mandate(s, the content of the..tests, and the financial resources available, among other considerations.

  9. Assessing errors related to characteristics of the items measured

    International Nuclear Information System (INIS)

    Liggett, W.

    1980-01-01

    Errors that are related to some intrinsic property of the items measured are often encountered in nuclear material accounting. An example is the error in nondestructive assay measurements caused by uncorrected matrix effects. Nuclear material accounting requires for each materials type one measurement method for which bounds on these errors can be determined. If such a method is available, a second method might be used to reduce costs or to improve precision. If the measurement error for the first method is longer-tailed than Gaussian, then precision might be improved by measuring all items by both methods. 8 refs

  10. Better assessment of physical function: item improvement is neglected but essential.

    Science.gov (United States)

    Bruce, Bonnie; Fries, James F; Ambrosini, Debbie; Lingala, Bharathi; Gandek, Barbara; Rose, Matthias; Ware, John E

    2009-01-01

    Physical function is a key component of patient-reported outcome (PRO) assessment in rheumatology. Modern psychometric methods, such as Item Response Theory (IRT) and Computerized Adaptive Testing, can materially improve measurement precision at the item level. We present the qualitative and quantitative item-evaluation process for developing the Patient Reported Outcomes Measurement Information System (PROMIS) Physical Function item bank. The process was stepwise: we searched extensively to identify extant Physical Function items and then classified and selectively reduced the item pool. We evaluated retained items for content, clarity, relevance and comprehension, reading level, and translation ease by experts and patient surveys, focus groups, and cognitive interviews. We then assessed items by using classic test theory and IRT, used confirmatory factor analyses to estimate item parameters, and graded response modeling for parameter estimation. We retained the 20 Legacy (original) Health Assessment Questionnaire Disability Index (HAQ-DI) and the 10 SF-36's PF-10 items for comparison. Subjects were from rheumatoid arthritis, osteoarthritis, and healthy aging cohorts (n = 1,100) and a national Internet sample of 21,133 subjects. We identified 1,860 items. After qualitative and quantitative evaluation, 124 newly developed PROMIS items composed the PROMIS item bank, which included revised Legacy items with good fit that met IRT model assumptions. Results showed that the clearest and best-understood items were simple, in the present tense, and straightforward. Basic tasks (like dressing) were more relevant and important versus complex ones (like dancing). Revised HAQ-DI and PF-10 items with five response options had higher item-information content than did comparable original Legacy items with fewer response options. IRT analyses showed that the Physical Function domain satisfied general criteria for unidimensionality with one-, two-, three-, and four-factor models

  11. Comparison of Classical Test Theory and Item Response Theory in Individual Change Assessment

    NARCIS (Netherlands)

    Jabrayilov, Ruslan; Emons, Wilco H. M.; Sijtsma, Klaas

    2016-01-01

    Clinical psychologists are advised to assess clinical and statistical significance when assessing change in individual patients. Individual change assessment can be conducted using either the methodologies of classical test theory (CTT) or item response theory (IRT). Researchers have been optimistic

  12. Development and validation of the Single Item Trait Empathy Scale (SITES).

    Science.gov (United States)

    Konrath, Sara; Meier, Brian P; Bushman, Brad J

    2018-04-01

    Empathy involves feeling compassion for others and imagining how they feel. In this article, we develop and validate the Single Item Trait Empathy Scale (SITES), which contains only one item that takes seconds to complete. In seven studies (N=5,724), the SITES was found to be both reliable and valid. It correlated in expected ways with a wide variety of intrapersonal outcomes. For example, it is negatively correlated with narcissism, depression, anxiety, and alexithymia. In contrast, it is positively correlated with other measures of empathy, self-esteem, subjective well-being, and agreeableness. The SITES also correlates with a wide variety of interpersonal outcomes, especially compassion for others and helping others. The SITES is recommended in situations when time or question quantity is constrained.

  13. An Investigation of Item Type in a Standards-Based Assessment.

    Directory of Open Access Journals (Sweden)

    Liz Hollingworth

    2007-12-01

    Full Text Available Large-scale state assessment programs use both multiple-choice and open-ended items on tests for accountability purposes. Certainly, there is an intuitive belief among some educators and policy makers that open-ended items measure something different than multiple-choice items. This study examined two item formats in custom-built, standards-based tests of achievement in Reading and Mathematics at grades 3-8. In this paper, we raise questions about the value of including open-ended items, given scoring costs, time constraints, and the higher probability of missing data from test-takers.

  14. Concurrent Validity and Sensitivity to Change of Direct Behavior Rating Single-Item Scales (DBR-SIS) within an Elementary Sample

    Science.gov (United States)

    Smith, Rhonda L.; Eklund, Katie; Kilgus, Stephen P.

    2018-01-01

    The purpose of this study was to evaluate the concurrent validity, sensitivity to change, and teacher acceptability of Direct Behavior Rating single-item scales (DBR-SIS), a brief progress monitoring measure designed to assess student behavioral change in response to intervention. Twenty-four elementary teacher-student dyads implemented a daily…

  15. The Stanford Leisure-Time Activity Categorical Item (L-Cat): a single categorical item sensitive to physical activity changes in overweight/obese women.

    Science.gov (United States)

    Kiernan, M; Schoffman, D E; Lee, K; Brown, S D; Fair, J M; Perri, M G; Haskell, W L

    2013-12-01

    Physical activity is essential for chronic disease prevention, yet Cat) is a single item comprising six descriptive categories ranging from inactive to very active. This novel methodological approach assesses national activity recommendations as well as multiple clinically relevant categories below and above the recommendations, and incorporates critical methodological principles that enhance psychometrics (reliability, validity and sensitivity to change). We evaluated the L-Cat's psychometrics among 267 overweight/obese women who were asked to meet the national activity recommendations in a randomized behavioral weight-loss trial. The L-Cat had excellent test-retest reliability (κ=0.64, PCat category at 6 months was associated with 1059 more daily pedometer steps (95% CI 712-1407, β=0.38, PCat categories differentiated from each other in a dose-response gradient for steps and weight loss (PsCat was sensitive to change in response to the trial's activity component. Women increased one L-Cat category at 6 months (M=1.0±1.4, PCat categories at 6 months lost more weight than those who did not (M=-4.6%, 95% CI -6.7 to -2.5, PCat has timely potential for clinical use such as tracking activity changes via electronic medical records, especially among overweight/obese populations who are unable or unlikely to reach national recommendations.

  16. Work ability as prognostic risk marker of disability pension: single-item work ability score versus multi-item work ability index.

    Science.gov (United States)

    Roelen, Corné A M; van Rhenen, Willem; Groothoff, Johan W; van der Klink, Jac J L; Twisk, Jos W R; Heymans, Martijn W

    2014-07-01

    Work ability predicts future disability pension (DP). A single-item work ability score (WAS) is emerging as a measure for work ability. This study compared single-item WAS with the multi-item work ability index (WAI) in its ability to identify workers at risk of DP. This prospective cohort study comprised 11 537 male construction workers, who completed the WAI at baseline and reported DP after a mean 2.3 years of follow-up. WAS and WAI were calibrated for DP risk predictions with the Hosmer-Lemeshow (H-L) test and their ability to discriminate between high- and low-risk construction workers was investigated with the area under the receiver operating characteristic curve (AUC). At follow-up, 336 (3%) construction workers reported DP. Both WAS [odds ratio (OR) 0.72, 95% confidence interval (95% CI) 0.66-0.78] and WAI (OR 0.57, 95% CI 0.52-0.63) scores were associated with DP at follow-up. The WAS showed miscalibration (H-L model χ (�)=10.60; df=3; P=0.01) and poorly discriminated between high- and low-risk construction workers (AUC 0.67, 95% CI 0.64-0.70). In contrast, calibration (H-L model χ �=8.20; df=8; P=0.41) and discrimination (AUC 0.78, 95% CI 0.75-0.80) were both adequate for the WAI. Although associated with the risk of future DP, the single-item WAS poorly identified male construction workers at risk of DP. We recommend using the multi-item WAI to screen for risk of DP in occupational health practice.

  17. Assessment of the Item Selection and Weighting in the Birmingham Vasculitis Activity Score for Wegener's Granulomatosis

    Science.gov (United States)

    MAHR, ALFRED D.; NEOGI, TUHINA; LAVALLEY, MICHAEL P.; DAVIS, JOHN C.; HOFFMAN, GARY S.; MCCUNE, W. JOSEPH; SPECKS, ULRICH; SPIERA, ROBERT F.; ST.CLAIR, E. WILLIAM; STONE, JOHN H.; MERKEL, PETER A.

    2013-01-01

    Objective To assess the Birmingham Vasculitis Activity Score for Wegener's Granulomatosis (BVAS/WG) with respect to its selection and weighting of items. Methods This study used the BVAS/WG data from the Wegener's Granulomatosis Etanercept Trial. The scoring frequencies of the 34 predefined items and any “other” items added by clinicians were calculated. Using linear regression with generalized estimating equations in which the physician global assessment (PGA) of disease activity was the dependent variable, we computed weights for all predefined items. We also created variables for clinical manifestations frequently added as other items, and computed weights for these as well. We searched for the model that included the items and their generated weights yielding an activity score with the highest R2 to predict the PGA. Results We analyzed 2,044 BVAS/WG assessments from 180 patients; 734 assessments were scored during active disease. The highest R2 with the PGA was obtained by scoring WG activity based on the following items: the 25 predefined items rated on ≥5 visits, the 2 newly created fatigue and weight loss variables, the remaining minor other and major other items, and a variable that signified whether new or worse items were present at a specific visit. The weights assigned to the items ranged from 1 to 21. Compared with the original BVAS/WG, this modified score correlated significantly more strongly with the PGA. Conclusion This study suggests possibilities to enhance the item selection and weighting of the BVAS/WG. These changes may increase this instrument's ability to capture the continuum of disease activity in WG. PMID:18512722

  18. A confirmative clinimetric analysis of the 36-item Family Assessment Device.

    Science.gov (United States)

    Timmerby, Nina; Cosci, Fiammetta; Watson, Maggie; Csillag, Claudio; Schmitt, Florence; Steck, Barbara; Bech, Per; Thastum, Mikael

    2018-02-07

    The Family Assessment Device (FAD) is a 60-item questionnaire widely used to evaluate self-reported family functioning. However, the factor structure as well as the number of items has been questioned. A shorter and more user-friendly version of the original FAD-scale, the 36-item FAD, has therefore previously been proposed, based on findings in a nonclinical population of adults. We aimed in this study to evaluate the brief 36-item version of the FAD in a clinical population. Data from a European multinational study, examining factors associated with levels of family functioning in adult cancer patients' families, were used. Both healthy and ill parents completed the 60-item version FAD. The psychometric analyses conducted were Principal Component Analysis and Mokken-analysis. A total of 564 participants were included. Based on the psychometric analysis we confirmed that the 36-item version of the FAD has robust psychometric properties and can be used in clinical populations. The present analysis confirmed that the 36-item version of the FAD (18 items assessing 'well-being' and 18 items assessing 'dysfunctional' family function) is a brief scale where the summed total score is a valid measure of the dimensions of family functioning. This shorter version of the FAD is, in accordance with the concept of 'measurement-based care', an easy to use scale that could be considered when the aim is to evaluate self-reported family functioning.

  19. Measuring single constructs by single items: Constructing an even shorter version of the "Short Five" personality inventory.

    Directory of Open Access Journals (Sweden)

    Kenn Konstabel

    Full Text Available The aim of this study was to construct a short, 30-item personality questionnaire that would be, in terms of content and meaning of the scores, as comparable as possible with longer, well-established inventories such as NEO PI-R and its clones. To do this, we shortened the formerly constructed 60-item "Short Five" (S5 by half so that each subscale would be represented by a single item. We compared all possibilities of selecting 30 items (preserving balanced keying within each domain of the five-factor model in terms of correlations with well-established scales, self-peer correlations, and clarity of meaning, and selected an optimal combination for each domain. The resulting shortened questionnaire, XS5, was compared to the original S5 using data from student samples in 6 different countries (Estonia, Finland, UK, Germany, Spain, and China, and a representative Finnish sample. The correlations between XS5 domain scales and their longer counterparts from well-established scales ranged from 0.74 to 0.84; the difference from the equivalent correlations for full version of S5 or from meta-analytic short-term dependability coefficients of NEO PI-R was not large. In terms of prediction of external criteria (emotional experience and self-reported behaviours, there were no important differences between XS5, S5, and the longer well-established scales. Controlling for acquiescence did not improve the prediction of criteria, self-peer correlations, or correlations with longer scales, but it did improve internal reliability and, in some analyses, comparability of the principal component structure. XS5 can be recommended as an economic measure of the five-factor model of personality at the level of domain scales; it has reasonable psychometric properties, fair correlations with longer well-established scales, and it can predict emotional experience and self-reported behaviours no worse than S5. When subscales are essential, we would still recommend using the

  20. Measuring single constructs by single items: Constructing an even shorter version of the “Short Five” personality inventory

    Science.gov (United States)

    Konstabel, Kenn; Lönnqvist, Jan-Erik; Leikas, Sointu; García Velázquez, Regina; Qin, Hiaying; Verkasalo, Markku; Walkowitz, Gari

    2017-01-01

    The aim of this study was to construct a short, 30-item personality questionnaire that would be, in terms of content and meaning of the scores, as comparable as possible with longer, well-established inventories such as NEO PI-R and its clones. To do this, we shortened the formerly constructed 60-item “Short Five” (S5) by half so that each subscale would be represented by a single item. We compared all possibilities of selecting 30 items (preserving balanced keying within each domain of the five-factor model) in terms of correlations with well-established scales, self-peer correlations, and clarity of meaning, and selected an optimal combination for each domain. The resulting shortened questionnaire, XS5, was compared to the original S5 using data from student samples in 6 different countries (Estonia, Finland, UK, Germany, Spain, and China), and a representative Finnish sample. The correlations between XS5 domain scales and their longer counterparts from well-established scales ranged from 0.74 to 0.84; the difference from the equivalent correlations for full version of S5 or from meta-analytic short-term dependability coefficients of NEO PI-R was not large. In terms of prediction of external criteria (emotional experience and self-reported behaviours), there were no important differences between XS5, S5, and the longer well-established scales. Controlling for acquiescence did not improve the prediction of criteria, self-peer correlations, or correlations with longer scales, but it did improve internal reliability and, in some analyses, comparability of the principal component structure. XS5 can be recommended as an economic measure of the five-factor model of personality at the level of domain scales; it has reasonable psychometric properties, fair correlations with longer well-established scales, and it can predict emotional experience and self-reported behaviours no worse than S5. When subscales are essential, we would still recommend using the full version

  1. Factor Structure and Reliability of Test Items for Saudi Teacher Licence Assessment

    Science.gov (United States)

    Alsadaawi, Abdullah Saleh

    2017-01-01

    The Saudi National Assessment Centre administers the Computer Science Teacher Test for teacher certification. The aim of this study is to explore gender differences in candidates' scores, and investigate dimensionality, reliability, and differential item functioning using confirmatory factor analysis and item response theory. The confirmatory…

  2. Assessment of Preference for Edible and Leisure Items in Individuals with Dementia

    Science.gov (United States)

    Ortega, Javier Virues; Iwata, Brian A.; Nogales-Gonzalez, Celia; Frades, Belen

    2012-01-01

    We conducted 2 studies on reinforcer preference in patients with dementia. Results of preference assessments yielded differential selections by 14 participants. Unlike prior studies with individuals with intellectual disabilities, all participants showed a noticeable preference for leisure items over edible items. Results of a subsequent analysis…

  3. Developing an African youth psychosocial assessment: an application of item response theory.

    Science.gov (United States)

    Betancourt, Theresa S; Yang, Frances; Bolton, Paul; Normand, Sharon-Lise

    2014-06-01

    This study aimed to refine a dimensional scale for measuring psychosocial adjustment in African youth using item response theory (IRT). A 60-item scale derived from qualitative data was administered to 667 war-affected adolescents (55% female). Exploratory factor analysis (EFA) determined the dimensionality of items based on goodness-of-fit indices. Items with loadings less than 0.4 were dropped. Confirmatory factor analysis (CFA) was used to confirm the scale's dimensionality found under the EFA. Item discrimination and difficulty were estimated using a graded response model for each subscale using weighted least squares means and variances. Predictive validity was examined through correlations between IRT scores (θ) for each subscale and ratings of functional impairment. All models were assessed using goodness-of-fit and comparative fit indices. Fisher's Information curves examined item precision at different underlying ranges of each trait. Original scale items were optimized and reconfigured into an empirically-robust 41-item scale, the African Youth Psychosocial Assessment (AYPA). Refined subscales assess internalizing and externalizing problems, prosocial attitudes/behaviors and somatic complaints without medical cause. The AYPA is a refined dimensional assessment of emotional and behavioral problems in African youth with good psychometric properties. Validation studies in other cultures are recommended. Copyright © 2014 John Wiley & Sons, Ltd.

  4. International Assessment: A Rasch Model and Teachers' Evaluation of TIMSS Science Achievement Items

    Science.gov (United States)

    Glynn, Shawn M.

    2012-01-01

    The Trends in International Mathematics and Science Study (TIMSS) is a comparative assessment of the achievement of students in many countries. In the present study, a rigorous independent evaluation was conducted of a representative sample of TIMSS science test items because item quality influences the validity of the scores used to inform…

  5. Assessment of Differential Item Functioning in the Experiences of Discrimination Index

    Science.gov (United States)

    Cunningham, Timothy J.; Berkman, Lisa F.; Gortmaker, Steven L.; Kiefe, Catarina I.; Jacobs, David R.; Seeman, Teresa E.; Kawachi, Ichiro

    2011-01-01

    The psychometric properties of instruments used to measure self-reported experiences of discrimination in epidemiologic studies are rarely assessed, especially regarding construct validity. The authors used 2000–2001 data from the Coronary Artery Risk Development in Young Adults (CARDIA) Study to examine differential item functioning (DIF) in 2 versions of the Experiences of Discrimination (EOD) Index, an index measuring self-reported experiences of racial/ethnic and gender discrimination. DIF may confound interpretation of subgroup differences. Large DIF was observed for 2 of 7 racial/ethnic discrimination items: White participants reported more racial/ethnic discrimination for the “at school” item, and black participants reported more racial/ethnic discrimination for the “getting housing” item. The large DIF by race/ethnicity in the index for racial/ethnic discrimination probably reflects item impact and is the result of valid group differences between blacks and whites regarding their respective experiences of discrimination. The authors also observed large DIF by race/ethnicity for 3 of 7 gender discrimination items. This is more likely to have been due to item bias. Users of the EOD Index must consider the advantages and disadvantages of DIF adjustment (omitting items, constructing separate measures, and retaining items). The EOD Index has substantial usefulness as an instrument that can assess self-reported experiences of discrimination. PMID:22038104

  6. Normative data for the 12 item WHO Disability Assessment Schedule 2.0.

    Directory of Open Access Journals (Sweden)

    Gavin Andrews

    Full Text Available BACKGROUND: The World Health Organization Disability Assessment Schedule (WHODAS 2.0 measures disability due to health conditions including diseases, illnesses, injuries, mental or emotional problems, and problems with alcohol or drugs. METHOD: The 12 Item WHODAS 2.0 was used in the second Australian Survey of Mental Health and Well-being. We report the overall factor structure and the distribution of scores and normative data (means and SDs for people with any physical disorder, any mental disorder and for people with neither. FINDINGS: A single second order factor justifies the use of the scale as a measure of global disability. People with mental disorders had high scores (mean 6.3, SD 7.1, people with physical disorders had lower scores (mean 4.3, SD 6.1. People with no disorder covered by the survey had low scores (mean 1.4, SD 3.6. INTERPRETATION: The provision of normative data from a population sample of adults will facilitate use of the WHODAS 2.0 12 item scale in clinical and epidemiological research.

  7. Forced-Choice Assessment of Work-Related Maladaptive Personality Traits: Preliminary Evidence From an Application of Thurstonian Item Response Modeling.

    Science.gov (United States)

    Guenole, Nigel; Brown, Anna A; Cooper, Andrew J

    2018-06-01

    This article describes an investigation of whether Thurstonian item response modeling is a viable method for assessment of maladaptive traits. Forced-choice responses from 420 working adults to a broad-range personality inventory assessing six maladaptive traits were considered. The Thurstonian item response model's fit to the forced-choice data was adequate, while the fit of a counterpart item response model to responses to the same items but arranged in a single-stimulus design was poor. Monotrait heteromethod correlations indicated corresponding traits in the two formats overlapped substantially, although they did not measure equivalent constructs. A better goodness of fit and higher factor loadings for the Thurstonian item response model, coupled with a clearer conceptual alignment to the theoretical trait definitions, suggested that the single-stimulus item responses were influenced by biases that the independent clusters measurement model did not account for. Researchers may wish to consider forced-choice designs and appropriate item response modeling techniques such as Thurstonian item response modeling for personality questionnaire applications in industrial psychology, especially when assessing maladaptive traits. We recommend further investigation of this approach in actual selection situations and with different assessment instruments.

  8. Maslach Burnout Inventory and a Self-Defined, Single-Item Burnout Measure Produce Different Clinician and Staff Burnout Estimates.

    Science.gov (United States)

    Knox, Margae; Willard-Grace, Rachel; Huang, Beatrice; Grumbach, Kevin

    2018-06-04

    Clinicians and healthcare staff report high levels of burnout. Two common burnout assessments are the Maslach Burnout Inventory (MBI) and a single-item, self-defined burnout measure. Relatively little is known about how the measures compare. To identify the sensitivity, specificity, and concurrent validity of the self-defined burnout measure compared to the more established MBI measure. Cross-sectional survey (November 2016-January 2017). Four hundred forty-four primary care clinicians and 606 staff from three San Francisco Aarea healthcare systems. The MBI measure, calculated from a high score on either the emotional exhaustion or cynicism subscale, and a single-item measure of self-defined burnout. Concurrent validity was assessed using a validated, 7-item team culture scale as reported by Willard-Grace et al. (J Am Board Fam Med 27(2):229-38, 2014) and a standard question about workplace atmosphere as reported by Rassolian et al. (JAMA Intern Med 177(7):1036-8, 2017) and Linzer et al. (Ann Intern Med 151(1):28-36, 2009). Similar to other nationally representative burnout estimates, 52% of clinicians (95% CI: 47-57%) and 46% of staff (95% CI: 42-50%) reported high MBI emotional exhaustion or high MBI cynicism. In contrast, 29% of clinicians (95% CI: 25-33%) and 31% of staff (95% CI: 28-35%) reported "definitely burning out" or more severe symptoms on the self-defined burnout measure. The self-defined measure's sensitivity to correctly identify MBI-assessed burnout was 50.4% for clinicians and 58.6% for staff; specificity was 94.7% for clinicians and 92.3% for staff. Area under the receiver operator curve was 0.82 for clinicians and 0.81 for staff. Team culture and atmosphere were significantly associated with both self-defined burnout and the MBI, confirming concurrent validity. Point estimates of burnout notably differ between the self-defined and MBI measures. Compared to the MBI, the self-defined burnout measure misses half of high-burnout clinicians and more

  9. Comparison of Alternate and Original Items on the Montreal Cognitive Assessment.

    Science.gov (United States)

    Lebedeva, Elena; Huang, Mei; Koski, Lisa

    2016-03-01

    The Montreal Cognitive Assessment (MoCA) is a screening tool for mild cognitive impairment (MCI) in elderly individuals. We hypothesized that measurement error when using the new alternate MoCA versions to monitor change over time could be related to the use of items that are not of comparable difficulty to their corresponding originals of similar content. The objective of this study was to compare the difficulty of the alternate MoCA items to the original ones. Five selected items from alternate versions of the MoCA were included with items from the original MoCA administered adaptively to geriatric outpatients (N = 78). Rasch analysis was used to estimate the difficulty level of the items. None of the five items from the alternate versions matched the difficulty level of their corresponding original items. This study demonstrates the potential benefits of a Rasch analysis-based approach for selecting items during the process of development of parallel forms. The results suggest that better match of the items from different MoCA forms by their difficulty would result in higher sensitivity to changes in cognitive function over time.

  10. Item response theory, computerized adaptive testing, and PROMIS: assessment of physical function.

    Science.gov (United States)

    Fries, James F; Witter, James; Rose, Matthias; Cella, David; Khanna, Dinesh; Morgan-DeWitt, Esi

    2014-01-01

    Patient-reported outcome (PRO) questionnaires record health information directly from research participants because observers may not accurately represent the patient perspective. Patient-reported Outcomes Measurement Information System (PROMIS) is a US National Institutes of Health cooperative group charged with bringing PRO to a new level of precision and standardization across diseases by item development and use of item response theory (IRT). With IRT methods, improved items are calibrated on an underlying concept to form an item bank for a "domain" such as physical function (PF). The most informative items can be combined to construct efficient "instruments" such as 10-item or 20-item PF static forms. Each item is calibrated on the basis of the probability that a given person will respond at a given level, and the ability of the item to discriminate people from one another. Tailored forms may cover any desired level of the domain being measured. Computerized adaptive testing (CAT) selects the best items to sharpen the estimate of a person's functional ability, based on prior responses to earlier questions. PROMIS item banks have been improved with experience from several thousand items, and are calibrated on over 21,000 respondents. In areas tested to date, PROMIS PF instruments are superior or equal to Health Assessment Questionnaire and Medical Outcome Study Short Form-36 Survey legacy instruments in clarity, translatability, patient importance, reliability, and sensitivity to change. Precise measures, such as PROMIS, efficiently incorporate patient self-report of health into research, potentially reducing research cost by lowering sample size requirements. The advent of routine IRT applications has the potential to transform PRO measurement.

  11. Explanatory item response modelling of an abstract reasoning assessment: A case for modern test design

    OpenAIRE

    Helland, Fredrik

    2016-01-01

    Assessment is an integral part of society and education, and for this reason it is important to know what you measure. This thesis is about explanatory item response modelling of an abstract reasoning assessment, with the objective to create a modern test design framework for automatic generation of valid and precalibrated items of abstract reasoning. Modern test design aims to strengthen the connections between the different components of a test, with a stress on strong theory, systematic it...

  12. Recommended core items to assess e-cigarette use in population-based surveys.

    Science.gov (United States)

    Pearson, Jennifer L; Hitchman, Sara C; Brose, Leonie S; Bauld, Linda; Glasser, Allison M; Villanti, Andrea C; McNeill, Ann; Abrams, David B; Cohen, Joanna E

    2018-05-01

    A consistent approach using standardised items to assess e-cigarette use in both youth and adult populations will aid cross-survey and cross-national comparisons of the effect of e-cigarette (and tobacco) policies and improve our understanding of the population health impact of e-cigarette use. Focusing on adult behaviour, we propose a set of e-cigarette use items, discuss their utility and potential adaptation, and highlight e-cigarette constructs that researchers should avoid without further item development. Reliable and valid items will strengthen the emerging science and inform knowledge synthesis for policy-making. Building on informal discussions at a series of international meetings of 65 experts from 15 countries, the authors provide recommendations for assessing e-cigarette use behaviour, relative perceived harm, device type, presence of nicotine, flavours and reasons for use. We recommend items assessing eight core constructs: e-cigarette ever use, frequency of use and former daily use; relative perceived harm; device type; primary flavour preference; presence of nicotine; and primary reason for use. These items should be standardised or minimally adapted for the policy context and target population. Researchers should be prepared to update items as e-cigarette device characteristics change. A minimum set of e-cigarette items is proposed to encourage consensus around items to allow for cross-survey and cross-jurisdictional comparisons of e-cigarette use behaviour. These proposed items are a starting point. We recognise room for continued improvement, and welcome input from e-cigarette users and scientific colleagues. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  13. Do people with and without medical conditions respond similarly to the short health anxiety inventory? An assessment of differential item functioning using item response theory.

    Science.gov (United States)

    LeBouthillier, Daniel M; Thibodeau, Michel A; Alberts, Nicole M; Hadjistavropoulos, Heather D; Asmundson, Gordon J G

    2015-04-01

    Individuals with medical conditions are likely to have elevated health anxiety; however, research has not demonstrated how medical status impacts response patterns on health anxiety measures. Measurement bias can undermine the validity of a questionnaire by overestimating or underestimating scores in groups of individuals. We investigated whether the Short Health Anxiety Inventory (SHAI), a widely-used measure of health anxiety, exhibits medical condition-based bias on item and subscale levels, and whether the SHAI subscales adequately assess the health anxiety continuum. Data were from 963 individuals with diabetes, breast cancer, or multiple sclerosis, and 372 healthy individuals. Mantel-Haenszel tests and item characteristic curves were used to classify the severity of item-level differential item functioning in all three medical groups compared to the healthy group. Test characteristic curves were used to assess scale-level differential item functioning and whether the SHAI subscales adequately assess the health anxiety continuum. Nine out of 14 items exhibited differential item functioning. Two items exhibited differential item functioning in all medical groups compared to the healthy group. In both Thought Intrusion and Fear of Illness subscales, differential item functioning was associated with mildly deflated scores in medical groups with very high levels of the latent traits. Fear of Illness items poorly discriminated between individuals with low and very low levels of the latent trait. While individuals with medical conditions may respond differentially to some items, clinicians and researchers can confidently use the SHAI with a variety of medical populations without concern of significant bias. Copyright © 2015 Elsevier Inc. All rights reserved.

  14. An Examination of Differential Item Functioning on the Vanderbilt Assessment of Leadership in Education

    Science.gov (United States)

    Polikoff, Morgan S.; May, Henry; Porter, Andrew C.; Elliott, Stephen N.; Goldring, Ellen; Murphy, Joseph

    2009-01-01

    The Vanderbilt Assessment of Leadership in Education is a 360-degree assessment of the effectiveness of principals' learning-centered leadership behaviors. In this report, we present results from a differential item functioning (DIF) study of the assessment. Using data from a national field trial, we searched for evidence of DIF on school level,…

  15. A single-item self-report medication adherence question predicts hospitalisation and death in patients with heart failure.

    Science.gov (United States)

    Wu, Jia-Rong; DeWalt, Darren A; Baker, David W; Schillinger, Dean; Ruo, Bernice; Bibbins-Domingo, Kristen; Macabasco-O'Connell, Aurelia; Holmes, George M; Broucksou, Kimberly A; Erman, Brian; Hawk, Victoria; Cene, Crystal W; Jones, Christine DeLong; Pignone, Michael

    2014-09-01

    To determine whether a single-item self-report medication adherence question predicts hospitalisation and death in patients with heart failure. Poor medication adherence is associated with increased morbidity and mortality. Having a simple means of identifying suboptimal medication adherence could help identify at-risk patients for interventions. We performed a prospective cohort study in 592 participants with heart failure within a four-site randomised trial. Self-report medication adherence was assessed at baseline using a single-item question: 'Over the past seven days, how many times did you miss a dose of any of your heart medication?' Participants who reported no missing doses were defined as fully adherent, and those missing more than one dose were considered less than fully adherent. The primary outcome was combined all-cause hospitalisation or death over one year and the secondary endpoint was heart failure hospitalisation. Outcomes were assessed with blinded chart reviews, and heart failure outcomes were determined by a blinded adjudication committee. We used negative binomial regression to examine the relationship between medication adherence and outcomes. Fifty-two percent of participants were 52% male, mean age was 61 years, and 31% were of New York Heart Association class III/IV at enrolment; 72% of participants reported full adherence to their heart medicine at baseline. Participants with full medication adherence had a lower rate of all-cause hospitalisation and death (0·71 events/year) compared with those with any nonadherence (0·86 events/year): adjusted-for-site incidence rate ratio was 0·83, fully adjusted incidence rate ratio 0·68. Incidence rate ratios were similar for heart failure hospitalisations. A single medication adherence question at baseline predicts hospitalisation and death over one year in heart failure patients. Medication adherence is associated with all-cause and heart failure-related hospitalisation and death in heart

  16. A single-item global job satisfaction measure is associated with quantitative blood immune indices in white-collar employees.

    Science.gov (United States)

    Nakata, Akinori; Irie, Masahiro; Takahashi, Masaya

    2013-01-01

    Although a single-item job satisfaction measure has been shown to be reliable and inclusive as multiple-item scales in relation to health, studies including immunological data are few. The purpose of this study was to evaluate the validity of single-item job and family life satisfaction based on its association with immune indices. A total of 189 white-collar employees (70% men) underwent a blood draw for the measurement of natural killer (NK), total T, and B cell counts as well as plasma immunoglobulin (Ig) G concentrations and completed single-item job and family life satisfaction measures, respectively. The response options for satisfaction measures were 'dissatisfied' (coded 1) to 'satisfied' (coded 4). Spearman's partial correlations controlling for cofactors revealed that increased job satisfaction was positively associated with NK cells (rsp=0.201, p=0.007) and IgG (rsp=0.178, p=0.018), while family life satisfaction was unrelated to immune indices. Those who reported a combination of low job/low family life satisfaction had significantly lower NK and higher B cell counts than those with a high job/high family life satisfaction. Our study suggests that the single-item summary measure of job satisfaction, but not family life satisfaction, may be a valid tool to evaluate immune status in healthy white-collar employees.

  17. Psychometric properties of the Global Operative Assessment of Laparoscopic Skills (GOALS) using item response theory.

    Science.gov (United States)

    Watanabe, Yusuke; Madani, Amin; Ito, Yoichi M; Bilgic, Elif; McKendy, Katherine M; Feldman, Liane S; Fried, Gerald M; Vassiliou, Melina C

    2017-02-01

    The extent to which each item assessed using the Global Operative Assessment of Laparoscopic Skills (GOALS) contributes to the total score remains unknown. The purpose of this study was to evaluate the level of difficulty and discriminative ability of each of the 5 GOALS items using item response theory (IRT). A total of 396 GOALS assessments for a variety of laparoscopic procedures over a 12-year time period were included. Threshold parameters of item difficulty and discrimination power were estimated for each item using IRT. The higher slope parameters seen with "bimanual dexterity" and "efficiency" are indicative of greater discriminative ability than "depth perception", "tissue handling", and "autonomy". IRT psychometric analysis indicates that the 5 GOALS items do not demonstrate uniform difficulty and discriminative power, suggesting that they should not be scored equally. "Bimanual dexterity" and "efficiency" seem to have stronger discrimination. Weighted scores based on these findings could improve the accuracy of assessing individual laparoscopic skills. Copyright © 2016 Elsevier Inc. All rights reserved.

  18. Validity and usefulness of a single-item measure of patient-reported bother from side effects of cancer therapy.

    Science.gov (United States)

    Pearman, Timothy P; Beaumont, Jennifer L; Mroczek, Daniel; O'Connor, Mary; Cella, David

    2018-03-01

    The improving efficacy of cancer treatment has resulted in an increasing array of treatment-related symptoms and associated burdens imposed on individuals undergoing aggressive treatment of their disease. Often, clinical trials compare therapies that have different types, and severities, of adverse effects. Whether rated by clinicians or patients themselves, it can be difficult to know which side effect profile is more disruptive or bothersome to patients. A simple summary index of bother can help to adjudicate the variability in adverse effects across treatments being compared with each other. Across 4 studies, a total of 5765 patients enrolled in cooperative group studies and industry-sponsored clinical trials were the subjects of the current study. Patients were diagnosed with a range of primary cancer sites, including bladder, brain, breast, colon/rectum, head/neck, hepatobiliary, kidney, lung, ovary, pancreas, and prostate as well as leukemia and lymphoma. All patients were administered the Functional Assessment of Cancer Therapy-General version (FACT-G). The single item "I am bothered by side effects of treatment" (GP5), rated on a 5-point Likert scale, is part of the FACT-G. To determine its validity as a useful summary measure from the patient perspective, it was correlated with individual and aggregated clinician-rated adverse events and patient reports of their general ability to enjoy life. Analyses of pharmaceutical trials demonstrated that mean GP5 scores ("I am bothered by side effects of treatment") significantly differed by maximum adverse event grade (PEffect sizes ranged from 0.13 to 0.46. Analyses of cooperative group trials demonstrated a significant correlation between GP5 and item GF3 ("I am able to enjoy life") in the predicted direction. The single FACT-G item "I am bothered by side effects of treatment" is significantly associated with clinician-reported adverse events and with patients' ability to enjoy their lives. It has promise as an

  19. Measuring everyday functional competence using the Rasch assessment of everyday activity limitations (REAL) item bank.

    Science.gov (United States)

    Oude Voshaar, Martijn A H; Ten Klooster, Peter M; Vonkeman, Harald E; van de Laar, Mart A F J

    2017-11-01

    Traditional patient-reported physical function instruments often poorly differentiate patients with mild-to-moderate disability. We describe the development and psychometric evaluation of a generic item bank for measuring everyday activity limitations in outpatient populations. Seventy-two items generated from patient interviews and mapped to the International Classification of Functioning, Disability and Health (ICF) domestic life chapter were administered to 1128 adults representative of the Dutch population. The partial credit model was fitted to the item responses and evaluated with respect to its assumptions, model fit, and differential item functioning (DIF). Measurement performance of a computerized adaptive testing (CAT) algorithm was compared with the SF-36 physical functioning scale (PF-10). A final bank of 41 items was developed. All items demonstrated acceptable fit to the partial credit model and measurement invariance across age, sex, and educational level. Five- and ten-item CAT simulations were shown to have high measurement precision, which exceeded that of SF-36 physical functioning scale across the physical function continuum. Floor effects were absent for a 10-item empirical CAT simulation, and ceiling effects were low (13.5%) compared with SF-36 physical functioning (38.1%). CAT also discriminated better than SF-36 physical functioning between age groups, number of chronic conditions, and respondents with or without rheumatic conditions. The Rasch assessment of everyday activity limitations (REAL) item bank will hopefully prove a useful instrument for assessing everyday activity limitations. T-scores obtained using derived measures can be used to benchmark physical function outcomes against the general Dutch adult population.

  20. Working memory for sequences of temporal durations reveals a volatile single-item store

    Directory of Open Access Journals (Sweden)

    Sanjay G Manohar

    2016-10-01

    Full Text Available When a sequence is held in working memory, different items are retained with differing fidelity. Here we ask whether a sequence of brief time intervals that must be remembered show recency effects, similar to those observed in verbal and visuospatial working memory. It has been suggested that prioritising some items over others can be accounted for by a focus of attention, maintaining some items in a privileged state. We therefore also investigated whether such benefits are vulnerable to disruption by attention or expectation. Participants listened to sequences of one to five tones, of varying durations (200ms to 2s. Subsequently, the length of one of the tones in the sequence had to be reproduced by holding a key. The discrepancy between the reproduced and actual durations quantified the fidelity of memory for auditory durations. Recall precision decreased with the number of items that had to be remembered, and was better for the first and last items of sequences, in line with set-size and serial position effects seen in other modalities. To test whether attentional filtering demands might impair performance, an irrelevant variation in pitch was introduced in some blocks of trials. In those blocks, memory precision was worse for sequences that consisted of only one item, i.e. the smallest memory set size. Thus, when irrelevant information was present, the benefit of having only one item in memory is attenuated. Finally we examined whether expectation could interfere with memory. On half the trials, the number of items in the upcoming sequence was cued. When the number of items was known in advance, performance was paradoxically worse when the sequence consisted of only one item. Thus the benefit of having only one item to remember is stronger when it is unexpectedly the only item. Our results suggest that similar mechanisms are used to hold auditory time durations in working memory, as for visual or verbal stimuli. Further, solitary items were

  1. Assessing nicotine dependence in adolescent E-cigarette users: The 4-item Patient-Reported Outcomes Measurement Information System (PROMIS) Nicotine Dependence Item Bank for electronic cigarettes.

    Science.gov (United States)

    Morean, Meghan E; Krishnan-Sarin, Suchitra; S O'Malley, Stephanie

    2018-04-26

    Adolescent e-cigarette use (i.e., "vaping") likely confers risk for developing nicotine dependence. However, there have been no studies assessing e-cigarette nicotine dependence in youth. We evaluated the psychometric properties of the 4-item Patient-Reported Outcomes Measurement Information System Nicotine Dependence Item Bank for E-cigarettes (PROMIS-E) for assessing youth e-cigarette nicotine dependence and examined risk factors for experiencing stronger dependence symptoms. In 2017, 520 adolescent past-month e-cigarette users completed the PROMIS-E during a school-based survey (50.5% female, 84.8% White, 16.22[1.19] years old). Adolescents also reported on sex, grade, race, age at e-cigarette use onset, vaping frequency, nicotine e-liquid use, and past-month cigarette smoking. Analyses included conducting confirmatory factor analysis and examining the internal consistency of the PROMIS-E. Bivariate correlations and independent-samples t-tests were used to examine unadjusted relationships between e-cigarette nicotine dependence and the proposed risk factors. Regression models were run in which all potential risk factors were entered as simultaneous predictors of PROMIS-E scores. The single-factor structure of the PROMIS-E was confirmed and evidenced good internal consistency. Across models, larger PROMIS-E scores were associated with being in a higher grade, initiating e-cigarette use at an earlier age, vaping more frequently, using nicotine e-liquid (and higher nicotine concentrations), and smoking cigarettes. Adolescent e-cigarette users reported experiencing nicotine dependence, which was assessed using the psychometrically sound PROMIS-E. Experiencing stronger nicotine dependence symptoms was associated with characteristics that previously have been shown to confer risk for frequent vaping and tobacco cigarette dependence. Copyright © 2018 Elsevier B.V. All rights reserved.

  2. Using Item Analysis to Assess Objectively the Quality of the Calgary-Cambridge OSCE Checklist

    Directory of Open Access Journals (Sweden)

    Tyrone Donnon

    2011-06-01

    Full Text Available Background:  The purpose of this study was to investigate the use of item analysis to assess objectively the quality of items on the Calgary-Cambridge Communications OSCE checklist. Methods:  A total of 150 first year medical students were provided with extensive teaching on the use of the Calgary-Cambridge Guidelines for interviewing patients and participated in a final year end 20 minute communication OSCE station.  Grouped into either the upper half (50% or lower half (50% communication skills performance groups, discrimination, difficulty and point biserial values were calculated for each checklist item. Results:  The mean score on the 33 item communication checklist was 24.09 (SD = 4.46 and the internal reliability coefficient was ? = 0.77. Although most of the items were found to have moderate (k = 12, 36% or excellent (k = 10, 30% discrimination values, there were 6 (18% identified as ‘fair’ and 3 (9% as ‘poor’. A post-examination review focused on item analysis findings resulted in an increase in checklist reliability (? = 0.80. Conclusions:  Item analysis has been used with MCQ exams extensively. In this study, it was also found to be an objective and practical approach to use in evaluating the quality of a standardized OSCE checklist.

  3. An Anthropologist among the Psychometricians: Assessment Events, Ethnography, and Differential Item Functioning in the Mongolian Gobi

    Science.gov (United States)

    Maddox, Bryan; Zumbo, Bruno D.; Tay-Lim, Brenda; Qu, Demin

    2015-01-01

    This article explores the potential for ethnographic observations to inform the analysis of test item performance. In 2010, a standardized, large-scale adult literacy assessment took place in Mongolia as part of the United Nations Educational, Scientific and Cultural Organization Literacy Assessment and Monitoring Programme (LAMP). In a novel form…

  4. Modeling the World Health Organization Disability Assessment Schedule II using non-parametric item response models.

    Science.gov (United States)

    Galindo-Garre, Francisca; Hidalgo, María Dolores; Guilera, Georgina; Pino, Oscar; Rojo, J Emilio; Gómez-Benito, Juana

    2015-03-01

    The World Health Organization Disability Assessment Schedule II (WHO-DAS II) is a multidimensional instrument developed for measuring disability. It comprises six domains (getting around, self-care, getting along with others, life activities and participation in society). The main purpose of this paper is the evaluation of the psychometric properties for each domain of the WHO-DAS II with parametric and non-parametric Item Response Theory (IRT) models. A secondary objective is to assess whether the WHO-DAS II items within each domain form a hierarchy of invariantly ordered severity indicators of disability. A sample of 352 patients with a schizophrenia spectrum disorder is used in this study. The 36 items WHO-DAS II was administered during the consultation. Partial Credit and Mokken scale models are used to study the psychometric properties of the questionnaire. The psychometric properties of the WHO-DAS II scale are satisfactory for all the domains. However, we identify a few items that do not discriminate satisfactorily between different levels of disability and cannot be invariantly ordered in the scale. In conclusion the WHO-DAS II can be used to assess overall disability in patients with schizophrenia, but some domains are too general to assess functionality in these patients because they contain items that are not applicable to this pathology. Copyright © 2014 John Wiley & Sons, Ltd.

  5. Hippocampal damage equally impairs memory for single items and memory for conjunctions.

    Science.gov (United States)

    Stark, Craig E L; Squire, Larry R

    2003-01-01

    single-item and associative memory.

  6. Small group learning: effect on item analysis and accuracy of self-assessment of medical students.

    Science.gov (United States)

    Biswas, Shubho Subrata; Jain, Vaishali; Agrawal, Vandana; Bindra, Maninder

    2015-01-01

    Small group sessions are regarded as a more active and student-centered approach to learning. Item analysis provides objective evidence of whether such sessions improve comprehension and make the topic easier for students, in addition to assessing the relative benefit of the sessions to good versus poor performers. Self-assessment makes students aware of their deficiencies. Small group sessions can also help students develop the ability to self-assess. This study was carried out to assess the effect of small group sessions on item analysis and students' self-assessment. A total of 21 female and 29 male first year medical students participated in a small group session on topics covered by didactic lectures two weeks earlier. It was preceded and followed by two multiple choice question (MCQ) tests, in which students were asked to self-assess their likely score. The MCQs used were item analyzed in a previous group and were chosen of matching difficulty and discriminatory indices for the pre- and post-tests. The small group session improved the marks of both genders equally, but female performance was better. The session made the items easier; increasing the difficulty index significantly but there was no significant alteration in the discriminatory index. There was overestimation in the self-assessment of both genders, but male overestimation was greater. The session improved the self-assessment of students in terms of expected marks and expectation of passing. Small group session improved the ability of students to self-assess their knowledge and increased the difficulty index of items reflecting students' better performance.

  7. Robustness of two single-item self-esteem measures: cross-validation with a measure of stigma in a sample of psychiatric patients.

    Science.gov (United States)

    Bagley, Christopher

    2005-08-01

    Robins' Single-item Self-esteem Inventory was compared with a single item from the Coopersmith Self-esteem. Although a new scoring format was used, there was good evidence of cross-validation in 83 current and former psychiatric patients who completed Harvey's adapted measure of stigma felt and experienced by users of mental health services. Scores on the two single-item self-esteem measures correlated .76 (p self-esteem in users of mental health services.

  8. [Impact of passing items above the ceiling on the assessment results of Peabody developmental motor scales].

    Science.gov (United States)

    Zhao, Gai; Bian, Yang; Li, Ming

    2013-12-18

    To analyze the impact of passing items above the roof level in the gross motor subtest of Peabody development motor scales (PDMS-2) on its assessment results. In the subtests of PDMS-2, 124 children from 1.2 to 71 months were administered. Except for the original scoring method, a new scoring method which includes passing items above the ceiling were developed. The standard scores and quotients of the two scoring methods were compared using the independent-samples t test. Only one child could pass the items above the ceiling in the stationary subtest, 19 children in the locomotion subtest, and 17 children in the visual-motor integration subtest. When the scores of these passing items were included in the raw scores, the total raw scores got the added points of 1-12, the standard scores added 0-1 points and the motor quotients added 0-3 points. The diagnostic classification was changed only in two children. There was no significant difference between those two methods about motor quotients or standard scores in the specific subtest (P>0.05). The passing items above a ceiling of PDMS-2 isn't a rare situation. It usually takes place in the locomotion subtest and visual-motor integration subtest. Including these passing items into the scoring system will not make significant difference in the standard scores of the subtests or the developmental motor quotients (DMQ), which supports the original setting of a ceiling established by upassing 3 items in a row. However, putting the passing items above the ceiling into the raw score will improve tracking of children's developmental trajectory and intervention effects.

  9. Combining item response theory with multiple imputation to equate health assessment questionnaires.

    Science.gov (United States)

    Gu, Chenyang; Gutman, Roee

    2017-09-01

    The assessment of patients' functional status across the continuum of care requires a common patient assessment tool. However, assessment tools that are used in various health care settings differ and cannot be easily contrasted. For example, the Functional Independence Measure (FIM) is used to evaluate the functional status of patients who stay in inpatient rehabilitation facilities, the Minimum Data Set (MDS) is collected for all patients who stay in skilled nursing facilities, and the Outcome and Assessment Information Set (OASIS) is collected if they choose home health care provided by home health agencies. All three instruments or questionnaires include functional status items, but the specific items, rating scales, and instructions for scoring different activities vary between the different settings. We consider equating different health assessment questionnaires as a missing data problem, and propose a variant of predictive mean matching method that relies on Item Response Theory (IRT) models to impute unmeasured item responses. Using real data sets, we simulated missing measurements and compared our proposed approach to existing methods for missing data imputation. We show that, for all of the estimands considered, and in most of the experimental conditions that were examined, the proposed approach provides valid inferences, and generally has better coverages, relatively smaller biases, and shorter interval estimates. The proposed method is further illustrated using a real data set. © 2016, The International Biometric Society.

  10. Overview of Classical Test Theory and Item Response Theory for Quantitative Assessment of Items in Developing Patient-Reported Outcome Measures

    Science.gov (United States)

    Cappelleri, Joseph C.; Lundy, J. Jason; Hays, Ron D.

    2014-01-01

    Introduction The U.S. Food and Drug Administration’s patient-reported outcome (PRO) guidance document defines content validity as “the extent to which the instrument measures the concept of interest” (FDA, 2009, p. 12). “Construct validity is now generally viewed as a unifying form of validity for psychological measurements, subsuming both content and criterion validity” (Strauss & Smith, 2009, p. 7). Hence both qualitative and quantitative information are essential in evaluating the validity of measures. Methods We review classical test theory and item response theory approaches to evaluating PRO measures including frequency of responses to each category of the items in a multi-item scale, the distribution of scale scores, floor and ceiling effects, the relationship between item response options and the total score, and the extent to which hypothesized “difficulty” (severity) order of items is represented by observed responses. Conclusion Classical test theory and item response theory can be useful in providing a quantitative assessment of items and scales during the content validity phase of patient-reported outcome measures. Depending on the particular type of measure and the specific circumstances, either one or both approaches should be considered to help maximize the content validity of PRO measures. PMID:24811753

  11. Identifying Promising Items: The Use of Crowdsourcing in the Development of Assessment Instruments

    Science.gov (United States)

    Sadler, Philip M.; Sonnert, Gerhard; Coyle, Harold P.; Miller, Kelly A.

    2016-01-01

    The psychometrically sound development of assessment instruments requires pilot testing of candidate items as a first step in gauging their quality, typically a time-consuming and costly effort. Crowdsourcing offers the opportunity for gathering data much more quickly and inexpensively than from most targeted populations. In a simulation of a…

  12. Sensitivity and specificity of the 3-item memory test in the assessment of post traumatic amnesia.

    NARCIS (Netherlands)

    Andriessen, T.M.J.C.; Jong, B. de; Jacobs, B.; Werf, S.P. van der; Vos, P.E.

    2009-01-01

    PRIMARY OBJECTIVE: To investigate how the type of stimulus (pictures or words) and the method of reproduction (free recall or recognition after a short or a long delay) affect the sensitivity and specificity of a 3-item memory test in the assessment of post traumatic amnesia (PTA). METHODS: Daily

  13. Improving the Memory Sections of the Standardized Assessment of Concussion Using Item Analysis

    Science.gov (United States)

    McElhiney, Danielle; Kang, Minsoo; Starkey, Chad; Ragan, Brian

    2014-01-01

    The purpose of the study was to improve the immediate and delayed memory sections of the Standardized Assessment of Concussion (SAC) by identifying a list of more psychometrically sound items (words). A total of 200 participants with no history of concussion in the previous six months (aged 19.60 ± 2.20 years; N?=?93 men, N?=?107 women)…

  14. Investigation of Science Inquiry Items for Use on an Alternate Assessment Based on Modified Achievement Standards Using Cognitive Lab Methodology

    Science.gov (United States)

    Dickenson, Tammiee S.; Gilmore, Joanna A.; Price, Karen J.; Bennett, Heather L.

    2013-01-01

    This study evaluated the benefits of item enhancements applied to science-inquiry items for incorporation into an alternate assessment based on modified achievement standards for high school students. Six items were included in the cognitive lab sessions involving both students with and without disabilities. The enhancements (e.g., use of visuals,…

  15. Development and validation of a ten-item questionnaire with explanatory illustrations to assess upper extremity disorders: favorable effect of illustrations in the item reduction process.

    Science.gov (United States)

    Kurimoto, Shigeru; Suzuki, Mikako; Yamamoto, Michiro; Okui, Nobuyuki; Imaeda, Toshihiko; Hirata, Hitoshi

    2011-11-01

    The purpose of this study is to develop a short and valid measure for upper extremity disorders and to assess the effect of attached illustrations in item reduction of a self-administered disability questionnaire while retaining psychometric properties. A validated questionnaire used to assess upper extremity disorders, the Hand20, was reduced to ten items using two item-reduction techniques. The psychometric properties of the abbreviated form, the Hand10, were evaluated on an independent sample that was used for the shortening process. Validity, reliability, and responsiveness of the Hand10 were retained in the item reduction process. It was possible that the use of explanatory illustrations attached to the Hand10 helped with its reproducibility. The illustrations for the Hand10 promoted text comprehension and motivation to answer the items. These changes resulted in high acceptability; more than 99.3% of patients, including 98.5% of elderly patients, could complete the Hand10 properly. The illustrations had favorable effects on the item reduction process and made it possible to retain precision of the instrument. The Hand10 is a reliable and valid instrument for individual-level applications with the advantage of being compact and broadly applicable, even in elderly individuals.

  16. What Form of Mathematics Are Assessments Assessing? The Case of Multiplication and Division in Fourth Grade NAEP Items

    Science.gov (United States)

    Kosko Karl W.; Singh, Rashmi

    2018-01-01

    Multiplicative reasoning is a key concept in elementary school mathematics. Item statistics reported by the National Assessment of Educational Progress (NAEP) assessment provide the best current indicator for how well elementary students across the U.S. understand this, and other concepts. However, beyond expert reviews and statistical analysis,…

  17. Item difficulty of multiple choice tests dependant on different item response formats – An experiment in fundamental research on psychological assessment

    Directory of Open Access Journals (Sweden)

    KLAUS D. KUBINGER

    2007-12-01

    Full Text Available Multiple choice response formats are problematical as an item is often scored as solved simply because the test-taker is a lucky guesser. Instead of applying pertinent IRT models which take guessing effects into account, a pragmatic approach of re-conceptualizing multiple choice response formats to reduce the chance of lucky guessing is considered. This paper compares the free response format with two different multiple choice formats. A common multiple choice format with a single correct response option and five distractors (“1 of 6” is used, as well as a multiple choice format with five response options, of which any number of the five is correct and the item is only scored as mastered if all the correct response options and none of the wrong ones are marked (“x of 5”. An experiment was designed, using pairs of items with exactly the same content but different response formats. 173 test-takers were randomly assigned to two test booklets of 150 items altogether. Rasch model analyses adduced a fitting item pool, after the deletion of 39 items. The resulting item difficulty parameters were used for the comparison of the different formats. The multiple choice format “1 of 6” differs significantly from “x of 5”, with a relative effect of 1.63, while the multiple choice format “x of 5” does not significantly differ from the free response format. Therefore, the lower degree of difficulty of items with the “1 of 6” multiple choice format is an indicator of relevant guessing effects. In contrast the “x of 5” multiple choice format can be seen as an appropriate substitute for free response format.

  18. Recommended core items to assess e-cigarette use in population-based surveys

    OpenAIRE

    Pearson, Jennifer L; Hitchman, Sara C; Brose, Leonie S; Bauld, Linda; Glasser, Allison M; Villanti, Andrea C; McNeill, Ann; Abrams, David B; Cohen, Joanna E

    2017-01-01

    Background: A consistent approach using standardized items to assess e-cigarette use in both youth and adult populations will aid cross-survey and cross-national comparisons of the effect of e-cigarette (and tobacco) policies and improve our understanding of the population health impact of e-cigarette use. Focusing on adult behavior, we propose a set of e-cigarette use items, discuss their utility and potential adaptation, and highlight e-cigarette constructs that researchers should avoid wit...

  19. RT-based memory detection : Item saliency effects in the single-probe and the multiple-probe protocol

    NARCIS (Netherlands)

    Verschuere, B.; Kleinberg, B.; Theocharidou, K.

    RT-based memory detection may provide an efficient means to assess recognition of concealed information. There is, however, considerable heterogeneity in detection rates, and we explored two potential moderators: item saliency and test protocol. Participants tried to conceal low salient (e.g.,

  20. An introduction to Item Response Theory and Rasch Analysis of the Eating Assessment Tool (EAT-10).

    Science.gov (United States)

    Kean, Jacob; Brodke, Darrel S; Biber, Joshua; Gross, Paul

    2018-03-01

    Item response theory has its origins in educational measurement and is now commonly applied in health-related measurement of latent traits, such as function and symptoms. This application is due in large part to gains in the precision of measurement attributable to item response theory and corresponding decreases in response burden, study costs, and study duration. The purpose of this paper is twofold: introduce basic concepts of item response theory and demonstrate this analytic approach in a worked example, a Rasch model (1PL) analysis of the Eating Assessment Tool (EAT-10), a commonly used measure for oropharyngeal dysphagia. The results of the analysis were largely concordant with previous studies of the EAT-10 and illustrate for brain impairment clinicians and researchers how IRT analysis can yield greater precision of measurement.

  1. Item analysis of single-peaked response data : the psychometric evaluation of bipolar measurement scales

    NARCIS (Netherlands)

    Polak, Maaike Geertruida

    2011-01-01

    The thesis explains the fundamental difference between unipolar and bipolar measurement scales for psychological characteristics. We explore the use of correspondence analysis (CA), a technique that is similar to principal component analysis and is available in SAS and SPSS, to select items that

  2. Examining the Psychometric Quality of Multiple-Choice Assessment Items using Mokken Scale Analysis.

    Science.gov (United States)

    Wind, Stefanie A

    The concept of invariant measurement is typically associated with Rasch measurement theory (Engelhard, 2013). Concerned with the appropriateness of the parametric transformation upon which the Rasch model is based, Mokken (1971) proposed a nonparametric procedure for evaluating the quality of social science measurement that is theoretically and empirically related to the Rasch model. Mokken's nonparametric procedure can be used to evaluate the quality of dichotomous and polytomous items in terms of the requirements for invariant measurement. Despite these potential benefits, the use of Mokken scaling to examine the properties of multiple-choice (MC) items in education has not yet been fully explored. A nonparametric approach to evaluating MC items is promising in that this approach facilitates the evaluation of assessments in terms of invariant measurement without imposing potentially inappropriate transformations. Using Rasch-based indices of measurement quality as a frame of reference, data from an eighth-grade physical science assessment are used to illustrate and explore Mokken-based techniques for evaluating the quality of MC items. Implications for research and practice are discussed.

  3. Development of an assessment tool to measure students′ perceptions of respiratory care education programs: Item generation, item reduction, and preliminary validation

    Directory of Open Access Journals (Sweden)

    Ghazi Alotaibi

    2013-01-01

    Full Text Available Objectives: Students who perceived their learning environment positively are more likely to develop effective learning strategies, and adopt a deep learning approach. Currently, there is no validated instrument for measuring the educational environment of educational programs on respiratory care (RC. The aim of this study was to develop an instrument to measure students′ perception of the RC educational environment. Materials and Methods: Based on the literature review and an assessment of content validity by multiple focus groups of RC educationalists, potential items of the instrument relevant to RC educational environment construct were generated by the research group. The initial 71 item questionnaire was then field-tested on all students from the 3 RC programs in Saudi Arabia and was subjected to multi-trait scaling analysis. Cronbach′s alpha was used to assess internal consistency reliabilities. Results: Two hundred and twelve students (100% completed the survey. The initial instrument of 71 items was reduced to 65 across 5 scales. Convergent and discriminant validity assessment demonstrated that the majority of items correlated more highly with their intended scale than a competing one. Cronbach′s alpha exceeded the standard criterion of >0.70 in all scales except one. There was no floor or ceiling effect for scale or overall score. Conclusions: This instrument is the first assessment tool developed to measure the RC educational environment. There was evidence of its good feasibility, validity, and reliability. This first validation of the instrument supports its use by RC students to evaluate educational environment.

  4. Open Single Item of Perceived Risk Factors (OSIPRF toward Cardiovascular Diseases Is an Appropriate Instrument for Evaluating Psychological Symptoms

    Directory of Open Access Journals (Sweden)

    Mozhgan Saeidi

    2016-12-01

    Full Text Available Psychological symptoms are considered as one of the aspects and consequences of cardiovascular diseases (CVDs, management of which can precipitate and facilitate the process of recovery. Evaluation of the psychological symptoms can increase awareness of treatment team regarding patients’ mental health, which can be beneficial for designing treatment programs (1. However, time-consuming process of interviews and assessment by questionnaires lead to fatigue and lack of patient cooperation, which may be problematic for healthcare evaluators. Therefore, the use of brief and suitable alternatives is always recommended.The use of practical and easy to implement instruments is constantly emphasized. A practical method for assessing patients' psychological status is examining causal beliefs and attitudes about the disease. The causal beliefs and perceived risk factors by patients, which are significantly related to the actual risk factors for CVDs (2, are not only related to psychological adjustment and mental health but also have an impact on patients’ compliance with treatment recommendations (3.It seems that several risk factors are at play regarding the perceived risk factors for CVDs such as gender (4, age (5, and most importantly, patients’ psychological status (3. Accordingly, evaluation of causal beliefs and perceived risk factors by patients could probably be a shortcut method for evaluation of patients’ psychological health. In recent years, Saeidi and Komasi (5 proposed a question and investigated the perceived risk factors with an open single item: “What do you think is the main cause of your illness?”. According to the authors, the perceived risk factors are recorded in five categories including biological (age, gender, and family history, environmental (dust, smoke, passive smoking, toxic substances, and effects of war, physiological (diabetes, hypertension, hyperlipidemia, and obesity, behavioral (lack of exercise, nutrition

  5. Checking Equity: Why Differential Item Functioning Analysis Should Be a Routine Part of Developing Conceptual Assessments

    Czech Academy of Sciences Publication Activity Database

    Martinková, Patrícia; Drabinová, Adéla; Liaw, Y.L.; Sanders, E.A.; McFarland, J.L.; Price, R.M.

    2017-01-01

    Roč. 16, č. 2 (2017), č. článku rm2. ISSN 1931-7913 R&D Projects: GA ČR GJ15-15856Y Grant - others:NSF(US) DUE-1043443 Institutional support: RVO:67985807 Keywords : differential item functioning * fairness * conceptual assessments * concept inventory * undergraduate education * bias Subject RIV: AM - Education OBOR OECD: Education , special (to gifted persons, those with learning disabilities) Impact factor: 3.930, year: 2016

  6. Improved utilization of ADAS-cog assessment data through item response theory based pharmacometric modeling.

    Science.gov (United States)

    Ueckert, Sebastian; Plan, Elodie L; Ito, Kaori; Karlsson, Mats O; Corrigan, Brian; Hooker, Andrew C

    2014-08-01

    This work investigates improved utilization of ADAS-cog data (the primary outcome in Alzheimer's disease (AD) trials of mild and moderate AD) by combining pharmacometric modeling and item response theory (IRT). A baseline IRT model characterizing the ADAS-cog was built based on data from 2,744 individuals. Pharmacometric methods were used to extend the baseline IRT model to describe longitudinal ADAS-cog scores from an 18-month clinical study with 322 patients. Sensitivity of the ADAS-cog items in different patient populations as well as the power to detect a drug effect in relation to total score based methods were assessed with the IRT based model. IRT analysis was able to describe both total and item level baseline ADAS-cog data. Longitudinal data were also well described. Differences in the information content of the item level components could be quantitatively characterized and ranked for mild cognitively impairment and mild AD populations. Based on clinical trial simulations with a theoretical drug effect, the IRT method demonstrated a significantly higher power to detect drug effect compared to the traditional method of analysis. A combined framework of IRT and pharmacometric modeling permits a more effective and precise analysis than total score based methods and therefore increases the value of ADAS-cog data.

  7. Development of Rasch-based item banks for the assessment of work performance in patients with musculoskeletal diseases.

    Science.gov (United States)

    Mueller, Evelyn A; Bengel, Juergen; Wirtz, Markus A

    2013-12-01

    This study aimed to develop a self-description assessment instrument to measure work performance in patients with musculoskeletal diseases. In terms of the International Classification of Functioning, Disability and Health (ICF), work performance is defined as the degree of meeting the work demands (activities) at the actual workplace (environment). To account for the fact that work performance depends on the work demands of the job, we strived to develop item banks that allow a flexible use of item subgroups depending on the specific work demands of the patients' jobs. Item development included the collection of work tasks from literature and content validation through expert surveys and patient interviews. The resulting 122 items were answered by 621 patients with musculoskeletal diseases. Exploratory factor analysis to ascertain dimensionality and Rasch analysis (partial credit model) for each of the resulting dimensions were performed. Exploratory factor analysis resulted in four dimensions, and subsequent Rasch analysis led to the following item banks: 'impaired productivity' (15 items), 'impaired cognitive performance' (18), 'impaired coping with stress' (13) and 'impaired physical performance' (low physical workload 20 items, high physical workload 10 items). The item banks exhibited person separation indices (reliability) between 0.89 and 0.96. The assessment of work performance adds the activities component to the more commonly employed participation component of the ICF-model. The four item banks can be adapted to specific jobs where necessary without losing comparability of person measures, as the item banks are based on Rasch analysis.

  8. Overview of classical test theory and item response theory for the quantitative assessment of items in developing patient-reported outcomes measures.

    Science.gov (United States)

    Cappelleri, Joseph C; Jason Lundy, J; Hays, Ron D

    2014-05-01

    The US Food and Drug Administration's guidance for industry document on patient-reported outcomes (PRO) defines content validity as "the extent to which the instrument measures the concept of interest" (FDA, 2009, p. 12). According to Strauss and Smith (2009), construct validity "is now generally viewed as a unifying form of validity for psychological measurements, subsuming both content and criterion validity" (p. 7). Hence, both qualitative and quantitative information are essential in evaluating the validity of measures. We review classical test theory and item response theory (IRT) approaches to evaluating PRO measures, including frequency of responses to each category of the items in a multi-item scale, the distribution of scale scores, floor and ceiling effects, the relationship between item response options and the total score, and the extent to which hypothesized "difficulty" (severity) order of items is represented by observed responses. If a researcher has few qualitative data and wants to get preliminary information about the content validity of the instrument, then descriptive assessments using classical test theory should be the first step. As the sample size grows during subsequent stages of instrument development, confidence in the numerical estimates from Rasch and other IRT models (as well as those of classical test theory) would also grow. Classical test theory and IRT can be useful in providing a quantitative assessment of items and scales during the content-validity phase of PRO-measure development. Depending on the particular type of measure and the specific circumstances, the classical test theory and/or the IRT should be considered to help maximize the content validity of PRO measures. Copyright © 2014 Elsevier HS Journals, Inc. All rights reserved.

  9. Identifying the most efficient items from the Mini-Mental State Examination for cognitive function assessment in older Taiwanese patients.

    Science.gov (United States)

    Lou, Meei-Fang; Dai, Yu-Tzu; Huang, Guey-Shiun; Yu, Po-Jui

    2007-03-01

    The purpose of the study was to identify the most efficient items from the Mini-Mental State Examination for assessment of cognitive function. The Mini-Mental State Examination is the most frequently used cognitive screening instrument. However, the Mini-Mental State Examination has been criticized for insensitivity to mild cognitive dysfunction, limited memory assessment and variability in level of difficulty of the individual items. This study used secondary data analysis. Item response theory two-parameter model was used to analyse the data from the admission assessment of mental status by the Mini-Mental State Examination for 801 patients. By using item response analysis, 16 items were selected from the original 30-item Mini-Mental State Examination. The 16 items included mainly the measures of orientation, recall and attention and calculation. The internal consistency of the 16-item Mini-Mental State Examination was 0.84. The proposed new cut-off point for the 16-item Mini-Mental State Examination was 11. The correct classification rate was 0.94, the sensitivity was 100% and the specificity was 97.4%, when compared with the original 30-item Mini-Mental State Examination from the cut-off point of 24. This new cut-off point was determined for the purpose of over-identifying patients at risk so as to ensure early detection of and prevention from the onset of cognitive disturbance. Only a few items are needed to describe the subject's cognitive status. Using item response theory analysis, the study found that the Mini-Mental State Examination could be simplified. Deleting the items with less variation makes this assessment tool not only shorter, easier to administer and less strenuous for respondents, but also enables one to maintain validity as a cognitive function test for clinical setting.

  10. A mathematical model for order splitting in a multiple supplier single-item inventory system

    DEFF Research Database (Denmark)

    Abginehchi, Soheil; Farahani, Reza Zanjirani; Rezapour, Shabnam

    2013-01-01

    systems. The item acquisition lead times of suppliers are random variables. Backorder is allowed and shortage cost is charged based on not only per unit in shortage but also per time unit. Continuous review (s,Q) policy has been assumed. When the inventory level depletes to a reorder level, the total...... order is split among n suppliers. Since the suppliers have different characteristics, the quantity ordered to different suppliers may be different. The problem is to determine the reorder level and quantity ordered to each supplier so that the expected total cost per time unit, including ordering cost......, procurement cost, inventory holding cost, and shortage cost, is minimized. We also conduct extensive numerical experiments to show the advantages of our model compared with the models in the literature. According to our extensive experiments, the model developed in this paper is the best model...

  11. TINGKAT PERSEDIAAN SPARE PART FORKLIFT MEREK KOMATSU DENGAN PENDEKATAN MODEL PERSEDIAAN SINGLE ITEM

    Directory of Open Access Journals (Sweden)

    Wahid Ahmad Jauhari

    2006-04-01

    Full Text Available The control and maintenance of inventories is a problem common to all enterprises in any sector of a given economy. Two fundamental question that must be answered in controlling the inventory are when to replenish the inventory and how much to order for replenishment. The (Q,r inventory models attempt to answer the two question under a variety of circumstances. Studies have shown, (1 that a company that ignores lead-time demand variability may suffer great financial damage, (2 that the gamma distribution provides the most common best fit to lead-time demand for variety of inventories items, (3 that a fixed lead-time demand assumption or a normal approximation to it will often yield significant errors (Namit and Chen, 1998.This research performed an efficient and accurate algorithm for solving (Q,r inventory model with gamma lead-time demand.

  12. Enactment versus observation: item-specific and relational processing in goal-directed action sequences (and lists of single actions.

    Directory of Open Access Journals (Sweden)

    Janette Schult

    Full Text Available What are the memory-related consequences of learning actions (such as "apply the patch" by enactment during study, as compared to action observation? Theories converge in postulating that enactment encoding increases item-specific processing, but not the processing of relational information. Typically, in the laboratory enactment encoding is studied for lists of unrelated single actions in which one action execution has no overarching purpose or relation with other actions. In contrast, real-life actions are usually carried out with the intention to achieve such a purpose. When actions are embedded in action sequences, relational information provides efficient retrieval cues. We contrasted memory for single actions with memory for action sequences in three experiments. We found more reliance on relational processing for action-sequences than single actions. To what degree can this relational information be used after enactment versus after the observation of an actor? We found indicators of superior relational processing after observation than enactment in ordered pair recall (Experiment 1A and in emerging subjective organization of repeated recall protocols (recall runs 2-3, Experiment 2. An indicator of superior item-specific processing after enactment compared to observation was recognition (Experiment 1B, Experiment 2. Similar net recall suggests that observation can be as good a learning strategy as enactment. We discuss possible reasons why these findings only partly converge with previous research and theorizing.

  13. Face Validity of the Single Work Ability Item: Comparison with Objectively Measured Heart Rate Reserve over Several Days

    Science.gov (United States)

    Gupta, Nidhi; Jensen, Bjørn Søvsø; Søgaard, Karen; Carneiro, Isabella Gomes; Christiansen, Caroline Stordal; Hanisch, Christiana; Holtermann, Andreas

    2014-01-01

    Purpose: The purpose of this study was to investigate the face validity of the self-reported single item work ability with objectively measured heart rate reserve (%HRR) among blue-collar workers. Methods: We utilized data from 127 blue-collar workers (Female = 53; Male = 74) aged 18–65 years from the cross-sectional “New method for Objective Measurements of physical Activity in Daily living (NOMAD)” study. The workers reported their single item work ability and completed an aerobic capacity cycling test and objective measurements of heart rate reserve monitored with Actiheart for 3–4 days with a total of 5,810 h, including 2,640 working hours. Results: A significant moderate correlation between work ability and %HRR was observed among males (R = −0.33, P = 0.005), but not among females (R = 0.11, P = 0.431). In a gender-stratified multi-adjusted logistic regression analysis, males with high %HRR were more likely to report a reduced work ability compared to males with low %HRR [OR = 4.75, 95% confidence interval (95% CI) = 1.31 to 17.25]. However, this association was not found among females (OR = 0.26, 95% CI 0.03 to 2.16), and a significant interaction between work ability, %HRR and gender was observed (P = 0.03). Conclusions: The observed association between work ability and objectively measured %HRR over several days among male blue-collar workers supports the face validity of the single work ability item. It is a useful and valid measure of the relation between physical work demands and resources among male blue-collar workers. The contrasting association among females needs to be further investigated. PMID:24840350

  14. Face Validity of the Single Work Ability Item: Comparison with Objectively Measured Heart Rate Reserve over Several Days

    Directory of Open Access Journals (Sweden)

    Nidhi Gupta

    2014-05-01

    Full Text Available Purpose: The purpose of this study was to investigate the face validity of the self-reported single item work ability with objectively measured heart rate reserve (%HRR among blue-collar workers. Methods: We utilized data from 127 blue-collar workers (Female = 53; Male = 74 aged 18–65 years from the cross-sectional “New method for Objective Measurements of physical Activity in Daily living (NOMAD” study. The workers reported their single item work ability and completed an aerobic capacity cycling test and objective measurements of heart rate reserve monitored with Actiheart for 3–4 days with a total of 5,810 h, including 2,640 working hours. Results: A significant moderate correlation between work ability and %HRR was observed among males (R = −0.33, P = 0.005, but not among females (R = 0.11, P = 0.431. In a gender-stratified multi-adjusted logistic regression analysis, males with high %HRR were more likely to report a reduced work ability compared to males with low %HRR [OR = 4.75, 95% confidence interval (95% CI = 1.31 to 17.25]. However, this association was not found among females (OR = 0.26, 95% CI 0.03 to 2.16, and a significant interaction between work ability, %HRR and gender was observed (P = 0.03. Conclusions: The observed association between work ability and objectively measured %HRR over several days among male blue-collar workers supports the face validity of the single work ability item. It is a useful and valid measure of the relation between physical work demands and resources among male blue-collar workers. The contrasting association among females needs to be further investigated.

  15. Gender differences in national assessment of educational progress science items: What does i don't know really mean?

    Science.gov (United States)

    Linn, Marcia C.; de Benedictis, Tina; Delucchi, Kevin; Harris, Abigail; Stage, Elizabeth

    The National Assessment of Educational Progress Science Assessment has consistently revealed small gender differences on science content items but not on science inquiry items. This assessment differs from others in that respondents can choose I don't know rather than guessing. This paper examines explanations for the gender differences including (a) differential prior instruction, (b) differential response to uncertainty and use of the I don't know response, (c) differential response to figurally presented items, and (d) different attitudes towards science. Of these possible explanations, the first two received support. Females are more likely to use the I don't know response, especially for items with physical science content or masculine themes such as football. To ameliorate this situation we need more effective science instruction and more gender-neutral assessment items.

  16. Development of a questionnaire to assess patient satisfaction with allergen-specific immunotherapy in adults: item generation, item reduction, and preliminary validation

    Directory of Open Access Journals (Sweden)

    Justícia JL

    2011-05-01

    Full Text Available Jose Luis Justícia1, Eva Baró2, Victoria Cardona3, Pedro Guardia4, Pedro Ojeda5, José Maria Olaguíbel6, José Maria Vega7, Carmen Vidal81Medical Department, Stallergenes Ibérica, Barcelona, Spain; 2Health Outcomes Research Department, 3D Health Research, Barcelona, Spain; 3Hospital Vall d'Hebron, Barcelona, Spain; 4Hospital Virgen Macarena, Sevilla, Spain; 5Clínica de Asma y Alergia Dres. Ojeda, Madrid, Spain; 6Complejo Hospitalario de Navarra, Pamplona, Spain; 7Hospital Regional Universitario Carlos Haya Málaga, Spain; 8Complejo Hospitalario Universitario de Santiago, Santiago de Compostela, SpainBackground: Allergen-specific immunotherapy (SIT is a treatment capable of modifying the natural course of allergy, so ensuring good adherence to SIT is fundamental. Up until now there has not existed an instrument specifically developed to measure patient satisfaction with SIT, although its assessment could help us to comprehend better and improve treatment adherence and effectiveness. The aim of this study was to develop an instrument to measure adult patient satisfaction with SIT.Methods: Items were generated from a literature review, focus groups with allergic adult patients undergoing SIT, and a meeting with experts. Potential items were administered to allergic patients undergoing SIT in an observational, cross-sectional, multicenter study. Item reduction was based on quantitative and qualitative criteria. A preliminary assessment of feasibility, reliability, and validity of the retained items was performed.Results: An initial pool of 70 items was administered to 257 patients undergoing SIT. Fifty-four items were eliminated resulting in a provisional instrument with 16 items. Factor analysis yielded four factors that were identified as perceived efficacy, activities and environment, cost-benefit balance, and overall satisfaction, explaining 74.8% of variance. Ceiling and floor effects were negligible for overall score. Overall score was

  17. Randomization and Data-Analysis Items in Quality Standards for Single-Case Experimental Studies

    Science.gov (United States)

    Heyvaert, Mieke; Wendt, Oliver; Van den Noortgate, Wim; Onghena, Patrick

    2015-01-01

    Reporting standards and critical appraisal tools serve as beacons for researchers, reviewers, and research consumers. Parallel to existing guidelines for researchers to report and evaluate group-comparison studies, single-case experimental (SCE) researchers are in need of guidelines for reporting and evaluating SCE studies. A systematic search was…

  18. 48 CFR 245.7101-3 - DD Form 1348-1, DoD Single Line Item Release/Receipt Document.

    Science.gov (United States)

    2010-10-01

    ... 48 Federal Acquisition Regulations System 3 2010-10-01 2010-10-01 false DD Form 1348-1, DoD Single Line Item Release/Receipt Document. 245.7101-3 Section 245.7101-3 Federal Acquisition Regulations... PROPERTY Plant Clearance Forms 245.7101-3 DD Form 1348-1, DoD Single Line Item Release/Receipt Document...

  19. Assessment of the Assessment Tool: Analysis of Items in a Non-MCQ Mathematics Exam

    Science.gov (United States)

    Khoshaim, Heba Bakr; Rashid, Saima

    2016-01-01

    Assessment is one of the vital steps in the teaching and learning process. The reported action research examines the effectiveness of an assessment process and inspects the validity of exam questions used for the assessment purpose. The instructors of a college-level mathematics course studied questions used in the final exams during the academic…

  20. Sensitivity and specificity of the 3-item memory test in the assessment of post traumatic amnesia.

    Science.gov (United States)

    Andriessen, Teuntje M J C; de Jong, Ben; Jacobs, Bram; van der Werf, Sieberen P; Vos, Pieter E

    2009-04-01

    To investigate how the type of stimulus (pictures or words) and the method of reproduction (free recall or recognition after a short or a long delay) affect the sensitivity and specificity of a 3-item memory test in the assessment of post traumatic amnesia (PTA). Daily testing was performed in 64 consecutively admitted traumatic brain injured patients, 22 orthopedically injured patients and 26 healthy controls until criteria for resolution of PTA were reached. Subjects were randomly assigned to a test with visual or verbal stimuli. Short delay reproduction was tested after an interval of 3-5 minutes, long delay reproduction was tested after 24 hours. Sensitivity and specificity were calculated over the first 4 test days. The 3-word test showed higher sensitivity than the 3-picture test, while specificity of the two tests was equally high. Free recall was a more effortful task than recognition for both patients and controls. In patients, a longer delay between registration and recall resulted in a significant decrease in the number of items reproduced. Presence of PTA is best assessed with a memory test that incorporates the free recall of words after a long delay.

  1. Alzheimer's Disease Assessment: A Review and Illustrations Focusing on Item Response Theory Techniques.

    Science.gov (United States)

    Balsis, Steve; Choudhury, Tabina K; Geraci, Lisa; Benge, Jared F; Patrick, Christopher J

    2018-04-01

    Alzheimer's disease (AD) affects neurological, cognitive, and behavioral processes. Thus, to accurately assess this disease, researchers and clinicians need to combine and incorporate data across these domains. This presents not only distinct methodological and statistical challenges but also unique opportunities for the development and advancement of psychometric techniques. In this article, we describe relatively recent research using item response theory (IRT) that has been used to make progress in assessing the disease across its various symptomatic and pathological manifestations. We focus on applications of IRT to improve scoring, test development (including cross-validation and adaptation), and linking and calibration. We conclude by describing potential future multidimensional applications of IRT techniques that may improve the precision with which AD is measured.

  2. War Reserve Analysis and Secondary Item Procureability Assessment of the AMCOM Supported Weapon Systems

    National Research Council Canada - National Science Library

    Maddux, Gary

    2000-01-01

    .... IOD evaluates the impacts of nonavailability of secondary items on the life cycle supportability of AMCOM weapon systems and evaluates the producibility of secondary items for war reserve requirements...

  3. Use of differential item functioning analysis to assess the equivalence of translations of a questionnaire

    NARCIS (Netherlands)

    Petersen, Morten Aa; Groenvold, Mogens; Bjorner, Jakob B.; Aaronson, Neil; Conroy, Thierry; Cull, Ann; Fayers, Peter; Hjermstad, Marianne; Sprangers, Mirjam; Sullivan, Marianne

    2003-01-01

    In cross-national comparisons based on questionnaires, accurate translations are necessary to obtain valid results. Differential item functioning (DIF) analysis can be used to test whether translations of items in multi-item scales are equivalent to the original. In data from 10,815 respondents

  4. Communicating Quantitative Literacy: An Examination of Open-Ended Assessment Items in TIMSS, NALS, IALS, and PISA

    Directory of Open Access Journals (Sweden)

    Karl W. Kosko

    2011-07-01

    Full Text Available Quantitative Literacy (QL has been described as the skill set an individual uses when interacting with the world in a quantitative manner. A necessary component of this interaction is communication. To this end, assessments of QL have included open-ended items as a means of including communicative aspects of QL. The present study sought to examine whether such open-ended items typically measured aspects of quantitative communication, as compared to mathematical communication, or mathematical skills. We focused on public-released items and rubrics from four of the most widely referenced assessments: the Third International Mathematics and Science Study (TIMSS-95: the National Adult Literacy Survey (NALS; now the National Assessment of Adult Literacy, NAAL in 1985 and 1992, the International Adult Literacy Skills (IALS beginning in 1994; and the Program for International Student Assessment (PISA beginning in 2000. We found that open-ended item rubrics in these QL assessments showed a strong tendency to assess answer-only responses. Therefore, while some open-ended items may have required certain levels of quantitative reasoning to find a solution, it is the solution rather than the reasoning that was often assessed.

  5. Item and test analysis to identify quality multiple choice questions (MCQS from an assessment of medical students of Ahmedabad, Gujarat

    Directory of Open Access Journals (Sweden)

    Sanju Gajjar

    2014-01-01

    Full Text Available Background: Multiple choice questions (MCQs are frequently used to assess students in different educational streams for their objectivity and wide reach of coverage in less time. However, the MCQs to be used must be of quality which depends upon its difficulty index (DIF I, discrimination index (DI and distracter efficiency (DE. Objective: To evaluate MCQs or items and develop a pool of valid items by assessing with DIF I, DI and DE and also to revise/ store or discard items based on obtained results. Settings: Study was conducted in a medical school of Ahmedabad. Materials and Methods: An internal examination in Community Medicine was conducted after 40 hours teaching during 1 st MBBS which was attended by 148 out of 150 students. Total 50 MCQs or items and 150 distractors were analyzed. Statistical Analysis: Data was entered and analyzed in MS Excel 2007 and simple proportions, mean, standard deviations, coefficient of variation were calculated and unpaired t test was applied. Results: Out of 50 items, 24 had "good to excellent" DIF I (31 - 60% and 15 had "good to excellent" DI (> 0.25. Mean DE was 88.6% considered as ideal/ acceptable and non functional distractors (NFD were only 11.4%. Mean DI was 0.14. Poor DI (< 0.15 with negative DI in 10 items indicates poor preparedness of students and some issues with framing of at least some of the MCQs. Increased proportion of NFDs (incorrect alternatives selected by < 5% students in an item decrease DE and makes it easier. There were 15 items with 17 NFDs, while rest items did not have any NFD with mean DE of 100%. Conclusion: Study emphasizes the selection of quality MCQs which truly assess the knowledge and are able to differentiate the students of different abilities in correct manner.

  6. Assessment of the psychometrics of a PROMIS item bank: self-efficacy for managing daily activities.

    Science.gov (United States)

    Hong, Ickpyo; Velozo, Craig A; Li, Chih-Ying; Romero, Sergio; Gruber-Baldini, Ann L; Shulman, Lisa M

    2016-09-01

    The aim of this study is to investigate the psychometrics of the Patient-Reported Outcomes Measurement Information System self-efficacy for managing daily activities item bank. The item pool was field tested on a sample of 1087 participants via internet (n = 250) and in-clinic (n = 837) surveys. All participants reported having at least one chronic health condition. The 35 item pool was investigated for dimensionality (confirmatory factor analyses, CFA and exploratory factor analysis, EFA), item-total correlations, local independence, precision, and differential item functioning (DIF) across gender, race, ethnicity, age groups, data collection modes, and neurological chronic conditions (McFadden Pseudo R (2) less than 10 %). The item pool met two of the four CFA fit criteria (CFI = 0.952 and SRMR = 0.07). EFA analysis found a dominant first factor (eigenvalue = 24.34) and the ratio of first to second eigenvalue was 12.4. The item pool demonstrated good item-total correlations (0.59-0.85) and acceptable internal consistency (Cronbach's alpha = 0.97). The item pool maintained its precision (reliability over 0.90) across a wide range of theta (3.70), and there was no significant DIF. The findings indicated the item pool has sound psychometric properties and the test items are eligible for development of computerized adaptive testing and short forms.

  7. Development of a self-report physical function instrument for disability assessment: item pool construction and factor analysis.

    Science.gov (United States)

    McDonough, Christine M; Jette, Alan M; Ni, Pengsheng; Bogusz, Kara; Marfeo, Elizabeth E; Brandt, Diane E; Chan, Leighton; Meterko, Mark; Haley, Stephen M; Rasch, Elizabeth K

    2013-09-01

    To build a comprehensive item pool representing work-relevant physical functioning and to test the factor structure of the item pool. These developmental steps represent initial outcomes of a broader project to develop instruments for the assessment of function within the context of Social Security Administration (SSA) disability programs. Comprehensive literature review; gap analysis; item generation with expert panel input; stakeholder interviews; cognitive interviews; cross-sectional survey administration; and exploratory and confirmatory factor analyses to assess item pool structure. In-person and semistructured interviews and Internet and telephone surveys. Sample of SSA claimants (n=1017) and a normative sample of adults from the U.S. general population (n=999). Not applicable. Model fit statistics. The final item pool consisted of 139 items. Within the claimant sample, 58.7% were white; 31.8% were black; 46.6% were women; and the mean age was 49.7 years. Initial factor analyses revealed a 4-factor solution, which included more items and allowed separate characterization of: (1) changing and maintaining body position, (2) whole body mobility, (3) upper body function, and (4) upper extremity fine motor. The final 4-factor model included 91 items. Confirmatory factor analyses for the 4-factor models for the claimant and the normative samples demonstrated very good fit. Fit statistics for claimant and normative samples, respectively, were: Comparative Fit Index=.93 and .98; Tucker-Lewis Index=.92 and .98; and root mean square error approximation=.05 and .04. The factor structure of the physical function item pool closely resembled the hypothesized content model. The 4 scales relevant to work activities offer promise for providing reliable information about claimant physical functioning relevant to work disability. Copyright © 2013 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.

  8. Assessing Impact, DIF, and DFF in Accommodated Item Scores: A Comparison of Multilevel Measurement Model Parameterizations

    Science.gov (United States)

    Beretvas, S. Natasha; Cawthon, Stephanie W.; Lockhart, L. Leland; Kaye, Alyssa D.

    2012-01-01

    This pedagogical article is intended to explain the similarities and differences between the parameterizations of two multilevel measurement model (MMM) frameworks. The conventional two-level MMM that includes item indicators and models item scores (Level 1) clustered within examinees (Level 2) and the two-level cross-classified MMM (in which item…

  9. An Application of Cognitive Diagnostic Assessment on TIMMS-2007 8th Grade Mathematics Items

    Science.gov (United States)

    Toker, Turker; Green, Kathy

    2012-01-01

    The least squares distance method (LSDM) was used in a cognitive diagnostic analysis of TIMSS (Trends in International Mathematics and Science Study) items administered to 4,498 8th-grade students from seven geographical regions of Turkey, extending analysis of attributes from content to process and skill attributes. Logit item positions were…

  10. Assessment of Differential Item Functioning in Health-Related Outcomes: A Simulation and Empirical Analysis with Hierarchical Polytomous Data

    Directory of Open Access Journals (Sweden)

    Zahra Sharafi

    2017-01-01

    Full Text Available Background. The purpose of this study was to evaluate the effectiveness of two methods of detecting differential item functioning (DIF in the presence of multilevel data and polytomously scored items. The assessment of DIF with multilevel data (e.g., patients nested within hospitals, hospitals nested within districts from large-scale assessment programs has received considerable attention but very few studies evaluated the effect of hierarchical structure of data on DIF detection for polytomously scored items. Methods. The ordinal logistic regression (OLR and hierarchical ordinal logistic regression (HOLR were utilized to assess DIF in simulated and real multilevel polytomous data. Six factors (DIF magnitude, grouping variable, intraclass correlation coefficient, number of clusters, number of participants per cluster, and item discrimination parameter with a fully crossed design were considered in the simulation study. Furthermore, data of Pediatric Quality of Life Inventory™ (PedsQL™ 4.0 collected from 576 healthy school children were analyzed. Results. Overall, results indicate that both methods performed equivalently in terms of controlling Type I error and detection power rates. Conclusions. The current study showed negligible difference between OLR and HOLR in detecting DIF with polytomously scored items in a hierarchical structure. Implications and considerations while analyzing real data were also discussed.

  11. Measuring everyday functional competence using the Rasch assessment of everyday activity limitations (REAL) item bank

    NARCIS (Netherlands)

    Oude Voshaar, Martijn A.H.; Ten Klooster, Peter M.; Vonkeman, Harald E.; van de Laar, Mart A.F.J.

    2017-01-01

    Objective: Traditional patient-reported physical function instruments often poorly differentiate patients with mild-to-moderate disability. We describe the development and psychometric evaluation of a generic item bank for measuring everyday activity limitations in outpatient populations. Study

  12. Assessing the Straightforwardly-Worded Brief Fear of Negative Evaluation Scale for Differential Item Functioning Across Gender and Ethnicity.

    Science.gov (United States)

    Harpole, Jared K; Levinson, Cheri A; Woods, Carol M; Rodebaugh, Thomas L; Weeks, Justin W; Brown, Patrick J; Heimberg, Richard G; Menatti, Andrew R; Blanco, Carlos; Schneier, Franklin; Liebowitz, Michael

    2015-06-01

    The Brief Fear of Negative Evaluation Scale (BFNE; Leary Personality and Social Psychology Bulletin , 9, 371-375, 1983) assesses fear and worry about receiving negative evaluation from others. Rodebaugh et al. Psychological Assessment, 16 , 169-181, (2004) found that the BFNE is composed of a reverse-worded factor (BFNE-R) and straightforwardly-worded factor (BFNE-S). Further, they found the BFNE-S to have better psychometric properties and provide more information than the BFNE-R. Currently there is a lack of research regarding the measurement invariance of the BFNE-S across gender and ethnicity with respect to item thresholds. The present study uses item response theory (IRT) to test the BFNE-S for differential item functioning (DIF) related to gender and ethnicity (White, Asian, and Black). Six data sets consisting of clinical, community, and undergraduate participants were utilized ( N =2,109). The factor structure of the BFNE-S was confirmed using categorical confirmatory factor analysis, IRT model assumptions were tested, and the BFNE-S was evaluated for DIF. Item nine demonstrated significant non-uniform DIF between White and Black participants. No other items showed significant uniform or non-uniform DIF across gender or ethnicity. Results suggest the BFNE-S can be used reliably with men and women and Asian and White participants. More research is needed to understand the implications of using the BFNE-S with Black participants.

  13. Concurrent validity and sensitivity to change of Direct Behavior Rating Single-Item Scales (DBR-SIS) within an elementary sample.

    Science.gov (United States)

    Smith, Rhonda L; Eklund, Katie; Kilgus, Stephen P

    2018-03-01

    The purpose of this study was to evaluate the concurrent validity, sensitivity to change, and teacher acceptability of Direct Behavior Rating single-item scales (DBR-SIS), a brief progress monitoring measure designed to assess student behavioral change in response to intervention. Twenty-four elementary teacher-student dyads implemented a daily report card intervention to promote positive student behavior during prespecified classroom activities. During both baseline and intervention, teachers completed DBR-SIS ratings of 2 target behaviors (i.e., Academic Engagement, Disruptive Behavior) whereas research assistants collected systematic direct observation (SDO) data in relation to the same behaviors. Five change metrics (i.e., absolute change, percent of change from baseline, improvement rate difference, Tau-U, and standardized mean difference; Gresham, 2005) were calculated for both DBR-SIS and SDO data, yielding estimates of the change in student behavior in response to intervention. Mean DBR-SIS scores were predominantly moderately to highly correlated with SDO data within both baseline and intervention, demonstrating evidence of the former's concurrent validity. DBR-SIS change metrics were also significantly correlated with SDO change metrics for both Disruptive Behavior and Academic Engagement, yielding evidence of the former's sensitivity to change. In addition, teacher Usage Rating Profile-Assessment (URP-A) ratings indicated they found DBR-SIS to be acceptable and usable. Implications for practice, study limitations, and areas of future research are discussed. (PsycINFO Database Record (c) 2018 APA, all rights reserved).

  14. The validity of the Satisfaction with Life Scale in adolescents and a comparison with single-item life satisfaction measures: a preliminary study.

    Science.gov (United States)

    Jovanović, Veljko

    2016-12-01

    The validity of the life satisfaction measures commonly used among adults has been rarely examined in adolescent samples. The present research had two main goals: (1) to evaluate the structural validity of the Satisfaction with Life Scale (SWLS) among adolescents and to test measurement invariance across gender; (2) to compare the criterion and convergent validity of the SWLS and single-item life satisfaction measures among adolescents. Three samples of Serbian adolescents were recruited for the present research. Study 1 (N = 481, M age  = 17.01 years) examined the structure of the SWLS via confirmatory factor analysis (CFA) and evaluated measurement invariance of the SWLS across gender by a multi-group CFA. Study 2 (N = 283, M age  = 17.34 years) and Study 3 (N = 220, M age  = 16.73 years) compared the convergent validity of the SWLS and single-item life satisfaction measures. The results of Study 1 supported the original one-factor model of the SWLS among adolescents and provided evidence for strong measurement invariance of the SWLS across gender. The findings of Study 2 and Study 3 showed that the SWLS and single-item measures were equally valid and strongly associated (r = .734 in Study 2 and r = .668 in Study 3). No substantial differences in correlations with school success and well-being indicators were found between the SWLS and single-item measures. Our findings support the use of the SWLS among adolescents and indicate that single-item life satisfaction measures perform as well as the SWLS in adolescent samples.

  15. Assessment of chromium(VI) release from 848 jewellery items by use of a diphenylcarbazide spot test

    DEFF Research Database (Denmark)

    Bregnbak, David; Johansen, Jeanne D.; Hamann, Dathan

    2016-01-01

    We recently evaluated and validated a diphenylcarbazide(DPC)-based screening spot test that can detect the release of chromium(VI) ions (≥0.5 ppm) from various metallic items and leather goods (1). We then screened a selection of metal screws, leather shoes, and gloves, as well as 50 earrings......, and identified chromium(VI) release from one earring. In the present study, we used the DPC spot test to assess chromium(VI) release in a much larger sample of jewellery items (n=848), 160 (19%) of which had previously be shown to contain chromium when analysed with X-ray fluorescence spectroscopy (2)....

  16. 'Do you think you suffer from depression?' Reevaluating the use of a single item question for the screening of depression in older primary care patients

    DEFF Research Database (Denmark)

    Ayalon, Liat; Goldfracht, Margalit; Bech, Per

    2010-01-01

    evaluated against a depression diagnosis made by the Structured Clinical Interview for DSM-IV. RESULTS: Overall, 3.9% of the sample was diagnosed with depression. The most notable finding was that the single-item question, 'do you think you suffer from depression?' had as good or better sensitivity (83......%) than all other screens. Nonetheless, its specificity of 83% suggested that it has to be followed up by a through diagnostic interview. Additional sensitivity analyses concerning the use of a single depression item taken directly from the depression screening measures supported this finding. CONCLUSIONS......: An easy way to detect depression in older primary care patients would be asking the single question, 'do you think you suffer from depression?'...

  17. 'Do you think you suffer from depression?' Reevaluating the use of a single item question for the screening of depression in older primary care patients

    DEFF Research Database (Denmark)

    Ayalon, Liat; Goldfracht, Margalit; Bech, Per

    2010-01-01

    OBJECTIVES: The majority of older adults seek depression treatment in primary care. Despite impressive efforts to integrate depression treatment into primary care, depression often remains undetected. The overall goal of the present study was to compare a single item screening for depression...... to existing depression screening tools. METHODS: A cross sectional sample of 153 older primary care patients. Participants completed several depression-screening measures (e.g. a single depression screen, Patient Health Questionnaire-9, Major Depression Inventory, Visual Analogue Scale). Measures were......: An easy way to detect depression in older primary care patients would be asking the single question, 'do you think you suffer from depression?'...

  18. Quantitative Literacy on the Web of Science, 2 – Mining the Health Numeracy Literature for Assessment Items

    Directory of Open Access Journals (Sweden)

    H.L. Vacher

    2009-01-01

    Full Text Available A topic search of the Web of Science (WoS database using the term “numeracy” produced a bibliography of 293 articles, reviews and editorial commentaries (Oct 2008. The citation graph of the bibliography clearly identifies five benchmark papers (1995-2001, four of which developed numeracy assessment instruments. Starting with the 80 papers that cite these benchmarks, we identified a set of 25 papers (1995-2008 in which the medical research community reports the development and/or application of health-numeracy assessments. In all we found 10 assessment instruments from which we have compiled a total of 48 assessment items. There are both general and context-specific tests, with the wide range in the latter illustrated by names such as the Diabetes Numeracy Test and the Asthma Numeracy Questionnaire. There is also a Medical Data Interpretation Test and a Subjective Numeracy Scale. Much of this literature discusses the validity and reliability of the test, and many papers include item-by-item results of the tests from when they were applied in the research reported in the papers. The research that used the tests was directed at exploring such subjects as the patients’ ability to evaluate risks and benefits in order to make informed decisions; to understand and carry out instructions in order to self-manage their medical conditions; and, in research settings, to understand what the researchers were asking in their assessments (e.g., quantified quality of life that require comparison of numerical information. We present the collection of items as a potential resource for educators interested in numeracy assessments in context.

  19. Negative affectivity in cardiovascular disease: Evaluating Type D personality assessment using item response theory

    NARCIS (Netherlands)

    Emons, Wilco H.M.; Meijer, R.R.; Denollet, Johan

    2007-01-01

    Objective: Individuals with increased levels of both negative affectivity (NA) and social inhibition (SI)—referred to as type-D personality—are at increased risk of adverse cardiac events. We used item response theory (IRT) to evaluate NA, SI, and type-D personality as measured by the DS14. The

  20. Calibration of context-specific survey items to assess youth physical activity behaviour.

    Science.gov (United States)

    Saint-Maurice, Pedro F; Welk, Gregory J; Bartee, R Todd; Heelan, Kate

    2017-05-01

    This study tests calibration models to re-scale context-specific physical activity (PA) items to accelerometer-derived PA. A total of 195 4th-12th grades children wore an Actigraph monitor and completed the Physical Activity Questionnaire (PAQ) one week later. The relative time spent in moderate-to-vigorous PA (MVPA % ) obtained from the Actigraph at recess, PE, lunch, after-school, evening and weekend was matched with a respective item score obtained from the PAQ's. Item scores from 145 participants were calibrated against objective MVPA % using multiple linear regression with age, and sex as additional predictors. Predicted minutes of MVPA for school, out-of-school and total week were tested in the remaining sample (n = 50) using equivalence testing. The results showed that PAQ β-weights ranged from 0.06 (lunch) to 4.94 (PE) MVPA % (P PAQ and accelerometer MVPA at school and out-of-school ranged from -15.6 to +3.8 min and the PAQ was within 10-15% of accelerometer measured activity. This study demonstrated that context-specific items can be calibrated to predict minutes of MVPA in groups of youth during in- and out-of-school periods.

  1. Re-Examining Test Item Issues in the TIMSS Mathematics and Science Assessments

    Science.gov (United States)

    Wang, Jianjun

    2011-01-01

    As the largest international study ever taken in history, the Trend in Mathematics and Science Study (TIMSS) has been held as a benchmark to measure U.S. student performance in the global context. In-depth analyses of the TIMSS project are conducted in this study to examine key issues of the comparative investigation: (1) item flaws in mathematics…

  2. Assessing the specificity of posttraumatic stress disorder's dysphoric items within the dysphoria model.

    Science.gov (United States)

    Armour, Cherie; Shevlin, Mark

    2013-10-01

    The factor structure of posttraumatic stress disorder (PTSD) currently used by the Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition (DSM-IV), has received limited support. A four-factor dysphoria model is widely supported. However, the dysphoria factor of this model has been hailed as a nonspecific factor of PTSD. The present study investigated the specificity of the dysphoria factor within the dysphoria model by conducting a confirmatory factor analysis while statistically controlling for the variance attributable to depression. The sample consisted of 429 individuals who met the diagnostic criteria for PTSD in the National Comorbidity Survey. The results concluded that there was no significant attenuation in any of the PTSD items. This finding is pertinent given several proposals for the removal of dysphoric items from the diagnostic criteria set of PTSD in the upcoming DSM-5.

  3. Evolution of a Test Item

    Science.gov (United States)

    Spaan, Mary

    2007-01-01

    This article follows the development of test items (see "Language Assessment Quarterly", Volume 3 Issue 1, pp. 71-79 for the article "Test and Item Specifications Development"), beginning with a review of test and item specifications, then proceeding to writing and editing of items, pretesting and analysis, and finally selection of an item for a…

  4. Negative affectivity and social inhibition in cardiovascular disease: evaluating type-D personality and its assessment using item response theory.

    Science.gov (United States)

    Emons, Wilco H M; Meijer, Rob R; Denollet, Johan

    2007-07-01

    Individuals with increased levels of both negative affectivity (NA) and social inhibition (SI)-referred to as type-D personality-are at increased risk of adverse cardiac events. We used item response theory (IRT) to evaluate NA, SI, and type-D personality as measured by the DS14. The objectives of this study were (a) to evaluate the relative contribution of individual items to the measurement precision at the cutoff to distinguish type-D from non-type-D personality and (b) to investigate the comparability of NA, SI, and type-D constructs across the general population and clinical populations. Data from representative samples including 1316 respondents from the general population, 427 respondents diagnosed with coronary heart disease, and 732 persons suffering from hypertension were analyzed using the graded response IRT model. In Study 1, the information functions obtained in the IRT analysis showed that (a) all items had highest measurement precision around the cutoff and (b) items are most informative at the higher end of the scale. In Study 2, the IRT analysis showed that measurements were fairly comparable across the general population and clinical populations. The DS14 adequately measures NA and SI, with highest reliability in the trait range around the cutoff. The DS14 is a valid instrument to assess and compare type-D personality across clinical groups.

  5. Standard Errors for National Trends in International Large-Scale Assessments in the Case of Cross-National Differential Item Functioning

    Science.gov (United States)

    Sachse, Karoline A.; Haag, Nicole

    2017-01-01

    Standard errors computed according to the operational practices of international large-scale assessment studies such as the Programme for International Student Assessment's (PISA) or the Trends in International Mathematics and Science Study (TIMSS) may be biased when cross-national differential item functioning (DIF) and item parameter drift are…

  6. Single-item measures for depression and anxiety: Validation of the Screening Tool for Psychological Distress in an inpatient cardiology setting.

    Science.gov (United States)

    Young, Quincy-Robyn; Nguyen, Michelle; Roth, Susan; Broadberry, Ann; Mackay, Martha H

    2015-12-01

    Depression and anxiety are common among patients with cardiovascular disease (CVD) and confer significant cardiac risk, contributing to CVD morbidity and mortality. Unfortunately, due to the lack of screening tools that address the specific needs of hospitalized patients, few cardiac inpatient programs offer routine screening for these forms of psychological distress, despite recommendations to do so. The purpose of this study was to validate single-item measures for depression and anxiety among cardiac inpatients. Consecutive inpatients were recruited from the cardiology and cardiac surgery step-down units at a university-affiliated, quaternary-care hospital. Subjects completed a questionnaire that included: (a) demographics, (b) single-item-measures for depression and anxiety (from the Screening Tool for Psychological Distress (STOP-D)), and (c) Hospital Anxiety and Depression Scale (HADS). One hundred and five participants were recruited with a wide variety of cardiac diagnoses, having a mean age of 66 years, and 28% were women. Both STOP-D items were highly correlated with their corresponding validated measures and demonstrated robust receiver-operator characteristic curves. Severity scores on both items correlated well with established severity cut-off scores on the corresponding subscales of the HADS. The STOP-D is a self-administered, self-report measure using two independent items that provide severity scores for depression and anxiety. The tool performs very well compared with other previously validated measures. Requiring no additional scoring and being free, STOP-D offers a simple and valid method for identifying hospitalized cardiac patients who are experiencing psychological distress. This crucial first step triggers initiation of appropriate monitoring and intervention, thus reducing the likelihood of the adverse cardiac outcomes associated with psychological distress. © The European Society of Cardiology 2014.

  7. Barriers and benefits to desired behaviors for single use plastic items in northeast Ohio's Lake Erie basin.

    Science.gov (United States)

    Bartolotta, Jill F; Hardy, Scott D

    2018-02-01

    Given the growing saliency of plastic marine debris, and the impact of plastics on beaches and aquatic environments in the Laurentian Great Lakes, applied research is needed to support municipal and nongovernmental campaigns to prevent debris from reaching the water's edge. This study addresses this need by examining the barriers and benefits to positive behavior for two plastic debris items in northeast Ohio's Lake Erie basin: plastic bags and plastic water bottles. An online survey is employed to gather data on the use and disposal of these plastic items and to solicit recommendations on how to positively change behavior to reduce improper disposal. Results support a ban on plastic bags and plastic water bottles, with more enthusiasm for a bag ban. Financial incentives are also seen as an effective way to influence behavior change, as are location-specific solutions focused on education and outreach. Copyright © 2017 Elsevier Ltd. All rights reserved.

  8. The influence of item order on intentional response distortion in the assessment of high potentials: assessing pilot applicants.

    Science.gov (United States)

    Khorramdel, Lale; Kubinger, Klaus D; Uitz, Alexander

    2014-04-01

    An experiment was conducted to investigate the effects of item order and questionnaire content on faking good or intentional response distortion. It was hypothesized that intentional response distortion would either increase towards the end of a long questionnaire, as learning effects might make it easier to adjust responses to a faking good schema, or decrease because applicants' will to distort responses is reduced if the questionnaire lasts long enough. Furthermore, it was hypothesized that certain types of questionnaire content are especially vulnerable to response distortion. Eighty-four pre-selected pilot applicants filled out a questionnaire consisting of 516 items including items from the NEO five factor inventory (NEO FFI), NEO personality inventory revised (NEO PI-R) and business-focused inventory of personality (BIP). The positions of the items were varied within the applicant sample to test if responses are affected by item order, and applicants' response behaviour was additionally compared to that of volunteers. Applicants reported significantly higher mean scores than volunteers, and results provide some evidence of decreased faking tendencies towards the end of the questionnaire. Furthermore, it could be demonstrated that lower variances or standard deviations in combination with appropriate (often higher) mean scores can serve as an indicator for faking tendencies in group comparisons, even if effects are not significant. © 2013 International Union of Psychological Science.

  9. A Multiple-Item Scale for Assessing E-Government Service Quality

    Science.gov (United States)

    Papadomichelaki, Xenia; Mentzas, Gregoris

    A critical element in the evolution of e-governmental services is the development of sites that better serve the citizens’ needs. To deliver superior service quality, we must first understand how citizens perceive and evaluate online citizen service. This involves defining what e-government service quality is, identifying its underlying dimensions, and determining how it can be conceptualized and measured. In this article we conceptualise an e-government service quality model (e-GovQual) and then we develop, refine, validate, confirm and test a multiple-item scale for measuring e-government service quality for public administration sites where citizens seek either information or services.

  10. Assessing the discriminating power of item and test scores in the linear factor-analysis model

    Directory of Open Access Journals (Sweden)

    Pere J. Ferrando

    2012-01-01

    Full Text Available Las propuestas rigurosas y basadas en un modelo psicométrico para estudiar el impreciso concepto de "capacidad discriminativa" son escasas y generalmente limitadas a los modelos no-lineales para items binarios. En este artículo se propone un marco general para evaluar la capacidad discriminativa de las puntuaciones en ítems y tests que son calibrados mediante el modelo de un factor común. La propuesta se organiza en torno a tres criterios: (a tipo de puntuación, (b rango de discriminación y (c aspecto específico que se evalúa. Dentro del marco propuesto: (a se discuten las relaciones entre 16 medidas, de las cuales 6 parecen ser nuevas, y (b se estudian las relaciones entre ellas. La utilidad de la propuesta en las aplicaciones psicométricas que usan el modelo factorial se ilustra mediante un ejemplo empírico.

  11. Passive ultra high frequency radio frequency identification systems for single-item identification in food supply chains

    Directory of Open Access Journals (Sweden)

    Paolo Barge

    2017-02-01

    Full Text Available In the food industry, composition, size, and shape of items are much less regular than in other commodities sectors. In addition, a wide variety of packaging, composed by different materials, is employed. As material, size and shape of items to which the tag should be attached strongly influence the minimum power requested for tag functioning, performance improvements can be achieved only selecting suitable radio frequency (RF identifiers for the specific combination of food product and packaging. When dealing with logistics units, the dynamic reading of a vast number of tags could originate simultaneous broadcasting of signals (tag-to-tag collisions that could affect reading rates and the overall reliability of the identification procedure. This paper reports the results of an analysis of the reading performance of ultra high frequency radio frequency identification systems for multiple static and dynamic electronic identification of food packed products in controlled conditions. Products were considered when arranged on a logistics pallet. The effects on reading rate of different factors, among which the product type, the gate configuration, the field polarisation, the power output of the RF reader, the interrogation protocol configuration as well as the transit speed, the number of tags and their interactions were statistically analysed and compared.

  12. TWO-PARAMETER IRT MODEL APPLICATION TO ASSESS PROBABILISTIC CHARACTERISTICS OF PROHIBITED ITEMS DETECTION BY AVIATION SECURITY SCREENERS

    Directory of Open Access Journals (Sweden)

    Alexander K. Volkov

    2017-01-01

    Full Text Available The modern approaches to the aviation security screeners’ efficiency have been analyzedand, certain drawbacks have been considered. The main drawback is the complexity of ICAO recommendations implementation concerning taking into account of shadow x-ray image complexity factors during preparation and evaluation of prohibited items detection efficiency by aviation security screeners. Х-ray image based factors are the specific properties of the x-ray image that in- fluence the ability to detect prohibited items by aviation security screeners. The most important complexity factors are: geometric characteristics of a prohibited item; view difficulty of prohibited items; superposition of prohibited items byother objects in the bag; bag content complexity; the color similarity of prohibited and usual items in the luggage.The one-dimensional two-parameter IRT model and the related criterion of aviation security screeners’ qualification have been suggested. Within the suggested model the probabilistic detection characteristics of aviation security screeners are considered as functions of such parameters as the difference between level of qualification and level of x-ray images com- plexity, and also between the aviation security screeners’ responsibility and structure of their professional knowledge. On the basis of the given model it is possible to consider two characteristic functions: first of all, characteristic function of qualifica- tion level which describes multi-complexity level of x-ray image interpretation competency of the aviation security screener; secondly, characteristic function of the x-ray image complexity which describes the range of x-ray image interpretation com- petency of the aviation security screeners having various training levels to interpret the x-ray image of a certain level of com- plexity. The suggested complex criterion to assess the level of the aviation security screener qualification allows to evaluate his or

  13. Development of a simple 12-item theory-based instrument to assess the impact of continuing professional development on clinical behavioral intentions.

    Directory of Open Access Journals (Sweden)

    France Légaré

    Full Text Available Decision-makers in organizations providing continuing professional development (CPD have identified the need for routine assessment of its impact on practice. We sought to develop a theory-based instrument for evaluating the impact of CPD activities on health professionals' clinical behavioral intentions.Our multipronged study had four phases. 1 We systematically reviewed the literature for instruments that used socio-cognitive theories to assess healthcare professionals' clinically-oriented behavioral intentions and/or behaviors; we extracted items relating to the theoretical constructs of an integrated model of healthcare professionals' behaviors and removed duplicates. 2 A committee of researchers and CPD decision-makers selected a pool of items relevant to CPD. 3 An international group of experts (n = 70 reached consensus on the most relevant items using electronic Delphi surveys. 4 We created a preliminary instrument with the items found most relevant and assessed its factorial validity, internal consistency and reliability (weighted kappa over a two-week period among 138 physicians attending a CPD activity. Out of 72 potentially relevant instruments, 47 were analyzed. Of the 1218 items extracted from these, 16% were discarded as improperly phrased and 70% discarded as duplicates. Mapping the remaining items onto the constructs of the integrated model of healthcare professionals' behaviors yielded a minimum of 18 and a maximum of 275 items per construct. The partnership committee retained 61 items covering all seven constructs. Two iterations of the Delphi process produced consensus on a provisional 40-item questionnaire. Exploratory factorial analysis following test-retest resulted in a 12-item questionnaire. Cronbach's coefficients for the constructs varied from 0.77 to 0.85.A 12-item theory-based instrument for assessing the impact of CPD activities on health professionals' clinical behavioral intentions showed adequate validity and

  14. Guideline appraisal with AGREE II: online survey of the potential influence of AGREE II items on overall assessment of guideline quality and recommendation for use.

    Science.gov (United States)

    Hoffmann-Eßer, Wiebke; Siering, Ulrich; Neugebauer, Edmund A M; Brockhaus, Anne Catharina; McGauran, Natalie; Eikermann, Michaela

    2018-02-27

    The AGREE II instrument is the most commonly used guideline appraisal tool. It includes 23 appraisal criteria (items) organized within six domains. AGREE II also includes two overall assessments (overall guideline quality, recommendation for use). Our aim was to investigate how strongly the 23 AGREE II items influence the two overall assessments. An online survey of authors of publications on guideline appraisals with AGREE II and guideline users from a German scientific network was conducted between 10th February 2015 and 30th March 2015. Participants were asked to rate the influence of the AGREE II items on a Likert scale (0 = no influence to 5 = very strong influence). The frequencies of responses and their dispersion were presented descriptively. Fifty-eight of the 376 persons contacted (15.4%) participated in the survey and the data of the 51 respondents with prior knowledge of AGREE II were analysed. Items 7-12 of Domain 3 (rigour of development) and both items of Domain 6 (editorial independence) had the strongest influence on the two overall assessments. In addition, Items 15-17 (clarity of presentation) had a strong influence on the recommendation for use. Great variations were shown for the other items. The main limitation of the survey is the low response rate. In guideline appraisals using AGREE II, items representing rigour of guideline development and editorial independence seem to have the strongest influence on the two overall assessments. In order to ensure a transparent approach to reaching the overall assessments, we suggest the inclusion of a recommendation in the AGREE II user manual on how to consider item and domain scores. For instance, the manual could include an a-priori weighting of those items and domains that should have the strongest influence on the two overall assessments. The relevance of these assessments within AGREE II could thereby be further specified.

  15. Assessing Model Characterization of Single Source ...

    Science.gov (United States)

    Aircraft measurements made downwind from specific coal fired power plants during the 2013 Southeast Nexus field campaign provide a unique opportunity to evaluate single source photochemical model predictions of both O3 and secondary PM2.5 species. The model did well at predicting downwind plume placement. The model shows similar patterns of an increasing fraction of PM2.5 sulfate ion to the sum of SO2 and PM2.5 sulfate ion by distance from the source compared with ambient based estimates. The model was less consistent in capturing downwind ambient based trends in conversion of NOX to NOY from these sources. Source sensitivity approaches capture near-source O3 titration by fresh NO emissions, in particular subgrid plume treatment. However, capturing this near-source chemical feature did not translate into better downwind peak estimates of single source O3 impacts. The model estimated O3 production from these sources but often was lower than ambient based source production. The downwind transect ambient measurements, in particular secondary PM2.5 and O3, have some level of contribution from other sources which makes direct comparison with model source contribution challenging. Model source attribution results suggest contribution to secondary pollutants from multiple sources even where primary pollutants indicate the presence of a single source. The National Exposure Research Laboratory (NERL) Computational Exposure Division (CED) develops and evaluates data, deci

  16. Item response modeling: a psychometric assessment of the children's fruit, vegetable, water, and physical activity self-efficacy scales among Chinese children.

    Science.gov (United States)

    Wang, Jing-Jing; Chen, Tzu-An; Baranowski, Tom; Lau, Patrick W C

    2017-09-16

    This study aimed to evaluate the psychometric properties of four self-efficacy scales (i.e., self-efficacy for fruit (FSE), vegetable (VSE), and water (WSE) intakes, and physical activity (PASE)) and to investigate their differences in item functioning across sex, age, and body weight status groups using item response modeling (IRM) and differential item functioning (DIF). Four self-efficacy scales were administrated to 763 Hong Kong Chinese children (55.2% boys) aged 8-13 years. Classical test theory (CTT) was used to examine the reliability and factorial validity of scales. IRM was conducted and DIF analyses were performed to assess the characteristics of item parameter estimates on the basis of children's sex, age and body weight status. All self-efficacy scales demonstrated adequate to excellent internal consistency reliability (Cronbach's α: 0.79-0.91). One FSE misfit item and one PASE misfit item were detected. Small DIF were found for all the scale items across children's age groups. Items with medium to large DIF were detected in different sex and body weight status groups, which will require modification. A Wright map revealed that items covered the range of the distribution of participants' self-efficacy for each scale except VSE. Several self-efficacy scales' items functioned differently by children's sex and body weight status. Additional research is required to modify the four self-efficacy scales to minimize these moderating influences for application.

  17. Utilising a multi-item questionnaire to assess household food security in Australia.

    Science.gov (United States)

    Butcher, Lucy M; O'Sullivan, Therese A; Ryan, Maria M; Lo, Johnny; Devine, Amanda

    2018-03-15

    Currently, two food sufficiency questions are utilised as a proxy measure of national food security status in Australia. These questions do not capture all dimensions of food security and have been attributed to underreporting of the problem. The purpose of this study was to investigate food security using the short form of the US Household Food Security Survey Module (HFSSM) within an Australian context; and explore the relationship between food security status and multiple socio-demographic variables. Two online surveys were completed by 2334 Australian participants from November 2014 to February 2015. Surveys contained the short form of the HFSSM and twelve socio-demographic questions. Cross-tabulations chi-square tests and a multinomial logistic regression model were employed to analyse the survey data. Food security status of the respondents was classified accordingly: High or Marginal (64%, n = 1495), Low (20%, n = 460) or Very Low (16%, n = 379). Significant independent predictors of food security were age (P important issue across Australia and that certain groups, regardless of income, are particularly vulnerable. Government policy and health promotion interventions that specifically target "at risk" groups may assist to more effectively address the problem. Additionally, the use of a multi-item measure is worth considering as a national indicator of food security in Australia. © 2018 Australian Health Promotion Association.

  18. Attitudes and evaluative practices: category vs. item and subjective vs. objective constructions in everyday food assessments.

    Science.gov (United States)

    Wiggins, Sally; Potter, Jonathan

    2003-12-01

    In social psychology, evaluative expressions have traditionally been understood in terms of their relationship to, and as the expression of, underlying 'attitudes'. In contrast, discursive approaches have started to study evaluative expressions as part of varied social practices, considering what such expressions are doing rather than their relationship to attitudinal objects or other putative mental entities. In this study the latter approach will be used to examine the construction of food and drink evaluations in conversation. The data are taken from a corpus of family mealtimes recorded over a period of months. The aim of this study is to highlight two distinctions that are typically obscured in traditional attitude work ('subjective' vs. 'objective' expressions, category vs. item evaluations). A set of extracts is examined to document the presence of these distinctions in talk that evaluates food and the way they are used and rhetorically developed to perform particular activities (accepting/refusing food, complimenting the food provider, persuading someone to eat). The analysis suggests that researchers (a) should be aware of the potential significance of these distinctions; (b) should be cautious when treating evaluative terms as broadly equivalent and (c) should be cautious when blurring categories and instances. This analysis raises the broader question of how far evaluative practices may be specific to particular domains, and what this specificity might consist in. It is concluded that research in this area could benefit from starting to focus on the role of evaluations in practices and charting their association with specific topics and objects.

  19. Exploring Plausible Causes of Differential Item Functioning in the PISA Science Assessment: Language, Curriculum or Culture

    Science.gov (United States)

    Huang, Xiaoting; Wilson, Mark; Wang, Lei

    2016-01-01

    In recent years, large-scale international assessments have been increasingly used to evaluate and compare the quality of education across regions and countries. However, measurement variance between different versions of these assessments often posts threats to the validity of such cross-cultural comparisons. In this study, we investigated the…

  20. Assessing cross-cultural item bias in questionnaires: Acculturation and the Measurement of Social Support and Family Cohesion for Adolescents

    OpenAIRE

    Hemert, Dianne A. van; Baerveldt, Chris; Vermande, Marjolijn

    2001-01-01

    Amethod is presented for evaluating the presence and size of cross-cultural item biases. The examined items concern parental support and family cohesion in a Likert-type questionnaire for adolescents in The Netherlands. Each evaluated item has two versions, a collectivist and an individualistic one, that measure the same theoretical construct. The standardized difference between the score means of the item versions, called the ?e score, gives an indication of the cultural bias of the item. As...

  1. Development of coordination system model on single-supplier multi-buyer for multi-item supply chain with probabilistic demand

    Science.gov (United States)

    Olivia, G.; Santoso, A.; Prayogo, D. N.

    2017-11-01

    Nowadays, the level of competition between supply chains is getting tighter and a good coordination system between supply chains members is very crucial in solving the issue. This paper focused on a model development of coordination system between single supplier and buyers in a supply chain as a solution. Proposed optimization model was designed to determine the optimal number of deliveries from a supplier to buyers in order to minimize the total cost over a planning horizon. Components of the total supply chain cost consist of transportation costs, handling costs of supplier and buyers and also stock out costs. In the proposed optimization model, the supplier can supply various types of items to retailers whose item demand patterns are probabilistic. Sensitivity analysis of the proposed model was conducted to test the effect of changes in transport costs, handling costs and production capacities of the supplier. The results of the sensitivity analysis showed a significant influence on the changes in the transportation cost, handling costs and production capacity to the decisions of the optimal numbers of product delivery for each item to the buyers.

  2. Symptoms of anxiety in depression: assessment of item performance of the Hamilton Anxiety Rating Scale in patients with depression.

    Science.gov (United States)

    Vaccarino, Anthony L; Evans, Kenneth R; Sills, Terrence L; Kalali, Amir H

    2008-01-01

    Although diagnostically dissociable, anxiety is strongly co-morbid with depression. To examine further the clinical symptoms of anxiety in major depressive disorder (MDD), a non-parametric item response analysis on "blinded" data from four pharmaceutical company clinical trials was performed on the Hamilton Anxiety Rating Scale (HAMA) across levels of depressive severity. The severity of depressive symptoms was assessed using the 17-item Hamilton Depression Rating Scale (HAMD). HAMA and HAMD measures were supplied for each patient on each of two post-screen visits (n=1,668 observations). Option characteristic curves were generated for all 14 HAMA items to determine the probability of scoring a particular option on the HAMA in relation to the total HAMD score. Additional analyses were conducted using Pearson's product-moment correlations. Results showed that anxiety-related symptomatology generally increased as a function of overall depressive severity, though there were clear differences between individual anxiety symptoms in their relationship with depressive severity. In particular, anxious mood, tension, insomnia, difficulties in concentration and memory, and depressed mood were found to discriminate over the full range of HAMD scores, increasing continuously with increases in depressive severity. By contrast, many somatic-related symptoms, including muscular, sensory, cardiovascular, respiratory, gastro-intestinal, and genito-urinary were manifested primarily at higher levels of depression and did not discriminate well at lower HAMD scores. These results demonstrate anxiety as a core feature of depression, and the relationship between anxiety-related symptoms and depression should be considered in the assessment of depression and evaluation of treatment strategies and outcome.

  3. The Dimensional Assessment of Personality Psychopathology Basic Questionnaire: shortened versions item analysis.

    Science.gov (United States)

    Aluja, Anton; Blanch, Àngel; Blanco, Eduardo; Martí-Guiu, Maite; Balada, Ferran

    2015-01-13

    This study has been designed to evaluate and replicate the psychometric properties of the Dimensional Assessment of Personality Psychopathology-Basic Questionnaire (DAPP-BQ) and the DAPP-BQ short form (DAPP-SF) in a large Spanish general population sample. Additionally, we have generated a reduced form called DAPP-90, using a strategy based on a structural equation modeling (SEM) methodology in two independent samples, a calibration and a validation sample. The DAPP-90 scales obtained a more satisfactory fit on SEM adjustment values (average: TLI > .97 and RMSEA assessment of patients in hospital consultation or in brief psychological assessments.

  4. An approach for estimating item sensitivity to within-person change over time: An illustration using the Alzheimer's Disease Assessment Scale-Cognitive subscale (ADAS-Cog).

    Science.gov (United States)

    Dowling, N Maritza; Bolt, Daniel M; Deng, Sien

    2016-12-01

    When assessments are primarily used to measure change over time, it is important to evaluate items according to their sensitivity to change, specifically. Items that demonstrate good sensitivity to between-person differences at baseline may not show good sensitivity to change over time, and vice versa. In this study, we applied a longitudinal factor model of change to a widely used cognitive test designed to assess global cognitive status in dementia, and contrasted the relative sensitivity of items to change. Statistically nested models were estimated introducing distinct latent factors related to initial status differences between test-takers and within-person latent change across successive time points of measurement. Models were estimated using all available longitudinal item-level data from the Alzheimer's Disease Assessment Scale-Cognitive subscale, including participants representing the full-spectrum of disease status who were enrolled in the multisite Alzheimer's Disease Neuroimaging Initiative. Five of the 13 Alzheimer's Disease Assessment Scale-Cognitive items demonstrated noticeably higher loadings with respect to sensitivity to change. Attending to performance change on only these 5 items yielded a clearer picture of cognitive decline more consistent with theoretical expectations in comparison to the full 13-item scale. Items that show good psychometric properties in cross-sectional studies are not necessarily the best items at measuring change over time, such as cognitive decline. Applications of the methodological approach described and illustrated in this study can advance our understanding regarding the types of items that best detect fine-grained early pathological changes in cognition. (PsycINFO Database Record (c) 2016 APA, all rights reserved).

  5. Single lump breast surface stress assessment study

    Science.gov (United States)

    Vairavan, R.; Ong, N. R.; Sauli, Z.; Kirtsaeng, S.; Sakuntasathien, S.; Paitong, P.; Alcain, J. B.; Lai, S. L.; Retnasamy, V.

    2017-09-01

    Breast cancer is one of the commonest cancers diagnosed among women around the world. Simulation approach has been utilized to study, characterize and improvise detection methods for breast cancer. However, minimal simulation work has been done to evaluate the surface stress of the breast with lumps. Thus, in this work, simulation analysis was utilized to evaluate and assess the breast surface stress due to the presence of a lump within the internal structure of the breast. The simulation was conducted using the Elmer software. Simulation results have confirmed that the presence of a lump within the breast causes stress on the skin surface of the breast.

  6. Examination of validity of fall risk assessment items for screening high fall risk elderly among the healthy community-dwelling Japanese population

    OpenAIRE

    DEMURA, Shinichi; SATO, Susumu; YAMAJI, Shunsuke; KASUGA, Kosho; NAGASAWA, Yoshinori

    2010-01-01

    We aimed to examine the validity of fall risk assessment items for the healthy community-dwelling elderly Japanese population. Participants were 1122 healthy elderly individuals aged 60 years and over (380 males and 742 females). The percentage who had experienced a fall was 15.8%. This study used fall experience and 50 fall risk assessment items representing the five risk factors (symptoms of falling, physical function, disease and physical symptom, environment, and behavior and character), ...

  7. Psychometrical Assessment and Item Analysis of the General Health Questionnaire in Victims of Terrorism

    Science.gov (United States)

    Delgado-Gomez, David; Lopez-Castroman, Jorge; de Leon-Martinez, Victoria; Baca-Garcia, Enrique; Cabanas-Arrate, Maria Luisa; Sanchez-Gonzalez, Antonio; Aguado, David

    2013-01-01

    There is a need to assess the psychiatric morbidity that appears as a consequence of terrorist attacks. The General Health Questionnaire (GHQ) has been used to this end, but its psychometric properties have never been evaluated in a population affected by terrorism. A sample of 891 participants included 162 direct victims of terrorist attacks and…

  8. e-GovQual: A Multiple-Item Scale for Assessing e-Government Service Quality

    Science.gov (United States)

    Papadomichelaki, Xenia; Mentzas, Gregoris

    2012-01-01

    A critical element in the evolution of governmental services through the internet is the development of sites that better serve the citizens' needs. To deliver superior service quality, we must first understand how citizens perceive and evaluate online. Citizen assessment is built on defining quality, identifying underlying dimensions, and…

  9. A multidimensional assessment of the validity and utility of alcohol use disorder severity as determined by item response theory models.

    Science.gov (United States)

    Dawson, Deborah A; Saha, Tulshi D; Grant, Bridget F

    2010-02-01

    The relative severity of the 11 DSM-IV alcohol use disorder (AUD) criteria are represented by their severity threshold scores, an item response theory (IRT) model parameter inversely proportional to their prevalence. These scores can be used to create a continuous severity measure comprising the total number of criteria endorsed, each weighted by its relative severity. This paper assesses the validity of the severity ranking of the 11 criteria and the overall severity score with respect to known AUD correlates, including alcohol consumption, psychological functioning, family history, antisociality, and early initiation of drinking, in a representative population sample of U.S. past-year drinkers (n=26,946). The unadjusted mean values for all validating measures increased steadily with the severity threshold score, except that legal problems, the criterion with the highest score, was associated with lower values than expected. After adjusting for the total number of criteria endorsed, this direct relationship was no longer evident. The overall severity score was no more highly correlated with the validating measures than a simple count of criteria endorsed, nor did the two measures yield different risk curves. This reflects both within-criterion variation in severity and the fact that the number of criteria endorsed and their severity are so highly correlated that severity is essentially redundant. Attempts to formulate a scalar measure of AUD will do as well by relying on simple counts of criteria or symptom items as by using scales weighted by IRT measures of severity. Published by Elsevier Ireland Ltd.

  10. Development and Standardization of the Diagnostic Adaptive Behavior Scale: Application of Item Response Theory to the Assessment of Adaptive Behavior

    Science.gov (United States)

    Tassé, Marc J.; Schalock, Robert L.; Thissen, David; Balboni, Giulia; Bersani, Henry, Jr.; Borthwick-Duffy, Sharon A.; Spreat, Scott; Widaman, Keith F.; Zhang, Dalun; Navas, Patricia

    2016-01-01

    The Diagnostic Adaptive Behavior Scale (DABS) was developed using item response theory (IRT) methods and was constructed to provide the most precise and valid adaptive behavior information at or near the cutoff point of making a decision regarding a diagnosis of intellectual disability. The DABS initial item pool consisted of 260 items. Using IRT…

  11. Assessing cross-cultural item bias in questionnaires : Acculturation and the Measurement of Social Support and Family Cohesion for Adolescents

    NARCIS (Netherlands)

    Hemert, Dianne A. van; Baerveldt, Chris; Vermande, Marjolijn

    2001-01-01

    Amethod is presented for evaluating the presence and size of cross-cultural item biases. The examined items concern parental support and family cohesion in a Likert-type questionnaire for adolescents in The Netherlands. Each evaluated item has two versions, a collectivist and an individualistic one,

  12. Varying the item format improved the range of measurement in patient-reported outcome measures assessing physical function

    DEFF Research Database (Denmark)

    Liegl, Gregor; Gandek, Barbara; Fischer, H. Felix

    2017-01-01

    precision between the short forms using different item formats. Results: Sufficient unidimensionality of all short-form items and the original PF item bank was supported. Compared to formats A and B, format C increased the range of reliable measurement by about 0.5 standard deviations on the positive side...

  13. OPTIONS FOR THE ASSESSMENT OF ITEMS OF FINANCIAL STATEMENTS AT NATIONAL, EUROPEAN AND INTERNATIONAL LEVEL

    Directory of Open Access Journals (Sweden)

    SILVIA SAMARA

    2010-01-01

    Full Text Available The main purpose of evaluation is to determine the financial position and the outcome of the entity’s activity. With the intensification of the phenomena of globalization of economies and financial markets and the emergence of phenomena such as inflation, it began to be more often used the assessment based on the current value and, in particular, on the fair value. The users of the financial statements must always be taken into when selecting a basis of evaluation. Internationally, we can observe the tendency that, by the use of a certain bases of evaluation, to respond favourably to the needs of a various range of users; a balance must be assured between the relevance of the information (their usefulness in decision-making and their reliability (their objectivity.

  14. The 4-Item Negative Symptom Assessment (NSA-4) Instrument: A Simple Tool for Evaluating Negative Symptoms in Schizophrenia Following Brief Training.

    Science.gov (United States)

    Alphs, Larry; Morlock, Robert; Coon, Cheryl; van Willigenburg, Arjen; Panagides, John

    2010-07-01

    Objective. To assess the ability of mental health professionals to use the 4-item Negative Symptom Assessment instrument, derived from the Negative Symptom Assessment-16, to rapidly determine the severity of negative symptoms of schizophrenia.Design. Open participation.Setting. Medical education conferences.Participants. Attendees at two international psychiatry conferences.Measurements. Participants read a brief set of the 4-item Negative Symptom Assessment instructions and viewed a videotape of a patient with schizophrenia. Using the 1 to 6 4-item Negative Symptom Assessment severity rating scale, they rated four negative symptom items and the overall global negative symptoms. These ratings were compared with a consensus rating determination using frequency distributions and Chi-square tests for the proportion of participant ratings that were within one point of the expert rating.Results. More than 400 medical professionals (293 physicians, 50% with a European practice, and 55% who reported past utilization of schizophrenia ratings scales) participated. Between 82.1 and 91.1 percent of the 4-items and the global rating determinations by the participants were within one rating point of the consensus expert ratings. The differences between the percentage of participant rating scores that were within one point versus the percentage that were greater than one point different from those by the consensus experts was significant (pnegative symptoms using the 4-item Negative Symptom Assessment did not generally differ among the geographic regions of practice, the professional credentialing, or their familiarity with the use of schizophrenia symptom rating instruments.Conclusion. These findings suggest that clinicians from a variety of geographic practices can, after brief training, use the 4-item Negative Symptom Assessment effectively to rapidly assess negative symptoms in patients with schizophrenia.

  15. Complement or Contamination: A Study of the Validity of Multiple-Choice Items when Assessing Reasoning Skills in Physics

    OpenAIRE

    Anders Jönsson; David Rosenlund; Fredrik Alvén

    2017-01-01

    The purpose of this study is to investigate the validity of using multiple-choice (MC) items as a complement to constructed-response (CR) items when making decisions about student performance on reasoning tasks. CR items from a national test in physics have been reformulated into MC items and students’ reasoning skills have been analyzed in two substudies. In the first study, 12 students answered the MC items and were asked to explain their answers orally. In the second study, 102 students fr...

  16. Proposta de um instrumento de medida para avaliar a satisfação de clientes de bancos utilizando a Teoria da Resposta ao Item Proposal of tool to assess the satisfaction of bank customers using the Item Response Theory

    Directory of Open Access Journals (Sweden)

    Alceu Balbim Junior

    2011-01-01

    Full Text Available Este artigo apresenta um instrumento de medida para avaliação da satisfação de clientes de bancos utilizando a Teoria da Resposta ao Item (TRI. Satisfazer os clientes tem sido uma busca constante das organizações que procuram manterem-se competitivas no mercado. Estudos constatam a relação entre a qualidade percebida pelos clientes, a satisfação e fidelidade. A avaliação da satisfação pode ser realizada por meio da qualidade percebida pelos clientes e a construção de ferramentas de avaliação deve contemplar características específicas da atividade em questão. Embasando-se em artigos que avaliam a satisfação de clientes de bancos, propõe-se um instrumento formado por 29 itens. Os itens foram aplicados a 240 clientes a fim de avaliar a satisfação com o banco de maior relacionamento. Utilizando a Teoria da Resposta ao Item, foram identificados os parâmetros dos itens e a curva de informação. A análise do grau de discriminação dos itens indicou que todos são apropriados. A curva de informação obtida evidenciou o intervalo no qual o instrumento apresenta melhores estimativas para níveis de satisfação. O trabalho apresentou o nível médio de satisfação da amostra e a concentração de clientes nos diferentes níveis de satisfação da escala.This paper presents a model for assessing the satisfaction of bank customers using the Item Response Theory (IRT. Organizations are constantly making effort to satisfy customers seeking to remain competitive. Several studies have reported on the relationship between perceived quality, satisfaction, and loyalty. The assessment of satisfaction can be accomplished through the perceived quality, and the development of assessment tools should address specific features of the activity in question. Based on articles that assess the satisfaction of bank customers, this study proposes an assessment tool consisting of 29 items. The items were applied to 240 clients to assess their

  17. A Third-Order Item Response Theory Model for Modeling the Effects of Domains and Subdomains in Large-Scale Educational Assessment Surveys

    Science.gov (United States)

    Rijmen, Frank; Jeon, Minjeong; von Davier, Matthias; Rabe-Hesketh, Sophia

    2014-01-01

    Second-order item response theory models have been used for assessments consisting of several domains, such as content areas. We extend the second-order model to a third-order model for assessments that include subdomains nested in domains. Using a graphical model framework, it is shown how the model does not suffer from the curse of…

  18. Short Scales for the Assessment of Personality Traits: Development and Validation of the Portuguese Ten-Item Personality Inventory (TIPI).

    Science.gov (United States)

    Nunes, Andreia; Limpo, Teresa; Lima, César F; Castro, São Luís

    2018-01-01

    The importance of quickly assessing personality traits in many studies prompted the development of brief scales such as the Ten-Item Personality Inventory (TIPI), a measure of five personality traits (extraversion, agreeableness, conscientiousness, emotional stability, and openness). In the current study, we present the Portuguese version of TIPI and examine its psychometric properties, based on a sample of 333 Portuguese adults aged 18 to 65 years. The results revealed reliability coefficients similar to the original version (α = 0.39-0.72), very good 4-week test-retest reliability ( n = 81, r s > 0.71), expected factorial structure, high convergent validity with the Big-Five Inventory ( r s > 0.60), and correlations with self-esteem, affect, and aggressiveness similar to those found with standard measures of personality traits. Overall, our findings suggest that the Portuguese TIPI is a reliable and valid alternative to longer measures: it offers a promising tool for research contexts in which the available time for personality assessment is highly limited.

  19. Short Scales for the Assessment of Personality Traits: Development and Validation of the Portuguese Ten-Item Personality Inventory (TIPI)

    Science.gov (United States)

    Nunes, Andreia; Limpo, Teresa; Lima, César F.; Castro, São Luís

    2018-01-01

    The importance of quickly assessing personality traits in many studies prompted the development of brief scales such as the Ten-Item Personality Inventory (TIPI), a measure of five personality traits (extraversion, agreeableness, conscientiousness, emotional stability, and openness). In the current study, we present the Portuguese version of TIPI and examine its psychometric properties, based on a sample of 333 Portuguese adults aged 18 to 65 years. The results revealed reliability coefficients similar to the original version (α = 0.39–0.72), very good 4-week test–retest reliability (n = 81, rs > 0.71), expected factorial structure, high convergent validity with the Big-Five Inventory (rs > 0.60), and correlations with self-esteem, affect, and aggressiveness similar to those found with standard measures of personality traits. Overall, our findings suggest that the Portuguese TIPI is a reliable and valid alternative to longer measures: it offers a promising tool for research contexts in which the available time for personality assessment is highly limited. PMID:29674989

  20. Using Procedure Based on Item Response Theory to Evaluate Classification Consistency Indices in the Practice of Large-Scale Assessment

    Directory of Open Access Journals (Sweden)

    Shanshan Zhang

    2017-09-01

    Full Text Available In spite of the growing interest in the methods of evaluating the classification consistency (CC indices, only few researches are available in the field of applying these methods in the practice of large-scale educational assessment. In addition, only few studies considered the influence of practical factors, for example, the examinee ability distribution, the cut score location and the score scale, on the performance of CC indices. Using the newly developed Lee's procedure based on the item response theory (IRT, the main purpose of this study is to investigate the performance of CC indices when practical factors are taken into consideration. A simulation study and an empirical study were conducted under comprehensive conditions. Results suggested that with negatively skewed distribution, the CC indices were larger than with other distributions. Interactions occurred among ability distribution, cut score location, and score scale. Consequently, Lee's IRT procedure is reliable to be used in the field of large-scale educational assessment, and when reporting the indices, it should be treated with caution as testing conditions may vary a lot.

  1. Analysis of Item-Level Bias in the Bayley-III Language Subscales: The Validity and Utility of Standardized Language Assessment in a Multilingual Setting.

    Science.gov (United States)

    Goh, Shaun K Y; Tham, Elaine K H; Magiati, Iliana; Sim, Litwee; Sanmugam, Shamini; Qiu, Anqi; Daniel, Mary L; Broekman, Birit F P; Rifkin-Graboi, Anne

    2017-09-18

    The purpose of this study was to improve standardized language assessments among bilingual toddlers by investigating and removing the effects of bias due to unfamiliarity with cultural norms or a distributed language system. The Expressive and Receptive Bayley-III language scales were adapted for use in a multilingual country (Singapore). Differential item functioning (DIF) was applied to data from 459 two-year-olds without atypical language development. This involved investigating if the probability of success on each item varied according to language exposure while holding latent language ability, gender, and socioeconomic status constant. Associations with language, behavioral, and emotional problems were also examined. Five of 16 items showed DIF, 1 of which may be attributed to cultural bias and another to a distributed language system. The remaining 3 items favored toddlers with higher bilingual exposure. Removal of DIF items reduced associations between language scales and emotional and language problems, but improved the validity of the expressive scale from poor to good. Our findings indicate the importance of considering cultural and distributed language bias in standardized language assessments. We discuss possible mechanisms influencing performance on items favoring bilingual exposure, including the potential role of inhibitory processing.

  2. Assessing Psycho-social Barriers to Rehabilitation in Injured Workers with Chronic Musculoskeletal Pain: Development and Item Properties of the Yellow Flag Questionnaire (YFQ).

    Science.gov (United States)

    Salathé, Cornelia Rolli; Trippolini, Maurizio Alen; Terribilini, Livio Claudio; Oliveri, Michael; Elfering, Achim

    2018-06-01

    Purpose To develop a multidimensional scale to asses psychosocial beliefs-the Yellow Flag Questionnaire (YFQ)-aimed at guiding interventions for workers with chronic musculoskeletal (MSK) pain. Methods Phase 1 consisted of item selection based on literature search, item development and expert consensus rounds. In phase 2, items were reduced with calculating a quality-score per item, using structure equation modeling and confirmatory factor analysis on data from 666 workers. In phase 3, Cronbach's α, and Pearson correlations coefficients were computed to compare YFQ with disability, anxiety, depression and self-efficacy and the YFQ score based on data from 253 injured workers. Regressions of YFQ total score on disability, anxiety, depression and self-efficacy were calculated. Results After phase 1, the YFQ included 116 items and 15 domains. Further reductions of items in phase 2 by applying the item quality criteria reduced the total to 48 items. Phase factor analysis with structural equation modeling confirmed 32 items in seven domains: activity, work, emotions, harm & blame, diagnosis beliefs, co-morbidity and control. Cronbach α was 0.91 for the total score, between 0.49 and 0.81 for the 7 distinct scores of each domain, respectively. Correlations between YFQ total score ranged with disability, anxiety, depression and self-efficacy was .58, .66, .73, -.51, respectively. After controlling for age and gender the YFQ total score explained between R2 27% and R2 53% variance of disability, anxiety, depression and self-efficacy. Conclusions The YFQ, a multidimensional screening scale is recommended for use to assess psychosocial beliefs of workers with chronic MSK pain. Further evaluation of the measurement properties such as the test-retest reliability, responsiveness and prognostic validity is warranted.

  3. Comparison of single questions and brief questionnaire with longer validated food frequency questionnaire to assess adequate fruit and vegetable intake.

    Science.gov (United States)

    Cook, Amelia; Roberts, Kia; O'Leary, Fiona; Allman-Farinelli, Margaret Anne

    2015-01-01

    The aim of this study was to determine if a single question (SQ) for fruit and a SQ or five-item questionnaire for vegetable consumption (VFQ) could replace a longer food frequency questionnaire (FFQ) to screen for inadequate versus adequate intakes in populations. Participants (109) completed three test screeners: fruit SQ, vegetable SQ, and a five-item VFQ followed by the reference 74-item FFQ (version 2 of the Dietary Questionnaire for Epidemiological Studies [DQESv2]) including 13 fruit and 25 vegetable items. The five-item VFQ asked about intake of salad vegetables, cooked vegetables, white potatoes, legumes, and vegetable juice. The screeners were compared with the reference (DQESv2 FFQ) for sensitivity, specificity, and positive and negative predictive powers (PPV, NPV) to detect intakes of two or more servings of fruit and three or more servings of vegetables. Relative validity was examined using Bland-Altman statistics. The fruit SQ showed a PPV of 56% and an NPV of 83%. The PPV for the vegetable SQ was 30% and the NPV was 89%. For the five-item VFQ, the PPV was 39% and the NPV was 85%. Bland-Altman plots and linear regression equations showed that although the screener showed good agreement for fruit (unstandardized b1 coefficient = 0.04) for vegetable intake the difference between methods increased at higher intake levels (unstandardized b1 coefficients = -0.3 for the SQ, b1 = -0.6 for five-item VFQ). The fruit SQ and the five-item VFQ are suitable replacements for longer FFQs to detect inadequate intake and assess population mean but not individual intakes. Copyright © 2015 Elsevier Inc. All rights reserved.

  4. Gender-Based Differential Item Performance in Mathematics Achievement Items.

    Science.gov (United States)

    Doolittle, Allen E.; Cleary, T. Anne

    1987-01-01

    Eight randomly equivalent samples of high school seniors were each given a unique form of the ACT Assessment Mathematics Usage Test (ACTM). Signed measures of differential item performance (DIP) were obtained for each item in the eight ACTM forms. DIP estimates were analyzed and a significant item category effect was found. (Author/LMO)

  5. Assessing Psychopathy Among Justice Involved Adolescents with the PCL: YV: An Item Response Theory Examination Across Gender

    Science.gov (United States)

    Tsang, Siny; Schmidt, Karen M.; Vincent, Gina M.; Salekin, Randall T.; Moretti, Marlene M.; Odgers, Candice L.

    2014-01-01

    This study used an item response theory (IRT) model and a large adolescent sample of justice involved youth (N = 1,007, 38% female) to examine the item functioning of the Psychopathy Checklist – Youth Version (PCL: YV). Items that were most discriminating (or most sensitive to changes) of the latent trait (thought to be psychopathy) among adolescents included “Glibness/superficial charm”, “Lack of remorse”, and “Need for stimulation”, whereas items that were least discriminating included “Pathological lying”, “Failure to accept responsibility”, and “Lacks goals.” The items “Impulsivity” and “Irresponsibility” were the most likely to be rated high among adolescents, whereas “Parasitic lifestyle”, and “Glibness/superficial charm” were the most likely to be rated low. Evidence of differential item functioning (DIF) on four of the 13 items was found between boys and girls. “Failure to accept responsibility” and “Impulsivity” were endorsed more frequently to describe adolescent girls than boys at similar levels of the latent trait, and vice versa for “Grandiose sense of self-worth” and “Lacks goals.” The DIF findings suggest that four PCL: YV items function differently between boys and girls. PMID:25580672

  6. A Single Conjunction Risk Assessment Metric: the F-Value

    Science.gov (United States)

    Frigm, Ryan Clayton; Newman, Lauri K.

    2009-01-01

    The Conjunction Assessment Team at NASA Goddard Space Flight Center provides conjunction risk assessment for many NASA robotic missions. These risk assessments are based on several figures of merit, such as miss distance, probability of collision, and orbit determination solution quality. However, these individual metrics do not singly capture the overall risk associated with a conjunction, making it difficult for someone without this complete understanding to take action, such as an avoidance maneuver. The goal of this analysis is to introduce a single risk index metric that can easily convey the level of risk without all of the technical details. The proposed index is called the conjunction "F-value." This paper presents the concept of the F-value and the tuning of the metric for use in routine Conjunction Assessment operations.

  7. A study of the psychometric properties of 12-item World Health Organization Disability Assessment Schedule 2.0 in a large population of people with chronic musculoskeletal pain.

    Science.gov (United States)

    Saltychev, Mikhail; Bärlund, Esa; Mattie, Ryan; McCormick, Zachary; Paltamaa, Jaana; Laimi, Katri

    2017-02-01

    To assess the validity of the Finnish translation of the 12-item World Health Organization Disability Assessment Schedule (WHODAS 2.0). Cross-sectional cohort survey study. Physical and Rehabilitation Medicine outpatient university clinic. The 501 consecutive patients with chronic musculoskeletal pain. Exploratory factor analysis and a graded response model using item response theory analysis were used to assess the constructs and discrimination ability of WHODAS 2.0. The exploratory factor analysis revealed two retained factors with eigenvalues 5.15 and 1.04. Discrimination ability of all items was high or perfect, varying from 1.2 to 2.5. The difficulty levels of seven out of 12 items were shifted towards the elevated disability level. As a result, the entire test characteristic curve showed a shift towards higher levels of disability, placing it at the point of disability level of +1 (where 0 indicates the average level of disability within the sample). The present data indicate that the Finnish translation of the 12-item WHODAS 2.0 is a valid instrument for measuring restrictions of activity and participation among patients with chronic musculoskeletal pain.

  8. Item level diagnostics and model - data fit in item response theory ...

    African Journals Online (AJOL)

    Item response theory (IRT) is a framework for modeling and analyzing item response data. Item-level modeling gives IRT advantages over classical test theory. The fit of an item score pattern to an item response theory (IRT) models is a necessary condition that must be assessed for further use of item and models that best fit ...

  9. Varying the item format improved the range of measurement in patient-reported outcome measures assessing physical function.

    Science.gov (United States)

    Liegl, Gregor; Gandek, Barbara; Fischer, H Felix; Bjorner, Jakob B; Ware, John E; Rose, Matthias; Fries, James F; Nolte, Sandra

    2017-03-21

    Physical function (PF) is a core patient-reported outcome domain in clinical trials in rheumatic diseases. Frequently used PF measures have ceiling effects, leading to large sample size requirements and low sensitivity to change. In most of these instruments, the response category that indicates the highest PF level is the statement that one is able to perform a given physical activity without any limitations or difficulty. This study investigates whether using an item format with an extended response scale, allowing respondents to state that the performance of an activity is easy or very easy, increases the range of precise measurement of self-reported PF. Three five-item PF short forms were constructed from the Patient-Reported Outcomes Measurement Information System (PROMIS®) wave 1 data. All forms included the same physical activities but varied in item stem and response scale: format A ("Are you able to …"; "without any difficulty"/"unable to do"); format B ("Does your health now limit you …"; "not at all"/"cannot do"); format C ("How difficult is it for you to …"; "very easy"/"impossible"). Each short-form item was answered by 2217-2835 subjects. We evaluated unidimensionality and estimated a graded response model for the 15 short-form items and remaining 119 items of the PROMIS PF bank to compare item and test information for the short forms along the PF continuum. We then used simulated data for five groups with different PF levels to illustrate differences in scoring precision between the short forms using different item formats. Sufficient unidimensionality of all short-form items and the original PF item bank was supported. Compared to formats A and B, format C increased the range of reliable measurement by about 0.5 standard deviations on the positive side of the PF continuum of the sample, provided more item information, and was more useful in distinguishing known groups with above-average functioning. Using an item format with an extended

  10. Effects of memantine on cognition in patients with moderate to severe Alzheimer's disease: post-hoc analyses of ADAS-cog and SIB total and single-item scores from six randomized, double-blind, placebo-controlled studies.

    Science.gov (United States)

    Mecocci, Patrizia; Bladström, Anna; Stender, Karina

    2009-05-01

    The post-hoc analyses reported here evaluate the specific effects of memantine treatment on ADAS-cog single-items or SIB subscales for patients with moderate to severe AD. Data from six multicentre, randomised, placebo-controlled, parallel-group, double-blind, 6-month studies were used as the basis for these post-hoc analyses. All patients with a Mini-Mental State Examination (MMSE) score of less than 20 were included. Analyses of patients with moderate AD (MMSE: 10-19), evaluated with the Alzheimer's disease Assessment Scale (ADAS-cog) and analyses of patients with moderate to severe AD (MMSE: 3-14), evaluated using the Severe Impairment Battery (SIB), were performed separately. The mean change from baseline showed a significant benefit of memantine treatment on both the ADAS-cog (p ADAS-cog single-item analyses showed significant benefits of memantine treatment, compared to placebo, for mean change from baseline for commands (p < 0.001), ideational praxis (p < 0.05), orientation (p < 0.01), comprehension (p < 0.05), and remembering test instructions (p < 0.05) for observed cases (OC). The SIB subscale analyses showed significant benefits of memantine, compared to placebo, for mean change from baseline for language (p < 0.05), memory (p < 0.05), orientation (p < 0.01), praxis (p < 0.001), and visuospatial ability (p < 0.01) for OC. Memantine shows significant benefits on overall cognitive abilities as well as on specific key cognitive domains for patients with moderate to severe AD. (c) 2009 John Wiley & Sons, Ltd.

  11. Validation of a 4-item Negative Symptom Assessment (NSA-4): a short, practical clinical tool for the assessment of negative symptoms in schizophrenia.

    Science.gov (United States)

    Alphs, Larry; Morlock, Robert; Coon, Cheryl; Cazorla, Pilar; Szegedi, Armin; Panagides, John

    2011-06-01

    The 16-item Negative Symptom Assessment (NSA-16) scale is a validated tool for evaluating negative symptoms of schizophrenia. The psychometric properties and predictive power of a four-item version (NSA-4) were compared with the NSA-16. Baseline data from 561 patients with predominant negative symptoms of schizophrenia who participated in two identically designed clinical trials were evaluated. Ordered logistic regression analysis of ratings using NSA-4 and NSA-16 were compared with ratings using several other standard tools to determine predictive validity and construct validity. Internal consistency and test--retest reliability were also analyzed. NSA-16 and NSA-4 scores were both predictive of scores on the NSA global rating (odds ratio = 0.83-0.86) and the Clinical Global Impressions--Severity scale (odds ratio = 0.91-0.93). NSA-16 and NSA-4 showed high correlation with each other (Pearson r = 0.85), similar high correlation with other measures of negative symptoms (demonstrating convergent validity), and lesser correlations with measures of other forms of psychopathology (demonstrating divergent validity). NSA-16 and NSA-4 both showed acceptable internal consistency (Cronbach α, 0.85 and 0.64, respectively) and test--retest reliability (intraclass correlation coefficient, 0.87 and 0.82). This study demonstrates that NSA-4 offers accuracy comparable to the NSA-16 in rating negative symptoms in patients with schizophrenia. Copyright © 2011 John Wiley & Sons, Ltd.

  12. The development and discussion of computerized visual perception assessment tool for Chinese characters structures - Concurrent estimation of the overall ability and the domain ability in item response theory approach.

    Science.gov (United States)

    Wu, Huey-Min; Lin, Chin-Kai; Yang, Yu-Mao; Kuo, Bor-Chen

    2014-11-12

    Visual perception is the fundamental skill required for a child to recognize words, and to read and write. There was no visual perception assessment tool developed for preschool children based on Chinese characters in Taiwan. The purposes were to develop the computerized visual perception assessment tool for Chinese Characters Structures and to explore the psychometrical characteristic of assessment tool. This study adopted purposive sampling. The study evaluated 551 kindergarten-age children (293 boys, 258 girls) ranging from 46 to 81 months of age. The test instrument used in this study consisted of three subtests and 58 items, including tests of basic strokes, single-component characters, and compound characters. Based on the results of model fit analysis, the higher-order item response theory was used to estimate the performance in visual perception, basic strokes, single-component characters, and compound characters simultaneously. Analyses of variance were used to detect significant difference in age groups and gender groups. The difficulty of identifying items in a visual perception test ranged from -2 to 1. The visual perception ability of 4- to 6-year-old children ranged from -1.66 to 2.19. Gender did not have significant effects on performance. However, there were significant differences among the different age groups. The performance of 6-year-olds was better than that of 5-year-olds, which was better than that of 4-year-olds. This study obtained detailed diagnostic scores by using a higher-order item response theory model to understand the visual perception of basic strokes, single-component characters, and compound characters. Further statistical analysis showed that, for basic strokes and compound characters, girls performed better than did boys; there also were differences within each age group. For single-component characters, there was no difference in performance between boys and girls. However, again the performance of 6-year-olds was better than

  13. Assessing the Equivalence of Paper, Mobile Phone, and Tablet Survey Responses at a Community Mental Health Center Using Equivalent Halves of a 'Gold-Standard' Depression Item Bank.

    Science.gov (United States)

    Brodey, Benjamin B; Gonzalez, Nicole L; Elkin, Kathryn Ann; Sasiela, W Jordan; Brodey, Inger S

    2017-09-06

    The computerized administration of self-report psychiatric diagnostic and outcomes assessments has risen in popularity. If results are similar enough across different administration modalities, then new administration technologies can be used interchangeably and the choice of technology can be based on other factors, such as convenience in the study design. An assessment based on item response theory (IRT), such as the Patient-Reported Outcomes Measurement Information System (PROMIS) depression item bank, offers new possibilities for assessing the effect of technology choice upon results. To create equivalent halves of the PROMIS depression item bank and to use these halves to compare survey responses and user satisfaction among administration modalities-paper, mobile phone, or tablet-with a community mental health care population. The 28 PROMIS depression items were divided into 2 halves based on content and simulations with an established PROMIS response data set. A total of 129 participants were recruited from an outpatient public sector mental health clinic based in Memphis. All participants took both nonoverlapping halves of the PROMIS IRT-based depression items (Part A and Part B): once using paper and pencil, and once using either a mobile phone or tablet. An 8-cell randomization was done on technology used, order of technologies used, and order of PROMIS Parts A and B. Both Parts A and B were administered as fixed-length assessments and both were scored using published PROMIS IRT parameters and algorithms. All 129 participants received either Part A or B via paper assessment. Participants were also administered the opposite assessment, 63 using a mobile phone and 66 using a tablet. There was no significant difference in item response scores for Part A versus B. All 3 of the technologies yielded essentially identical assessment results and equivalent satisfaction levels. Our findings show that the PROMIS depression assessment can be divided into 2 equivalent

  14. Is a single item stress measure independently associated with subsequent severe injury: a prospective cohort study of 16,385 forest industry employees.

    Science.gov (United States)

    Salminen, Simo; Kouvonen, Anne; Koskinen, Aki; Joensuu, Matti; Väänänen, Ari

    2014-06-02

    A previous review showed that high stress increases the risk of occupational injury by three- to five-fold. However, most of the prior studies have relied on short follow-ups. In this prospective cohort study we examined the effect of stress on recorded hospitalised injuries in an 8-year follow-up. A total of 16,385 employees of a Finnish forest company responded to the questionnaire. Perceived stress was measured with a validated single-item measure, and analysed in relation recorded hospitalised injuries from 1986 to 2008. We used Cox proportional hazard regression models to examine the prospective associations between work stress, injuries and confounding factors. Highly stressed participants were approximately 40% more likely to be hospitalised due to injury over the follow-up period than participants with low stress. This association remained significant after adjustment for age, gender, marital status, occupational status, educational level, and physical work environment. High stress is associated with an increased risk of severe injury.

  15. Psychometric Validation of the World Health Organization Disability Assessment Schedule 2.0-Twelve-Item Version in Persons with Spinal Cord Injuries

    Science.gov (United States)

    Smedema, Susan Miller; Ruiz, Derek; Mohr, Michael J.

    2017-01-01

    Purpose: To evaluate the factorial and concurrent validity and internal consistency reliability of the World Health Organization Disability Assessment Schedule 2.0 (WHODAS 2.0) 12-item version in persons with spinal cord injuries. Method: Two hundred forty-seven adults with spinal cord injuries completed an online survey consisting of the WHODAS…

  16. Improving the Reliability of Student Scores from Speeded Assessments: An Illustration of Conditional Item Response Theory Using a Computer-Administered Measure of Vocabulary

    Science.gov (United States)

    Petscher, Yaacov; Mitchell, Alison M.; Foorman, Barbara R.

    2015-01-01

    A growing body of literature suggests that response latency, the amount of time it takes an individual to respond to an item, may be an important factor to consider when using assessment data to estimate the ability of an individual. Considering that tests of passage and list fluency are being adapted to a computer administration format, it is…

  17. Electronic assessment of clinical reasoning in clerkships: A mixed-methods comparison of long-menu key-feature problems with context-rich single best answer questions

    NARCIS (Netherlands)

    Huwendiek, S.; Reichert, F.; Duncker, C.; Leng, B.A. De; Vleuten, C.P.M. van der; Muijtjens, A.M.; Bosse, H.M.; Haag, M.; Hoffmann, G.F.; Tonshoff, B.; Dolmans, D.

    2017-01-01

    BACKGROUND: It remains unclear which item format would best suit the assessment of clinical reasoning: context-rich single best answer questions (crSBAs) or key-feature problems (KFPs). This study compared KFPs and crSBAs with respect to students' acceptance, their educational impact, and

  18. Chip based single cell analysis for nanotoxicity assessment.

    Science.gov (United States)

    Shah, Pratikkumar; Kaushik, Ajeet; Zhu, Xuena; Zhang, Chengxiao; Li, Chen-Zhong

    2014-05-07

    Nanomaterials, because of their tunable properties and performances, have been utilized extensively in everyday life related consumable products and technology. On exposure, beyond the physiological range, nanomaterials cause health risks via affecting the function of organisms, genomic systems, and even the central nervous system. Thus, new analytical approaches for nanotoxicity assessment to verify the feasibility of nanomaterials for future use are in demand. The conventional analytical techniques, such as spectrophotometric assay-based techniques, usually require a lengthy and time-consuming process and often produce false positives, and often cannot be implemented at a single cell level measurement for studying cell behavior without interference from its surrounding environment. Hence, there is a demand for a precise, accurate, sensitive assessment for toxicity using single cells. Recently, due to the advantages of automation of fluids and minimization of human errors, the integration of a cell-on-a-chip (CoC) with a microfluidic system is in practice for nanotoxicity assessments. This review explains nanotoxicity and its assessment approaches with advantages/limitations and new approaches to overcome the confines of traditional techniques. Recent advances in nanotoxicity assessment using a CoC integrated with a microfluidic system are also discussed in this review, which may be of use for nanotoxicity assessment and diagnostics.

  19. North Star Ambulatory Assessment, 6-minute walk test and timed items in ambulant boys with Duchenne muscular dystrophy.

    Science.gov (United States)

    Mazzone, Elena; Martinelli, Diego; Berardinelli, Angela; Messina, Sonia; D'Amico, Adele; Vasco, Gessica; Main, Marion; Doglio, Luca; Politano, Luisa; Cavallaro, Filippo; Frosini, Silvia; Bello, Luca; Carlesi, Adelina; Bonetti, Anna Maria; Zucchini, Elisabetta; De Sanctis, Roberto; Scutifero, Marianna; Bianco, Flaviana; Rossi, Francesca; Motta, Maria Chiara; Sacco, Annalisa; Donati, Maria Alice; Mongini, Tiziana; Pini, Antonella; Battini, Roberta; Pegoraro, Elena; Pane, Marika; Pasquini, Elisabetta; Bruno, Claudio; Vita, Giuseppe; de Waure, Chiara; Bertini, Enrico; Mercuri, Eugenio

    2010-11-01

    The North Star Ambulatory Assessment is a functional scale specifically designed for ambulant boys affected by Duchenne muscular dystrophy (DMD). Recently the 6-minute walk test has also been used as an outcome measure in trials in DMD. The aim of our study was to assess a large cohort of ambulant boys affected by DMD using both North Star Assessment and 6-minute walk test. More specifically, we wished to establish the spectrum of findings for each measure and their correlation. This is a prospective multicentric study involving 10 centers. The cohort included 112 ambulant DMD boys of age ranging between 4.10 and 17 years (mean 8.18±2.3 DS). Ninety-one of the 112 were on steroids: 37/91 on intermittent and 54/91 on daily regimen. The scores on the North Star assessment ranged from 6/34 to 34/34. The distance on the 6-minute walk test ranged from 127 to 560.6 m. The time to walk 10 m was between 3 and 15 s. The time to rise from the floor ranged from 1 to 27.5 s. Some patients were unable to rise from the floor. As expected the results changed with age and were overall better in children treated with daily steroids. The North Star assessment had a moderate to good correlation with 6-minute walk test and with timed rising from floor but less with 10 m timed walk/run test. The 6-minute walk test in contrast had better correlation with 10 m timed walk/run test than with timed rising from floor. These findings suggest that a combination of these outcome measures can be effectively used in ambulant DMD boys and will provide information on different aspects of motor function, that may not be captured using a single measure. Copyright © 2010. Published by Elsevier B.V.

  20. Translation and cross-cultural adaptation of the Detailed Assessment of Speed of Handwriting 17+ to Brazilian Portuguese: conceptual, item and semantic equivalence.

    Science.gov (United States)

    Cardoso, Monique Herrera; Capellini, Simone Aparecida

    2018-02-19

    Perform a cross-cultural adaptation of the Detailed Assessment of Speed of Handwriting 17+ (DASH 17+) for Brazilians. Evaluation of (1) conceptual, item and (2) semantic equivalence, with assistance of four translators and application of a pilot study to 36 students. (1) The concepts and items are equivalent in the British and Brazilian cultures. (2) Adaptations were made concerning the English language pangram used in copying tasks and selection of the lower-case, cursive handwriting in the alphabet-writing task. Application of the pilot study verified acceptability and understanding of the proposed tasks by the students. The Brazilian Portuguese version of the DASH 17+ was presented after finalization of the conceptual, item and semantic equivalence of the instrument. Further studies on psychometric properties should be conducted with the purpose of measuring the speed of handwriting in youngsters and adults with greater reliability and validity to the procedure.

  1. Recipient ineligibility after liver transplantation assessment: a single centre experience.

    Science.gov (United States)

    Arya, Aman; Hernandez-Alejandro, Roberto; Marotta, Paul; Uhanova, Julia; Chandok, Natasha

    2013-06-01

    Candidacy for liver transplantation is determined through standardized evaluation. There are limited data on the frequency and reasons for denial of transplantation after assessment; analysis may shed light on the short-term utility of the assessment. We sought to describe the frequency and reasons for ineligibility for liver transplantation among referred adults. We studied all prospectively followed recipient candidates at a single centre who were deemed unsuitable for liver transplantation after assessment. Inclusion criteria were age 18 years and older and completion of a standard liver transplantation evaluation over a 3-year period. Patients were excluded if they had a history of prior assessment or liver transplantation within the study period. Demographic and baseline clinical data and reasons for recipient ineligibility were recorded. In all, 337 patients underwent their first liver transplantation evaluation during the study period; 166 (49.3%) fulfilled inclusion criteria. The mean age was 55.4 years, and 106 (63.9%) were men. The 3 most common reasons for denial of listing were patient too well (n = 82, 49.4%), medical comorbidities and/or need for medical optimization (n = 43, 25.9%) and need for addiction rehabilitation (n = 28, 16.9%). Ineligibility for transplantation after assessment was common, occurring in nearly half of the cohort. Most denied candidates could be identified with more discriminate screening before the resource-intensive assessment; however, the assessment likely provides unforeseen positive impacts on patient care.

  2. Using existing questionnaires in latent class analysis: should we use summary scores or single items as input? A methodological study using a cohort of patients with low back pain

    Directory of Open Access Journals (Sweden)

    Nielsen AM

    2016-04-01

    Full Text Available Anne Molgaard Nielsen,1 Werner Vach,2 Peter Kent,1,3 Lise Hestbaek,1,4 Alice Kongsted1,4 1Department of Sports Science and Clinical Biomechanics, University of Southern Denmark, Odense, Denmark; 2Center for Medical Biometry and Medical Informatics, Medical Center, University of Freiburg, Freiburg, Germany; 3School of Physiotherapy and Exercise Science, Curtin University, Perth, Australia; 4Nordic Institute of Chiropractic and Clinical Biomechanics, University of Southern Denmark, Odense, Denmark Background: Latent class analysis (LCA is increasingly being used in health research, but optimal approaches to handling complex clinical data are unclear. One issue is that commonly used questionnaires are multidimensional, but expressed as summary scores. Using the example of low back pain (LBP, the aim of this study was to explore and descriptively compare the application of LCA when using questionnaire summary scores and when using single items to subgrouping of patients based on multidimensional data. Materials and methods: Baseline data from 928 LBP patients in an observational study were classified into four health domains (psychology, pain, activity, and participation using the World Health Organization’s International Classification of Functioning, Disability, and Health framework. LCA was performed within each health domain using the strategies of summary-score and single-item analyses. The resulting subgroups were descriptively compared using statistical measures and clinical interpretability. Results: For each health domain, the preferred model solution ranged from five to seven subgroups for the summary-score strategy and seven to eight subgroups for the single-item strategy. There was considerable overlap between the results of the two strategies, indicating that they were reflecting the same underlying data structure. However, in three of the four health domains, the single-item strategy resulted in a more nuanced description, in terms

  3. Does the Assessment of Recovery Capital scale reflect a single or multiple domains?

    Science.gov (United States)

    Arndt, Stephan; Sahker, Ethan; Hedden, Suzy

    2017-01-01

    The goal of this study was to determine whether the 50-item Assessment of Recovery Capital scale represents a single general measure or whether multiple domains might be psychometrically useful for research or clinical applications. Data are from a cross-sectional de-identified existing program evaluation information data set with 1,138 clients entering substance use disorder treatment. Principal components and iterated factor analysis were used on the domain scores. Multiple group factor analysis provided a quasi-confirmatory factor analysis. The solution accounted for 75.24% of the total variance, suggesting that 10 factors provide a reasonably good fit. However, Tucker's congruence coefficients between the factor structure and defining weights (0.41-0.52) suggested a poor fit to the hypothesized 10-domain structure. Principal components of the 10-domain scores yielded one factor whose eigenvalue was greater than one (5.93), accounting for 75.8% of the common variance. A few domains had perceptible but small unique variance components suggesting that a few of the domains may warrant enrichment. Our findings suggest that there is one general factor, with a caveat. Using the 10 measures inflates the chance for Type I errors. Using one general measure avoids this issue, is simple to interpret, and could reduce the number of items. However, those seeking to maximally predict later recovery success may need to use the full instrument and all 10 domains.

  4. Creating a brief rating scale for the assessment of learning disabilities using reliability and true score estimates of the scale's items based on the Rasch model.

    Science.gov (United States)

    Sideridis, Georgios; Padeliadu, Susana

    2013-01-01

    The purpose of the present studies was to provide the means to create brief versions of instruments that can aid the diagnosis and classification of students with learning disabilities and comorbid disorders (e.g., attention-deficit/hyperactivity disorder). A sample of 1,108 students with and without a diagnosis of learning disabilities took part in study 1. Using information from modern theory methods (i.e., the Rasch model), a scale was created that included fewer than one third of the original battery items designed to assess reading skills. This best item synthesis was then evaluated for its predictive and criterion validity with a valid external reading battery (study 2). Using a sample of 232 students with and without learning disabilities, results indicated that the brief version of the scale was equally effective as the original scale in predicting reading achievement. Analysis of the content of the brief scale indicated that the best item synthesis involved items from cognition, motivation, strategy use, and advanced reading skills. It is suggested that multiple psychometric criteria be employed in evaluating the psychometric adequacy of scales used for the assessment and identification of learning disabilities and comorbid disorders.

  5. A Model of Batch Scheduling for a Single Batch Processor with Additional Setups to Minimize Total Inventory Holding Cost of Parts of a Single Item Requested at Multi-due-date

    Science.gov (United States)

    Hakim Halim, Abdul; Ernawati; Hidayat, Nita P. A.

    2018-03-01

    This paper deals with a model of batch scheduling for a single batch processor on which a number of parts of a single items are to be processed. The process needs two kinds of setups, i. e., main setups required before processing any batches, and additional setups required repeatedly after the batch processor completes a certain number of batches. The parts to be processed arrive at the shop floor at the times coinciding with their respective starting times of processing, and the completed parts are to be delivered at multiple due dates. The objective adopted for the model is that of minimizing total inventory holding cost consisting of holding cost per unit time for a part in completed batches, and that in in-process batches. The formulation of total inventory holding cost is derived from the so-called actual flow time defined as the interval between arrival times of parts at the production line and delivery times of the completed parts. The actual flow time satisfies not only minimum inventory but also arrival and delivery just in times. An algorithm to solve the model is proposed and a numerical example is shown.

  6. Reliability of a single objective measure in assessing sleepiness.

    Science.gov (United States)

    Sunwoo, Bernie Y; Jackson, Nicholas; Maislin, Greg; Gurubhagavatula, Indira; George, Charles F; Pack, Allan I

    2012-01-01

    To evaluate reliability of single objective tests in assessing sleepiness. Subjects who completed polysomnography underwent a 4-nap multiple sleep latency test (MSLT) the following day. Prior to each nap opportunity on MSLT, subjects performed the psychomotor vigilance test (PVT) and divided attention driving task (DADT). Results of single versus multiple test administrations were compared using the intraclass correlation coefficient (ICC) and adjusted for test administration order effects to explore time of day effects. Measures were explored as continuous and binary (i.e., impaired or not impaired). Community-based sample evaluated at a tertiary, university-based sleep center. 372 adult commercial vehicle operators oversampled for increased obstructive sleep apnea risk. N/A. AS CONTINUOUS MEASURES, ICC WERE AS FOLLOWS: MSLT 0.45, PVT median response time 0.69, PVT number of lapses 0.51, 10-min DADT tracking error 0.87, 20-min DADT tracking error 0.90. Based on binary outcomes, ICC were: MSLT 0.63, PVT number of lapses 0.85, 10-min DADT 0.95, 20-min DADT 0.96. Statistically significant time of day effects were seen in both the MSLT and PVT but not the DADT. Correlation between ESS and different objective tests was strongest for MSLT, range [-0.270 to -0.195] and persisted across all time points. Single DADT and PVT administrations are reliable measures of sleepiness. A single MSLT administration can reasonably discriminate individuals with MSL < 8 minutes. These results support the use of a single administration of some objective tests of sleepiness when performed under controlled conditions in routine clinical care.

  7. Validity and reliability of the TED-QOL: a new three-item questionnaire to assess quality of life in thyroid eye disease.

    Science.gov (United States)

    Fayers, Tessa; Dolman, Peter J

    2011-12-01

    To develop and test a user-friendly questionnaire for rapidly assessing quality of life (QOL) in thyroid eye disease (TED). A three-item questionnaire, the TED-QOL, was designed and compared to the 16-item Graves Ophthalmopathy (GO)-QOL and the nine-item GO-Quality of Life Scale (QLS). 100 patients with TED were administered all three questionnaires on two occasions. Results were compared to clinical severity scores (Vision, Inflammation, Strabismus, Appearance (VISA) classification). Main outcomes were construct and criterion validity, test-retest reliability, duration, comprehension and completion rates. TED-QOL correlated strongly with the other questionnaires for corresponding items (Pearson correlation: appearance 0.71, 0.62; functioning 0.69, 0.66; overall QOL 0.53). Test-retest analysis demonstrated good reliability for all three questionnaires (intraclass correlations: TED-QOL 0.81, 0.74, 0.87; GO-QOL 0.81, 0.82; GO-QLS 0.74, 0.86, 0.67). TED-QOL was significantly faster to complete (1.6 min vs GO-QOL 3.1 min, GO-QLS 2.7 min, p<0.0001) and had a higher completion rate (100% vs GO-QOL 78%, GO-QLS 94%). There was only moderate correlation between items on all three questionnaires and VISA scores. The TED-QOL is rapid and easy to complete and analyse and has similar validity and reliability to longer questionnaires. All questionnaires showed only moderate correlation with disease severity, emphasising the discrepancy between objective and subjective assessments and the importance of measuring both.

  8. Exploratory factor analysis of the 12-item Functional Assessment of Chronic Illness Therapy-Spiritual Well-Being Scale in people newly diagnosed with advanced cancer.

    Science.gov (United States)

    Bai, Mei; Dixon, Jane K

    2014-01-01

    The purpose of this study was to reexamine the factor pattern of the 12-item Functional Assessment of Chronic Illness Therapy-Spiritual Well-Being Scale (FACIT-Sp-12) using exploratory factor analysis in people newly diagnosed with advanced cancer. Principal components analysis (PCA) and 3 common factor analysis methods were used to explore the factor pattern of the FACIT-Sp-12. Factorial validity was assessed in association with quality of life (QOL). Principal factor analysis (PFA), iterative PFA, and maximum likelihood suggested retrieving 3 factors: Peace, Meaning, and Faith. Both Peace and Meaning positively related to QOL, whereas only Peace uniquely contributed to QOL. This study supported the 3-factor model of the FACIT-Sp-12. Suggestions for revision of items and further validation of the identified factor pattern were provided.

  9. Generalizability theory and item response theory

    OpenAIRE

    Glas, Cornelis A.W.; Eggen, T.J.H.M.; Veldkamp, B.P.

    2012-01-01

    Item response theory is usually applied to items with a selected-response format, such as multiple choice items, whereas generalizability theory is usually applied to constructed-response tasks assessed by raters. However, in many situations, raters may use rating scales consisting of items with a selected-response format. This chapter presents a short overview of how item response theory and generalizability theory were integrated to model such assessments. Further, the precision of the esti...

  10. A study on the establishment of safety assessment guidelines of commercial grade item dedication in digitalized safety systems

    International Nuclear Information System (INIS)

    Hwang, H. S.; Kim, B. R.; Oh, S. H.

    1999-01-01

    Because of obsolescing the components used in safety related systems of nuclear power plants, decreasing the number of suppliers qualified for the nuclear QA program and increasing maintenance costs of them, utilities have been considering to use commercial grade digital computers as an alternative for resolving such issues. However, commercial digital computers use the embedded pre-existing software, including operating system software, which are not developed by using nuclear grade QA program. Thus, it is necessary for utilities to establish processes for dedicating digital commercial grade items. A regulatory body also needs guidance to evaluate the digital commercial products properly. This paper surveyed the regulations and their regulatory guides, which establish the requirements for commercial grade items dedication, industry standards and guidances applicable to safety related systems. This paper provides some guidelines to be applied in evaluating the safety of digital upgrades and new digital plant protection systems in Korea

  11. Comparing the Effects of Different Smoothing Algorithms on the Assessment of Dimensionality of Ordered Categorical Items with Parallel Analysis.

    Science.gov (United States)

    Debelak, Rudolf; Tran, Ulrich S

    2016-01-01

    The analysis of polychoric correlations via principal component analysis and exploratory factor analysis are well-known approaches to determine the dimensionality of ordered categorical items. However, the application of these approaches has been considered as critical due to the possible indefiniteness of the polychoric correlation matrix. A possible solution to this problem is the application of smoothing algorithms. This study compared the effects of three smoothing algorithms, based on the Frobenius norm, the adaption of the eigenvalues and eigenvectors, and on minimum-trace factor analysis, on the accuracy of various variations of parallel analysis by the means of a simulation study. We simulated different datasets which varied with respect to the size of the respondent sample, the size of the item set, the underlying factor model, the skewness of the response distributions and the number of response categories in each item. We found that a parallel analysis and principal component analysis of smoothed polychoric and Pearson correlations led to the most accurate results in detecting the number of major factors in simulated datasets when compared to the other methods we investigated. Of the methods used for smoothing polychoric correlation matrices, we recommend the algorithm based on minimum trace factor analysis.

  12. Connecting single-stock assessment models through correlated survival

    DEFF Research Database (Denmark)

    Albertsen, Christoffer Moesgaard; Nielsen, Anders; Thygesen, Uffe Høgsbro

    2017-01-01

    times. We propose a simple alternative. In three case studies each with two stocks, we improve the single-stock models, as measured by Akaike information criterion, by adding correlation in the cohort survival. To limit the number of parameters, the correlations are parameterized through...... the corresponding partial correlations. We consider six models where the partial correlation matrix between stocks follows a band structure ranging from independent assessments to complex correlation structures. Further, a simulation study illustrates the importance of handling correlated data sufficiently...... by investigating the coverage of confidence intervals for estimated fishing mortality. The results presented will allow managers to evaluate stock statuses based on a more accurate evaluation of model output uncertainty. The methods are directly implementable for stocks with an analytical assessment and do...

  13. Generalizability theory and item response theory

    NARCIS (Netherlands)

    Glas, Cornelis A.W.; Eggen, T.J.H.M.; Veldkamp, B.P.

    2012-01-01

    Item response theory is usually applied to items with a selected-response format, such as multiple choice items, whereas generalizability theory is usually applied to constructed-response tasks assessed by raters. However, in many situations, raters may use rating scales consisting of items with a

  14. ITEM LEVEL DIAGNOSTICS AND MODEL - DATA FIT IN ITEM ...

    African Journals Online (AJOL)

    Global Journal

    Item response theory (IRT) is a framework for modeling and analyzing item response ... data. Though, there is an argument that the evaluation of fit in IRT modeling has been ... National Council on Measurement in Education ... model data fit should be based on three types of ... prediction should be assessed through the.

  15. Item Response Data Analysis Using Stata Item Response Theory Package

    Science.gov (United States)

    Yang, Ji Seung; Zheng, Xiaying

    2018-01-01

    The purpose of this article is to introduce and review the capability and performance of the Stata item response theory (IRT) package that is available from Stata v.14, 2015. Using a simulated data set and a publicly available item response data set extracted from Programme of International Student Assessment, we review the IRT package from…

  16. Spare Items validation

    International Nuclear Information System (INIS)

    Fernandez Carratala, L.

    1998-01-01

    There is an increasing difficulty for purchasing safety related spare items, with certifications by manufacturers for maintaining the original qualifications of the equipment of destination. The main reasons are, on the top of the logical evolution of technology, applied to the new manufactured components, the quitting of nuclear specific production lines and the evolution of manufacturers quality systems, originally based on nuclear codes and standards, to conventional industry standards. To face this problem, for many years different Dedication processes have been implemented to verify whether a commercial grade element is acceptable to be used in safety related applications. In the same way, due to our particular position regarding the spare part supplies, mainly from markets others than the american, C.N. Trillo has developed a methodology called Spare Items Validation. This methodology, which is originally based on dedication processes, is not a single process but a group of coordinated processes involving engineering, quality and management activities. These are to be performed on the spare item itself, its design control, its fabrication and its supply for allowing its use in destinations with specific requirements. The scope of application is not only focussed on safety related items, but also to complex design, high cost or plant reliability related components. The implementation in C.N. Trillo has been mainly curried out by merging, modifying and making the most of processes and activities which were already being performed in the company. (Author)

  17. The Role of Content and Context in PISA Interest Scales: A study of the embedded interest items in the PISA 2006 science assessment

    Science.gov (United States)

    Drechsel, Barbara; Carstensen, Claus; Prenzel, Manfred

    2011-01-01

    This paper focuses interest in science as one of the attitudinal aspects of scientific literacy. Large-scale data from the Programme for International Student Assessment (PISA) 2006 are analysed in order to describe student interest more precisely. So far the analyses have provided a general indicator of interest, aggregated over all contexts and contents in the science test. With its innovative approach PISA embeds interest items within the cognitive test unit and its contents and contexts. The main difference from conventional interest measures is that in most questionnaires, a relatively small number of interest items cover broad fields of contents and contexts. The science units represent a number of systematically differentiated scientific contexts and contents. The units' stimulus texts allow for concrete descriptions of relevant content aspects, applications, and contexts. In the analyses, multidimensional item response models are applied in order to disentangle student interest. The results indicate that multidimensional models fit the data. A two-dimensional model separating interest into two different knowledge of science dimensions described in the PISA science framework is further analysed with respect to gender, performance differences, and country. The findings give a comprehensive description of students' interest in science. The paper deals with methodological problems and describes requirements of the test construction for further assessments. The results are discussed with regard to their significance for science education.

  18. An Arrangement of the Items Influencing Assessment of the Electrotechnical Technology Course / PROEJA, campuses Campos Centro and Itaperuna: The Learners’ View

    Directory of Open Access Journals (Sweden)

    Jorge Luíz Clemente Gomes

    2016-04-01

    Full Text Available This work aims to organize pre-defined items that affect the students’ answers when assessing the Electrotechnical Technology Course / PROEJA. The research was carried out from October / 2011 to December / 2012 with questionnaires applied with 1st to 6th period students. At campus Campos Centro, “Technical Visits” and “Internship” presented high levels of importance and low satisfaction, while “Personal Realization” and “Professional Achievement” presented high levels of relevance and satisfaction. At campus Itaperuna, “Job opportunities” and “Professional Achievement” presented high levels of relevance and satisfaction. Items “Faculty” and “New Technologies”, presented high importance but low satisfaction. The research aims at improving the quality of the course.

  19. Assessing the factor structures of the 55- and 22-item versions of the conformity to masculine norms inventory.

    Science.gov (United States)

    Owen, Jesse

    2011-03-01

    The current study examined the psychometric properties of the abbreviated versions, 55- and 22-items, of the Conformity to Masculine Norms Inventory (CMNI). The authors tested the factor structure for the 11 subscales of the CMNI-55 and the global masculinity factor for the CMNI-55 and the CMNI-22. In a clinical sample of men and women (n=522), the results supported the 11-factor model. Furthermore, the factor structure was invariant for men and women. The higher order model, which tested the utility of the global masculine score, demonstrated marginal fit. The factor structures for the global masculinity score for the CMNI-22 demonstrated poor fit. Collectively, the results suggest that the CMNI-55 is better represented in a multidimensional construct. The subscales' alpha levels and factor loadings were, generally, within acceptable limits. Gender and ethnic mean level differences are also reported. © The Author(s) 2011

  20. Why sample selection matters in exploratory factor analysis: implications for the 12-item World Health Organization Disability Assessment Schedule 2.0.

    Science.gov (United States)

    Gaskin, Cadeyrn J; Lambert, Sylvie D; Bowe, Steven J; Orellana, Liliana

    2017-03-11

    Sample selection can substantially affect the solutions generated using exploratory factor analysis. Validation studies of the 12-item World Health Organization (WHO) Disability Assessment Schedule 2.0 (WHODAS 2.0) have generally involved samples in which substantial proportions of people had no, or minimal, disability. With the WHODAS 2.0 oriented towards measuring disability across six life domains (cognition, mobility, self-care, getting along, life activities, and participation in society), performing factor analysis with samples of people with disability may be more appropriate. We determined the influence of the sampling strategy on (a) the number of factors extracted and (b) the factor structure of the WHODAS 2.0. Using data from adults aged 50+ from the six countries in Wave 1 of the WHO's longitudinal Study on global AGEing and adult health (SAGE), we repeatedly selected samples (n = 750) using two strategies: (1) simple random sampling that reproduced nationally representative distributions of WHODAS 2.0 summary scores for each country (i.e., positively skewed distributions with many zero scores indicating the absence of disability), and (2) stratified random sampling with weights designed to obtain approximately symmetric distributions of summary scores for each country (i.e. predominantly including people with varying degrees of disability). Samples with skewed distributions typically produced one-factor solutions, except for the two countries with the lowest percentages of zero scores, in which the majority of samples produced two factors. Samples with approximately symmetric distributions, generally produced two- or three-factor solutions. In the two-factor solutions, the getting along domain items loaded on one factor (commonly with a cognition domain item), with remaining items loading on a second factor. In the three-factor solutions, the getting along and self-care domain items loaded separately on two factors and three other domains

  1. Does the Assessment of Recovery Capital scale reflect a single or multiple domains?

    Directory of Open Access Journals (Sweden)

    Arndt S

    2017-07-01

    Full Text Available Stephan Arndt,1–3 Ethan Sahker,1,4 Suzy Hedden1 1Iowa Consortium for Substance Abuse Research and Evaluation, 2Department of Psychiatry, Carver College of Medicine, 3Department of Biostatistics, College of Public Health, 4Department of Psychological and Quantitative Foundations, Counseling Psychology Program College of Education, University of Iowa, Iowa City, IA, USA Objective: The goal of this study was to determine whether the 50-item Assessment of Recovery Capital scale represents a single general measure or whether multiple domains might be psychometrically useful for research or clinical applications. Methods: Data are from a cross-sectional de-identified existing program evaluation information data set with 1,138 clients entering substance use disorder treatment. Principal components and iterated factor analysis were used on the domain scores. Multiple group factor analysis provided a quasi-confirmatory factor analysis. Results: The solution accounted for 75.24% of the total variance, suggesting that 10 factors provide a reasonably good fit. However, Tucker’s congruence coefficients between the factor structure and defining weights (0.41–0.52 suggested a poor fit to the hypothesized 10-domain structure. Principal components of the 10-domain scores yielded one factor whose eigenvalue was greater than one (5.93, accounting for 75.8% of the common variance. A few domains had perceptible but small unique variance components suggesting that a few of the domains may warrant enrichment. Conclusion: Our findings suggest that there is one general factor, with a caveat. Using the 10 measures inflates the chance for Type I errors. Using one general measure avoids this issue, is simple to interpret, and could reduce the number of items. However, those seeking to maximally predict later recovery success may need to use the full instrument and all 10 domains. Keywords: social support, psychometrics, quality of life

  2. Using Direct Behavior Rating--Single Item Scales to Assess Student Behavior within Multi-Tiered Systems of Support

    Science.gov (United States)

    Miller, Faith G.; Patwa, Shamim S.; Chafouleas, Sandra M.

    2014-01-01

    An increased emphasis on collecting and using data in schools has occurred, in part, because of the implementation of multi-tiered systems of support (MTSS). Commonly referred to as response to intervention in the academic domain and school-wide positive behavioral interventions and supports in the behavioral domain, these initiatives have a…

  3. Connecting Lines of Research on Task Model Variables, Automatic Item Generation, and Learning Progressions in Game-Based Assessment

    Science.gov (United States)

    Graf, Edith Aurora

    2014-01-01

    In "How Task Features Impact Evidence from Assessments Embedded in Simulations and Games," Almond, Kim, Velasquez, and Shute have prepared a thought-provoking piece contrasting the roles of task model variables in a traditional assessment of mathematics word problems to their roles in "Newton's Playground," a game designed…

  4. Instructional Topics in Educational Measurement (ITEMS) Module: Using Automated Processes to Generate Test Items

    Science.gov (United States)

    Gierl, Mark J.; Lai, Hollis

    2013-01-01

    Changes to the design and development of our educational assessments are resulting in the unprecedented demand for a large and continuous supply of content-specific test items. One way to address this growing demand is with automatic item generation (AIG). AIG is the process of using item models to generate test items with the aid of computer…

  5. Reliability and construct validity of the Spanish version of the 6-item CTS symptoms scale for outcomes assessment in carpal tunnel syndrome.

    Science.gov (United States)

    Rosales, Roberto S; Martin-Hidalgo, Yolanda; Reboso-Morales, Luis; Atroshi, Isam

    2016-03-03

    The purpose of this study was to assess the reliability and construct validity of the Spanish version of the 6-item carpal tunnel syndrome (CTS) symptoms scale (CTS-6). In this cross-sectional study 40 patients diagnosed with CTS based on clinical and neurophysiologic criteria, completed the standard Spanish versions of the CTS-6 and the disabilities of the arm, shoulder and hand (QuickDASH) scales on two occasions with a 1-week interval. Internal-consistency reliability was assessed with the Cronbach alpha coefficient and test-retest reliability with the intraclass correlation coefficient, two way random effect model and absolute agreement definition (ICC2,1). Cross-sectional precision was analyzed with the Standard Error of the Measurement (SEM). Longitudinal precision for test-retest reliability coefficient was assessed with the Standard Error of the Measurement difference (SEMdiff) and the Minimal Detectable Change at 95 % confidence level (MDC95). For assessing construct validity it was hypothesized that the CTS-6 would have a strong positive correlation with the QuickDASH, analyzed with the Pearson correlation coefficient (r). The standard Spanish version of the CTS-6 presented a Cronbach alpha of 0.81 with a SEM of 0.3. Test-retest reliability showed an ICC of 0.85 with a SRMdiff of 0.36 and a MDC95 of 0.7. The correlation between CTS-6 and the QuickDASH was concordant with the a priori formulated construct hypothesis (r 0.69) CONCLUSIONS: The standard Spanish version of the 6-item CTS symptoms scale showed good internal consistency, test-retest reliability and construct validity for outcomes assessment in CTS. The CTS-6 will be useful to clinicians and researchers in Spanish speaking parts of the world. The use of standardized outcome measures across countries also will facilitate comparison of research results in carpal tunnel syndrome.

  6. Addressing challenges in single species assessments via a simple state-space assessment model

    DEFF Research Database (Denmark)

    Nielsen, Anders

    Single-species and age-structured fish stock assessments still remains the main tool for managing fish stocks. A simple state-space assessment model is presented as an alternative to (semi) deterministic procedures and the full parametric statistical catch at age models. It offers a solution...... to some of the key challenges of these models. Compared to the deterministic procedures it solves a list of problems originating from falsely assuming that age classified catches are known without errors and allows quantification of uncertainties of estimated quantities of interest. Compared to full...

  7. Validation of the Single-Factor Model of the Relationship Assessment Scale among Married and Cohabiting Persons from Monterrey, Mexico

    Directory of Open Access Journals (Sweden)

    José Moral de la Rubia

    2015-07-01

    Full Text Available The study of intimate partner relationships is particularly important because this union is the foundation of the family. Satisfaction with the relationship can be defined as the overall attitude to the relationship and the partner. The Hendrick's Relationship Assessment Scale (RAS is a instrument commonly used to assess the construct. Previous research papers have showed that this scale has high internal consistency and a single-factor structure. Although there are validation studies of the RAS, these studies used inappropriate statistical techniques to analyze its Likert-type items, and to determine the number of factors; likewise, its factor invariance across sex has not been previously contrasted. Therefore, this study posed the following research questions: Does the RAS have consistent and discriminating items? Basing the analysis on a polychoric correlation matrix, what is its level of internal consistency? How many factors emerge using rigorous empirical methods? Is the single-factor model invariant across sex? In order to answer these research questions, we used a random route probability sampling in this instrument validation study of the RAS. The sample was extracted from the population of married couples or the ones living in consensual union in Monterrey, Mexico. There were 431 female and 376 male participants in the study. The RAS’ items were consistent and discriminative. The internal consistency of the scale was excellent in the whole sample (ordinal α = .93, as well as among female (ordinal α = .94 and male participants (ordinal α = .92. Horn's parallel analysis and Velicer's  minimum average partial test suggested a one factor solution. Moreover, the single-factor model (with one correlation between the residuals of the two negatively worded items had a close fit to the data, and its properties of invariance across sex were very acceptable by the Unweighted Least Squares method. We conclude that the scale shows internal

  8. A comparative study on assessment procedures and metric properties of two scoring systems of the Coma Recovery Scale-Revised items: standard and modified scores.

    Science.gov (United States)

    Sattin, Davide; Lovaglio, Piergiorgio; Brenna, Greta; Covelli, Venusia; Rossi Sebastiano, Davide; Duran, Dunja; Minati, Ludovico; Giovannetti, Ambra Mara; Rosazza, Cristina; Bersano, Anna; Nigri, Anna; Ferraro, Stefania; Leonardi, Matilde

    2017-09-01

    The study compared the metric characteristics (discriminant capacity and factorial structure) of two different methods for scoring the items of the Coma Recovery Scale-Revised and it analysed scale scores collected using the standard assessment procedure and a new proposed method. Cross sectional design/methodological study. Inpatient, neurological unit. A total of 153 patients with disorders of consciousness were consecutively enrolled between 2011 and 2013. All patients were assessed with the Coma Recovery Scale-Revised using standard (rater 1) and inverted (rater 2) procedures. Coma Recovery Scale-Revised score, number of cognitive and reflex behaviours and diagnosis. Regarding patient assessment, rater 1 using standard and rater 2 using inverted procedures obtained the same best scores for each subscale of the Coma Recovery Scale-Revised for all patients, so no clinical (and statistical) difference was found between the two procedures. In 11 patients (7.7%), rater 2 noted that some Coma Recovery Scale-Revised codified behavioural responses were not found during assessment, although higher response categories were present. A total of 51 (36%) patients presented the same Coma Recovery Scale-Revised scores of 7 or 8 using a standard score, whereas no overlap was found using the modified score. Unidimensionality was confirmed for both score systems. The Coma Recovery Scale Modified Score showed a higher discriminant capacity than the standard score and a monofactorial structure was also supported. The inverted assessment procedure could be a useful evaluation method for the assessment of patients with disorder of consciousness diagnosis.

  9. Exploring Different Types of Assessment Items to Measure Linguistically Diverse Students' Understanding of Energy and Matter in Chemistry

    Science.gov (United States)

    Ryoo, Kihyun; Toutkoushian, Emily; Bedell, Kristin

    2018-01-01

    Energy and matter are fundamental, yet challenging concepts in middle school chemistry due to their abstract, unobservable nature. Although it is important for science teachers to elicit a range of students' ideas to design and revise their instruction, capturing such varied ideas using traditional assessments consisting of multiple-choice items…

  10. The Meaning of Goodness-of-Fit Tests: Commentary on "Goodness-of-Fit Assessment of Item Response Theory Models"

    Science.gov (United States)

    Thissen, David

    2013-01-01

    In this commentary, David Thissen states that "Goodness-of-fit assessment for IRT models is maturing; it has come a long way from zero." Thissen then references prior works on "goodness of fit" in the index of Lord and Novick's (1968) classic text; Yen (1984); Drasgow, Levine, Tsien, Williams, and Mead (1995); Chen and…

  11. Using automatic item generation to create multiple-choice test items.

    Science.gov (United States)

    Gierl, Mark J; Lai, Hollis; Turner, Simon R

    2012-08-01

    Many tests of medical knowledge, from the undergraduate level to the level of certification and licensure, contain multiple-choice items. Although these are efficient in measuring examinees' knowledge and skills across diverse content areas, multiple-choice items are time-consuming and expensive to create. Changes in student assessment brought about by new forms of computer-based testing have created the demand for large numbers of multiple-choice items. Our current approaches to item development cannot meet this demand. We present a methodology for developing multiple-choice items based on automatic item generation (AIG) concepts and procedures. We describe a three-stage approach to AIG and we illustrate this approach by generating multiple-choice items for a medical licensure test in the content area of surgery. To generate multiple-choice items, our method requires a three-stage process. Firstly, a cognitive model is created by content specialists. Secondly, item models are developed using the content from the cognitive model. Thirdly, items are generated from the item models using computer software. Using this methodology, we generated 1248 multiple-choice items from one item model. Automatic item generation is a process that involves using models to generate items using computer technology. With our method, content specialists identify and structure the content for the test items, and computer technology systematically combines the content to generate new test items. By combining these outcomes, items can be generated automatically. © Blackwell Publishing Ltd 2012.

  12. The comparability of English, French and Dutch scores on the Functional Assessment of Chronic Illness Therapy-Fatigue (FACIT-F: an assessment of differential item functioning in patients with systemic sclerosis.

    Directory of Open Access Journals (Sweden)

    Linda Kwakkenbos

    Full Text Available The Functional Assessment of Chronic Illness Therapy-Fatigue (FACIT-F is commonly used to assess fatigue in rheumatic diseases, and has shown to discriminate better across levels of the fatigue spectrum than other commonly used measures. The aim of this study was to assess the cross-language measurement equivalence of the English, French, and Dutch versions of the FACIT-F in systemic sclerosis (SSc patients.The FACIT-F was completed by 871 English-speaking Canadian, 238 French-speaking Canadian and 230 Dutch SSc patients. Confirmatory factor analysis was used to assess the factor structure in the three samples. The Multiple-Indicator Multiple-Cause (MIMIC model was utilized to assess differential item functioning (DIF, comparing English versus French and versus Dutch patient responses separately.A unidimensional factor model showed good fit in all samples. Comparing French versus English patients, statistically significant, but small-magnitude DIF was found for 3 of 13 items. French patients had 0.04 of a standard deviation (SD lower latent fatigue scores than English patients and there was an increase of only 0.03 SD after accounting for DIF. For the Dutch versus English comparison, 4 items showed small, but statistically significant, DIF. Dutch patients had 0.20 SD lower latent fatigue scores than English patients. After correcting for DIF, there was a reduction of 0.16 SD in this difference.There was statistically significant DIF in several items, but the overall effect on fatigue scores was minimal. English, French and Dutch versions of the FACIT-F can be reasonably treated as having equivalent scoring metrics.

  13. The Comparability of English, French and Dutch Scores on the Functional Assessment of Chronic Illness Therapy-Fatigue (FACIT-F): An Assessment of Differential Item Functioning in Patients with Systemic Sclerosis

    Science.gov (United States)

    Kwakkenbos, Linda; Willems, Linda M.; Baron, Murray; Hudson, Marie; Cella, David; van den Ende, Cornelia H. M.; Thombs, Brett D.

    2014-01-01

    Objective The Functional Assessment of Chronic Illness Therapy- Fatigue (FACIT-F) is commonly used to assess fatigue in rheumatic diseases, and has shown to discriminate better across levels of the fatigue spectrum than other commonly used measures. The aim of this study was to assess the cross-language measurement equivalence of the English, French, and Dutch versions of the FACIT-F in systemic sclerosis (SSc) patients. Methods The FACIT-F was completed by 871 English-speaking Canadian, 238 French-speaking Canadian and 230 Dutch SSc patients. Confirmatory factor analysis was used to assess the factor structure in the three samples. The Multiple-Indicator Multiple-Cause (MIMIC) model was utilized to assess differential item functioning (DIF), comparing English versus French and versus Dutch patient responses separately. Results A unidimensional factor model showed good fit in all samples. Comparing French versus English patients, statistically significant, but small-magnitude DIF was found for 3 of 13 items. French patients had 0.04 of a standard deviation (SD) lower latent fatigue scores than English patients and there was an increase of only 0.03 SD after accounting for DIF. For the Dutch versus English comparison, 4 items showed small, but statistically significant, DIF. Dutch patients had 0.20 SD lower latent fatigue scores than English patients. After correcting for DIF, there was a reduction of 0.16 SD in this difference. Conclusions There was statistically significant DIF in several items, but the overall effect on fatigue scores was minimal. English, French and Dutch versions of the FACIT-F can be reasonably treated as having equivalent scoring metrics. PMID:24638101

  14. Can cancer patients assess the influence of pain on functions? A randomised, controlled study of the pain interference items in the Brief Pain Inventory

    Directory of Open Access Journals (Sweden)

    Kaasa Stein

    2007-03-01

    Full Text Available Abstract Background The Brief Pain Inventory (BPI is recommended as a pain measurement tool by the Expert Working Group of the European Association of Palliative Care. The BPI is designed to assess both pain severity and interference with functions caused by pain. The purpose of this study was to investigate if pain interference items are influenced by other factors than pain. Methods We asked adult cancer patients to complete the original and a revised BPI on two study days. In the original version of the BPI the patients were asked how, during the last 24 hours, pain has interfered with functions. In the revised BPI this question was changed to how, during the last 24 hours, these functions are affected in general. Heath related quality of life was assessed at both study days applying the European Organization for Research and Treatment of Cancer quality of life questionnaire. Results Forty-eight of the 55 included patients completed both assessments. The BPI pain intensities scores and the health related quality of life scores were similar at the two study days. Except for mood this study observed no significant distinctions between the patients' BPI interference items scores in the original (pain influence on function and the revised BPI (function in general. Seventeen patients reported higher influence from pain on functions than the total influence on function from all causes. Conclusion We observed similar scores in the original BPI interference scores (pain influence on function compared with the revised BPI interference scores (decreased function in general. This finding might imply that the BPI interference scale measures are partly responded to as more of a global interference measure.

  15. Assessment of free and cued recall in Alzheimer's disease and vascular and frontotemporal dementia with 24-item Grober and Buschke test.

    Science.gov (United States)

    Cerciello, Milena; Isella, Valeria; Proserpi, Alice; Papagno, Costanza

    2017-01-01

    Alzheimer's disease (AD), vascular dementia (VaD) and frontotemporal dementia (FTD) are the most common forms of dementia. It is well known that memory deficits in AD are different from those in VaD and FTD, especially with respect to cued recall. The aim of this clinical study was to compare the memory performance in 15 AD, 10 VaD and 9 FTD patients and 20 normal controls by means of a 24-item Grober-Buschke test [8]. The patients' groups were comparable in terms of severity of dementia. We considered free and total recall (free plus cued) both in immediate and delayed recall and computed an Index of Sensitivity to Cueing (ISC) [8] for immediate and delayed trials. We assessed whether cued recall predicted the subsequent free recall across our patients' groups. We found that AD patients recalled fewer items from the beginning and were less sensitive to cueing supporting the hypothesis that memory disorders in AD depend on encoding and storage deficit. In immediate recall VaD and FTD showed a similar memory performance and a stronger sensitivity to cueing than AD, suggesting that memory disorders in these patients are due to a difficulty in spontaneously implementing efficient retrieval strategies. However, we found a lower ISC in the delayed recall compared to the immediate trials in VaD than FTD due to a higher forgetting in VaD.

  16. Psychometric assessment of the Adult-Adolescent Parenting Inventory in a sample of low-income single mothers.

    Science.gov (United States)

    Lutenbacher, M

    2001-01-01

    The Adult-Adolescent Parenting Inventory (AAPI) is a 32-item inventory widely used to identify adolescents and adults at risk for inadequate parenting behaviors. It includes four subscales representing the most frequent patterns associated with abusive parenting: (a) Inappropriate Expectations; (b) Lack of Empathy; (c) Parental Value of Corporal Punishment; and (d) Parent-Child Role Reversal. Although it has been used in a variety of samples, the psychometric properties of the AAPI have not been examined in low-income single mothers. The purposes of this study were to: (a) examine the reliability and validity of the Adult-Adolescent Parenting Inventory (AAPI) in a sample of 206 low-income single mothers; (b) assess the mother's risk for inadequate parenting by comparing their AAPI subscale scores with normative subscale scores on the AAPI; (c) assess the construct validity of the AAPI by testing the hypothesis that mothers with lower AAPI scores have a higher level of depressive symptoms and lower self-esteem in comparison to mothers with higher AAPI scores; and (d) determine whether the 4-factor structure proposed by Bavolek (1984) could be replicated. AAPI scores indicated these mothers were at high risk for child abuse when compared with normative data for parents with no known history of abuse. Higher risk for abusive parenting was associated with a higher level of depressive symptoms, less education, and unemployment. The subscales, Inappropriate Expectations and Parental Value of Corporal Punishment demonstrated poor internal consistency with Cronbach's alphas of .40 and .54, respectively. Hypothesis testing supported the construct validity of the AAPI. Bavolek's 4-factor structure was not supported. A 19-item modified version of the AAPI with three dimensions was identified. This modified version of the AAPI may provide a more efficacious tool for use with low-income single mothers.

  17. TEDS-M 2008 User Guide for the International Database. Supplement 4: TEDS-M Released Mathematics and Mathematics Pedagogy Knowledge Assessment Items

    Science.gov (United States)

    Brese, Falk, Ed.

    2012-01-01

    The goal for selecting the released set of test items was to have approximately 25% of each of the full item sets for mathematics content knowledge (MCK) and mathematics pedagogical content knowledge (MPCK) that would represent the full range of difficulty, content, and item format used in the TEDS-M study. The initial step in the selection was to…

  18. Coeducational or Single-Sex Schools? A Review of the Literature. New Zealand Council for Educational Research, Set 76, Number 1 Item 9.

    Science.gov (United States)

    Irving, James

    This article is part of an informational kit for teachers published by the New Zealand Council for Educational Research. The focus of this article is on the advantages and disadvantages of co-educational and single-sex secondary schools as discussed in research efforts from England and New Zealand. (JLL)

  19. Correlation between the pain numeric rating scale and the 12-item WHO Disability Assessment Schedule 2.0 in patients with musculoskeletal pain.

    Science.gov (United States)

    Saltychev, Mikhail; Bärlund, Esa; Laimi, Katri

    2018-03-01

    The aim of this study was to assess the correlation between pain severity measured on a numeric rating scale and restrictions of functioning measured with the WHO Disability Assessment Schedule (WHODAS 2.0). This was a cross-sectional study of 1207 patients with musculoskeletal pain conditions. Correlation was assessed using Spearman's and Pearson tests. Although all the Spearman's rank correlations between WHODAS 2.0 items and pain severity were statistically significant, they were mostly weak, with only a few moderate associations for 'S2 household responsibilities', 'S8 washing', 'S9 dressing', and 'S12 day-to-day work'. The correlation between the WHODAS 2.0 total score and pain severity was also moderate: 0.41 [95% confidence interval (CI): 0.36-0.45] for average pain and 0.42 (95% CI: 0.37-0.46) for worst pain. The correlation between the WHODAS 2.0 total score and pain level was also assessed using Pearson's product-moment correlation, yielding figures that were similar to Spearman's correlation: 0.42 (Pcorrelation between pain severity measured by numeric rating scale and functioning level measured by WHODAS 2.0 was weak to moderate, with slightly stronger associations in physical domains of functioning.

  20. Item response theory analyses of the Delis-Kaplan Executive Function System card sorting subtest.

    Science.gov (United States)

    Spencer, Mercedes; Cho, Sun-Joo; Cutting, Laurie E

    2018-02-02

    In the current study, we examined the dimensionality of the 16-item Card Sorting subtest of the Delis-Kaplan Executive Functioning System assessment in a sample of 264 native English-speaking children between the ages of 9 and 15 years. We also tested for measurement invariance for these items across age and gender groups using item response theory (IRT). Results of the exploratory factor analysis indicated that a two-factor model that distinguished between verbal and perceptual items provided the best fit to the data. Although the items demonstrated measurement invariance across age groups, measurement invariance was violated for gender groups, with two items demonstrating differential item functioning for males and females. Multigroup analysis using all 16 items indicated that the items were more effective for individuals whose IRT scale scores were relatively high. A single-group explanatory IRT model using 14 non-differential item functioning items showed that for perceptual ability, females scored higher than males and that scores increased with age for both males and females; for verbal ability, the observed increase in scores across age differed for males and females. The implications of these findings are discussed.

  1. An NCME Instructional Module on Polytomous Item Response Theory Models

    Science.gov (United States)

    Penfield, Randall David

    2014-01-01

    A polytomous item is one for which the responses are scored according to three or more categories. Given the increasing use of polytomous items in assessment practices, item response theory (IRT) models specialized for polytomous items are becoming increasingly common. The purpose of this ITEMS module is to provide an accessible overview of…

  2. What Do You Think You Are Measuring? A Mixed-Methods Procedure for Assessing the Content Validity of Test Items and Theory-Based Scaling

    Science.gov (United States)

    Koller, Ingrid; Levenson, Michael R.; Glück, Judith

    2017-01-01

    The valid measurement of latent constructs is crucial for psychological research. Here, we present a mixed-methods procedure for improving the precision of construct definitions, determining the content validity of items, evaluating the representativeness of items for the target construct, generating test items, and analyzing items on a theoretical basis. To illustrate the mixed-methods content-scaling-structure (CSS) procedure, we analyze the Adult Self-Transcendence Inventory, a self-report measure of wisdom (ASTI, Levenson et al., 2005). A content-validity analysis of the ASTI items was used as the basis of psychometric analyses using multidimensional item response models (N = 1215). We found that the new procedure produced important suggestions concerning five subdimensions of the ASTI that were not identifiable using exploratory methods. The study shows that the application of the suggested procedure leads to a deeper understanding of latent constructs. It also demonstrates the advantages of theory-based item analysis. PMID:28270777

  3. Methodological quality of diagnostic accuracy studies on non-invasive coronary CT angiography: influence of QUADAS (Quality Assessment of Diagnostic Accuracy Studies included in systematic reviews) items on sensitivity and specificity

    Energy Technology Data Exchange (ETDEWEB)

    Schueler, Sabine; Walther, Stefan; Schuetz, Georg M. [Humboldt-Universitaet zu Berlin, Freie Universitaet Berlin, Charite Medical School, Department of Radiology, Berlin (Germany); Schlattmann, Peter [University Hospital of Friedrich Schiller University Jena, Department of Medical Statistics, Informatics, and Documentation, Jena (Germany); Dewey, Marc [Humboldt-Universitaet zu Berlin, Freie Universitaet Berlin, Charite Medical School, Department of Radiology, Berlin (Germany); Charite, Institut fuer Radiologie, Berlin (Germany)

    2013-06-15

    To evaluate the methodological quality of diagnostic accuracy studies on coronary computed tomography (CT) angiography using the QUADAS (Quality Assessment of Diagnostic Accuracy Studies included in systematic reviews) tool. Each QUADAS item was individually defined to adapt it to the special requirements of studies on coronary CT angiography. Two independent investigators analysed 118 studies using 12 QUADAS items. Meta-regression and pooled analyses were performed to identify possible effects of methodological quality items on estimates of diagnostic accuracy. The overall methodological quality of coronary CT studies was merely moderate. They fulfilled a median of 7.5 out of 12 items. Only 9 of the 118 studies fulfilled more than 75 % of possible QUADAS items. One QUADAS item (''Uninterpretable Results'') showed a significant influence (P = 0.02) on estimates of diagnostic accuracy with ''no fulfilment'' increasing specificity from 86 to 90 %. Furthermore, pooled analysis revealed that each QUADAS item that is not fulfilled has the potential to change estimates of diagnostic accuracy. The methodological quality of studies investigating the diagnostic accuracy of non-invasive coronary CT is only moderate and was found to affect the sensitivity and specificity. An improvement is highly desirable because good methodology is crucial for adequately assessing imaging technologies. (orig.)

  4. Methodological quality of diagnostic accuracy studies on non-invasive coronary CT angiography: influence of QUADAS (Quality Assessment of Diagnostic Accuracy Studies included in systematic reviews) items on sensitivity and specificity

    International Nuclear Information System (INIS)

    Schueler, Sabine; Walther, Stefan; Schuetz, Georg M.; Schlattmann, Peter; Dewey, Marc

    2013-01-01

    To evaluate the methodological quality of diagnostic accuracy studies on coronary computed tomography (CT) angiography using the QUADAS (Quality Assessment of Diagnostic Accuracy Studies included in systematic reviews) tool. Each QUADAS item was individually defined to adapt it to the special requirements of studies on coronary CT angiography. Two independent investigators analysed 118 studies using 12 QUADAS items. Meta-regression and pooled analyses were performed to identify possible effects of methodological quality items on estimates of diagnostic accuracy. The overall methodological quality of coronary CT studies was merely moderate. They fulfilled a median of 7.5 out of 12 items. Only 9 of the 118 studies fulfilled more than 75 % of possible QUADAS items. One QUADAS item (''Uninterpretable Results'') showed a significant influence (P = 0.02) on estimates of diagnostic accuracy with ''no fulfilment'' increasing specificity from 86 to 90 %. Furthermore, pooled analysis revealed that each QUADAS item that is not fulfilled has the potential to change estimates of diagnostic accuracy. The methodological quality of studies investigating the diagnostic accuracy of non-invasive coronary CT is only moderate and was found to affect the sensitivity and specificity. An improvement is highly desirable because good methodology is crucial for adequately assessing imaging technologies. (orig.)

  5. Right ventricular function assessment in single LAD lesion patients ...

    African Journals Online (AJOL)

    Rania Gaber

    2015-10-09

    Oct 9, 2015 ... Doppler method in patients with single LAD lesion. Methods: The patient group was ... Results: The right ventricular tissue Doppler parameters (Sm, E, A, E/A ratio, IVA, E/E00) of the patients group were significantly .... cardial interaction effect of tethered LV anterior myocardium. Mittal et al. reported that Left ...

  6. Dynamics Assessment of Advanced Single-Phase PLL Structures

    DEFF Research Database (Denmark)

    Golestan, Saeed; Monfarad, Mohammad; Freijedo, Francisco D.

    2013-01-01

    Recently, several advanced phase locked loop (PLL) techniques have been proposed for single-phase applications. Among these, the Park-PLL, and the second order generalized integrator (SOGI) based PLL are very attractive, owing to their simple digital implementation, low computational burden...

  7. Assessing T cell differentiation at the single-cell level

    NARCIS (Netherlands)

    Gerlach, Carmen

    2012-01-01

    This thesis describes the development and use of a novel technology for single-cell fate mapping, called cellular barcoding. With this technology, unique and heritable genetic tags (barcodes) are introduced into naïve T cells. Using cellular barcoding, we investigated I) how different

  8. Cost assessment of instruments for single-incision laparoscopic cholecystectomy

    DEFF Research Database (Denmark)

    Henriksen, Nadia A; Al-Tayar, Haytham; Rosenberg, Jacob

    2012-01-01

    Specially designed surgical instruments have been developed for single-incision laparoscopic surgery, but high instrument costs may impede the implementation of these procedures. The aim of this study was to compare the cost of operative implements used for elective cholecystectomy performed...

  9. Item-saving assessment of self-care performance in children with developmental disabilities: A prospective caregiver-report computerized adaptive test

    Science.gov (United States)

    Chen, Cheng-Te; Chen, Yu-Lan; Lin, Yu-Ching; Hsieh, Ching-Lin; Tzeng, Jeng-Yi

    2018-01-01

    Objective The purpose of this study was to construct a computerized adaptive test (CAT) for measuring self-care performance (the CAT-SC) in children with developmental disabilities (DD) aged from 6 months to 12 years in a content-inclusive, precise, and efficient fashion. Methods The study was divided into 3 phases: (1) item bank development, (2) item testing, and (3) a simulation study to determine the stopping rules for the administration of the CAT-SC. A total of 215 caregivers of children with DD were interviewed with the 73-item CAT-SC item bank. An item response theory model was adopted for examining the construct validity to estimate item parameters after investigation of the unidimensionality, equality of slope parameters, item fitness, and differential item functioning (DIF). In the last phase, the reliability and concurrent validity of the CAT-SC were evaluated. Results The final CAT-SC item bank contained 56 items. The stopping rules suggested were (a) reliability coefficient greater than 0.9 or (b) 14 items administered. The results of simulation also showed that 85% of the estimated self-care performance scores would reach a reliability higher than 0.9 with a mean test length of 8.5 items, and the mean reliability for the rest was 0.86. Administering the CAT-SC could reduce the number of items administered by 75% to 84%. In addition, self-care performances estimated by the CAT-SC and the full item bank were very similar to each other (Pearson r = 0.98). Conclusion The newly developed CAT-SC can efficiently measure self-care performance in children with DD whose performances are comparable to those of TD children aged from 6 months to 12 years as precisely as the whole item bank. The item bank of the CAT-SC has good reliability and a unidimensional self-care construct, and the CAT can estimate self-care performance with less than 25% of the items in the item bank. Therefore, the CAT-SC could be useful for measuring self-care performance in children with

  10. Rats Remember Items in Context Using Episodic Memory.

    Science.gov (United States)

    Panoz-Brown, Danielle; Corbin, Hannah E; Dalecki, Stefan J; Gentry, Meredith; Brotheridge, Sydney; Sluka, Christina M; Wu, Jie-En; Crystal, Jonathon D

    2016-10-24

    Vivid episodic memories in people have been characterized as the replay of unique events in sequential order [1-3]. Animal models of episodic memory have successfully documented episodic memory of a single event (e.g., [4-8]). However, a fundamental feature of episodic memory in people is that it involves multiple events, and notably, episodic memory impairments in human diseases are not limited to a single event. Critically, it is not known whether animals remember many unique events using episodic memory. Here, we show that rats remember many unique events and the contexts in which the events occurred using episodic memory. We used an olfactory memory assessment in which new (but not old) odors were rewarded using 32 items. Rats were presented with 16 odors in one context and the same odors in a second context. To attain high accuracy, the rats needed to remember item in context because each odor was rewarded as a new item in each context. The demands on item-in-context memory were varied by assessing memory with 2, 3, 5, or 15 unpredictable transitions between contexts, and item-in-context memory survived a 45 min retention interval challenge. When the memory of item in context was put in conflict with non-episodic familiarity cues, rats relied on item in context using episodic memory. Our findings suggest that rats remember multiple unique events and the contexts in which these events occurred using episodic memory and support the view that rats may be used to model fundamental aspects of human cognition. Copyright © 2016 Elsevier Ltd. All rights reserved.

  11. Assessment of the Tensile Properties for Single Fibers

    Science.gov (United States)

    2018-02-01

    release; distribution is unlimited. 24 7. Conclusions A method for accurately characterizing the tensile material properties of single fibers...subject to any penalty for failing to comply with a collection of information if it does not display a currently valid OMB control number. PLEASE DO NOT...10. SPONSOR/MONITOR’S ACRONYM(S) 11. SPONSOR/MONITOR’S REPORT NUMBER(S) 12. DISTRIBUTION/ AVAILABILITY STATEMENT 13. SUPPLEMENTARY NOTES

  12. Differential items functioning to assess aggressiveness in college students / Funcionamento diferencial de itens para avaliar a agressividade de universitários

    Directory of Open Access Journals (Sweden)

    Fermino Fernandes Sisto

    2008-01-01

    Full Text Available In this research evidences of construct validity were searched analyzing the differential functioning items related to aggressiveness. The participants were 445 college students of both genders, attending the courses of Engineering, Computing and Psychology. The scale of aggressiveness composed by 81 items was collectively applied, in the classroom, to the students who consented to participate in the study. The items of the instrument were studied by means of the Rasch model. Twenty-eight items presented differential functioning item, 15 were characterized as typical for females and 13 for males. The reliability coefficients were 0.99 to the items and 0.86 to the persons. It was concluded that the aggressiveness can be measured separately on the basis of gender.

  13. Surveying Assessment in Experiential Learning: A Single Campus Study

    Directory of Open Access Journals (Sweden)

    Thomas Yates

    2015-12-01

    Full Text Available The purpose of this study was to determine the methods of experiential assessment in use at a Canadian university and the extent to which they are used. Exploring experiential assessment will allow identification of commonly used methods and facilitate the development of best practices of assessment in the context of experiential learning (EL at an institutional level. The origins of EL are found in the work of Dewey (1938, later modified by Kolb and Fry (1975. Experiential methods include: experiential education, service learning problem-based learning and others such as action learning, enquiry-based learning, and case studies. Faculty currently involved in EL at the participating university were invited to complete an online survey about their teaching and assessment methods. This paper will share the results and analysis of the EL inventory survey.

  14. Reducing the item number to obtain the same-length self-assessment scales: a systematic approach using result of graphical loglinear rasch models

    DEFF Research Database (Denmark)

    Nielsen, Tine; Kreiner, Svend

    2011-01-01

    The Revised Danish Learning Styles Inventory (R-D-LSI) (Nielsen 2005), which is an adaptation of Sternberg- Wagner Thinking Styles Inventory (Sternberg, 1997), comprises 14 subscales, each measuring a separate learning style. Of these 14 subscales, 9 are eight items long and 5 are seven items long...... Inventory (D-SA-LSI) comprising 14 subscales each with an item length of seven. The systematic approach to item reduction based on results of GLLRM will be presented and exemplified by its application to the R-D-LSI....

  15. Calibration of Automatically Generated Items Using Bayesian Hierarchical Modeling.

    Science.gov (United States)

    Johnson, Matthew S.; Sinharay, Sandip

    For complex educational assessments, there is an increasing use of "item families," which are groups of related items. However, calibration or scoring for such an assessment requires fitting models that take into account the dependence structure inherent among the items that belong to the same item family. C. Glas and W. van der Linden…

  16. Assessing the test-retest repeatability of the Vietnamese version of the National Eye Institute 25-item Visual Function Questionnaire among bilateral cataract patients for a Vietnamese population.

    Science.gov (United States)

    To, Kien Gia; Meuleners, Lynn; Chen, Huei-Yang; Lee, Andy; Do, Dung Van; Duong, Dat Van; Phi, Tien Duy; Tran, Hoang Huy; Nguyen, Nguyen Do

    2014-06-01

    To determine the test-retest repeatability of the National Eye Institute 25-item Visual Function Questionnaire (NEI VFQ-25) for use with older Vietnamese adults with bilateral cataract. The questionnaire was translated into Vietnamese and back-translated into English by two independent translators. Patients with bilateral cataract aged 50 and older completed the questionnaire on two separate occasions, one to two weeks after first administration of the questionnaire. Test-retest repeatability was assessed using the Cronbach's α and intraclass correlation coefficients. The average age of participants was 67 ± 8 years and most participants were female (73%). Internal consistency was acceptable with the α coefficient above 0.7 for all subscales and intraclass correlation coefficients were 0.6 or greater in all subscales. The Vietnamese NEI VFQ-25 is reliable for use in studies assessing vision-related quality of life in older adults with bilateral cataract in Vietnam. We propose some modifications to the NEI-VFQ questions to reflect activities of older people in Vietnam. © 2013 ACOTA.

  17. Use of UV-C radiation to disinfect non-critical patient care items: a laboratory assessment of the Nanoclave Cabinet

    Directory of Open Access Journals (Sweden)

    Moore Ginny

    2012-08-01

    Full Text Available Abstract Background The near-patient environment is often heavily contaminated, yet the decontamination of near-patient surfaces and equipment is often poor. The Nanoclave Cabinet produces large amounts of ultraviolet-C (UV-C radiation (53 W/m2 and is designed to rapidly disinfect individual items of clinical equipment. Controlled laboratory studies were conducted to assess its ability to eradicate a range of potential pathogens including Clostridium difficile spores and Adenovirus from different types of surface. Methods Each test surface was inoculated with known levels of vegetative bacteria (106 cfu/cm2, C. difficile spores (102-106 cfu/cm2 or Adenovirus (109 viral genomes, placed in the Nanoclave Cabinet and exposed for up to 6 minutes to the UV-C light source. Survival of bacterial contaminants was determined via conventional cultivation techniques. Degradation of viral DNA was determined via PCR. Results were compared to the number of colonies or level of DNA recovered from non-exposed control surfaces. Experiments were repeated to incorporate organic soils and to compare the efficacy of the Nanoclave Cabinet to that of antimicrobial wipes. Results After exposing 8 common non-critical patient care items to two 30-second UV-C irradiation cycles, bacterial numbers on 40 of 51 target sites were consistently reduced to below detectable levels (≥ 4.7 log10 reduction. Bacterial load was reduced but still persisted on other sites. Objects that proved difficult to disinfect using the Nanoclave Cabinet (e.g. blood pressure cuff were also difficult to disinfect using antimicrobial wipes. The efficacy of the Nanoclave Cabinet was not affected by the presence of organic soils. Clostridium difficile spores were more resistant to UV-C irradiation than vegetative bacteria. However, two 60-second irradiation cycles were sufficient to reduce the number of surface-associated spores from 103 cfu/cm2 to below detectable levels. A 3 log10 reduction in

  18. Adaptive screening for depression--recalibration of an item bank for the assessment of depression in persons with mental and somatic diseases and evaluation in a simulated computer-adaptive test environment.

    Science.gov (United States)

    Forkmann, Thomas; Kroehne, Ulf; Wirtz, Markus; Norra, Christine; Baumeister, Harald; Gauggel, Siegfried; Elhan, Atilla Halil; Tennant, Alan; Boecker, Maren

    2013-11-01

    This study conducted a simulation study for computer-adaptive testing based on the Aachen Depression Item Bank (ADIB), which was developed for the assessment of depression in persons with somatic diseases. Prior to computer-adaptive test simulation, the ADIB was newly calibrated. Recalibration was performed in a sample of 161 patients treated for a depressive syndrome, 103 patients from cardiology, and 103 patients from otorhinolaryngology (mean age 44.1, SD=14.0; 44.7% female) and was cross-validated in a sample of 117 patients undergoing rehabilitation for cardiac diseases (mean age 58.4, SD=10.5; 24.8% women). Unidimensionality of the itembank was checked and a Rasch analysis was performed that evaluated local dependency (LD), differential item functioning (DIF), item fit and reliability. CAT-simulation was conducted with the total sample and additional simulated data. Recalibration resulted in a strictly unidimensional item bank with 36 items, showing good Rasch model fit (item fit residualsLD. CAT simulation revealed that 13 items on average were necessary to estimate depression in the range of -2 and +2 logits when terminating at SE≤0.32 and 4 items if using SE≤0.50. Receiver Operating Characteristics analysis showed that θ estimates based on the CAT algorithm have good criterion validity with regard to depression diagnoses (Area Under the Curve≥.78 for all cut-off criteria). The recalibration of the ADIB succeeded and the simulation studies conducted suggest that it has good screening performance in the samples investigated and that it may reasonably add to the improvement of depression assessment. © 2013.

  19. Assessing Goodness of Fit in Item Response Theory with Nonparametric Models: A Comparison of Posterior Probabilities and Kernel-Smoothing Approaches

    Science.gov (United States)

    Sueiro, Manuel J.; Abad, Francisco J.

    2011-01-01

    The distance between nonparametric and parametric item characteristic curves has been proposed as an index of goodness of fit in item response theory in the form of a root integrated squared error index. This article proposes to use the posterior distribution of the latent trait as the nonparametric model and compares the performance of an index…

  20. The role of attention in item-item binding in visual working memory.

    Science.gov (United States)

    Peterson, Dwight J; Naveh-Benjamin, Moshe

    2017-09-01

    An important yet unresolved question regarding visual working memory (VWM) relates to whether or not binding processes within VWM require additional attentional resources compared with processing solely the individual components comprising these bindings. Previous findings indicate that binding of surface features (e.g., colored shapes) within VWM is not demanding of resources beyond what is required for single features. However, it is possible that other types of binding, such as the binding of complex, distinct items (e.g., faces and scenes), in VWM may require additional resources. In 3 experiments, we examined VWM item-item binding performance under no load, articulatory suppression, and backward counting using a modified change detection task. Binding performance declined to a greater extent than single-item performance under higher compared with lower levels of concurrent load. The findings from each of these experiments indicate that processing item-item bindings within VWM requires a greater amount of attentional resources compared with single items. These findings also highlight an important distinction between the role of attention in item-item binding within VWM and previous studies of long-term memory (LTM) where declines in single-item and binding test performance are similar under divided attention. The current findings provide novel evidence that the specific type of binding is an important determining factor regarding whether or not VWM binding processes require attention. (PsycINFO Database Record (c) 2017 APA, all rights reserved).

  1. Item Modeling Concept Based on Multimedia Authoring

    Directory of Open Access Journals (Sweden)

    Janez Stergar

    2008-09-01

    Full Text Available In this paper a modern item design framework for computer based assessment based on Flash authoring environment will be introduced. Question design will be discussed as well as the multimedia authoring environment used for item modeling emphasized. Item type templates are a structured means of collecting and storing item information that can be used to improve the efficiency and security of the innovative item design process. Templates can modernize the item design, enhance and speed up the development process. Along with content creation, multimedia has vast potential for use in innovative testing. The introduced item design template is based on taxonomy of innovative items which have great potential for expanding the content areas and construct coverage of an assessment. The presented item design approach is based on GUI's – one for question design based on implemented item design templates and one for user interaction tracking/retrieval. The concept of user interfaces based on Flash technology will be discussed as well as implementation of the innovative approach of the item design forms with multimedia authoring. Also an innovative method for user interaction storage/retrieval based on PHP extending Flash capabilities in the proposed framework will be introduced.

  2. Single-molecule protein sequencing through fingerprinting: computational assessment

    Science.gov (United States)

    Yao, Yao; Docter, Margreet; van Ginkel, Jetty; de Ridder, Dick; Joo, Chirlmin

    2015-10-01

    Proteins are vital in all biological systems as they constitute the main structural and functional components of cells. Recent advances in mass spectrometry have brought the promise of complete proteomics by helping draft the human proteome. Yet, this commonly used protein sequencing technique has fundamental limitations in sensitivity. Here we propose a method for single-molecule (SM) protein sequencing. A major challenge lies in the fact that proteins are composed of 20 different amino acids, which demands 20 molecular reporters. We computationally demonstrate that it suffices to measure only two types of amino acids to identify proteins and suggest an experimental scheme using SM fluorescence. When achieved, this highly sensitive approach will result in a paradigm shift in proteomics, with major impact in the biological and medical sciences.

  3. Single-molecule protein sequencing through fingerprinting: computational assessment

    International Nuclear Information System (INIS)

    Yao, Yao; Docter, Margreet; Van Ginkel, Jetty; Joo, Chirlmin; De Ridder, Dick

    2015-01-01

    Proteins are vital in all biological systems as they constitute the main structural and functional components of cells. Recent advances in mass spectrometry have brought the promise of complete proteomics by helping draft the human proteome. Yet, this commonly used protein sequencing technique has fundamental limitations in sensitivity. Here we propose a method for single-molecule (SM) protein sequencing. A major challenge lies in the fact that proteins are composed of 20 different amino acids, which demands 20 molecular reporters. We computationally demonstrate that it suffices to measure only two types of amino acids to identify proteins and suggest an experimental scheme using SM fluorescence. When achieved, this highly sensitive approach will result in a paradigm shift in proteomics, with major impact in the biological and medical sciences. (paper)

  4. Learning environment assessments of a single curriculum being taught at two medical schools 10,000 miles apart.

    Science.gov (United States)

    Tackett, Sean; Shochet, Robert; Shilkofski, Nicole A; Colbert-Getz, Jorie; Rampal, Krishna; Abu Bakar, Hamidah; Wright, Scott

    2015-06-17

    Perdana University Graduate School of Medicine (PUGSOM), the first graduate-entry medical school in Malaysia, was established in 2011 in collaboration with Johns Hopkins University School of Medicine (JHUSOM), an American medical school. This study compared learning environments (LE) at these two schools, which shared the same overarching curriculum, along with a comparator Malaysian medical school, Cyberjaya University College of Medical Sciences (CUCMS). As a secondary aim, we compared 2 LE assessment tools - the widely-used Dundee Ready Educational Environment Measure (DREEM) and the newer Johns Hopkins Learning Environment Scale (JHLES). Students responded anonymously at the end of their first year of medical school to surveys which included DREEM, JHLES, single-item global LE assessment variables, and demographics questions. Respondents included 24/24 (100 %) students at PUGSOM, 100/120 (83 %) at JHUSOM, and 79/83 (95 %) at CUCMS. PUGSOM had the highest overall LE ratings (p safety" domains. JHLES detected significant differences across schools in 5/7 domains and had stronger correlations than DREEM to each global LE assessment variable. The inaugural class of medical students at PUGSOM rated their LE exceptionally highly, providing evidence that transporting a medical school curriculum may be successful. The JHLES showed promise as a LE assessment tool for use in international settings.

  5. Assessing the scientific relevance of a single publication over time

    Directory of Open Access Journals (Sweden)

    Philipp A. Bloching

    2013-09-01

    Full Text Available Quantitatively assessing the scientific relevance of a research paper is challenging for two reasons. Firstly, scientific relevance may change over time, and secondly, it is unclear how to evaluate a recently published paper. The temporally averaged paper-specific impact factor is defined as the yearly average of citations to the paper until now including bonus citations equal to the journal impact factor in the publication year. This new measure subsequently allows relevance rankings and annual updates of all (i.e. both recent and older scientific papers of a department, or even a whole scientific field, on a more objective basis. It can also be used to assess both the average and overall time-dependent scientific relevance of researchers in a specific department or scientific field.

  6. Location Indices for Ordinal Polytomous Items Based on Item Response Theory. Research Report. ETS RR-15-20

    Science.gov (United States)

    Ali, Usama S.; Chang, Hua-Hua; Anderson, Carolyn J.

    2015-01-01

    Polytomous items are typically described by multiple category-related parameters; situations, however, arise in which a single index is needed to describe an item's location along a latent trait continuum. Situations in which a single index would be needed include item selection in computerized adaptive testing or test assembly. Therefore single…

  7. Weighting and Aggregation in Life Cycle Assessment: Do Present Aggregated Single Scores Provide Correct Decision Support?

    DEFF Research Database (Denmark)

    Kalbar, Pradip; Birkved, Morten; Nygaard, Simon Elsborg

    2016-01-01

    This study investigates the prevailing practice of obtaining single scores in life cycle assessment (LCA) and identifies potential lacunas in impact assessment methodology related to the results of aggregation into endpoints and single scores. In order to conduct this investigation, a detailed...... approach was adopted to facilitate identification of three main problems related to the single-score calculation approach. The prevailing ReCiPe single-score calculation method does not account for either the effect of so-called dominating alternatives (i.e., alternatives having high values across all...

  8. Assessment of coverage levels of single dose measles vaccine

    International Nuclear Information System (INIS)

    Tariq, P.

    2003-01-01

    Objective: To study the consequences of low coverage levels of a single dose of measles vaccine. Results: mean age observed in measles cases was 2 years and 8 months with a range from 3 months to 8 years. Maximum number of cases reported were <1 year of age (n=22,32%). Fifty percent of cases were seen among vaccinated children. Seventy-five percent (n=51) had history of contact with a measles case. Pneumonia was the commonest complication followed by acute gastroenteritis, encephalitis, febrile convulsions, oral ulcers, oral thrush, eye changes of vitamin-A deficiency and pulmonary tuberculosis (T.B.) in descending order of frequency. Fifty four cases were successfully treated for complications of measles and discharged. Nine cases left against medical advice. Five patients died all of them had encephalitis either alone (n=1) or in combination with pneumonia and acute gastroenteritis (n=4). Conclusion: There is a dire need to increase the immunization coverage to reduce the rate of vaccine failure and achieve effective control of measles.(author)

  9. Economic assessment of single-walled carbon nanotube processes

    Science.gov (United States)

    Isaacs, J. A.; Tanwani, A.; Healy, M. L.; Dahlben, L. J.

    2010-02-01

    The carbon nanotube market is steadily growing and projected to reach 1.9 billion by 2010. This study examines the economics of manufacturing single-walled carbon nanotubes (SWNT) using process-based cost models developed for arc, CVD, and HiPco processes. Using assumed input parameters, manufacturing costs are calculated for 1 g SWNT for arc, CVD, and HiPco, totaling 1,906, 1,706, and 485, respectively. For each SWNT process, the synthesis and filtration steps showed the highest costs, with direct labor as a primary cost driver. Reductions in production costs are calculated for increased working hours per day and for increased synthesis reaction yield (SRY) in each process. The process-based cost models offer a means for exploring opportunities for cost reductions, and provide a structured system for comparisons among alternative SWNT manufacturing processes. Further, the models can be used to comprehensively evaluate additional scenarios on the economics of environmental, health, and safety best manufacturing practices.

  10. Economic assessment of single-walled carbon nanotube processes

    Energy Technology Data Exchange (ETDEWEB)

    Isaacs, J. A., E-mail: jaisaacs@coe.neu.ed [Northeastern University, NSF Center for High-rate Nanomanufacturing (United States); Tanwani, A. [Infojini Solutions Inc. (United States); Healy, M. L. [Babcock Power Inc. (United States); Dahlben, L. J. [Northeastern University, NSF Center for High-rate Nanomanufacturing (United States)

    2010-02-15

    The carbon nanotube market is steadily growing and projected to reach $1.9 billion by 2010. This study examines the economics of manufacturing single-walled carbon nanotubes (SWNT) using process-based cost models developed for arc, CVD, and HiPco processes. Using assumed input parameters, manufacturing costs are calculated for 1 g SWNT for arc, CVD, and HiPco, totaling $1,906, $1,706, and $485, respectively. For each SWNT process, the synthesis and filtration steps showed the highest costs, with direct labor as a primary cost driver. Reductions in production costs are calculated for increased working hours per day and for increased synthesis reaction yield (SRY) in each process. The process-based cost models offer a means for exploring opportunities for cost reductions, and provide a structured system for comparisons among alternative SWNT manufacturing processes. Further, the models can be used to comprehensively evaluate additional scenarios on the economics of environmental, health, and safety best manufacturing practices.

  11. Quality assessment of observational studies in a drug-safety systematic review, comparison of two tools: the Newcastle–Ottawa Scale and the RTI item bank

    Directory of Open Access Journals (Sweden)

    Margulis AV

    2014-10-01

    Full Text Available Andrea V Margulis,1 Manel Pladevall,1 Nuria Riera-Guardia,1 Cristina Varas-Lorenzo,1 Lorna Hazell,2,3 Nancy D Berkman,4 Meera Viswanathan,4 Susana Perez-Gutthann,1 1RTI Health Solutions, Barcelona, Spain; 2Drug Safety Research Unit, Southampton, UK; 3Associate Department of the School of Pharmacy and Biomedical Sciences, University of Portsmouth, Portsmouth, UK; 4RTI International, Research Triangle Park, NC, USA Background: The study objective was to compare the Newcastle–Ottawa Scale (NOS and the RTI item bank (RTI-IB and estimate interrater agreement using the RTI-IB within a systematic review on the cardiovascular safety of glucose-lowering drugs. Methods: We tailored both tools and added four questions to the RTI-IB. Two reviewers assessed the quality of the 44 included studies with both tools, (independently for the RTI-IB and agreed on which responses conveyed low, unclear, or high risk of bias. For each question in the RTI-IB (n=31, the observed interrater agreement was calculated as the percentage of studies given the same bias assessment by both reviewers; chance-adjusted interrater agreement was estimated with the first-order agreement coefficient (AC1 statistic. Results: The NOS required less tailoring and was easier to use than the RTI-IB, but the RTI-IB produced a more thorough assessment. The RTI-IB includes most of the domains measured in the NOS. Median observed interrater agreement for the RTI-IB was 75% (25th percentile [p25] =61%; p75 =89%; median AC1 statistic was 0.64 (p25 =0.51; p75 =0.86. Conclusion: The RTI-IB facilitates a more complete quality assessment than the NOS but is more burdensome. The observed agreement and AC1 statistic in this study were higher than those reported by the RTI-IB's developers. Keywords: systematic review, meta-analysis, quality assessment, AC1

  12. Item response theory analysis of the life orientation test-revised: age and gender differential item functioning analyses.

    Science.gov (United States)

    Steca, Patrizia; Monzani, Dario; Greco, Andrea; Chiesi, Francesca; Primi, Caterina

    2015-06-01

    This study is aimed at testing the measurement properties of the Life Orientation Test-Revised (LOT-R) for the assessment of dispositional optimism by employing item response theory (IRT) analyses. The LOT-R was administered to a large sample of 2,862 Italian adults. First, confirmatory factor analyses demonstrated the theoretical conceptualization of the construct measured by the LOT-R as a single bipolar dimension. Subsequently, IRT analyses for polytomous, ordered response category data were applied to investigate the items' properties. The equivalence of the items across gender and age was assessed by analyzing differential item functioning. Discrimination and severity parameters indicated that all items were able to distinguish people with different levels of optimism and adequately covered the spectrum of the latent trait. Additionally, the LOT-R appears to be gender invariant and, with minor exceptions, age invariant. Results provided evidence that the LOT-R is a reliable and valid measure of dispositional optimism. © The Author(s) 2014.

  13. Análise de Teoria de Resposta ao Item de um instrumento breve de avaliação de comportamentos antissociais = Item Response Theory Analysis of a brief instrument for assessing antisocial behaviors

    Directory of Open Access Journals (Sweden)

    Hauck Filho, Nelson

    2014-01-01

    Full Text Available Comportamentos antissociais são comuns a diversas condições psicopatológicas, incluindo transtornos da personalidade (e. g. , antissocial e narcisista e transtornos do humor (e. g. , transtorno bipolar. Todavia, até o momento, havia uma importante lacuna no contexto brasileiro no que diz respeito à avaliação breve dos comportamentos antissociais em indivíduos adultos de contextos não carcerários. Em virtude disso, o presente estudo teve como objetivo a construção e a análise mediante Teoria de Resposta ao Item de um instrumento breve para uso em pesquisas e rastreio junto à população geral adulta. As análises das respostas de 204 estudantes universitários (média de idades = 23,56 anos; DP = 7,70; 60,6% mulheres a um conjunto de itens permitiram reter 13 itens com excelentes propriedades psicométricas. Esses itens se mostraram avaliativos de um fator geral de antissocialidade, interpretável como uma propensão ao antagonismo, à não cooperação e à agressão em uma diversidade de contextos sociais. Limitações do estudo são discutidas ao final

  14. Screening for depression and assessing change in severity of depression. Is the Geriatric Depression Scale (30-.15- and 8- item versions) useful for both purposes in nursing home patients?

    NARCIS (Netherlands)

    Smalbrugge, M.; Jongenelis, L.; Pot, A.M.; Eefsting, J.A.; Beekman, A.T.F.

    2008-01-01

    The objectives of this study were to determine the ability of the 30-, 15- and 8-item versions of the GDS for screening and assessing change in severity of depression in nursing home patients. The GDS and the MADRS were administered to 350 elderly NH-patients by trained interviewers. The presence of

  15. Development of a Short Version of MSQOL-54 Using Factor Analysis and Item Response Theory.

    Directory of Open Access Journals (Sweden)

    Rosalba Rosato

    Full Text Available The Multiple Sclerosis Quality of Life-54 (MSQOL-54, 52 items grouped in 12 subscales plus two single items is the most used MS specific health related quality of life inventory.To develop a shortened version of the MSQOL-54.MSQOL-54 dimensionality and metric properties were investigated by confirmatory factor analysis (CFA and Rasch modelling (Partial Credit Model, PCM on MSQOL-54s completed by 473 MS patients. Their mean age was 41 years, 65% were women, and median Expanded Disability Status Scale (EDSS score was 2.0 (range 0-9.5. Differential item functioning (DIF was evaluated for gender, age and EDSS. Dimensionality of the resulting short version was assessed by exploratory factor analysis (EFA and CFA. Cognitive debriefing of the short instrument (vs. the original was then performed on 12 MS patients.CFA of MSQOL-54 subscales showed that the data fitted the overall model well. Two subscales (Role Limitations--Physical, Role Limitations--Emotional did not fit the PCM, and were removed; two other subscales (Health Perceptions, Social Function did not fit the model, but were retained as single items. Sexual Satisfaction (single-item subscale was also removed. The resulting MSQOL-29 consisted of 25 items grouped in 7 subscales, plus 4 single items. PCM fit statistics were within the acceptability range for all MSQOL-29 items except one which had significant DIF by age. EFA and CFA indicated adequate fit to the original two-factor (Physical and Mental Health Composites hypothesis. Cognitive debriefing confirmed that MSQOL-29 was acceptable and had lost no key items.The proposed MSQOL-29 is 50% shorter than MSQOL-54, yet preserves key quality of life dimensions. Prospective validation on a large, independent MS patient sample is ongoing.

  16. Assessing the measurement of aerosol single scattering albedo by Cavity Attenuated Phase-Shift Single Scattering Monitor (CAPS PMssa)

    Science.gov (United States)

    Perim de Faria, Julia; Bundke, Ulrich; Onasch, Timothy B.; Freedman, Andrew; Petzold, Andreas

    2016-04-01

    The necessity to quantify the direct impact of aerosol particles on climate forcing is already well known; assessing this impact requires continuous and systematic measurements of the aerosol optical properties. Two of the main parameters that need to be accurately measured are the aerosol optical depth and single scattering albedo (SSA, defined as the ratio of particulate scattering to extinction). The measurement of single scattering albedo commonly involves the measurement of two optical parameters, the scattering and the absorption coefficients. Although there are well established technologies to measure both of these parameters, the use of two separate instruments with different principles and uncertainties represents potential sources of significant errors and biases. Based on the recently developed cavity attenuated phase shift particle extinction monitor (CAPS PM_{ex) instrument, the CAPS PM_{ssa instrument combines the CAPS technology to measure particle extinction with an integrating sphere capable of simultaneously measuring the scattering coefficient of the same sample. The scattering channel is calibrated to the extinction channel, such that the accuracy of the single scattering albedo measurement is only a function of the accuracy of the extinction measurement and the nephelometer truncation losses. This gives the instrument an accurate and direct measurement of the single scattering albedo. In this study, we assess the measurements of both the extinction and scattering channels of the CAPS PM_{ssa through intercomparisons with Mie theory, as a fundamental comparison, and with proven technologies, such as integrating nephelometers and filter-based absorption monitors. For comparison, we use two nephelometers, a TSI 3563 and an Aurora 4000, and two measurements of the absorption coefficient, using a Particulate Soot Absorption Photometer (PSAP) and a Multi Angle Absorption Photometer (MAAP). We also assess the indirect absorption coefficient

  17. Assessing normative cut points through differential item functioning analysis: An example from the adaptation of the Middlesex Elderly Assessment of Mental State (MEAMS for use as a cognitive screening test in Turkey

    Directory of Open Access Journals (Sweden)

    Kutlay Sehim

    2006-03-01

    Full Text Available Abstract Background The Middlesex Elderly Assessment of Mental State (MEAMS was developed as a screening test to detect cognitive impairment in the elderly. It includes 12 subtests, each having a 'pass score'. A series of tasks were undertaken to adapt the measure for use in the adult population in Turkey and to determine the validity of existing cut points for passing subtests, given the wide range of educational level in the Turkish population. This study focuses on identifying and validating the scoring system of the MEAMS for Turkish adult population. Methods After the translation procedure, 350 normal subjects and 158 acquired brain injury patients were assessed by the Turkish version of MEAMS. Initially, appropriate pass scores for the normal population were determined through ANOVA post-hoc tests according to age, gender and education. Rasch analysis was then used to test the internal construct validity of the scale and the validity of the cut points for pass scores on the pooled data by using Differential Item Functioning (DIF analysis within the framework of the Rasch model. Results Data with the initially modified pass scores were analyzed. DIF was found for certain subtests by age and education, but not for gender. Following this, pass scores were further adjusted and data re-fitted to the model. All subtests were found to fit the Rasch model (mean item fit 0.184, SD 0.319; person fit -0.224, SD 0.557 and DIF was then found to be absent. Thus the final pass scores for all subtests were determined. Conclusion The MEAMS offers a valid assessment of cognitive state for the adult Turkish population, and the revised cut points accommodate for age and education. Further studies are required to ascertain the validity in different diagnostic groups.

  18. Assessing normative cut points through differential item functioning analysis: an example from the adaptation of the Middlesex Elderly Assessment of Mental State (MEAMS) for use as a cognitive screening test in Turkey.

    Science.gov (United States)

    Tennant, Alan; Küçükdeveci, Ayse A; Kutlay, Sehim; Elhan, Atilla H

    2006-03-23

    The Middlesex Elderly Assessment of Mental State (MEAMS) was developed as a screening test to detect cognitive impairment in the elderly. It includes 12 subtests, each having a 'pass score'. A series of tasks were undertaken to adapt the measure for use in the adult population in Turkey and to determine the validity of existing cut points for passing subtests, given the wide range of educational level in the Turkish population. This study focuses on identifying and validating the scoring system of the MEAMS for Turkish adult population. After the translation procedure, 350 normal subjects and 158 acquired brain injury patients were assessed by the Turkish version of MEAMS. Initially, appropriate pass scores for the normal population were determined through ANOVA post-hoc tests according to age, gender and education. Rasch analysis was then used to test the internal construct validity of the scale and the validity of the cut points for pass scores on the pooled data by using Differential Item Functioning (DIF) analysis within the framework of the Rasch model. Data with the initially modified pass scores were analyzed. DIF was found for certain subtests by age and education, but not for gender. Following this, pass scores were further adjusted and data re-fitted to the model. All subtests were found to fit the Rasch model (mean item fit 0.184, SD 0.319; person fit -0.224, SD 0.557) and DIF was then found to be absent. Thus the final pass scores for all subtests were determined. The MEAMS offers a valid assessment of cognitive state for the adult Turkish population, and the revised cut points accommodate for age and education. Further studies are required to ascertain the validity in different diagnostic groups.

  19. Funcionamento diferencial de itens para avaliar a agressividade de universitários Differential items functioning to assess aggressiveness in college students

    Directory of Open Access Journals (Sweden)

    Fermino Fernandes Sisto

    2008-01-01

    Full Text Available Nesta pesquisa buscou-se evidência de validade de construto relacionada ao funcionamento dos itens para diferenciar sexos em um instrumento de agressividade. Participaram 445 universitários, de ambos os sexos, dos cursos de Engenharia, Computação e Psicologia. A escala de agressividade composta por 81 itens foi aplicada coletivamente, em sala de aula, nos estudantes que consentiram em participar do estudo. Os itens do instrumento foram analisados por meio do modelo Rasch. Vinte e oito itens apresentaram funcionamento diferencial, sendo 15 condutas mais características de pessoas do sexo feminino e outras 13 mais características do masculino. Os índices de precisão foram de 0,99 para os itens e 0,86 para as pessoas. Conclui-se que a agressividade pode ser medida separadamente em razão do sexo.In this research evidences of construct validity were searched analyzing the differential functioning items related to aggressiveness. The participants were 445 college students of both genders, attending the courses of Engineering, Computing and Psychology. The scale of aggressiveness composed by 81 items was collectively applied, in the classroom, to the students who consented to participate in the study. The items of the instrument were studied by means of the Rasch model. Twenty-eight items presented differential functioning item, 15 were characterized as typical for females and 13 for males. The reliability coefficients were 0.99 to the items and 0.86 to the persons. It was concluded that the aggressiveness can be measured separately on the basis of gender.

  20. The impact of item order on ratings of cancer risk perception.

    Science.gov (United States)

    Taylor, Kathryn L; Shelby, Rebecca A; Schwartz, Marc D; Ackerman, Josh; LaSalle, V Holland; Gelmann, Edward P; McGuire, Colleen

    2002-07-01

    Although perceived risk is central to most theories of health behavior, there is little consensus on its measurement with regard to item wording, response set, or the number of items to include. In a methodological assessment of perceived risk, we assessed the impact of changing the order of three commonly used perceived risk items: quantitative personal risk, quantitative population risk, and comparative risk. Participants were 432 men and women enrolled in an ancillary study of the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial. Three groups of consecutively enrolled participants responded to the three items in one of three question orders. Results indicated that item order was related to the perceived risk ratings of both ovarian (P Perceptions of risk were significantly lower when the comparative rating was made first. The findings suggest that compelling participants to consider their own risk relative to the risk of others results in lower ratings of perceived risk. Although the use of multiple items may provide more information than when only a single method is used, different conclusions may be reached depending on the context in which an item is assessed.

  1. Risk assessment of PCDD/Fs levels in human tissues related to major food items based on chemical analyses and micro-EROD assay.

    Science.gov (United States)

    Tsang, H L; Wu, S C; Wong, C K C; Leung, C K M; Tao, S; Wong, M H

    2009-10-01

    Nine groups of food items (freshwater fish, marine fish, pork, chicken, chicken eggs, leafy, non-leafy vegetables, rice and flour) and three types of human samples (human milk, maternal serum and cord serum) were collected for the analysis of PCDD/Fs. Results of chemical analysis revealed PCDD/Fs concentrations (pg g(-1) fat) in the following ascending order: pork (0.289 pg g(-1) fat), grass carp (Ctenopharyngodon idellus) (freshwater fish) (0.407), golden thread (Nemipterus virgatus) (marine fish) (0.511), chicken (0.529), mandarin fish (Siniperca kneri) (marine fish) (0.535), chicken egg (0.552), and snubnose pompano (Trachinotus blochii) (marine fish) (1.219). The results of micro-EROD assay showed relatively higher PCDD/Fs levels in fish (2.65 pg g(-1) fat) when compared with pork (0.47), eggs (0.33), chicken (0.13), flour (0.07), vegetables (0.05 pg g(-1) wet wt) and rice (0.05). The estimated average daily intake of PCDD/Fs of 3.51 pg EROD-TEQ/kg bw/day was within the range of WHO Tolerable Daily Intake (1-4 pg WHO-TEQ/kg bw/day) and was higher than the Provisional Tolerable Daily Intake (PMTL) (70 pg for dioxins and dioxin-like PCBs) recommended by the Joint FAO/WHO Expert Committee on Food Additives (JECFA) [Joint FAO/WHO Expert Committee on Food Additives (JECFA), Summary and conclusions of the fifty-seventh meeting, JECFA, 2001.]. Nevertheless, the current findings were significantly lower than the TDI (14 pg WHO-TEQ/kg/bw/day) recommended by the Scientific Committee on Food of the Europe Commission [European Scientific Committee on Food (EU SCF), Opinions on the SCF on the risk assessment of dioxins and dioxin-like PCBs in food, 2000.]. However, it should be noted that micro-EROD assay overestimates the PCDD/Fs levels by 2 to 7 folds which may also amplify the PCDD/Fs levels accordingly. Although the levels of PCDD/Fs obtained from micro-EROD assay were much higher than those obtained by chemical analysis by 2 to 7 folds, it provides a cost-effective and

  2. Analysis of Nonequivalent Assessments across Different Linguistic Groups Using a Mixed Methods Approach: Understanding the Causes of Differential Item Functioning by Cognitive Interviewing

    Science.gov (United States)

    Benítez, Isabel; Padilla, José-Luis

    2014-01-01

    Differential item functioning (DIF) can undermine the validity of cross-lingual comparisons. While a lot of efficient statistics for detecting DIF are available, few general findings have been found to explain DIF results. The objective of the article was to study DIF sources by using a mixed method design. The design involves a quantitative phase…

  3. Item response modeling: A psychometric assessment of the children's fruit, vegetable, water, and physical activity self-efficacy scales among Chinese children

    Science.gov (United States)

    This study aimed to evaluate the psychometric properties of four self-efficacy scales (i.e., self-efficacy for fruit (FSE), vegetable (VSE), and water (WSE) intakes, and physical activity (PASE)) and to investigate their differences in item functioning across sex, age, and body weight status groups ...

  4. Quality of life assessed with the medical outcomes study short form 36-item health survey of patients on renal replacement therapy: A systematic review and meta-analysis

    NARCIS (Netherlands)

    Y.S. Liem (Ylian Serina); J.L. Bosch (Johanna); L.R. Arends (Lidia); M.H. Heijenbrok-Kal (Majanka); M.G.M. Hunink (Myriam)

    2007-01-01

    textabstractObjectives: The Medical Outcomes Study Short Form 36-Item Health Survey (SF-36) is the most widely used generic instrument to estimate quality of life of patients on renal replacement therapy. Purpose of this study was to summarize and compare the published literature on quality of

  5. Statistical approaches to assessing single and multiple outcome measures in dry eye therapy and diagnosis.

    Science.gov (United States)

    Tomlinson, Alan; Hair, Mario; McFadyen, Angus

    2013-10-01

    Dry eye is a multifactorial disease which would require a broad spectrum of test measures in the monitoring of its treatment and diagnosis. However, studies have typically reported improvements in individual measures with treatment. Alternative approaches involve multiple, combined outcomes being assessed by different statistical analyses. In order to assess the effect of various statistical approaches to the use of single and combined test measures in dry eye, this review reanalyzed measures from two previous studies (osmolarity, evaporation, tear turnover rate, and lipid film quality). These analyses assessed the measures as single variables within groups, pre- and post-intervention with a lubricant supplement, by creating combinations of these variables and by validating these combinations with the combined sample of data from all groups of dry eye subjects. The effectiveness of single measures and combinations in diagnosis of dry eye was also considered. Copyright © 2013. Published by Elsevier Inc.

  6. A 67-Item Stress Resilience item bank showing high content validity was developed in a psychosomatic sample.

    Science.gov (United States)

    Obbarius, Nina; Fischer, Felix; Obbarius, Alexander; Nolte, Sandra; Liegl, Gregor; Rose, Matthias

    2018-04-10

    To develop the first item bank to measure Stress Resilience (SR) in clinical populations. Qualitative item development resulted in an initial pool of 131 items covering a broad theoretical SR concept. These items were tested in n=521 patients at a psychosomatic outpatient clinic. Exploratory and Confirmatory Factor Analysis (CFA), as well as other state-of-the-art item analyses and IRT were used for item evaluation and calibration of the final item bank. Out of the initial item pool of 131 items, we excluded 64 items (54 factor loading .3, 2 non-discriminative Item Response Curves, 4 Differential Item Functioning). The final set of 67 items indicated sufficient model fit in CFA and IRT analyses. Additionally, a 10-item short form with high measurement precision (SE≤.32 in a theta range between -1.8 and +1.5) was derived. Both the SR item bank and the SR short form were highly correlated with an existing static legacy tool (Connor-Davidson Resilience Scale). The final SR item bank and 10-item short form showed good psychometric properties. When further validated, they will be ready to be used within a framework of Computer-Adaptive Tests for a comprehensive assessment of the Stress-Construct. Copyright © 2018. Published by Elsevier Inc.

  7. Negative affect impairs associative memory but not item memory.

    OpenAIRE

    Bisby, J. A.; Burgess, N.

    2014-01-01

    The formation of associations between items and their context has been proposed to rely on mechanisms distinct from those supporting memory for a single item. Although emotional experiences can profoundly affect memory, our understanding of how it interacts with different aspects of memory remains unclear. We performed three experiments to examine the effects of emotion on memory for items and their associations. By presenting neutral and negative items with background contexts, Experiment 1 ...

  8. Does the Order of Item Difficulty of the Addenbrooke's Cognitive Examination Add Anything to Subdomain Scores in the Clinical Assessment of Dementia?

    Science.gov (United States)

    McGrory, Sarah; Starr, John M; Shenkin, Susan D; Austin, Elizabeth J; Hodges, John R

    2015-01-01

    The Addenbrooke's Cognitive Examination (ACE) is used to measure cognition across a range of domains in dementia. Identifying the order in which cognitive decline occurs across items, and whether this varies between dementia aetiologies could add more information to subdomain scores. ACE-Revised data from 350 patients were split into three groups: Alzheimer's type (n = 131), predominantly frontal (n = 119) and other frontotemporal lobe degenerative disorders (n = 100). Results of factor analysis and Mokken scaling analysis were compared. Principal component analysis revealed one factor for each group. Confirmatory factor analysis found that the one-factor model fit two samples poorly. Mokken analyses revealed different item ordering in terms of difficulty for each group. The different patterns for each diagnostic group could aid in the separation of these different types of dementia.

  9. Does the Order of Item Difficulty of the Addenbrooke's Cognitive Examination Add Anything to Subdomain Scores in the Clinical Assessment of Dementia

    Directory of Open Access Journals (Sweden)

    Sarah McGrory

    2015-04-01

    Full Text Available Background: The Addenbrooke's Cognitive Examination (ACE is used to measure cognition across a range of domains in dementia. Identifying the order in which cognitive decline occurs across items, and whether this varies between dementia aetiologies could add more information to subdomain scores. Method: ACE-Revised data from 350 patients were split into three groups: Alzheimer's type (n = 131, predominantly frontal (n = 119 and other frontotemporal lobe degenerative disorders (n = 100. Results of factor analysis and Mokken scaling analysis were compared. Results: Principal component analysis revealed one factor for each group. Confirmatory factor analysis found that the one-factor model fit two samples poorly. Mokken analyses revealed different item ordering in terms of difficulty for each group. Conclusion: The different patterns for each diagnostic group could aid in the separation of these different types of dementia.

  10. Assessment of endogenous dopamine release by methylphenidate challenge using iodine-123 iodobenzamide single-photon emission tomography

    International Nuclear Information System (INIS)

    Booij, J.; Korn, P.; Linszen, D.H.; Royen, E.A. van

    1997-01-01

    This double-blind, placebo-controlled study assessed pharmacologically induced endogenous dopamine (DA) release in healthy male volunteers (n=12). Changes in endogenous DA release after injection of the psychostimulant drug methylphenidate were evaluated by single-photon emission tomography (SPET) and constant infusion of iodine-123 iodobenzamide ([ 123 I[IBZM), a D 2 receptor radioligand that is sensitive to endogenous DA release. Methylphenidate induced displacement of striatal [ 123 I[IBZM binding, resulting in a significantly decrease in the specific to non-specific [ 123 I[IBZM uptake ratio (average: 8.6%) in comparison with placebo (average: -1.9%). Moreover, injection of methylphenidate induced significant behavioural responses on the following items: excitement, anxiety, tension, and mannerisms and posturing. The results of this study demonstrate the feasibility of using constant infusion of [ 123 I[IBZM and SPET imaging to measure endogenous DA release after methylphenidate challenge and to investigate neurochemical aspects of behaviour. (orig.). With 2 figs., 1 tab

  11. Validating the 11-Item Revised University of California Los Angeles Scale to Assess Loneliness Among Older Adults: An Evaluation of Factor Structure and Other Measurement Properties.

    Science.gov (United States)

    Lee, Joonyup; Cagle, John G

    2017-11-01

    To examine the measurement properties and factor structure of the short version of the Revised University of California Los Angeles (R-UCLA) loneliness scale from the Health and Retirement Study (HRS). Based on data from 3,706 HRS participants aged 65 + who completed the 2012 wave of the HRS and its Psychosocial Supplement, the measurement properties and factorability of the R-UCLA were examined by conducting an exploratory factor analysis (EFA) and the confirmatory factor analysis (CFA) on randomly split halves. The average score for the 11-item loneliness scale was 16.4 (standard deviation: 4.5). An evaluation of the internal consistency produced a Cronbach's α of 0.87. Results from the EFA showed that two- and three-factor models were appropriate. However, based on the results of the CFA, only a two-factor model was determined to be suitable because there was a very high correlation between two factors identified in the three-factor model, available social connections and sense of belonging. This study provides important data on the properties of the 11-item R-UCLA scale by identifying a two-factor model of loneliness: feeling isolated and available social connections. Our findings suggest the 11-item R-UCLA has good factorability and internal reliability. Copyright © 2017 American Association for Geriatric Psychiatry. Published by Elsevier Inc. All rights reserved.

  12. A comparison of three methods of assessing differential item functioning (DIF) in the Hospital Anxiety Depression Scale: ordinal logistic regression, Rasch analysis and the Mantel chi-square procedure.

    Science.gov (United States)

    Cameron, Isobel M; Scott, Neil W; Adler, Mats; Reid, Ian C

    2014-12-01

    It is important for clinical practice and research that measurement scales of well-being and quality of life exhibit only minimal differential item functioning (DIF). DIF occurs where different groups of people endorse items in a scale to different extents after being matched by the intended scale attribute. We investigate the equivalence or otherwise of common methods of assessing DIF. Three methods of measuring age- and sex-related DIF (ordinal logistic regression, Rasch analysis and Mantel χ(2) procedure) were applied to Hospital Anxiety Depression Scale (HADS) data pertaining to a sample of 1,068 patients consulting primary care practitioners. Three items were flagged by all three approaches as having either age- or sex-related DIF with a consistent direction of effect; a further three items identified did not meet stricter criteria for important DIF using at least one method. When applying strict criteria for significant DIF, ordinal logistic regression was slightly less sensitive. Ordinal logistic regression, Rasch analysis and contingency table methods yielded consistent results when identifying DIF in the HADS depression and HADS anxiety scales. Regardless of methods applied, investigators should use a combination of statistical significance, magnitude of the DIF effect and investigator judgement when interpreting the results.

  13. Protein single-model quality assessment by feature-based probability density functions.

    Science.gov (United States)

    Cao, Renzhi; Cheng, Jianlin

    2016-04-04

    Protein quality assessment (QA) has played an important role in protein structure prediction. We developed a novel single-model quality assessment method-Qprob. Qprob calculates the absolute error for each protein feature value against the true quality scores (i.e. GDT-TS scores) of protein structural models, and uses them to estimate its probability density distribution for quality assessment. Qprob has been blindly tested on the 11th Critical Assessment of Techniques for Protein Structure Prediction (CASP11) as MULTICOM-NOVEL server. The official CASP result shows that Qprob ranks as one of the top single-model QA methods. In addition, Qprob makes contributions to our protein tertiary structure predictor MULTICOM, which is officially ranked 3rd out of 143 predictors. The good performance shows that Qprob is good at assessing the quality of models of hard targets. These results demonstrate that this new probability density distribution based method is effective for protein single-model quality assessment and is useful for protein structure prediction. The webserver of Qprob is available at: http://calla.rnet.missouri.edu/qprob/. The software is now freely available in the web server of Qprob.

  14. Evaluation of psychometric properties and differential item functioning of 8-item Child Perceptions Questionnaires using item response theory.

    Science.gov (United States)

    Yau, David T W; Wong, May C M; Lam, K F; McGrath, Colman

    2015-08-19

    Four-factor structure of the two 8-item short forms of Child Perceptions Questionnaire CPQ11-14 (RSF:8 and ISF:8) has been confirmed. However, the sum scores are typically reported in practice as a proxy of Oral health-related Quality of Life (OHRQoL), which implied a unidimensional structure. This study first assessed the unidimensionality of 8-item short forms of CPQ11-14. Item response theory (IRT) was employed to offer an alternative and complementary approach of validation and to overcome the limitations of classical test theory assumptions. A random sample of 649 12-year-old school children in Hong Kong was analyzed. Unidimensionality of the scale was tested by confirmatory factor analysis (CFA), principle component analysis (PCA) and local dependency (LD) statistic. Graded response model was fitted to the data. Contribution of each item to the scale was assessed by item information function (IIF). Reliability of the scale was assessed by test information function (TIF). Differential item functioning (DIF) across gender was identified by Wald test and expected score functions. Both CPQ11-14 RSF:8 and ISF:8 did not deviate much from the unidimensionality assumption. Results from CFA indicated acceptable fit of the one-factor model. PCA indicated that the first principle component explained >30 % of the total variation with high factor loadings for both RSF:8 and ISF:8. Almost all LD statistic items suggesting little contribution of information to the scale and item removal caused little practical impact. Comparing the TIFs, RSF:8 showed slightly better information than ISF:8. In addition to oral symptoms items, the item "Concerned with what other people think" demonstrated a uniform DIF (p Items related to oral symptoms were not informative to OHRQoL and deletion of these items is suggested. The impact of DIF across gender on the overall score was minimal. CPQ11-14 RSF:8 performed slightly better than ISF:8 in measurement precision. The 6-item short forms

  15. Evaluation of item candidates for a diabetic retinopathy quality of life item bank.

    Science.gov (United States)

    Fenwick, Eva K; Pesudovs, Konrad; Khadka, Jyoti; Rees, Gwyn; Wong, Tien Y; Lamoureux, Ecosse L

    2013-09-01

    We are developing an item bank assessing the impact of diabetic retinopathy (DR) on quality of life (QoL) using a rigorous multi-staged process combining qualitative and quantitative methods. We describe here the first two qualitative phases: content development and item evaluation. After a comprehensive literature review, items were generated from four sources: (1) 34 previously validated patient-reported outcome measures; (2) five published qualitative articles; (3) eight focus groups and 18 semi-structured interviews with 57 DR patients; and (4) seven semi-structured interviews with diabetes or ophthalmic experts. Items were then evaluated during 3 stages, namely binning (grouping) and winnowing (reduction) based on key criteria and panel consensus; development of item stems and response options; and pre-testing of items via cognitive interviews with patients. The content development phase yielded 1,165 unique items across 7 QoL domains. After 3 sessions of binning and winnowing, items were reduced to a minimally representative set (n = 312) across 9 domains of QoL: visual symptoms; ocular surface symptoms; activity limitation; mobility; emotional; health concerns; social; convenience; and economic. After 8 cognitive interviews, 42 items were amended resulting in a final set of 314 items. We have employed a systematic approach to develop items for a DR-specific QoL item bank. The psychometric properties of the nine QoL subscales will be assessed using Rasch analysis. The resulting validated item bank will allow clinicians and researchers to better understand the QoL impact of DR and DR therapies from the patient's perspective.

  16. Single mothers' self-assessment of health: a systematic exploration of the literature.

    Science.gov (United States)

    Rousou, E; Kouta, C; Middleton, N; Karanikola, M

    2013-12-01

    This study aimed to explore single mothers' self-assessed level of health status compared to partnered mothers and the relevant factors associated with it. The number of single-mother families is increasing worldwide. A large body of international research reveals that single mothers experience poorer physical and mental health than their married counterparts. An important contributory factor for this health disparity appears to be socio-economic disadvantage. A systematic search of the literature was conducted using the keywords 'lone' or 'single' and 'mother*' or 'parent*' or 'family structure' in combination with 'health'. EMBASE, CINAHL, COCHRANE and PUBMED databases were searched for quantitative research studies published in the past decade. Eleven quantitative research articles with self-assessment of health status in single mothers were identified. Single mothers report lower levels of health status compared to partnered mothers. These inequalities appear to be associated with financial hardship and lack of social support. Both these factors increase single mothers' susceptibility to stress and illness. Despite the study limitations (e.g. results based mainly on secondary data from household surveys), it provides evidence that single motherhood places women in an adverse social position that is associated with prolonged stress mainly due to unemployment, economic hardship and social exclusion, which affects negatively their health status. These findings can be seen as a challenge for health professionals, especially those working in the community sector and policy makers too, to establish supportive measures for this vulnerable group focused on socio-economic factors. © 2013 International Council of Nurses.

  17. SHIPPING OF RADIOACTIVE ITEMS

    CERN Multimedia

    TIS/RP Group

    2001-01-01

    The TIS-RP group informs users that shipping of small radioactive items is normally guaranteed within 24 hours from the time the material is handed in at the TIS-RP service. This time is imposed by the necessary procedures (identification of the radionuclides, determination of dose rate and massive objects require a longer procedure and will therefore take longer.

  18. Selecting Lower Priced Items.

    Science.gov (United States)

    Kleinert, Harold L.; And Others

    1988-01-01

    A program used to teach moderately to severely mentally handicapped students to select the lower priced items in actual shopping activities is described. Through a five-phase process, students are taught to compare prices themselves as well as take into consideration variations in the sizes of containers and varying product weights. (VW)

  19. The Role of Item Models in Automatic Item Generation

    Science.gov (United States)

    Gierl, Mark J.; Lai, Hollis

    2012-01-01

    Automatic item generation represents a relatively new but rapidly evolving research area where cognitive and psychometric theories are used to produce tests that include items generated using computer technology. Automatic item generation requires two steps. First, test development specialists create item models, which are comparable to templates…

  20. Item information and discrimination functions for trinary PCM items

    NARCIS (Netherlands)

    Akkermans, Wies; Muraki, Eiji

    1997-01-01

    For trinary partial credit items the shape of the item information and the item discrimination function is examined in relation to the item parameters. In particular, it is shown that these functions are unimodal if δ2 – δ1 < 4 ln 2 and bimodal otherwise. The locations and values of the maxima are

  1. Validation of the 36-item version of the WHO Disability Assessment Schedule 2.0 (WHODAS 2.0) for assessing women's disability and functioning associated with maternal morbidity.

    Science.gov (United States)

    Silveira, Carla; Parpinelli, Mary Angela; Pacagnella, Rodolfo Carvalho; Andreucci, Carla Betina; Angelini, Carina Robles; Ferreira, Elton Carlos; Cecatti, José Guilherme

    2017-02-01

    Objective  To validate the translation and adaptation to Brazilian Portuguese of 36 items from the World Health Organizaton Disability Assessment Schedule 2.0 (WHODAS 2.0), regarding their content and structure (construct), in a female population after pregnancy. Methods  This is a validation of an instrument for the evaluation of disability and functioning and an assessment of its psychometric properties, performed in a tertiary maternity and a referral center specialized in high-risk pregnancies in Brazil. A sample of 638 women in different postpartum periods who had either a normal or a complicated pregnancy was included. The structure was evaluated by exploratory factor analysis (EFA) and confirmatory factor analysis (CFA), while the content and relationships among the domains were assessed through Pearson's correlation coefficient. The sociodemographic characteristics were identified, and the mean scores with their standard deviations for the 36 questions of the WHODAS 2.0 were calculated. The internal consistency was evaluated byCronbach's α. Results  Cronbach's α was higher than 0.79 for both sets of questons of the questionnaire. The EFA and CFA for the main 32 questions exhibited a total variance of 54.7% (Kaiser-Meyer-Olkin [KMO] measure of sampling adequacy =  0.934; p  < 0.001) and 53.47% (KMO = 0.934; p  < 0.001) respectively. There was a significant correlation among the 6 domains (r = 0.571-0.876), and a moderate correlation among all domains (r = 0.476-0.694). Conclusion  The version of the WHODAS 2.0 instrument adapted to Brazilian Portuguese showed good psychometric properties in this sample, and therefore could be applied to populations of women regarding their reproductive history. Thieme-Revinter Publicações Ltda Rio de Janeiro, Brazil.

  2. Modeling and Stability Assessment of Single-Phase Grid Synchronization Techniques

    DEFF Research Database (Denmark)

    Golestan, Saeed; Guerrero, Josep M.; Vasquez, Juan

    2018-01-01

    (GSTs) is of vital importance. This task is most often based on obtaining a linear time-invariant (LTI) model for the GST and applying standard stability tests to it. Another option is modeling and dynamics/stability assessment of GSTs in the linear time-periodic (LTP) framework, which has received...... a very little attention. In this letter, the procedure of deriving the LTP model for single-phase GSTs is first demonstrated. The accuracy of the LTP model in predicting the GST dynamic behavior and stability is then evaluated and compared with that of the LTI one. Two well-known single-phase GSTs, i...

  3. Differential item functioning analysis of the Vanderbilt Expertise Test for cars.

    Science.gov (United States)

    Lee, Woo-Yeol; Cho, Sun-Joo; McGugin, Rankin W; Van Gulick, Ana Beth; Gauthier, Isabel

    2015-01-01

    The Vanderbilt Expertise Test for cars (VETcar) is a test of visual learning for contemporary car models. We used item response theory to assess the VETcar and in particular used differential item functioning (DIF) analysis to ask if the test functions the same way in laboratory versus online settings and for different groups based on age and gender. An exploratory factor analysis found evidence of multidimensionality in the VETcar, although a single dimension was deemed sufficient to capture the recognition ability measured by the test. We selected a unidimensional three-parameter logistic item response model to examine item characteristics and subject abilities. The VETcar had satisfactory internal consistency. A substantial number of items showed DIF at a medium effect size for test setting and for age group, whereas gender DIF was negligible. Because online subjects were on average older than those tested in the lab, we focused on the age groups to conduct a multigroup item response theory analysis. This revealed that most items on the test favored the younger group. DIF could be more the rule than the exception when measuring performance with familiar object categories, therefore posing a challenge for the measurement of either domain-general visual abilities or category-specific knowledge.

  4. The Long-Term Conditions Questionnaire: conceptual framework and item development.

    Science.gov (United States)

    Peters, Michele; Potter, Caroline M; Kelly, Laura; Hunter, Cheryl; Gibbons, Elizabeth; Jenkinson, Crispin; Coulter, Angela; Forder, Julien; Towers, Ann-Marie; A'Court, Christine; Fitzpatrick, Ray

    2016-01-01

    To identify the main issues of importance when living with long-term conditions to refine a conceptual framework for informing the item development of a patient-reported outcome measure for long-term conditions. Semi-structured qualitative interviews (n=48) were conducted with people living with at least one long-term condition. Participants were recruited through primary care. The interviews were transcribed verbatim and analyzed by thematic analysis. The analysis served to refine the conceptual framework, based on reviews of the literature and stakeholder consultations, for developing candidate items for a new measure for long-term conditions. Three main organizing concepts were identified: impact of long-term conditions, experience of services and support, and self-care. The findings helped to refine a conceptual framework, leading to the development of 23 items that represent issues of importance in long-term conditions. The 23 candidate items formed the first draft of the measure, currently named the Long-Term Conditions Questionnaire. The aim of this study was to refine the conceptual framework and develop items for a patient-reported outcome measure for long-term conditions, including single and multiple morbidities and physical and mental health conditions. Qualitative interviews identified the key themes for assessing outcomes in long-term conditions, and these underpinned the development of the initial draft of the measure. These initial items will undergo cognitive testing to refine the items prior to further validation in a survey.

  5. Item Banking with Embedded Standards

    Science.gov (United States)

    MacCann, Robert G.; Stanley, Gordon

    2009-01-01

    An item banking method that does not use Item Response Theory (IRT) is described. This method provides a comparable grading system across schools that would be suitable for low-stakes testing. It uses the Angoff standard-setting method to obtain item ratings that are stored with each item. An example of such a grading system is given, showing how…

  6. Vegetable parenting practices scale: Item response modeling analyses

    Science.gov (United States)

    Our objective was to evaluate the psychometric properties of a vegetable parenting practices scale using multidimensional polytomous item response modeling which enables assessing item fit to latent variables and the distributional characteristics of the items in comparison to the respondents. We al...

  7. Item response theory - A first approach

    Science.gov (United States)

    Nunes, Sandra; Oliveira, Teresa; Oliveira, Amílcar

    2017-07-01

    The Item Response Theory (IRT) has become one of the most popular scoring frameworks for measurement data, frequently used in computerized adaptive testing, cognitively diagnostic assessment and test equating. According to Andrade et al. (2000), IRT can be defined as a set of mathematical models (Item Response Models - IRM) constructed to represent the probability of an individual giving the right answer to an item of a particular test. The number of Item Responsible Models available to measurement analysis has increased considerably in the last fifteen years due to increasing computer power and due to a demand for accuracy and more meaningful inferences grounded in complex data. The developments in modeling with Item Response Theory were related with developments in estimation theory, most remarkably Bayesian estimation with Markov chain Monte Carlo algorithms (Patz & Junker, 1999). The popularity of Item Response Theory has also implied numerous overviews in books and journals, and many connections between IRT and other statistical estimation procedures, such as factor analysis and structural equation modeling, have been made repeatedly (Van der Lindem & Hambleton, 1997). As stated before the Item Response Theory covers a variety of measurement models, ranging from basic one-dimensional models for dichotomously and polytomously scored items and their multidimensional analogues to models that incorporate information about cognitive sub-processes which influence the overall item response process. The aim of this work is to introduce the main concepts associated with one-dimensional models of Item Response Theory, to specify the logistic models with one, two and three parameters, to discuss some properties of these models and to present the main estimation procedures.

  8. SHIPPING OF RADIOACTIVE ITEMS

    CERN Multimedia

    TIS/RP Group

    2001-01-01

    The TIS-RP group informs users that shipping of small radioactive items is normally guaranteed within 24 hours from the time the material is handed in at the TIS-RP service. This time is imposed by the necessary procedures (identification of the radionuclides, determination of dose rate, preparation of the package and related paperwork). Large and massive objects require a longer procedure and will therefore take longer.

  9. The MCRA model for probabilistic single-compound and cumulative risk assessment of pesticides.

    Science.gov (United States)

    van der Voet, Hilko; de Boer, Waldo J; Kruisselbrink, Johannes W; Goedhart, Paul W; van der Heijden, Gerie W A M; Kennedy, Marc C; Boon, Polly E; van Klaveren, Jacob D

    2015-05-01

    Pesticide risk assessment is hampered by worst-case assumptions leading to overly pessimistic assessments. On the other hand, cumulative health effects of similar pesticides are often not taken into account. This paper describes models and a web-based software system developed in the European research project ACROPOLIS. The models are appropriate for both acute and chronic exposure assessments of single compounds and of multiple compounds in cumulative assessment groups. The software system MCRA (Monte Carlo Risk Assessment) is available for stakeholders in pesticide risk assessment at mcra.rivm.nl. We describe the MCRA implementation of the methods as advised in the 2012 EFSA Guidance on probabilistic modelling, as well as more refined methods developed in the ACROPOLIS project. The emphasis is on cumulative assessments. Two approaches, sample-based and compound-based, are contrasted. It is shown that additional data on agricultural use of pesticides may give more realistic risk assessments. Examples are given of model and software validation of acute and chronic assessments, using both simulated data and comparisons against the previous release of MCRA and against the standard software DEEM-FCID used by the Environmental Protection Agency in the USA. It is shown that the EFSA Guidance pessimistic model may not always give an appropriate modelling of exposure. Crown Copyright © 2014. Published by Elsevier Ltd. All rights reserved.

  10. Science Library of Test Items. Volume Twenty-Two. A Collection of Multiple Choice Test Items Relating Mainly to Skills.

    Science.gov (United States)

    New South Wales Dept. of Education, Sydney (Australia).

    As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…

  11. Science Library of Test Items. Volume Eighteen. A Collection of Multiple Choice Test Items Relating Mainly to Chemistry.

    Science.gov (United States)

    New South Wales Dept. of Education, Sydney (Australia).

    As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…

  12. Science Library of Test Items. Volume Twenty. A Collection of Multiple Choice Test Items Relating Mainly to Physics, 1.

    Science.gov (United States)

    New South Wales Dept. of Education, Sydney (Australia).

    As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…

  13. Science Library of Test Items. Volume Seventeen. A Collection of Multiple Choice Test Items Relating Mainly to Biology.

    Science.gov (United States)

    New South Wales Dept. of Education, Sydney (Australia).

    As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…

  14. Science Library of Test Items. Volume Nineteen. A Collection of Multiple Choice Test Items Relating Mainly to Geology.

    Science.gov (United States)

    New South Wales Dept. of Education, Sydney (Australia).

    As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…

  15. APOLLO: a quality assessment service for single and multiple protein models.

    Science.gov (United States)

    Wang, Zheng; Eickholt, Jesse; Cheng, Jianlin

    2011-06-15

    We built a web server named APOLLO, which can evaluate the absolute global and local qualities of a single protein model using machine learning methods or the global and local qualities of a pool of models using a pair-wise comparison approach. Based on our evaluations on 107 CASP9 (Critical Assessment of Techniques for Protein Structure Prediction) targets, the predicted quality scores generated from our machine learning and pair-wise methods have an average per-target correlation of 0.671 and 0.917, respectively, with the true model quality scores. Based on our test on 92 CASP9 targets, our predicted absolute local qualities have an average difference of 2.60 Å with the actual distances to native structure. http://sysbio.rnet.missouri.edu/apollo/. Single and pair-wise global quality assessment software is also available at the site.

  16. Assessment of single-shell tank residual-liquid issues at Hanford Site, Washington

    International Nuclear Information System (INIS)

    Murthy, K.S.; Stout, L.A.; Napier, B.A.; Reisenauer, A.E.; Landstrom, D.K.

    1983-06-01

    This report provides an assessment of the overall effectiveness and implications of jet pumping the interstitial liquids (IL) from single-shell tanks at Hanford. The jet-pumping program, currently in progress at Hanford, involves the planned removal of IL contained in 89 of the 149 single-shell tanks and its transfer to double-shell tanks after volume reduction by evaporation. The purpose of this report is to estimate the public and worker doses associated with (1) terminating pumping immediately, (2) pumping to a 100,000-gal limit per tank, (3) pumping to a 50,000-gal limit per tank, and (4) pumping to the maximum practical liquid removal level of 30,000 gal. Assessment of the cost-effectiveness of these various levels of pumping in minimizing any undue health and safety risks to the public or worker is also presented

  17. Qualitative and quantitative assessment of single fingerprints in forensic DNA analysis.

    Science.gov (United States)

    Ostojic, Lana; Klempner, Stacey A; Patel, Rosni A; Mitchell, Adele A; Axler-DiPerte, Grace L; Wurmbach, Elisa

    2014-11-01

    Fingerprints and touched items are important sources of DNA for STR profiling, since this evidence can be recovered in a wide variety of criminal offenses. However, there are some fundamental difficulties in working with these samples, including variability in quantity and quality of extracted DNA. In this study, we collected and analyzed over 700 fingerprints. We compared a commercially available extraction protocol (Zygem) to two methods developed in our laboratory, a simple one-tube protocol and a high sensitivity protocol (HighSens) that includes additional steps to concentrate and purify the DNA. The amplification protocols tested were AmpFLSTR® Identifiler® using either 28 or 31 amplification cycles, and Identifiler® Plus using 32 amplification cycles. We found that the HighSens and Zygem extraction methods were significantly better in their DNA yields than the one-tube method. Identifiler® Plus increased the quality of the STR profiles for the one-tube extraction significantly. However, this effect could not be verified for the other extraction methods. Furthermore, microscopic analysis of single fingerprints revealed that some individuals tended to shed more material than others onto glass slides. However, a dense deposition of skin flakes did not strongly correlate with a high quality STR profile. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  18. Development of a lack of appetite item bank for computer-adaptive testing (CAT)

    DEFF Research Database (Denmark)

    Thamsborg, Lise Laurberg Holst; Petersen, Morten Aa; Aaronson, Neil K

    2015-01-01

    to 12 lack of appetite items. CONCLUSIONS: Phases 1-3 resulted in 12 lack of appetite candidate items. Based on a field testing (phase 4), the psychometric characteristics of the items will be assessed and the final item bank will be generated. This CAT item bank is expected to provide precise...

  19. Automated Item Generation with Recurrent Neural Networks.

    Science.gov (United States)

    von Davier, Matthias

    2018-03-12

    Utilizing technology for automated item generation is not a new idea. However, test items used in commercial testing programs or in research are still predominantly written by humans, in most cases by content experts or professional item writers. Human experts are a limited resource and testing agencies incur high costs in the process of continuous renewal of item banks to sustain testing programs. Using algorithms instead holds the promise of providing unlimited resources for this crucial part of assessment development. The approach presented here deviates in several ways from previous attempts to solve this problem. In the past, automatic item generation relied either on generating clones of narrowly defined item types such as those found in language free intelligence tests (e.g., Raven's progressive matrices) or on an extensive analysis of task components and derivation of schemata to produce items with pre-specified variability that are hoped to have predictable levels of difficulty. It is somewhat unlikely that researchers utilizing these previous approaches would look at the proposed approach with favor; however, recent applications of machine learning show success in solving tasks that seemed impossible for machines not too long ago. The proposed approach uses deep learning to implement probabilistic language models, not unlike what Google brain and Amazon Alexa use for language processing and generation.

  20. Prospective randomized assessment of single versus double-gloving for general surgical procedures.

    Science.gov (United States)

    Na'aya, H U; Madziga, A G; Eni, U E

    2009-01-01

    There is increased tendency towards double-gloving by general surgeons in our practice, due probably to awareness of the risk of contamination with blood or other body fluids during surgery. The aim of the study was to compare the relative frequency of glove puncture in single-glove versus double glove sets in general surgical procedures, and to determine if duration of surgery affects perforation rate. Surgeons at random do single or double gloves at their discretion, for general surgical procedures. All the gloves used by the surgeons were assessed immediately after surgery for perforation. A total of 1120 gloves were tested, of which 880 were double-glove sets and 240 single-glove sets. There was no significant difference in the overall perforation rate between single and double glove sets (18.3% versus 20%). However, only 2.3% had perforations in both the outer and inner gloves in the double glove group. Therefore, there was significantly greater risk for blood-skin exposure in the single glove sets (p < 0.01). The perforation rate was also significantly greater during procedures lasting an hour or more compared to those lasting less than an hour (p < 0.01). Double-gloving reduces the risk of blood-skin contamination in all general surgical procedures, and especially so in procedures lasting an hour or more.

  1. Deterministic and Probabilistic Serviceability Assessment of Footbridge Vibrations due to a Single Walker Crossing

    Directory of Open Access Journals (Sweden)

    Cristoforo Demartino

    2018-01-01

    Full Text Available This paper presents a numerical study on the deterministic and probabilistic serviceability assessment of footbridge vibrations due to a single walker crossing. The dynamic response of the footbridge is analyzed by means of modal analysis, considering only the first lateral and vertical modes. Single span footbridges with uniform mass distribution are considered, with different values of the span length, natural frequencies, mass, and structural damping and with different support conditions. The load induced by a single walker crossing the footbridge is modeled as a moving sinusoidal force either in the lateral or in the vertical direction. The variability of the characteristics of the load induced by walkers is modeled using probability distributions taken from the literature defining a Standard Population of walkers. Deterministic and probabilistic approaches were adopted to assess the peak response. Based on the results of the simulations, deterministic and probabilistic vibration serviceability assessment methods are proposed, not requiring numerical analyses. Finally, an example of the application of the proposed method to a truss steel footbridge is presented. The results highlight the advantages of the probabilistic procedure in terms of reliability quantification.

  2. Diet Quality of Items Advertised in Supermarket Sales Circulars Compared to Diets of the US Population, as Assessed by the Healthy Eating Index-2010.

    Science.gov (United States)

    Jahns, Lisa; Scheett, Angela J; Johnson, LuAnn K; Krebs-Smith, Susan M; Payne, Collin R; Whigham, Leah D; Hoverson, Bonita S; Kranz, Sibylle

    2016-01-01

    Supermarkets use sales circulars to highlight specific foods, usually at reduced prices. Resulting purchases help form the set of available foods within households from which individuals and families make choices about what to eat. The purposes of this study were to determine how closely foods featured in weekly supermarket sales circulars conform to dietary guidance and how diet quality compares with that of the US population's intakes. Food and beverage items (n=9,149) in 52 weekly sales circulars from a small Midwestern grocery chain in 2009 were coded to obtain food group and nutrient and energy content. Healthy Eating Index-2010 (HEI-2010) total and component scores were calculated using algorithms developed by the National Cancer Institute. HEI-2010 scores for the US population aged 2+ years were estimated using data from the 2009-2010 National Health and Nutrition Examination Survey. HEI-2010 scores of circulars and population intakes were compared using Student's t tests. Mean total (42.8 of 100) HEI-2010 scores of circulars were lower than that of the US population (55.4; Pdiet quality. Supermarkets could support improvements in consumer diets by weekly featuring foods that are more in concordance with food and nutrient recommendations. Copyright © 2016 Academy of Nutrition and Dietetics. Published by Elsevier Inc. All rights reserved.

  3. The Development and Evaluation of an Online Formative Assessment upon Single-Player Game in E-Learning Environment

    Science.gov (United States)

    Tsai, Fu-Hsing

    2013-01-01

    This study developed a game-based formative assessment, called tic-tac-toe quiz for single-player version (TRIS-Q-SP), in an energy education e-learning system. This assessment game combined tic-tac-toe with online assessment, and revised the rule of tic-tac-toe for stimulating students to use online formative assessment actively. Additionally, to…

  4. How employees perceive organizational learning: construct validation of the 25-item short form of the strategic learning assessment map (SF-SLAM)

    NARCIS (Netherlands)

    Mainert, Jakob; Niepel, Christoph; Lans, T.; Greiff, Samuel

    2018-01-01

    Purpose: The Strategic Learning Assessment Map (SLAM) originally assessed organizational learning (OL) at the level of the firm by addressing managers, who rated OL in the SLAM on five dimensions of individual learning, group learning, organizational learning, feed-forward learning, and feedback

  5. Item-focussed Trees for the Identification of Items in Differential Item Functioning.

    Science.gov (United States)

    Tutz, Gerhard; Berger, Moritz

    2016-09-01

    A novel method for the identification of differential item functioning (DIF) by means of recursive partitioning techniques is proposed. We assume an extension of the Rasch model that allows for DIF being induced by an arbitrary number of covariates for each item. Recursive partitioning on the item level results in one tree for each item and leads to simultaneous selection of items and variables that induce DIF. For each item, it is possible to detect groups of subjects with different item difficulties, defined by combinations of characteristics that are not pre-specified. The way a DIF item is determined by covariates is visualized in a small tree and therefore easily accessible. An algorithm is proposed that is based on permutation tests. Various simulation studies, including the comparison with traditional approaches to identify items with DIF, show the applicability and the competitive performance of the method. Two applications illustrate the usefulness and the advantages of the new method.

  6. Therapeutic Assessment of Complex Trauma: A Single-Case Time-Series Study.

    Science.gov (United States)

    Tarocchi, Anna; Aschieri, Filippo; Fantini, Francesca; Smith, Justin D

    2013-06-01

    The cumulative effect of repeated traumatic experiences in early childhood incrementally increases the risk of adjustment problems later in life. Surviving traumatic environments can lead to the development of an interrelated constellation of emotional and interpersonal symptoms termed complex posttraumatic stress disorder (CPTSD). Effective treatment of trauma begins with a multimethod psychological assessment and requires the use of several evidence-based therapeutic processes, including establishing a safe therapeutic environment, reprocessing the trauma, constructing a new narrative, and managing emotional dysregulation. Therapeutic Assessment (TA) is a semistructured, brief intervention that uses psychological testing to promote positive change. The case study of Kelly, a middle-aged woman with a history of repeated interpersonal trauma, illustrates delivery of the TA model for CPTSD. Results of this single-case time-series experiment indicate statistically significant symptom improvement as a result of participating in TA. We discuss the implications of these findings for assessing and treating trauma-related concerns, such as CPTSD.

  7. Examination of the PROMIS upper extremity item bank.

    Science.gov (United States)

    Hung, Man; Voss, Maren W; Bounsanga, Jerry; Crum, Anthony B; Tyser, Andrew R

    Clinical measurement. The psychometric properties of the PROMIS v1.2 UE item bank were tested on various samples prior to its release, but have not been fully evaluated among the orthopaedic population. This study assesses the performance of the UE item bank within the UE orthopaedic patient population. The UE item bank was administered to 1197 adult patients presenting to a tertiary orthopaedic clinic specializing in hand and UE conditions and was examined using traditional statistics and Rasch analysis. The UE item bank fits a unidimensional model (outfit MNSQ range from 0.64 to 1.70) and has adequate reliabilities (person = 0.84; item = 0.82) and local independence (item residual correlations range from -0.37 to 0.34). Only one item exhibits gender differential item functioning. Most items target low levels of function. The UE item bank is a useful clinical assessment tool. Additional items covering higher functions are needed to enhance validity. Supplemental testing is recommended for patients at higher levels of function until more high function UE items are developed. 2c. Copyright © 2016 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.

  8. Feed mechanism and method for feeding minute items

    Science.gov (United States)

    Stringer, Timothy Kent [Bucyrus, KS; Yerganian, Simon Scott [Lee's Summit, MO

    2009-10-20

    A feeding mechanism and method for feeding minute items, such as capacitors, resistors, or solder preforms. The mechanism is adapted to receive a plurality of the randomly-positioned and randomly-oriented extremely small or minute items, and to isolate, orient, and position one or more of the items in a specific repeatable pickup location wherefrom they may be removed for use by, for example, a computer-controlled automated assembly machine. The mechanism comprises a sliding shelf adapted to receive and support the items; a wiper arm adapted to achieve a single even layer of the items; and a pushing arm adapted to push the items into the pickup location. The mechanism can be adapted for providing the items with a more exact orientation, and can also be adapted for use in a liquid environment.

  9. Item validity vs. item discrimination index: a redundancy?

    Science.gov (United States)

    Panjaitan, R. L.; Irawati, R.; Sujana, A.; Hanifah, N.; Djuanda, D.

    2018-03-01

    In several literatures about evaluation and test analysis, it is common to find that there are calculations of item validity as well as item discrimination index (D) with different formula for each. Meanwhile, other resources said that item discrimination index could be obtained by calculating the correlation between the testee’s score in a particular item and the testee’s score on the overall test, which is actually the same concept as item validity. Some research reports, especially undergraduate theses tend to include both item validity and item discrimination index in the instrument analysis. It seems that these concepts might overlap for both reflect the test quality on measuring the examinees’ ability. In this paper, examples of some results of data processing on item validity and item discrimination index were compared. It would be discussed whether item validity and item discrimination index can be represented by one of them only or it should be better to present both calculations for simple test analysis, especially in undergraduate theses where test analyses were included.

  10. Language-related differential item functioning between English and German PROMIS Depression items is negligible.

    Science.gov (United States)

    Fischer, H Felix; Wahl, Inka; Nolte, Sandra; Liegl, Gregor; Brähler, Elmar; Löwe, Bernd; Rose, Matthias

    2017-12-01

    To investigate differential item functioning (DIF) of PROMIS Depression items between US and German samples we compared data from the US PROMIS calibration sample (n = 780), a German general population survey (n = 2,500) and a German clinical sample (n = 621). DIF was assessed in an ordinal logistic regression framework, with 0.02 as criterion for R 2 -change and 0.096 for Raju's non-compensatory DIF. Item parameters were initially fixed to the PROMIS Depression metric; we used plausible values to account for uncertainty in depression estimates. Only four items showed DIF. Accounting for DIF led to negligible effects for the full item bank as well as a post hoc simulated computer-adaptive test (German general population sample was considerably lower compared to the US reference value of 50. Overall, we found little evidence for language DIF between US and German samples, which could be addressed by either replacing the DIF items by items not showing DIF or by scoring the short form in German samples with the corrected item parameters reported. Copyright © 2016 John Wiley & Sons, Ltd.

  11. Item selection via Bayesian IRT models.

    Science.gov (United States)

    Arima, Serena

    2015-02-10

    With reference to a questionnaire that aimed to assess the quality of life for dysarthric speakers, we investigate the usefulness of a model-based procedure for reducing the number of items. We propose a mixed cumulative logit model, which is known in the psychometrics literature as the graded response model: responses to different items are modelled as a function of individual latent traits and as a function of item characteristics, such as their difficulty and their discrimination power. We jointly model the discrimination and the difficulty parameters by using a k-component mixture of normal distributions. Mixture components correspond to disjoint groups of items. Items that belong to the same groups can be considered equivalent in terms of both difficulty and discrimination power. According to decision criteria, we select a subset of items such that the reduced questionnaire is able to provide the same information that the complete questionnaire provides. The model is estimated by using a Bayesian approach, and the choice of the number of mixture components is justified according to information criteria. We illustrate the proposed approach on the basis of data that are collected for 104 dysarthric patients by local health authorities in Lecce and in Milan. Copyright © 2014 John Wiley & Sons, Ltd.

  12. Criteria for eliminating items of a Test of Figural Analogies

    Directory of Open Access Journals (Sweden)

    Diego Blum

    2013-12-01

    Full Text Available This paper describes the steps taken to eliminate two of the items in a Test of Figural Analogies (TFA. The main guidelines of psychometric analysis concerning Classical Test Theory (CTT and Item Response Theory (IRT are explained. The item elimination process was based on both the study of the CTT difficulty and discrimination index, and the unidimensionality analysis. The a, b, and c parameters of the Three Parameter Logistic Model of IRT were also considered for this purpose, as well as the assessment of each item fitting this model. The unfavourable characteristics of a group of TFA items are detailed, and decisions leading to their possible elimination are discussed.

  13. RCRA Assessment Plan for Single-Shell Tank Waste Management Area TX-TY

    Energy Technology Data Exchange (ETDEWEB)

    Horton, Duane G.

    2007-03-26

    WMA TX-TY contains underground, single-shell tanks that were used to store liquid waste that contained chemicals and radionuclides. Most of the liquid has been removed, and the remaining waste is regulated under the RCRA as modi¬fied in 40 CFR Part 265, Subpart F and Washington State’s Hazardous Waste Management Act . WMA TX-TY was placed in assessment monitoring in 1993 because of elevated specific conductance. A groundwater quality assessment plan was written in 1993 describing the monitoring activities to be used in deciding whether WMA TX-TY had affected groundwater. That plan was updated in 2001 for continued RCRA groundwater quality assessment as required by 40 CFR 265.93 (d)(7). This document further updates the assessment plan for WMA TX-TY by including (1) information obtained from ten new wells installed at the WMA after 1999 and (2) information from routine quarterly groundwater monitoring during the last five years. Also, this plan describes activities for continuing the groundwater assessment at WMA TX TY.

  14. Negative Affect Impairs Associative Memory but Not Item Memory

    Science.gov (United States)

    Bisby, James A.; Burgess, Neil

    2014-01-01

    The formation of associations between items and their context has been proposed to rely on mechanisms distinct from those supporting memory for a single item. Although emotional experiences can profoundly affect memory, our understanding of how it interacts with different aspects of memory remains unclear. We performed three experiments to examine…

  15. What We Don't Test: What an Analysis of Unreleased ACS Exam Items Reveals about Content Coverage in General Chemistry Assessments

    Science.gov (United States)

    Reed, Jessica J.; Villafan~e, Sachel M.; Raker, Jeffrey R.; Holme, Thomas A.; Murphy, Kristen L.

    2017-01-01

    General chemistry courses are often the foundation for the study of other science disciplines and upper-level chemistry concepts. Students who take introductory chemistry courses are more often from health and science-related fields than chemistry. As such, the content taught and assessed in general chemistry courses is envisioned as building…

  16. Sources of interference in item and associative recognition memory.

    Science.gov (United States)

    Osth, Adam F; Dennis, Simon

    2015-04-01

    A powerful theoretical framework for exploring recognition memory is the global matching framework, in which a cue's memory strength reflects the similarity of the retrieval cues being matched against the contents of memory simultaneously. Contributions at retrieval can be categorized as matches and mismatches to the item and context cues, including the self match (match on item and context), item noise (match on context, mismatch on item), context noise (match on item, mismatch on context), and background noise (mismatch on item and context). We present a model that directly parameterizes the matches and mismatches to the item and context cues, which enables estimation of the magnitude of each interference contribution (item noise, context noise, and background noise). The model was fit within a hierarchical Bayesian framework to 10 recognition memory datasets that use manipulations of strength, list length, list strength, word frequency, study-test delay, and stimulus class in item and associative recognition. Estimates of the model parameters revealed at most a small contribution of item noise that varies by stimulus class, with virtually no item noise for single words and scenes. Despite the unpopularity of background noise in recognition memory models, background noise estimates dominated at retrieval across nearly all stimulus classes with the exception of high frequency words, which exhibited equivalent levels of context noise and background noise. These parameter estimates suggest that the majority of interference in recognition memory stems from experiences acquired before the learning episode. (c) 2015 APA, all rights reserved).

  17. Single-cell-based evaluation of sperm progressive motility via fluorescent assessment of mitochondria membrane potential.

    Science.gov (United States)

    Moscatelli, Natalina; Spagnolo, Barbara; Pisanello, Marco; Lemma, Enrico Domenico; De Vittorio, Massimo; Zara, Vincenzo; Pisanello, Ferruccio; Ferramosca, Alessandra

    2017-12-20

    Sperm cells progressive motility is the most important parameter involved in the fertilization process. Sperm middle piece contains mitochondria, which play a critical role in energy production and whose proper operation ensures the reproductive success. Notably, sperm progressive motility is strictly related to mitochondrial membrane potential (MMP) and consequently to mitochondrial functionality. Although previous studies presented an evaluation of mitochondrial function through MMP assessment in entire sperm cells samples, a quantitative approach at single-cell level could provide more insights in the analysis of semen quality. Here we combine laser scanning confocal microscopy and functional fluorescent staining of mitochondrial membrane to assess MMP distribution among isolated spermatozoa. We found that the sperm fluorescence value increases as a function of growing progressive motility and that such fluorescence is influenced by MMP disruptors, potentially allowing for the discrimination of different quality classes of sperm cells in heterogeneous populations.

  18. Item response theory at subject- and group-level

    NARCIS (Netherlands)

    Tobi, Hilde

    1990-01-01

    This paper reviews the literature about item response models for the subject level and aggregated level (group level). Group-level item response models (IRMs) are used in the United States in large-scale assessment programs such as the National Assessment of Educational Progress and the California

  19. Esthetic outcome for maxillary anterior single implants assessed by different dental specialists.

    Science.gov (United States)

    Al-Dosari, Abdullah; Al-Rowis, Ra'ed; Moslem, Feras; Alshehri, Fahad; Ballo, Ahmed M

    2016-10-01

    The aim of this study was to assess the esthetic outcome of maxillary anterior single implants by comparing the esthetic perception of dental professionals and patients. Twenty-three patients with single implants in the esthetic zone were enrolled in this study. Dentists of four different dental specialties (Three orthodontists, three oral surgeons, three prosthodontists, and three periodontists) evaluated the pink esthetic score (PES)/white esthetic score (WES) for 23 implant-supported single restorations. The satisfactions of the patients on the esthetic outcome of the treatment have been evaluated according to the visual analog scale (VAS). The mean total PES/WES was 12.26 ± 4.76. The mean PES was 6.45 ± 2.78 and mean WES was 5.80 ± 2.82. There was a statistically significant difference among the different specialties for WES ( P esthetic perception, thereby providing rationales for involving patients in the treatment plan to achieve higher levels of patient satisfaction.

  20. Therapeutic Assessment for Preadolescent Boys with Oppositional Defiant Disorder: A Replicated Single-Case Time-Series Design

    Science.gov (United States)

    Smith, Justin D.; Handler, Leonard; Nash, Michael R.

    2010-01-01

    The Therapeutic Assessment (TA) model is a relatively new treatment approach that fuses assessment and psychotherapy. The study examines the efficacy of this model with preadolescent boys with oppositional defiant disorder and their families. A replicated single-case time-series design with daily measures is used to assess the effects of TA and to…

  1. Diagnostic Efficacy of a Single Progesterone Determination to Assess Full-Term Pregnancy in the Bitch.

    Science.gov (United States)

    Rota, A; Charles, C; Starvaggi Cucuzza, A; Pregel, P

    2015-12-01

    In clinical settings, when the reproductive history of a near-term bitch is limited to mating dates, the possibility to accurately assess whether pregnancy is at term could be very useful in order to be able to plan a correct management of parturition or to safely perform an elective Caesarean section. The aim of this study was to assess the diagnostic efficacy of a single progesterone determination, measured by chemiluminescent immunoassay (CLIA), in predicting the occurrence of parturition on the following day. At least one blood sample was collected from 51 pre-partum bitches during the 3 days before parturition and on day of parturition. The efficacy of progesterone as a marker of the end of pregnancy was tested using a receiver operating characteristic (ROC) analysis. Youden's index was calculated to select the optimal cut-off value (with 95% confidence interval), aiming at maximizing the correct identification of negative events, so not to risk to diagnose as full term a bitch which is not. Progesterone concentration lower than 3.4 ng/ml correctly identified the bitches whelping the following day; however, because of the obliged prudential approach, sensitivity was low (46.88%), and 17 of 32 full-term bitches were missed. Due to a very large individual variation, a single progesterone determination has low diagnostic efficacy, although it can represent a useful first screening. © 2015 Blackwell Verlag GmbH.

  2. Random Item Generation Is Affected by Age

    Science.gov (United States)

    Multani, Namita; Rudzicz, Frank; Wong, Wing Yiu Stephanie; Namasivayam, Aravind Kumar; van Lieshout, Pascal

    2016-01-01

    Purpose: Random item generation (RIG) involves central executive functioning. Measuring aspects of random sequences can therefore provide a simple method to complement other tools for cognitive assessment. We examine the extent to which RIG relates to specific measures of cognitive function, and whether those measures can be estimated using RIG…

  3. Automated Assessment of Dynamic Knee Valgus and Risk of Knee Injury During the Single Leg Squat

    Science.gov (United States)

    Lee, Alexander; Raina, Sachin; Kulić, Dana

    2017-01-01

    Many clinical assessment protocols of the lower limb rely on the evaluation of functional movement tests such as the single leg squat (SLS), which are often assessed visually. Visual assessment is subjective and depends on the experience of the clinician. In this paper, an inertial measurement unit (IMU)-based method for automated assessment of squat quality is proposed to provide clinicians with a quantitative measure of SLS performance. A set of three IMUs was used to estimate the joint angles, velocities, and accelerations of the squatting leg. Statistical time domain features were generated from these measurements. The most informative features were used for classifier training. A data set of SLS performed by healthy participants was collected and labeled by three expert clinical raters using two different labeling criteria: “observed amount of knee valgus” and “overall risk of injury”. The results showed that both flexion at the hip and knee, as well as hip and ankle internal rotation are discriminative features, and that participants with “poor” squats bend the hip and knee less than those with better squat performance. Furthermore, improved classification performance is achieved for females by training separate classifiers stratified by gender. Classification results showed excellent accuracy, 95.7 % for classifying squat quality as “poor” or “good” and 94.6% for differentiating between high and no risk of injury. PMID:29204327

  4. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes

    Science.gov (United States)

    Parks, Donovan H.; Imelfort, Michael; Skennerton, Connor T.; Hugenholtz, Philip; Tyson, Gene W.

    2015-01-01

    Large-scale recovery of genomes from isolates, single cells, and metagenomic data has been made possible by advances in computational methods and substantial reductions in sequencing costs. Although this increasing breadth of draft genomes is providing key information regarding the evolutionary and functional diversity of microbial life, it has become impractical to finish all available reference genomes. Making robust biological inferences from draft genomes requires accurate estimates of their completeness and contamination. Current methods for assessing genome quality are ad hoc and generally make use of a limited number of “marker” genes conserved across all bacterial or archaeal genomes. Here we introduce CheckM, an automated method for assessing the quality of a genome using a broader set of marker genes specific to the position of a genome within a reference genome tree and information about the collocation of these genes. We demonstrate the effectiveness of CheckM using synthetic data and a wide range of isolate-, single-cell-, and metagenome-derived genomes. CheckM is shown to provide accurate estimates of genome completeness and contamination and to outperform existing approaches. Using CheckM, we identify a diverse range of errors currently impacting publicly available isolate genomes and demonstrate that genomes obtained from single cells and metagenomic data vary substantially in quality. In order to facilitate the use of draft genomes, we propose an objective measure of genome quality that can be used to select genomes suitable for specific gene- and genome-centric analyses of microbial communities. PMID:25977477

  5. Single use disposable digital flexible ureteroscopes: an ex-vivo assessment and cost analysis.

    Science.gov (United States)

    Hennessey, D B; Fojecki, G; Papa, N; Lawrentschuk, N; Bolton, D

    2018-04-15

    The single use flexible ureteroscope (fURS), the LithoVue is an important recent development. We aim to measure the capability of this instrument and to assess if there is a benefit to switching to single use instruments. The LithoVue was compared to Olympus URF-V and Stortz Flex Xc ex-vivo. An analysis of reusable fURS usage was performed to evaluate damage, durability and maintenance costs. This was then compared to the projected costs of using single use instruments. Flexion, deflection and irrigation flow of the LithoVue was equivalent, if not better than reusable instruments. An analysis of 234 procedures with 7 new Olympus URF-V scopes, revealed 15 scope damages. Staghorn stones and lower pole/midzone stones were significant risk factors for damage, p=0.014. Once damage occurred, it was likely to occur again. Total repair costs were $162,628 (£92,411), the mean cost per case is $695 (£395). Factoring in the purchase cost, cleaning and repair costs, and the cumulative cost of 28 reusable fURS cases is approximately $50,000 (£28,412). If the LithoVue was priced at $1200 AUD, switching to a single use scope would cost approximately $35,000 (£19,888). The LithoVue is analogous to reusable fURS scopes in regard to standard technical metrics. Depending on its purchase cost it may also represent a cost saving for hospitals when compared to the cumulative costs of maintaining reusable fURS. Additionally, urologist may consider to use the scope in cases in which reusable scope damage is anticipated. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.

  6. Criterion validity of the Short Mood and Feelings Questionnaire and one- and two-item depression screens in young adolescents

    Directory of Open Access Journals (Sweden)

    McCauley Elizabeth

    2010-02-01

    Full Text Available Abstract Background The use of short screening questionnaires may be a promising option for identifying children at risk for depression in a community setting. The objective of this study was to assess the validity of the Short Mood and Feelings Questionnaire (SMFQ and one- and two-item screening instruments for depressive disorders in a school-based sample of young adolescents. Methods Participants were 521 sixth-grade students attending public middle schools. Child and parent versions of the SMFQ were administered to evaluate the child's depressive symptoms. The presence of any depressive disorder during the previous month was assessed using the Diagnostic Interview Schedule for Children (DISC as the criterion standard. First, we assessed the diagnostic accuracy of child, parent, and combined scores of the full 13-item SMFQ by calculating the area under the receiver operating characteristic curve (AUC, sensitivity and specificity. The same approach was then used to evaluate the accuracy of a two-item scale consisting of only depressed mood and anhedonia items, and a single depressed mood item. Results The combined child + parent SMFQ score showed the highest accuracy (AUC = 0.86. Diagnostic accuracy was lower for child (AUC = 0.73 and parent (AUC = 0.74 SMFQ versions. Corresponding versions of one- and two-item screens had lower AUC estimates, but the combined versions of the brief screens each still showed moderate accuracy. Furthermore, child and combined versions of the two-item screen demonstrated higher sensitivity (although lower specificity than either the one-item screen or the full SMFQ. Conclusions Under conditions where parents accompany children to screening settings (e.g. primary care, use of a child + parent version of the SMFQ is recommended. However, when parents are not available, and the cost of a false positive result is minimal, then a one- or two-item screen may be useful for initial identification of at-risk youth.

  7. Problems with the factor analysis of items: Solutions based on item response theory and item parcelling

    Directory of Open Access Journals (Sweden)

    Gideon P. De Bruin

    2004-10-01

    Full Text Available The factor analysis of items often produces spurious results in the sense that unidimensional scales appear multidimensional. This may be ascribed to failure in meeting the assumptions of linearity and normality on which factor analysis is based. Item response theory is explicitly designed for the modelling of the non-linear relations between ordinal variables and provides a strong alternative to the factor analysis of items. Items may also be combined in parcels that are more likely to satisfy the assumptions of factor analysis than do the items. The use of the Rasch rating scale model and the factor analysis of parcels is illustrated with data obtained with the Locus of Control Inventory. The results of these analyses are compared with the results obtained through the factor analysis of items. It is shown that the Rasch rating scale model and the factoring of parcels produce superior results to the factor analysis of items. Recommendations for the analysis of scales are made. Opsomming Die faktorontleding van items lewer dikwels misleidende resultate op, veral in die opsig dat eendimensionele skale as meerdimensioneel voorkom. Hierdie resultate kan dikwels daaraan toegeskryf word dat daar nie aan die aannames van lineariteit en normaliteit waarop faktorontleding berus, voldoen word nie. Itemresponsteorie, wat eksplisiet vir die modellering van die nie-liniêre verbande tussen ordinale items ontwerp is, bied ’n aantreklike alternatief vir die faktorontleding van items. Items kan ook in pakkies gegroepeer word wat meer waarskynlik aan die aannames van faktorontleding voldoen as individuele items. Die gebruik van die Rasch beoordelingskaalmodel en die faktorontleding van pakkies word aan die hand van data wat met die Lokus van Beheervraelys verkry is, gedemonstreer. Die resultate van hierdie ontledings word vergelyk met die resultate wat deur ‘n faktorontleding van die individuele items verkry is. Die resultate dui daarop dat die Rasch

  8. Negative affect impairs associative memory but not item memory.

    Science.gov (United States)

    Bisby, James A; Burgess, Neil

    2013-12-17

    The formation of associations between items and their context has been proposed to rely on mechanisms distinct from those supporting memory for a single item. Although emotional experiences can profoundly affect memory, our understanding of how it interacts with different aspects of memory remains unclear. We performed three experiments to examine the effects of emotion on memory for items and their associations. By presenting neutral and negative items with background contexts, Experiment 1 demonstrated that item memory was facilitated by emotional affect, whereas memory for an associated context was reduced. In Experiment 2, arousal was manipulated independently of the memoranda, by a threat of shock, whereby encoding trials occurred under conditions of threat or safety. Memory for context was equally impaired by the presence of negative affect, whether induced by threat of shock or a negative item, relative to retrieval of the context of a neutral item in safety. In Experiment 3, participants were presented with neutral and negative items as paired associates, including all combinations of neutral and negative items. The results showed both above effects: compared to a neutral item, memory for the associate of a negative item (a second item here, context in Experiments 1 and 2) is impaired, whereas retrieval of the item itself is enhanced. Our findings suggest that negative affect impairs associative memory while recognition of a negative item is enhanced. They support dual-processing models in which negative affect or stress impairs hippocampal-dependent associative memory while the storage of negative sensory/perceptual representations is spared or even strengthened.

  9. Validation of the 24-item recovery assessment scale-revised (RAS-R) in the Norwegian language and context: a multi-centre study.

    Science.gov (United States)

    Biringer, Eva; Tjoflåt, Marit

    2018-01-25

    The Recovery Assessment Scale-revised (RAS-R) is a self-report instrument measuring mental health recovery. The purpose of the present study was to translate and adapt the RAS-R into the Norwegian language and to investigate its psychometric properties in terms of factor structure, convergent and discriminant validity and reliability in the Norwegian context. The present study is a cross-sectional multi-centre study. After a pilot test, the Norwegian version of the RAS-R was distributed to 231 service users in mental health specialist and community services. The factor structure of the instrument was investigated by a confirmatory factor analysis (CFA), and internal consistency was assessed by Cronbach's alpha. The RAS-R was found to be acceptable and feasible for service users. The original five-factor structure was confirmed. All model fit indices, including the standardised root mean square residual (SRMR), which is independent of the χ 2 -test, met the criteria for an acceptable model fit. Internal consistencies within sub-scales as measured by Cronbach's alpha ranged from 0.65 to 0.85. Cronbach's alpha for the total scale was 0.90. As expected, some redundancy between factors existed (in particular among the factors Personal confidence and hope, Goal and success orientation and Not dominated by symptoms). The Norwegian RAS-R showed acceptable psychometric properties in terms of convergent validity and reliability, and fit indices from the CFA confirmed the original factor structure. We recommend the Norwegian RAS-R as a tool in service users' and health professionals' collaborative work towards the service users' recovery goals and as an outcome measure in larger evaluations.

  10. Which Statistic Should Be Used to Detect Item Preknowledge When the Set of Compromised Items Is Known?

    Science.gov (United States)

    Sinharay, Sandip

    2017-09-01

    Benefiting from item preknowledge is a major type of fraudulent behavior during educational assessments. Belov suggested the posterior shift statistic for detection of item preknowledge and showed its performance to be better on average than that of seven other statistics for detection of item preknowledge for a known set of compromised items. Sinharay suggested a statistic based on the likelihood ratio test for detection of item preknowledge; the advantage of the statistic is that its null distribution is known. Results from simulated and real data and adaptive and nonadaptive tests are used to demonstrate that the Type I error rate and power of the statistic based on the likelihood ratio test are very similar to those of the posterior shift statistic. Thus, the statistic based on the likelihood ratio test appears promising in detecting item preknowledge when the set of compromised items is known.

  11. Dissociating the neural correlates of intra-item and inter-item working-memory binding.

    Directory of Open Access Journals (Sweden)

    Carinne Piekema

    Full Text Available BACKGROUND: Integration of information streams into a unitary representation is an important task of our cognitive system. Within working memory, the medial temporal lobe (MTL has been conceptually linked to the maintenance of bound representations. In a previous fMRI study, we have shown that the MTL is indeed more active during working-memory maintenance of spatial associations as compared to non-spatial associations or single items. There are two explanations for this result, the mere presence of the spatial component activates the MTL, or the MTL is recruited to bind associations between neurally non-overlapping representations. METHODOLOGY/PRINCIPAL FINDINGS: The current fMRI study investigates this issue further by directly comparing intrinsic intra-item binding (object/colour, extrinsic intra-item binding (object/location, and inter-item binding (object/object. The three binding conditions resulted in differential activation of brain regions. Specifically, we show that the MTL is important for establishing extrinsic intra-item associations and inter-item associations, in line with the notion that binding of information processed in different brain regions depends on the MTL. CONCLUSIONS/SIGNIFICANCE: Our findings indicate that different forms of working-memory binding rely on specific neural structures. In addition, these results extend previous reports indicating that the MTL is implicated in working-memory maintenance, challenging the classic distinction between short-term and long-term memory systems.

  12. Determination of the caffeine contents of various food items within the Austrian market and validation of a caffeine assessment tool (CAT).

    Science.gov (United States)

    Rudolph, E; Färbinger, A; König, J

    2012-01-01

    The caffeine content of 124 products, including coffee, coffee-based beverages, energy drinks, tea, colas, yoghurt and chocolate, were determined using RP-HPLC with UV detection after solid-phase extraction. Highest concentrations of caffeine were found for coffee prepared from pads (755 mg l⁻¹) and regular filtered coffee (659 mg l⁻¹). The total caffeine content of coffee and chocolate-based beverages was between 15 mg l⁻¹ in chocolate milk and 448 mg l⁻¹ in canned ice coffee. For energy drinks the caffeine content varied in a range from 266 to 340 mg l⁻¹. Caffeine concentrations in tea and ice teas were between 13 and 183 mg l⁻¹. Coffee-flavoured yoghurts ranged from 33 to 48 mg kg⁻¹. The caffeine concentration in chocolate and chocolate bars was between 17 mg kg⁻¹ in whole milk chocolate and 551 mg kg⁻¹ in a chocolate with coffee filling. A caffeine assessment tool was developed and validated by a 3-day dietary record (r²= 0.817, p < 0.01) using these analytical data and caffeine saliva concentrations (r²= 0.427, p < 0.01).

  13. A New Kind of Single-Well Tracer Test for Assessing Subsurface Heterogeneity

    Science.gov (United States)

    Hansen, S. K.; Vesselinov, V. V.; Lu, Z.; Reimus, P. W.; Katzman, D.

    2017-12-01

    Single-well injection-withdrawal (SWIW) tracer tests have historically been interpreted using the idealized assumption of tracer path reversibility (i.e., negligible background flow), with background flow due to natural hydraulic gradient being an un-modeled confounding factor. However, we have recently discovered that it is possible to use background flow to our advantage to extract additional information about the subsurface. To wit: we have developed a new kind of single-well tracer test that exploits flow due to natural gradient to estimate the variance of the log hydraulic conductivity field of a heterogeneous aquifer. The test methodology involves injection under forced gradient and withdrawal under natural gradient, and makes use of a relationship, discovered using a large-scale Monte Carlo study and machine learning techniques, between power law breakthrough curve tail exponent and log-hydraulic conductivity variance. We will discuss how we performed the computational study and derived this relationship and then show an application example in which our new single-well tracer test interpretation scheme was applied to estimation of heterogeneity of a formation at the chromium contamination site at Los Alamos National Laboratory. Detailed core hole records exist at the same site, from which it was possible to estimate the log hydraulic conductivity variance using a Kozeny-Carman relation. The variances estimated using our new tracer test methodology and estimated by direct inspection of core were nearly identical, corroborating the new methodology. Assessment of aquifer heterogeneity is of critical importance to deployment of amendments associated with in-situ remediation strategies, since permeability contrasts potentially reduce the interaction between amendment and contaminant. Our new tracer test provides an easy way to obtain this information.

  14. Assessing arsenic and selenium in a single nail clipping using portable X-ray fluorescence

    International Nuclear Information System (INIS)

    Fleming, David E.B.; Nader, Michel N.; Foran, Kelly A.; Groskopf, Craig; Reno, Michael C.; Ware, Chris S.; Tehrani, Mina; Guimarães, Diana; Parsons, Patrick J.

    2017-01-01

    The feasibility of measuring arsenic and selenium contents in a single nail clipping was investigated using a small-focus portable X-ray fluorescence (XRF) instrument with monochromatic excitation beams. Nail clipping phantoms supplemented with arsenic and selenium to produce materials with 0, 5, 10, 15, and 20 µg/g were used for calibration purposes. In total, 10 different clippings were analyzed at two different measurement positions. Energy spectra were fit with detection peaks for arsenic K_α, selenium K_α, arsenic K_β, selenium K_β, and bromine K_α characteristic X-rays. Data analysis was performed under two distinct conditions of fitting constraint. Calibration lines were established from the amplitude of each of the arsenic and selenium peaks as a function of the elemental contents in the clippings. The slopes of the four calibration lines were consistent between the two conditions of analysis. The calculated minimum detection limit (MDL) of the method, when considering the K_α peak only, ranged from 0.210±0.002 µg/g selenium under one condition of analysis to 0.777±0.009 µg/g selenium under another. Compared with previous portable XRF nail clipping studies, MDLs were substantially improved for both arsenic and selenium. The new measurement technique had the additional benefits of being short in duration (~3 min) and requiring only a single nail clipping. The mass of the individual clipping used did not appear to play a major role in signal strength, but positioning of the clipping is important. - Highlights: • Portable X-ray fluorescence was used to assess As and Se in nail clipping phantoms. • Calibration lines were consistent between two different conditions of data analysis. • This new XRF approach was sensitive and required only a single nail clipping.

  15. Subjective and Objective Quality Assessment of Single-Channel Speech Separation Algorithms

    DEFF Research Database (Denmark)

    Mowlaee, Pejman; Saeidi, Rahim; Christensen, Mads Græsbøll

    2012-01-01

    Previous studies on performance evaluation of single-channel speech separation (SCSS) algorithms mostly focused on automatic speech recognition (ASR) accuracy as their performance measure. Assessing the separated signals by different metrics other than this has the benefit that the results...... are expected to carry on to other applications beyond ASR. In this paper, in addition to conventional speech quality metrics (PESQ and SNRloss), we also evaluate the separation systems output using different source separation metrics: blind source separation evaluation (BSS EVAL) and perceptual evaluation...... that PESQ and PEASS quality metrics predict well the subjective quality of separated signals obtained by the separation systems. From the results it is observed that the short-time objective intelligibility (STOI) measure predict the speech intelligibility results....

  16. A fire risk assessment model for residential high-rises with a single stairwell

    DEFF Research Database (Denmark)

    Hansen, N. D.; Steffensen, F.B.; Valkvist, M.B.

    2018-01-01

    As few or none prescriptive guidelines for fire risk assessment of residential high-rise buildings exist, it has been unclear which fire safety design features constitute an acceptable (adequate) safety level. In order to fill this gap a simplified risk-based decision-support tool, the Fire Risk...... Model (FRM), was developed. The FRM evaluates both the risk level to the occupants and the property risk level as a function of the building characteristics, height and fire safety features for single stairwell residential high-rise buildings. The acceptability of a high-rise design is then defined......, and the associated performance of the FRM evaluated. It was found that compartmentation and the door configurations in the egress path play an important role, along with sprinklers, in order for the design to successfully keep the stairwell free from smoke. Specifically, modern curtain wall facades were found...

  17. Assessment of illumination conditions in a single-pixel imaging configuration

    Science.gov (United States)

    Garoi, Florin; Udrea, Cristian; Damian, Cristian; Logofǎtu, Petre C.; Colţuc, Daniela

    2016-12-01

    Single-pixel imaging based on multiplexing is a promising technique, especially in applications where 2D detectors or raster scanning imaging are not readily applicable. With this method, Hadamard masks are projected on a spatial light modulator to encode an incident scene and a signal is recorded at the photodiode detector for each of these masks. Ultimately, the image is reconstructed on the computer by applying the inverse transform matrix. Thus, various algorithms were optimized and several spatial light modulators already characterized for such a task. This work analyses the imaging quality of such a single-pixel arrangement, when various illumination conditions are used. More precisely, the main comparison is made between coherent and incoherent ("white light") illumination and between two multiplexing methods, namely Hadamard and Scanning. The quality of the images is assessed by calculating their SNR, using two relations. The results show better images are obtained with "white light" illumination for the first method and coherent one for the second.

  18. Assessment of DNA damage in radiation workers by using single cell gel electrophoresis

    International Nuclear Information System (INIS)

    Jia Lili; Zhang Tao; Yang Yonghua; Wang Yan; Du Liqing; Cao Jia; Wang Hong; Liu Qiang; Fan Feiyue

    2010-01-01

    Objective: To assess the DNA damage of radiation workers in different grade hospitals, and to explore the correlation between the types of work or work time and the levels of DNA damage. Methods: DNA single strand break were detected by using alkaline single cell gel electrophoresis (SCGE), and the comet was analyzed with CASP (Comet Assay Software Project). TDNA%, TL, TM and OTM were calculated. Results: The parameters of SCGE in the radiation group were higher than those of control group (F=3.93, P<0.01). The significant difference was found not only among the different types of work or different work time, but also among the different grade hospitals (F=1.83, 1.91, P<0.05). Conclusions: Various levels of DNA damage could be detected in the radiation workers of the two hospitals. DNA damage of radiation workers is less serious in the higher-grade hospital than the lower grade one. Different types of work or work time might affect the DNA damage level. (authors)

  19. Single-photon emission computed tomography for the assessment of ventricular perfusion and function

    International Nuclear Information System (INIS)

    Gonzalez, Patricio; Dussaillant, Gaston; Gutierrez, Daniela; Berrocal, Isabel; Alay, Rita; Otarola, Sonia

    2013-01-01

    Background: Single-photon emission computed tomography (SPECT) can be used as a non-invasive tool for the assessment of coronary perfusion. Aim: To assess ventricular perfusion and function by SPECT in patients with single vessel coronary artery disease. Material and Methods: Among patients with indications for a coronary artery angiography, those with significant lesions in one vessel, were selected for the study. Within 24 hours, cardiac SPECT examinations on basal conditions and after high doses of dipyridamole, were performed. SPECT data from 38 patients with a low probability of coronary artery disease was used for comparisons. Results:Ten patients aged 61 ± 8 years (seven men) were studied. Visual analysis of SPECT revealed signs suggestive of ischemia in eight patients. The remaining two patients did not have perfusion disturbances. SPECT detected eight of ten abnormal vessels reported in the coronary artery angiography. There were two false negative results Summed stress, summed rest and summed difference scores were 9.78 ± 6.51, 3.22 ± 5.07 and 6.33 ± 4.97, respectively. The ejection fractions under stress and at rest were 53 ± 11.7% and 61 ± 15.7% respectively (p ≤ 0.01). The figures for the control group were 69.1 ± 13.5% and 75.2 ± 12.04% respectively (significantly different from patients). Two patients had a summed motion score above 14.9. Likewise, two patients had a summed thickening score above 10.9. Conclusions: SPECT detected 80% of coronary lesions found during coronary artery angiography. Visual analysis of perfusion is highly reliable for diagnosis. Quantitative parameters must be considered only as reference parameters

  20. The PROMIS fatigue item bank has good measurement properties in patients with fibromyalgia and severe fatigue.

    Science.gov (United States)

    Yost, Kathleen J; Waller, Niels G; Lee, Minji K; Vincent, Ann

    2017-06-01

    Efficient management of fibromyalgia (FM) requires precise measurement of FM-specific symptoms. Our objective was to assess the measurement properties of the Patient-Reported Outcome Measurement Information System (PROMIS) fatigue item bank (FIB) in people with FM. We applied classical psychometric and item response theory methods to cross-sectional PROMIS-FIB data from two samples. Data on the clinical FM sample were obtained at a tertiary medical center. Data for the U.S. general population sample were obtained from the PROMIS network. The full 95-item bank was administered to both samples. We investigated dimensionality of the item bank in both samples by separately fitting a bifactor model with two group factors; experience and impact. We assessed measurement invariance between samples, and we explored an alternate factor structure with the normative sample and subsequently confirmed that structure in the clinical sample. Finally, we assessed whether reporting FM subdomain scores added value over reporting a single total score. The item bank was dominated by a general fatigue factor. The fit of the initial bifactor model and evidence of measurement invariance indicated that the same constructs were measured across the samples. An alternative bifactor model with three group factors demonstrated slightly improved fit. Subdomain scores add value over a total score. We demonstrated that the PROMIS-FIB is appropriate for measuring fatigue in clinical samples of FM patients. The construct can be presented by a single score; however, subdomain scores for the three group factors identified in the alternative model may also be reported.

  1. Characterization of Disability in Canadians with Mental Disorders Using an Abbreviated Version of a DSM-5 Emerging Measure: The 12-Item WHO Disability Assessment Schedule (WHODAS) 2.0.

    Science.gov (United States)

    Sjonnesen, Kirsten; Bulloch, Andrew G M; Williams, Jeanne; Lavorato, Dina; B Patten, Scott

    2016-04-01

    The World Health Organization Disability Assessment Schedule 2.0 (WHODAS 2.0) is a disability scale included in Section 3 of the fifth edition of the Diagnostic and Statistical Manual of Mental Disorders (DSM-5) as a possible replacement for the Global Assessment of Functioning Scale (GAF). To assist Canadian psychiatrists with interpretation of the scale, we have conducted a descriptive analysis using data from the 2012 Canadian Community Health Survey-Mental Health component (CCHS-MH). The 2012 CCHS-MH was a cross-sectional survey of the Canadian community (n = 23,757). The survey included an abbreviated 12-item version of the WHODAS 2.0. Mental disorder diagnoses were assessed for schizophrenia, other psychosis, major depressive episode (MDE), generalized anxiety disorder (GAD), bipolar I disorder, substance abuse/dependence, and alcohol abuse/dependence. Mean scores ranged from 14.2 (95% CI, 14.1 to 14.3) for the overall community population to 23.1 (95% CI, 19.5 to 26.7) for those with schizophrenia, with higher scores indicating greater disability. Furthermore, the difference in scores between those with lifetime and past-month episodes suggests that the scale is sensitive to changes occurring during the course of these disorders; for example, scores varied from 23.6 (95% CI, 22.2 to 25.1) for past-month MDE to 14.4 (95% CI, 14.2 to 14.7) in the lifetime MDE group without a past-year episode. This analysis suggests that the WHODAS 2.0 may be a suitable replacement for the GAF. As a disability measure, even though it is not a mental health-specific instrument, the 12-item WHODAS 2.0 appears to be sensitive to the impact of mental disorders and to changes over the time course of a mental disorder. However, the clinical utility of this measure requires additional assessment. © The Author(s) 2016.

  2. The Effects of Multiple-Step and Single-Step Directions on Fourth and Fifth Grade Students' Grammar Assessment Performance

    Science.gov (United States)

    Mazerik, Matthew B.

    2006-01-01

    The mean scores of English Language Learners (ELL) and English Only (EO) students in 4th and 5th grade (N = 110), across the teacher-administered Grammar Skills Test, were examined for differences in participants' scores on assessments containing single-step directions and assessments containing multiple-step directions. The results indicated no…

  3. Treatment response of airway clearance assessed by single-breath washout in children with cystic fibrosis.

    Science.gov (United States)

    Abbas, Chiara; Singer, Florian; Yammine, Sophie; Casaulta, Carmen; Latzin, Philipp

    2013-12-01

    We studied the ability of 4 single-breath gas washout (SBW) tests to measure immediate effects of airway clearance in children with CF. 25 children aged 4-16 years with CF performed pulmonary function tests to assess short-term variability at baseline and response to routine airway clearance. Tidal helium and sulfur hexafluoride (double-tracer gas: DTG) SBW, tidal capnography, tidal and vital capacity nitrogen (N2) SBW and spirometry were applied. We analyzed the gasses' phase III slope (SnIII--normalized for tidal volume) and FEV1 from spirometry. SnIII from tidal DTG-SBW, SnIII from vital capacity N2-SBW, and FEV1 improved significantly after airway clearance. From these tests, individual change of SnIII from tidal DTG-SBW and FEV1 exceeded short-term variability in 10 and 6 children. With the tidal DTG-SBW, an easy and promising test for peripheral gas mixing efficiency, immediate pulmonary function response to airway clearance can be assessed in CF children. Copyright © 2013 European Cystic Fibrosis Society. Published by Elsevier B.V. All rights reserved.

  4. Assessment of metals bioavailability to vegetables under field conditions using DGT, single extractions and multivariate statistics

    Science.gov (United States)

    2012-01-01

    Background The metals bioavailability in soils is commonly assessed by chemical extractions; however a generally accepted method is not yet established. In this study, the effectiveness of Diffusive Gradients in Thin-films (DGT) technique and single extractions in the assessment of metals bioaccumulation in vegetables, and the influence of soil parameters on phytoavailability were evaluated using multivariate statistics. Soil and plants grown in vegetable gardens from mining-affected rural areas, NW Romania, were collected and analysed. Results Pseudo-total metal content of Cu, Zn and Cd in soil ranged between 17.3-146 mg kg-1, 141–833 mg kg-1 and 0.15-2.05 mg kg-1, respectively, showing enriched contents of these elements. High degrees of metals extractability in 1M HCl and even in 1M NH4Cl were observed. Despite the relatively high total metal concentrations in soil, those found in vegetables were comparable to values typically reported for agricultural crops, probably due to the low concentrations of metals in soil solution (Csoln) and low effective concentrations (CE), assessed by DGT technique. Among the analysed vegetables, the highest metal concentrations were found in carrots roots. By applying multivariate statistics, it was found that CE, Csoln and extraction in 1M NH4Cl, were better predictors for metals bioavailability than the acid extractions applied in this study. Copper transfer to vegetables was strongly influenced by soil organic carbon (OC) and cation exchange capacity (CEC), while pH had a higher influence on Cd transfer from soil to plants. Conclusions The results showed that DGT can be used for general evaluation of the risks associated to soil contamination with Cu, Zn and Cd in field conditions. Although quantitative information on metals transfer from soil to vegetables was not observed. PMID:23079133

  5. Selecting Items for Criterion-Referenced Tests.

    Science.gov (United States)

    Mellenbergh, Gideon J.; van der Linden, Wim J.

    1982-01-01

    Three item selection methods for criterion-referenced tests are examined: the classical theory of item difficulty and item-test correlation; the latent trait theory of item characteristic curves; and a decision-theoretic approach for optimal item selection. Item contribution to the standardized expected utility of mastery testing is discussed. (CM)

  6. Semiparametric Item Response Functions in the Context of Guessing

    Science.gov (United States)

    Falk, Carl F.; Cai, Li

    2016-01-01

    We present a logistic function of a monotonic polynomial with a lower asymptote, allowing additional flexibility beyond the three-parameter logistic model. We develop a maximum marginal likelihood-based approach to estimate the item parameters. The new item response model is demonstrated on math assessment data from a state, and a computationally…

  7. INITIAL SINGLE-SHELL TANK (SST) SYSTEM PERFORMANCE ASSESSMENT OF THE HANFORD SITE

    International Nuclear Information System (INIS)

    JARAYSI, M.N.

    2007-01-01

    The ''Initial Single-Shell Tank System Performance Assessment for the Hanford Site [1] (SST PA) presents the analysis of the long-term impacts of residual wastes assumed to remain after retrieval of tank waste and closure of the SST farms at the US Department of Energy (DOE) Hanford Site. The SST PA supports key elements of the closure process agreed upon in 2004 by DOE, the Washington State Department of Ecology (Ecology), and the US Environmental Protection Agency (EPA). The SST PA element is defined in Appendix I of the ''Hanford Federal Facility Agreement and Consent Order'' (HFFACO) (Ecology et al. 1989) [2], the document that establishes the overall closure process for the SST and double-shell tank (DST) systems. The approach incorporated in the SST PA integrates substantive features of both hazardous and radioactive waste management regulations into a single analysis. The defense-in-depth approach used in this analysis defined two major engineering barriers (a surface barrier and the grouted tank structure) and one natural barrier (the vadose zone) that will be relied on to control waste release into the accessible environment and attain expected performance metrics. The analysis evaluates specific barrier characteristics and other site features that influence contaminant migration by the various pathways. A ''reference'' case and a suite of sensitivity/uncertainty cases are considered. The ''reference case'' evaluates environmental impacts assuming central tendency estimates of site conditions. ''Reference'' case analysis results show residual tank waste impacts on nearby groundwater, air resources; or inadvertent intruders to be well below most important performance objectives. Conversely, past releases to the soil, from previous tank farm operations, are shown to have groundwater impacts that re significantly above most performance objectives. Sensitivity/uncertainty cases examine single and multiple parameter variability along with plausible alternatives

  8. Recent advances in magnesium assessment: From single selective sensors to multisensory approach.

    Science.gov (United States)

    Lvova, Larisa; Gonçalves, Carla Guanais; Di Natale, Corrado; Legin, Andrey; Kirsanov, Dmitry; Paolesse, Roberto

    2018-03-01

    The development of efficient analytical procedures for the selective detection of magnesium is an important analytical task, since this element is one of the most abundant metals in cells and plays an essential role in a plenty of cellular processes. Magnesium misbalance has been related to several pathologies and diseases both in plants and animals, as far as in humans, but the number of suitable methods for magnesium detection especially in life sample and biological environments is scarce. Chemical sensors, due to their high reliability, simplicity of handling and instrumentation, fast and real-time in situ and on site analysis are promising candidates for magnesium analysis and represent an attractive alternative to the standard instrumental methods. Here the recent achievements in the development of chemical sensors for magnesium ions detection over the last decade are reviewed. The working principles and the main types of sensors applied are described. Focus is placed on the optical sensors and multisensory systems applications for magnesium assessment in different media. Further, a critical outlook on the employment of multisensory approach in comparison to single selective sensors application in biological samples is presented. Copyright © 2017 Elsevier B.V. All rights reserved.

  9. Assessing Motor Fluctuations in Parkinson's Disease Patients Based on a Single Inertial Sensor.

    Science.gov (United States)

    Pérez-López, Carlos; Samà, Albert; Rodríguez-Martín, Daniel; Català, Andreu; Cabestany, Joan; Moreno-Arostegui, Juan Manuel; de Mingo, Eva; Rodríguez-Molinero, Alejandro

    2016-12-15

    Altered movement control is typically the first noticeable symptom manifested by Parkinson's disease (PD) patients. Once under treatment, the effect of the medication is very patent and patients often recover correct movement control over several hours. Nonetheless, as the disease advances, patients present motor complications. Obtaining precise information on the long-term evolution of these motor complications and their short-term fluctuations is crucial to provide optimal therapy to PD patients and to properly measure the outcome of clinical trials. This paper presents an algorithm based on the accelerometer signals provided by a waist sensor that has been validated in the automatic assessment of patient's motor fluctuations (ON and OFF motor states) during their activities of daily living. A total of 15 patients have participated in the experiments in ambulatory conditions during 1 to 3 days. The state recognised by the algorithm and the motor state annotated by patients in standard diaries are contrasted. Results show that the average specificity and sensitivity are higher than 90%, while their values are higher than 80% of all patients, thereby showing that PD motor status is able to be monitored through a single sensor during daily life of patients in a precise and objective way.

  10. RELIABILITY OF KINEMATICS AND KINETICS ASSOCIATED WITH HORIZONTAL SINGLE LEG DROP JUMP ASSESSMENT. A BRIEF REPORT

    Directory of Open Access Journals (Sweden)

    Markus Stålbom

    2007-06-01

    Full Text Available Determining the reliability of a unilateral horizontal drop jump for displacement provided the focus for this research. Eighteen male subjects were required to step off a 20cm box and land on a force plate with one leg and thereafter jump for maximal horizontal displacement on two different days. Dependent variables from the jump assessment included mean and peak vertical (V and horizontal (H ground reaction forces (GRF and impulses, horizontal displacement and contact time. The between-trial variability of all kinematic and kinetic measures was less than 7%. The most consistent measure over both trials was the horizontal displacement jumped (1.2 to 1.4% and the most variable were the contact time the first day (6.5% and peak HGRF the second day (4.3%. In all cases there was less variation associated with the second rather than the first day. In terms of test-retest variability the percent changes in the means and coefficient of variations (CVs were all under 10%. The smallest changes in the mean (0.43 %, least variation (< 2.26 % and second highest intraclass correlation co-efficient (ICC = 0.95 were found for horizontal displacement jumped. The highest ICC (0.96 was found for horizontal impulse. Given the reliability of the single leg drop jump, it may offer better prognostic and diagnostic information than that obtained with bilateral vertical jumps

  11. Online Angiography Image-Based FFR Assessment During Coronary Catheterization: A Single-Center Study.

    Science.gov (United States)

    Kornowski, Ran; Vaknin-Assa, Hana; Assali, Abid; Greenberg, Gabriel; Valtzer, Orna; Lavi, Ifat

    2018-03-15

    To assess the diagnostic performance of angiography-derived fractional flow reserve (FFRangio) measurements in patients with stable coronary artery disease when used online in the catheterization laboratory during routine coronary angiography. FFR, an index of the hemodynamic severity of coronary stenosis, is derived from invasive measurements using a pressure-monitoring guidewire and hyperemic stimulus. While FFR is the gold standard, it remains under-utilized. FFRangio may have several advantages owing to the reduced operator time, no wire-related or procedural complications, and no need for administration of vasodilators. FFRangio is a novel technology that uses a patient's hemodynamic data and routine angiograms to generate FFR values at each point along the coronary tree. We present the online application of the system where FFRangio was successfully used in the catheterization laboratory during routine coronary angiography and compared to invasive FFR. Fifty-three patients (79% men) and 60 coronary lesions were analyzed. Values derived using FFRangio ranged from 0.58-0.96 and correlated closely (Pearson's correlation coefficient, r=0.91; Psystem. In this single-center experience, FFRangio values showed high correlation rates to invasive FFR.

  12. Retrieval of very large numbers of items in the Web of Science: an exercise to develop accurate search strategies

    NARCIS (Netherlands)

    Arencibia-Jorge, R.; Leydesdorff, L.; Chinchilla-Rodríguez, Z.; Rousseau, R.; Paris, S.W.

    2009-01-01

    The Web of Science interface counts at most 100,000 retrieved items from a single query. If the query results in a dataset containing more than 100,000 items the number of retrieved items is indicated as >100,000. The problem studied here is how to find the exact number of items in a query that

  13. 47 CFR 76.985 - Subscriber bill itemization.

    Science.gov (United States)

    2010-10-01

    ...) The amount of the total bill assessed as a franchise fee and the identity of the franchising authority... fees and costs itemized pursuant to this section. (c) Local franchising authorities may adopt...

  14. Dimensionality of the UWES-17: An item response modelling analysis

    Directory of Open Access Journals (Sweden)

    Deon P. de Bruin

    2013-10-01

    Research purpose: The main focus of this study was to use the Rasch model to provide insight into the dimensionality of the UWES-17, and to assess whether work engagement should be interpreted as one single overall score, three separate scores, or a combination. Motivation for the study: It is unclear whether a summative score is more representative of work engagement or whether scores are more meaningful when interpreted for each dimension separately. Previous work relied on confirmatory factor analysis; the potential of item response models has not been tapped. Research design: A quantitative cross-sectional survey design approach was used. Participants, 2429 employees of a South African Information and Communication Technology (ICT company, completed the UWES-17. Main findings: Findings indicate that work engagement should be treated as a unidimensional construct: individual scores should be interpreted in a summative manner, giving a single global score. Practical/managerial implications: Users of the UWES-17 may interpret a single, summative score for work engagement. Findings of this study should also contribute towards standardising UWES-17 scores, allowing meaningful comparisons to be made. Contribution/value-add: The findings will benefit researchers, organisational consultants and managers. Clarity on dimensionality and interpretation of work engagement will assist researchers in future studies. Managers and consultants will be able to make better-informed decisions when using work engagement data.

  15. Science Library of Test Items. Volume Twenty-One. A Collection of Multiple Choice Test Items Relating Mainly to Physics, 2.

    Science.gov (United States)

    New South Wales Dept. of Education, Sydney (Australia).

    As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…

  16. Using a Process Dissociation Approach to Assess Verbal Short-Term Memory for Item and Order Information in a Sample of Individuals with a Self-Reported Diagnosis of Dyslexia.

    Science.gov (United States)

    Wang, Xiaoli; Xuan, Yifu; Jarrold, Christopher

    2016-01-01

    Previous studies have examined whether difficulties in short-term memory for verbal information, that might be associated with dyslexia, are driven by problems in retaining either information about to-be-remembered items or the order in which these items were presented. However, such studies have not used process-pure measures of short-term memory for item or order information. In this work we adapt a process dissociation procedure to properly distinguish the contributions of item and order processes to verbal short-term memory in a group of 28 adults with a self-reported diagnosis of dyslexia and a comparison sample of 29 adults without a dyslexia diagnosis. In contrast to previous work that has suggested that individuals with dyslexia experience item deficits resulting from inefficient phonological representation and language-independent order memory deficits, the results showed no evidence of specific problems in short-term retention of either item or order information among the individuals with a self-reported diagnosis of dyslexia, despite this group showing expected difficulties on separate measures of word and non-word reading. However, there was some suggestive evidence of a link between order memory for verbal material and individual differences in non-word reading, consistent with other claims for a role of order memory in phonologically mediated reading. The data from the current study therefore provide empirical evidence to question the extent to which item and order short-term memory are necessarily impaired in dyslexia.

  17. Telemetric assessment of social and single housing: Evaluation of electrocardiographic intervals in jacketed cynomolgus monkeys.

    Science.gov (United States)

    Kaiser, Robert A; Tichenor, Stephen D; Regalia, Douglas E; York, Kristina; Holzgrefe, Henry H

    2015-01-01

    Proactive efforts to socially house laboratory animals are a contemporary, important focus for enhancing animal welfare. Jacketing cynomolgus monkeys has been traditionally considered an exclusionary criterion for social housing based on unsubstantiated concerns that study conduct or telemetry equipment might be compromised. Our objective was to evaluate the effects of jacketing naïve, adolescent cynomolgus monkeys in different single and social housing types based on parallel comparisons of heart rate. Eight naive cynomolgus monkeys were randomized into pairs and ECG data were collected for 24h from each animal in each housing condition using a crossover design. Caging paradigms consisted of standard individual, standard pair, quaternary pair (4 linked cages), and European-style pair housing in non-sequential order varied by pair to control for possible time bias. Dosing and blood collection procedures were performed to characterize any effects of housing on ECG data during study conduct. There was no increase in the incidence of equipment damage in pair vs. individually housed animals. Further, animals in all 4 housing paradigms showed similar acclimation assessed as heart rate (mean 139-154 beats per minute), and maintained similar diurnal rhythms, with an expected slowing of the heart rate at night (aggregate lights out HR 110±4bpm compared to daytime 146±7bpm). This study demonstrates the effects of different social access and housing types on the study-naïve cynomolgus monkeys during jacketed cardiovascular telemetry data collection in a repeat-dose toxicology study design. There were no discernible effects of social housing on baseline ECG parameters collected via jacketed telemetry, and all animals maintained expected diurnal rhythms in all housing settings tested. These data demonstrate that cynomolgus monkeys can be socially housed during data collection as a standard practice, consistent with global efforts to improve study animal welfare. Copyright

  18. Single nucleotide polymorphisms for assessing genetic diversity in castor bean (Ricinus communis

    Directory of Open Access Journals (Sweden)

    Rabinowicz Pablo D

    2010-01-01

    Full Text Available Abstract Background Castor bean (Ricinus communis is an agricultural crop and garden ornamental that is widely cultivated and has been introduced worldwide. Understanding population structure and the distribution of castor bean cultivars has been challenging because of limited genetic variability. We analyzed the population genetics of R. communis in a worldwide collection of plants from germplasm and from naturalized populations in Florida, U.S. To assess genetic diversity we conducted survey sequencing of the genomes of seven diverse cultivars and compared the data to a reference genome assembly of a widespread cultivar (Hale. We determined the population genetic structure of 676 samples using single nucleotide polymorphisms (SNPs at 48 loci. Results Bayesian clustering indicated five main groups worldwide and a repeated pattern of mixed genotypes in most countries. High levels of population differentiation occurred between most populations but this structure was not geographically based. Most molecular variance occurred within populations (74% followed by 22% among populations, and 4% among continents. Samples from naturalized populations in Florida indicated significant population structuring consistent with local demes. There was significant population differentiation for 56 of 78 comparisons in Florida (pairwise population ϕPT values, p Conclusion Low levels of genetic diversity and mixing of genotypes have led to minimal geographic structuring of castor bean populations worldwide. Relatively few lineages occur and these are widely distributed. Our approach of determining population genetic structure using SNPs from genome-wide comparisons constitutes a framework for high-throughput analyses of genetic diversity in plants, particularly in species with limited genetic diversity.

  19. Evaluating the quality of medical multiple-choice items created with automated processes.

    Science.gov (United States)

    Gierl, Mark J; Lai, Hollis

    2013-07-01

    Computerised assessment raises formidable challenges because it requires large numbers of test items. Automatic item generation (AIG) can help address this test development problem because it yields large numbers of new items both quickly and efficiently. To date, however, the quality of the items produced using a generative approach has not been evaluated. The purpose of this study was to determine whether automatic processes yield items that meet standards of quality that are appropriate for medical testing. Quality was evaluated firstly by subjecting items created using both AIG and traditional processes to rating by a four-member expert medical panel using indicators of multiple-choice item quality, and secondly by asking the panellists to identify which items were developed using AIG in a blind review. Fifteen items from the domain of therapeutics were created in three different experimental test development conditions. The first 15 items were created by content specialists using traditional test development methods (Group 1 Traditional). The second 15 items were created by the same content specialists using AIG methods (Group 1 AIG). The third 15 items were created by a new group of content specialists using traditional methods (Group 2 Traditional). These 45 items were then evaluated for quality by a four-member panel of medical experts and were subsequently categorised as either Traditional or AIG items. Three outcomes were reported: (i) the items produced using traditional and AIG processes were comparable on seven of eight indicators of multiple-choice item quality; (ii) AIG items can be differentiated from Traditional items by the quality of their distractors, and (iii) the overall predictive accuracy of the four expert medical panellists was 42%. Items generated by AIG methods are, for the most part, equivalent to traditionally developed items from the perspective of expert medical reviewers. While the AIG method produced comparatively fewer plausible

  20. Effects of Reducing the Cognitive Load of Mathematics Test Items on Student Performance

    Directory of Open Access Journals (Sweden)

    Susan C. Gillmor

    2015-01-01

    Full Text Available This study explores a new item-writing framework for improving the validity of math assessment items. The authors transfer insights from Cognitive Load Theory (CLT, traditionally used in instructional design, to educational measurement. Fifteen, multiple-choice math assessment items were modified using research-based strategies for reducing extraneous cognitive load. An experimental design with 222 middle-school students tested the effects of the reduced cognitive load items on student performance and anxiety. Significant findings confirm the main research hypothesis that reducing the cognitive load of math assessment items improves student performance. Three load-reducing item modifications are identified as particularly effective for reducing item difficulty: signalling important information, aesthetic item organization, and removing extraneous content. Load reduction was not shown to impact student anxiety. Implications for classroom assessment and future research are discussed.

  1. Improved Approximation Algorithms for Item Pricing with Bounded Degree and Valuation

    Science.gov (United States)

    Hamane, Ryoso; Itoh, Toshiya

    When a store sells items to customers, the store wishes to decide the prices of the items to maximize its profit. If the store sells the items with low (resp. high) prices, the customers buy more (resp. less) items, which provides less profit to the store. It would be hard for the store to decide the prices of items. Assume that a store has a set V of n items and there is a set C of m customers who wish to buy those items. The goal of the store is to decide the price of each item to maximize its profit. We refer to this maximization problem as an item pricing problem. We classify the item pricing problems according to how many items the store can sell or how the customers valuate the items. If the store can sell every item i with unlimited (resp. limited) amount, we refer to this as unlimited supply (resp. limited supply). We say that the item pricing problem is single-minded if each customer j∈C wishes to buy a set ej⊆V of items and assigns valuation w(ej)≥0. For the single-minded item pricing problems (in unlimited supply), Balcan and Blum regarded them as weighted k-hypergraphs and gave several approximation algorithms. In this paper, we focus on the (pseudo) degree of k-hypergraphs and the valuation ratio, i. e., the ratio between the smallest and the largest valuations. Then for the single-minded item pricing problems (in unlimited supply), we show improved approximation algorithms (for k-hypergraphs, general graphs, bipartite graphs, etc.) with respect to the maximum (pseudo) degree and the valuation ratio.

  2. Design of a box trainer for objective assessment of technical skills in single-port surgery

    NARCIS (Netherlands)

    Horeman, Tim; Sun, Siyu; Tuijthof, Gabrielle J. M.; Jansen, Frank William; Meijerink, Jeroen W. J. H. J.; Dankelman, Jenny

    2015-01-01

    Laparoscopic single-port (SP) surgery uses only a single entry point for all instruments. The approach of SP has been applied in multiple laparoscopic disciplines owing to its improved cosmetic result. However, in SP surgery, instrument movements are further restricted, resulting in increased

  3. Optimal sampling strategies to assess inulin clearance in children by the inulin single-injection method

    NARCIS (Netherlands)

    van Rossum, Lyonne K.; Mathot, Ron A. A.; Cransberg, Karlien; Vulto, Arnold G.

    2003-01-01

    Glomerular filtration rate in patients can be determined by estimating the plasma clearance of inulin with the single-injection method. In this method, a single bolus injection of inulin is administered and several blood samples are collected. For practical and convenient application of this method

  4. Assessment of changes in regional cerebral blood flow in patients with major depression using the 99mTc-HMPAO single photon emission tomography method

    International Nuclear Information System (INIS)

    Yazici, K.; Kapucu, Oe.; Erbas, B.; Varoglu, E.; Guelec, C.; Bekdik, C.F.; Hacettepe Univ., Ankara

    1992-01-01

    Regional cerebral blood flow was investigated in 14 patients with major depression diagnosed according to the DSM-III-R criteria (six patients with single and eight patients with recurrent episodes) and in ten healthy volunteers. The mean ages of the patients and the controls were 33.5±2.7 and 31.6±2.6 years, respectively. The severity of the depression was assessed using the 17-item Hamiltonian Depression Scale (mean: 23.2±1.5). None of the patients was under medication. After administration of 500 MBq technetium-99m hexamethylpropylene amine oxime, a single photon emission tomography study was performed and then transaxial, sagittal and coronal slices were obtained. For the semiquantitative analysis of the data, the ratios of the mean counts/pixel to the whole slice were calculated for 24 regions on three consecutive transaxial slices in the orbitomeatal plane. Additionally, left/right and frontal/occipital ratios were calculated. Both sides of the temporal region had a significantly decreased cerebral blood flow (CBF) when compared to the controls. The left/right ratio of the prefrontal region was also significantly lower in the patients than in the controls. The Hamilton score had a negative correlation with blood flow in the anterofrontal and left prefrontal regions. According to our results, regional CBF seems to be decreased in the left prefrontal and in both temporal regions in major depression. The severity of depression is correlated with the reduction in CBF in the regions of the anterofrontal and left prefrontal cortex. (orig.)

  5. Assessment of left ventricular function by electrocardiogram-gated myocardial single photon emission computed tomography using quantitative gated single photon emission computed tomography software

    International Nuclear Information System (INIS)

    Morita, Koichi; Adachi, Itaru; Konno, Masanori

    1999-01-01

    Electrocardiogram (ECG)-gated myocardial single photon emission computed tomography (SPECT) can assess left ventricular (LV) perfusion and function easily using quantitative gated SPECT (QGS) software. ECG-gated SPECT was performed in 44 patients with coronary artery disease under post-stress and resting conditions to assess the values of LV functional parameters, by comparison to LV ejection fraction derived from gated blood pool scan and myocardial characteristics. A good correlation was obtained between ejection fraction using QGS and that using cardiac blood pool scan (r=0.812). Some patients with myocardial ischemia had lower ejection fraction under post-stress compared to resting conditions, indicating post-stress LV dysfunction. LV wall motion and wall thickening were significantly impaired in ischemic and infarcted myocardium, and the degree of abnormality in the infarcted areas was greater than in the ischemia area. LV functional parameters derived using QGS were useful to assess post-stress LV dysfunction and myocardial viability. In conclusion, ECG-gated myocardial SPECT permits simultaneous quantitative assessment of myocardial perfusion and function. (author)

  6. 48 CFR 852.214-72 - Alternate item(s).

    Science.gov (United States)

    2010-10-01

    ... AND FORMS SOLICITATION PROVISIONS AND CONTRACT CLAUSES Texts of Provisions and Clauses 852.214-72... 2008) Bids on []* will be given equal consideration along with bids on []** and any such bids received... [].** * Contracting officer will insert an alternate item that is considered acceptable. ** Contracting officer will...

  7. Can Item Keyword Feedback Help Remediate Knowledge Gaps?

    Science.gov (United States)

    Feinberg, Richard A; Clauser, Amanda L

    2016-10-01

    In graduate medical education, assessment results can effectively guide professional development when both assessment and feedback support a formative model. When individuals cannot directly access the test questions and responses, a way of using assessment results formatively is to provide item keyword feedback. The purpose of the following study was to investigate whether exposure to item keyword feedback aids in learner remediation. Participants included 319 trainees who completed a medical subspecialty in-training examination (ITE) in 2012 as first-year fellows, and then 1 year later in 2013 as second-year fellows. Performance on 2013 ITE items in which keywords were, or were not, exposed as part of the 2012 ITE score feedback was compared across groups based on the amount of time studying (preparation). For the same items common to both 2012 and 2013 ITEs, response patterns were analyzed to investigate changes in answer selection. Test takers who indicated greater amounts of preparation on the 2013 ITE did not perform better on the items in which keywords were exposed compared to those who were not exposed. The response pattern analysis substantiated overall growth in performance from the 2012 ITE. For items with incorrect responses on both attempts, examinees selected the same option 58% of the time. Results from the current study were unsuccessful in supporting the use of item keywords in aiding remediation. Unfortunately, the results did provide evidence of examinees retaining misinformation.

  8. Identifying predictors of physics item difficulty: A linear regression approach

    Science.gov (United States)

    Mesic, Vanes; Muratovic, Hasnija

    2011-06-01

    Large-scale assessments of student achievement in physics are often approached with an intention to discriminate students based on the attained level of their physics competencies. Therefore, for purposes of test design, it is important that items display an acceptable discriminatory behavior. To that end, it is recommended to avoid extraordinary difficult and very easy items. Knowing the factors that influence physics item difficulty makes it possible to model the item difficulty even before the first pilot study is conducted. Thus, by identifying predictors of physics item difficulty, we can improve the test-design process. Furthermore, we get additional qualitative feedback regarding the basic aspects of student cognitive achievement in physics that are directly responsible for the obtained, quantitative test results. In this study, we conducted a secondary analysis of data that came from two large-scale assessments of student physics achievement at the end of compulsory education in Bosnia and Herzegovina. Foremost, we explored the concept of “physics competence” and performed a content analysis of 123 physics items that were included within the above-mentioned assessments. Thereafter, an item database was created. Items were described by variables which reflect some basic cognitive aspects of physics competence. For each of the assessments, Rasch item difficulties were calculated in separate analyses. In order to make the item difficulties from different assessments comparable, a virtual test equating procedure had to be implemented. Finally, a regression model of physics item difficulty was created. It has been shown that 61.2% of item difficulty variance can be explained by factors which reflect the automaticity, complexity, and modality of the knowledge structure that is relevant for generating the most probable correct solution, as well as by the divergence of required thinking and interference effects between intuitive and formal physics knowledge

  9. Identifying predictors of physics item difficulty: A linear regression approach

    Directory of Open Access Journals (Sweden)

    Hasnija Muratovic

    2011-06-01

    Full Text Available Large-scale assessments of student achievement in physics are often approached with an intention to discriminate students based on the attained level of their physics competencies. Therefore, for purposes of test design, it is important that items display an acceptable discriminatory behavior. To that end, it is recommended to avoid extraordinary difficult and very easy items. Knowing the factors that influence physics item difficulty makes it possible to model the item difficulty even before the first pilot study is conducted. Thus, by identifying predictors of physics item difficulty, we can improve the test-design process. Furthermore, we get additional qualitative feedback regarding the basic aspects of student cognitive achievement in physics that are directly responsible for the obtained, quantitative test results. In this study, we conducted a secondary analysis of data that came from two large-scale assessments of student physics achievement at the end of compulsory education in Bosnia and Herzegovina. Foremost, we explored the concept of “physics competence” and performed a content analysis of 123 physics items that were included within the above-mentioned assessments. Thereafter, an item database was created. Items were described by variables which reflect some basic cognitive aspects of physics competence. For each of the assessments, Rasch item difficulties were calculated in separate analyses. In order to make the item difficulties from different assessments comparable, a virtual test equating procedure had to be implemented. Finally, a regression model of physics item difficulty was created. It has been shown that 61.2% of item difficulty variance can be explained by factors which reflect the automaticity, complexity, and modality of the knowledge structure that is relevant for generating the most probable correct solution, as well as by the divergence of required thinking and interference effects between intuitive and formal

  10. Brief Sensation Seeking Scale: Latent structure of 8-item and 4-item versions in Peruvian adolescents.

    Science.gov (United States)

    Merino-Soto, Cesar; Salas Blas, Edwin

    2018-01-01

    This research intended to validate two brief scales of sensations seeking with Peruvian adolescents: the eight item scale (BSSS8; Hoyle, Stephenson, Palmgreen, Lorch, y Donohew, 2002) and the four item scale (BSSS4; Stephenson, Hoyle, Slater, y Palmgreen, 2003). Questionnaires were administered to 618 voluntary participants, with an average age of 13.6 years, from different levels of high school, state and private school in a district in the south of Lima. It analyzed the internal structure of both short versions using three models: a) unidimensional (M1), b) oblique or related dimensions (M2), and c) the bifactor model (M3). Results show that both instruments have a single dimension which best represents the variability of the items; a fact that can be explained both by the complexity of the concept and by the small number of items representing each factor, which is more noticeable in the BSSS4. Reliability is within levels found by previous studies: alpha: .745 = BSSS8 and BSSS4 =. 643; omega coefficient: .747 in BSSS8 and .651 in BSSS4. These are considered suitable for the type of instruments studied. Based on the correlation between the two instruments, it was found that there are satisfactory levels of equivalence between the BSSS8 and BSSS4. However, it is recommended that the BSSS4 is mainly used for research and for the purpose of describing populations.

  11. Single Amplified Genomes as Source for Novel Extremozymes: Annotation, Expression and Functional Assessment

    KAUST Repository

    Grötzinger, Stefan

    2017-12-01

    Enzymes, as nature’s catalysts, show remarkable abilities that can revolutionize the chemical, biotechnological, bioremediation, agricultural and pharmaceutical industries. However, the narrow range of stability of the majority of described biocatalysts limits their use for many applications. To overcome these restrictions, extremozymes derived from microorganisms thriving under harsh conditions can be used. Extremophiles living in high salinity are especially interesting as they operate at low water activity, which is similar to conditions used in standard chemical applications. Because only about 0.1 % of all microorganisms can be cultured, the traditional way of culture-based enzyme function determination needs to be overcome. The rise of high-throughput next-generation-sequencing technologies allows for deep insight into nature’s variety. Single amplified genomes (SAGs) specifically allow for whole genome assemblies from small sample volumes with low cell yields, as are typical for extreme environments. Although these technologies have been available for years, the expected boost in biotechnology has held off. One of the main reasons is the lack of reliable functional annotation of the genomic data, which is caused by the low amount (0.15 %) of experimentally described genes. Here, we present a novel annotation algorithm, designed to annotate the enzymatic function of genomes from microorganisms with low homologies to described microorganisms. The algorithm was established on SAGs from the extreme environment of selected hypersaline Red Sea brine pools with 4.3 M salinity and temperatures up to 68°C. Additionally, a novel consensus pattern for the identification of γ-carbonic anhydrases was created and applied in the algorithm. To verify the annotation, selected genes were expressed in the hypersaline expression system Halobacterium salinarum. This expression system was established and optimized in a continuously stirred tank reactor, leading to

  12. Administration of follitropin alfa and lutropin alfa combined in a single injection: a feasibility assessment

    Directory of Open Access Journals (Sweden)

    Agostinetto Rita

    2009-05-01

    Full Text Available Abstract Background Gonadotrophins are routinely administered in assisted reproductive technology (ART treatment protocols. Recombinant human follicle-stimulating hormone (r-hFSH; follitropin alfa and recombinant human luteinizing hormone (r-hLH; lutropin alfa can be administered individually or in a fixed combination. The ability to vary the FSH to LH dose ratio in a single injection without compromising the bioactivity of either gonadotrophin or generating losses of active principle is important for physicians and patients alike. Methods This study investigated whether follitropin alfa (GONAL-f (R, as lyophilized powder for reconstitution or solution from the GONAL-f (R (filled-by-mass [FbM] Prefilled Pen, could be used to reconstitute Pergoveris TM (follitropin alfa/lutropin alfa 150 IU/75 IU lyophilized powder. In Ratio Groups 1 and 2, the r-hFSH:r-hLH ratio was 3:1; in Ratio Groups 3 and 4, the ratios of r-hFSH:r-hLH were 5:1 and 8:1, respectively. The protein content and bioactivity of each mixed solution were evaluated. The r-hFSH and r-hLH content was determined using reverse-phase high performance liquid chromatography. The biological activity of r-hFSH and r-hLH was assessed using the Steelman-Pohley and Van Hell in vivo bioassays in rats, respectively. Results Follitropin alfa/lutropin alfa 150 IU/75 IU lyophilized powder could be successfully mixed with follitropin alfa 75 IU FbM solution that was either reconstituted from lyophilized powder or injected directly from the prefilled pen to create solutions with ratios of r-hFSH and r-hLH from 3:1 to 8:1. The measured content of r-hFSH and r-hLH corresponded favourably with the target protein content in Ratio Groups 1–4. The in vivo target and measured bioactivity of r-hFSH and r-hLH were also closely matched in all Ratio Groups. Conclusion Follitropin alfa lyophilized powder or solution can be accurately mixed with follitropin alfa/lutropin alfa 150 IU/75 IU lyophilized powder to

  13. Strapdown Airborne Gravimetry Quality Assessment Method Based on Single Survey Line Data: A Study by SGA-WZ02 Gravimeter

    Science.gov (United States)

    Wu, Meiping; Cao, Juliang; Zhang, Kaidong; Cai, Shaokun; Yu, Ruihang

    2018-01-01

    Quality assessment is an important part in the strapdown airborne gravimetry. Root mean square error (RMSE) evaluation method is a classical way to evaluate the gravimetry quality, but classical evaluation methods are preconditioned by extra flight or reference data. Thus, a method, which is able to largely conquer the premises of classical quality assessment methods and can be used in single survey line, has been developed in this paper. According to theoretical analysis, the method chooses the stability of two horizontal attitude angles, horizontal specific force and vertical specific force as the determinants of quality assessment method. The actual data, collected by SGA-WZ02 from 13 flights 21 lines in certain survey, was used to build the model and elaborate the method. To substantiate the performance of the quality assessment model, the model is applied in extra repeat line flights from two surveys. Compared with internal RMSE, standard deviation of assessment residuals are 0.23 mGal and 0.16 mGal in two surveys, which shows that the quality assessment method is reliable and stricter. The extra flights are not necessary by specially arranging the route of flights. The method, summarized from SGA-WZ02, is a feasible approach to assess gravimetry quality using single line data and is also suitable for other strapdown gravimeters. PMID:29373535

  14. Colorado Student Assessment Program: 2001 Released Passages, Items, and Prompts. Grade 4 Reading and Writing, Grade 4 Lectura y Escritura, Grade 5 Mathematics and Reading, Grade 6 Reading, Grade 7 Reading and Writing, Grade 8 Mathematics, Reading and Science, Grade 9 Reading, and Grade 10 Mathematics and Reading and Writing.

    Science.gov (United States)

    Colorado State Dept. of Education, Denver.

    This document contains released reading comprehension passages, test items, and writing prompts from the Colorado Student Assessment Program for 2001. The sample questions and prompts are included without answers or examples of student responses. Test materials are included for: (1) Grade 4 Reading and Writing; (2) Grade 4 Lectura y Escritura…

  15. A more general model for testing measurement invariance and differential item functioning.

    Science.gov (United States)

    Bauer, Daniel J

    2017-09-01

    The evaluation of measurement invariance is an important step in establishing the validity and comparability of measurements across individuals. Most commonly, measurement invariance has been examined using 1 of 2 primary latent variable modeling approaches: the multiple groups model or the multiple-indicator multiple-cause (MIMIC) model. Both approaches offer opportunities to detect differential item functioning within multi-item scales, and thereby to test measurement invariance, but both approaches also have significant limitations. The multiple groups model allows 1 to examine the invariance of all model parameters but only across levels of a single categorical individual difference variable (e.g., ethnicity). In contrast, the MIMIC model permits both categorical and continuous individual difference variables (e.g., sex and age) but permits only a subset of the model parameters to vary as a function of these characteristics. The current article argues that moderated nonlinear factor analysis (MNLFA) constitutes an alternative, more flexible model for evaluating measurement invariance and differential item functioning. We show that the MNLFA subsumes and combines the strengths of the multiple group and MIMIC models, allowing for a full and simultaneous assessment of measurement invariance and differential item functioning across multiple categorical and/or continuous individual difference variables. The relationships between the MNLFA model and the multiple groups and MIMIC models are shown mathematically and via an empirical demonstration. (PsycINFO Database Record (c) 2017 APA, all rights reserved).

  16. A New Functional Health Literacy Scale for Japanese Young Adults Based on Item Response Theory.

    Science.gov (United States)

    Tsubakita, Takashi; Kawazoe, Nobuo; Kasano, Eri

    2017-03-01

    Health literacy predicts health outcomes. Despite concerns surrounding the health of Japanese young adults, to date there has been no objective assessment of health literacy in this population. This study aimed to develop a Functional Health Literacy Scale for Young Adults (funHLS-YA) based on item response theory. Each item in the scale requires participants to choose the most relevant term from 3 choices in relation to a target item, thus assessing objective rather than perceived health literacy. The 20-item scale was administered to 1816 university students and 1751 responded. Cronbach's α coefficient was .73. Difficulty and discrimination parameters of each item were estimated, resulting in the exclusion of 1 item. Some items showed different difficulty parameters for male and female participants, reflecting that some aspects of health literacy may differ by gender. The current 19-item version of funHLS-YA can reliably assess the objective health literacy of Japanese young adults.

  17. Modelling sequentially scored item responses

    NARCIS (Netherlands)

    Akkermans, W.

    2000-01-01

    The sequential model can be used to describe the variable resulting from a sequential scoring process. In this paper two more item response models are investigated with respect to their suitability for sequential scoring: the partial credit model and the graded response model. The investigation is

  18. Analyzing force concept inventory with item response theory

    Science.gov (United States)

    Wang, Jing; Bao, Lei

    2010-10-01

    Item response theory is a popular assessment method used in education. It rests on the assumption of a probability framework that relates students' innate ability and their performance on test questions. Item response theory transforms students' raw test scores into a scaled proficiency score, which can be used to compare results obtained with different test questions. The scaled score also addresses the issues of ceiling effects and guessing, which commonly exist in quantitative assessment. We used item response theory to analyze the force concept inventory (FCI). Our results show that item response theory can be useful for analyzing physics concept surveys such as the FCI and produces results about the individual questions and student performance that are beyond the capability of classical statistics. The theory yields detailed measurement parameters regarding the difficulty, discrimination features, and probability of correct guess for each of the FCI questions.

  19. Psychometric Consequences of Subpopulation Item Parameter Drift

    Science.gov (United States)

    Huggins-Manley, Anne Corinne

    2017-01-01

    This study defines subpopulation item parameter drift (SIPD) as a change in item parameters over time that is dependent on subpopulations of examinees, and hypothesizes that the presence of SIPD in anchor items is associated with bias and/or lack of invariance in three psychometric outcomes. Results show that SIPD in anchor items is associated…

  20. Dynamics Assessment of Grid-Synchronization Algorithms for Single-Phase Grid-Connected Converters

    DEFF Research Database (Denmark)

    Han, Yang; Luo, Mingyu; Guerrero, Josep M.

    2015-01-01

    Several advanced phase-lock-loop (PLL) algorithms have been proposed for single-phase power electronic systems. Among these algorithms, the orthogonal signal generators (OSGs) are widely utilized to generate a set of in-quadrature signals, owing to its benefit of simple digital implementation and...

  1. Nondestructive assessment of single-span timber bridges using a vibration- based method

    Science.gov (United States)

    Xiping Wang; James P. Wacker; Angus M. Morison; John W. Forsman; John R. Erickson; Robert J. Ross

    2005-01-01

    This paper describes an effort to develop a global dynamic testing technique for evaluating the overall stiffness of timber bridge superstructures. A forced vibration method was used to measure the natural frequency of single-span timber bridges in the laboratory and field. An analytical model based on simple beam theory was proposed to represent the relationship...

  2. Assessment of vadose zone radionuclide contamination around Single Shell Tank 241-C-103

    International Nuclear Information System (INIS)

    Kos, S.E.

    1995-12-01

    Five drywells surrounding single shell tank 241-C-103 were logged with the high-purity germanium logging system to investigate possible leakage of radioactive contamination from the tank. The investigation included integration of the drywell survey results with several other data sources. There is no conclusive evidence showing indications that the 241-C-103 tank has leaked

  3. Comparison of PASS Assessment Scores in Single-Gender and Heterogeneous Middle Schools in South Carolina

    Science.gov (United States)

    Canada, Patricia Oxendine

    2012-01-01

    In response to the mandates of No Child Left Behind, (NCLB), educators across the country struggle to close the gaps between males and females. Some of the physiological differences existing between the male and female brain suggest support for single-gender instruction, which is on the rise within this country as well as other parts of the world.…

  4. Diverse Food Items Are Similarly Categorized by 8- to 13-Year-Old Children

    Science.gov (United States)

    Beltran, Alicia; Knight Sepulveda, Karina; Watson, Kathy; Baranowski, Tom; Baranowski, Janice; Islam, Noemi; Missaghian, Mariam

    2008-01-01

    Objective: Assess how 8- to 13-year-old children categorized and labeled food items for possible use as part of a food search strategy in a computerized 24-hour dietary recall. Design: A set of 62 cards with pictures and names of food items from 18 professionally defined food groups was sorted by each child into piles of similar food items.…

  5. Impact of covariate models on the assessment of the air pollution-mortality association in a single- and multipollutant context.

    Science.gov (United States)

    Sacks, Jason D; Ito, Kazuhiko; Wilson, William E; Neas, Lucas M

    2012-10-01

    With the advent of multicity studies, uniform statistical approaches have been developed to examine air pollution-mortality associations across cities. To assess the sensitivity of the air pollution-mortality association to different model specifications in a single and multipollutant context, the authors applied various regression models developed in previous multicity time-series studies of air pollution and mortality to data from Philadelphia, Pennsylvania (May 1992-September 1995). Single-pollutant analyses used daily cardiovascular mortality, fine particulate matter (particles with an aerodynamic diameter ≤2.5 µm; PM(2.5)), speciated PM(2.5), and gaseous pollutant data, while multipollutant analyses used source factors identified through principal component analysis. In single-pollutant analyses, risk estimates were relatively consistent across models for most PM(2.5) components and gaseous pollutants. However, risk estimates were inconsistent for ozone in all-year and warm-season analyses. Principal component analysis yielded factors with species associated with traffic, crustal material, residual oil, and coal. Risk estimates for these factors exhibited less sensitivity to alternative regression models compared with single-pollutant models. Factors associated with traffic and crustal material showed consistently positive associations in the warm season, while the coal combustion factor showed consistently positive associations in the cold season. Overall, mortality risk estimates examined using a source-oriented approach yielded more stable and precise risk estimates, compared with single-pollutant analyses.

  6. Non-ignorable missingness item response theory models for choice effects in examinee-selected items.

    Science.gov (United States)

    Liu, Chen-Wei; Wang, Wen-Chung

    2017-11-01

    Examinee-selected item (ESI) design, in which examinees are required to respond to a fixed number of items in a given set, always yields incomplete data (i.e., when only the selected items are answered, data are missing for the others) that are likely non-ignorable in likelihood inference. Standard item response theory (IRT) models become infeasible when ESI data are missing not at random (MNAR). To solve this problem, the authors propose a two-dimensional IRT model that posits one unidimensional IRT model for observed data and another for nominal selection patterns. The two latent variables are assumed to follow a bivariate normal distribution. In this study, the mirt freeware package was adopted to estimate parameters. The authors conduct an experiment to demonstrate that ESI data are often non-ignorable and to determine how to apply the new model to the data collected. Two follow-up simulation studies are conducted to assess the parameter recovery of the new model and the consequences for parameter estimation of ignoring MNAR data. The results of the two simulation studies indicate good parameter recovery of the new model and poor parameter recovery when non-ignorable missing data were mistakenly treated as ignorable. © 2017 The British Psychological Society.

  7. Inventory control in multi-item production systems

    NARCIS (Netherlands)

    Bruin, J.

    2010-01-01

    This thesis focusses on the analysis and construction of control policies in multiitem production systems. In such systems, multiple items can be made to stock, but they have to share the finite capacity of a single machine. This machine can only produce one unit at a time and if it is set-up for

  8. Optimisation and validation of methods to assess single nucleotide polymorphisms (SNPs) in archival histological material

    DEFF Research Database (Denmark)

    Andreassen, C N; Sørensen, Flemming Brandt; Overgaard

    2004-01-01

    only archival specimens are available. This study was conducted to validate protocols optimised for assessment of SNPs based on paraffin embedded, formalin fixed tissue samples.PATIENTS AND METHODS: In 137 breast cancer patients, three TGFB1 SNPs were assessed based on archival histological specimens...... precipitation).RESULTS: Assessment of SNPs based on archival histological material is encumbered by a number of obstacles and pitfalls. However, these can be widely overcome by careful optimisation of the methods used for sample selection, DNA extraction and PCR. Within 130 samples that fulfil the criteria...

  9. An assessment of memristor intrinsic fluctuations: a measurement of single atomic motion

    Science.gov (United States)

    Borghetti, Julien; Yang, J. Joshua; Medeiros-Ribeiro, Gilberto; Williams, R. Stanley

    2010-03-01

    Memristors provides electrically tunable resistance for upcoming non-volatile memory and future neuromorphic computing. One of the key benefits of such a device is its scalability, which can be demonstrated from an architectural perspective as well as from a fundamental physics limit. 4D addressing schemes utilizing cross bar structures that can be stacked several layers high above the chip embodies unlimited addressing space. On the other limit, the basic operating principles of memristive devices allow one to reach storage of information in a single atom. In this report of nanoscale (sub 50nm) devices, we detect single atom fluctuations, which would then represent the ultimate limit for noise sources thus delineating the boundary conditions for circuit design. We show that electrically induced individual atom migrations do not affect the overall device atomic configuration until a critical bias where a single local fluctuation triggers a general atomic reconfiguration. This instability illustrates the robustness of the device non-volatility upon small electrical stress.

  10. Teoria da Resposta ao Item Teoria de la respuesta al item Item response theory

    Directory of Open Access Journals (Sweden)

    Eutalia Aparecida Candido de Araujo

    2009-12-01

    Full Text Available A preocupação com medidas de traços psicológicos é antiga, sendo que muitos estudos e propostas de métodos foram desenvolvidos no sentido de alcançar este objetivo. Entre os trabalhos propostos, destaca-se a Teoria da Resposta ao Item (TRI que, a princípio, veio completar limitações da Teoria Clássica de Medidas, empregada em larga escala até hoje na medida de traços psicológicos. O ponto principal da TRI é que ela leva em consideração o item particularmente, sem relevar os escores totais; portanto, as conclusões não dependem apenas do teste ou questionário, mas de cada item que o compõe. Este artigo propõe-se a apresentar esta Teoria que revolucionou a teoria de medidas.La preocupación con las medidas de los rasgos psicológicos es antigua y muchos estudios y propuestas de métodos fueron desarrollados para lograr este objetivo. Entre estas propuestas de trabajo se incluye la Teoría de la Respuesta al Ítem (TRI que, en principio, vino a completar las limitaciones de la Teoría Clásica de los Tests, ampliamente utilizada hasta hoy en la medida de los rasgos psicológicos. El punto principal de la TRI es que se tiene en cuenta el punto concreto, sin relevar las puntuaciones totales; por lo tanto, los resultados no sólo dependen de la prueba o cuestionario, sino que de cada ítem que lo compone. En este artículo se propone presentar la Teoría que revolucionó la teoría de medidas.The concern with measures of psychological traits is old and many studies and proposals of methods were developed to achieve this goal. Among these proposed methods highlights the Item Response Theory (IRT that, in principle, came to complete limitations of the Classical Test Theory, which is widely used until nowadays in the measurement of psychological traits. The main point of IRT is that it takes into account the item in particular, not relieving the total scores; therefore, the findings do not only depend on the test or questionnaire

  11. Developing an item bank to measure the coping strategies of people with hereditary retinal diseases.

    Science.gov (United States)

    Prem Senthil, Mallika; Khadka, Jyoti; De Roach, John; Lamey, Tina; McLaren, Terri; Campbell, Isabella; Fenwick, Eva K; Lamoureux, Ecosse L; Pesudovs, Konrad

    2018-05-05

    Our understanding of the coping strategies used by people with visual impairment to manage stress related to visual loss is limited. This study aims to develop a sophisticated coping instrument in the form of an item bank implemented via Computerised adaptive testing (CAT) for hereditary retinal diseases. Items on coping were extracted from qualitative interviews with patients which were supplemented by items from a literature review. A systematic multi-stage process of item refinement was carried out followed by expert panel discussion and cognitive interviews. The final coping item bank had 30 items. Rasch analysis was used to assess the psychometric properties. A CAT simulation was carried out to estimate an average number of items required to gain precise measurement of hereditary retinal disease-related coping. One hundred eighty-nine participants answered the coping item bank (median age = 58 years). The coping scale demonstrated good precision and targeting. The standardised residual loadings for items revealed six items grouped together. Removal of the six items reduced the precision of the main coping scale and worsened the variance explained by the measure. Therefore, the six items were retained within the main scale. Our CAT simulation indicated that, on average, less than 10 items are required to gain a precise measurement of coping. This is the first study to develop a psychometrically robust coping instrument for hereditary retinal diseases. CAT simulation indicated that on an average, only four and nine items were required to gain measurement at moderate and high precision, respectively.

  12. A single - item replacement decision model for repairable spare ...

    African Journals Online (AJOL)

    In this paper, we present an analytical method for determining spare parts replacement over an infinite planning horizon. (The objective is to minimize the total system cost). We develop an exact and simple method for determining the time for equipment replacement or making decision about when to replace equipments, ...

  13. Development of the PROMIS positive emotional and sensory expectancies of smoking item banks.

    Science.gov (United States)

    Tucker, Joan S; Shadel, William G; Edelen, Maria Orlando; Stucky, Brian D; Li, Zhen; Hansen, Mark; Cai, Li

    2014-09-01

    The positive emotional and sensory expectancies of cigarette smoking include improved cognitive abilities, positive affective states, and pleasurable sensorimotor sensations. This paper describes development of Positive Emotional and Sensory Expectancies of Smoking item banks that will serve to standardize the assessment of this construct among daily and nondaily cigarette smokers. Data came from daily (N = 4,201) and nondaily (N =1,183) smokers who completed an online survey. To identify a unidimensional set of items, we conducted item factor analyses, item response theory analyses, and differential item functioning analyses. Additionally, we evaluated the performance of fixed-item short forms (SFs) and computer adaptive tests (CATs) to efficiently assess the construct. Eighteen items were included in the item banks (15 common across daily and nondaily smokers, 1 unique to daily, 2 unique to nondaily). The item banks are strongly unidimensional, highly reliable (reliability = 0.95 for both), and perform similarly across gender, age, and race/ethnicity groups. A SF common to daily and nondaily smokers consists of 6 items (reliability = 0.86). Results from simulated CATs indicated that, on average, less than 8 items are needed to assess the construct with adequate precision using the item banks. These analyses identified a new set of items that can assess the positive emotional and sensory expectancies of smoking in a reliable and standardized manner. Considerable efficiency in assessing this construct can be achieved by using the item bank SF, employing computer adaptive tests, or selecting subsets of items tailored to specific research or clinical purposes. © The Author 2014. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  14. The relationship between early changes in the HAMD-17 anxiety/somatization factor items and treatment outcome among depressed outpatients.

    Science.gov (United States)

    Farabaugh, Amy; Mischoulon, David; Fava, Maurizio; Wu, Shirley L; Mascarini, Alessandra; Tossani, Eliana; Alpert, Jonathan E

    2005-03-01

    The 17-item Hamilton Rating Scale for Depression (HAMD-17) Anxiety/Somatization factor includes six items: Anxiety (psychic), Anxiety (somatic), Somatic Symptoms (gastrointestinal), Somatic Symptoms (general), Hypochondriasis and Insight. This study examines the relationship between early changes (defined as those observed between baseline and week 1) in these HAMD-17 Anxiety/Somatization Factor items and treatment outcome among major depressive disorder (MDD) patients who participated in a study comparing the antidepressant efficacy of a standardized extract of hypericum with both placebo and fluoxetine. Following a 1-week, single-blind washout, patients with MDD diagnosed by the Structured Clinical Interview for DSM-IV (SCID) were randomized to 12 weeks of double-blind treatment with hypericum extract (900 mg/day), fluoxetine (20 mg/day) or placebo. The relationship between early changes in HAMD-17 anxiety/somatization factor items and treatment outcome was assessed separately for patients who received study treatment (hypericum or fluoxetine) versus placebo with a logistic regression method. One hundred and thirty-five patients (female 57%, mean age=37.3+/-11.0 years; mean baseline HAMD-17=19.7+/-3.2 years) were randomized to double-blind treatment and were included in the intent-to-treat (ITT) analyses. After adjusting for baseline HAMD-17 scores and for multiple comparisons with the Bonferroni correction, patients who remitted (HAMD-17 score Somatic Symptoms (General) scores than non-remitters. No other significant differences in early changes were noted for the remaining items between remitters versus non-remitters who received active treatment. For patients treated with placebo, early change was not predictive of remission for any of the items after Bonferroni correction. In conclusion, the presence of early improvement on the HAMD-17 item concerning fatigue and general somatic symptoms is significantly predictive of achieving remission at endpoint with

  15. Editorial Commentary: Single-Image Slice Magnetic Resonance Imaging Assessments Do Not Predict 3-Dimensional Muscle Volume.

    Science.gov (United States)

    Brand, Jefferson C

    2016-01-01

    No single-image magnetic resonance imaging (MRI) assessment-Goutallier classification, Fuchs classification, or cross-sectional area-is predictive of whole-muscle volume or fatty atrophy of the supraspinatus or infraspinatus. Rather, 3-dimensional MRI measurement of whole-muscle volume and fat-free muscle volume is required and is associated with shoulder strength, which is clinically relevant. Three-dimensional MRI may represent a new gold standard for assessment of the rotator cuff musculature using imaging and may help to predict the feasibility of repair of a rotator cuff tear as well as the postoperative outcome. Unfortunately, 3-dimensional MRI assessment of muscle volume is labor intensive and is not widely available for clinical use. Copyright © 2016 Arthroscopy Association of North America. Published by Elsevier Inc. All rights reserved.

  16. Numerosity estimates for attended and unattended items in visual search.

    Science.gov (United States)

    Kelley, Troy D; Cassenti, Daniel N; Marusich, Laura R; Ghirardelli, Thomas G

    2017-07-01

    The goal of this research was to examine memories created for the number of items during a visual search task. Participants performed a visual search task for a target defined by a single feature (Experiment 1A), by a conjunction of features (Experiment 1B), or by a specific spatial configuration of features (Experiment 1C). On some trials following the search task, subjects were asked to recall the total number of items in the previous display. In all search types, participants underestimated the total number of items, but the severity of the underestimation varied depending on the efficiency of the search. In three follow-up studies (Experiments 2A, 2B, and 2C) using the same visual stimuli, the participants' only task was to estimate the number of items on each screen. Participants still underestimated the numerosity of the items, although the degree of underestimation was smaller than in the search tasks and did not depend on the type of visual stimuli. In Experiment 3, participants were asked to recall the number of items in a display only once. Subjects still displayed a tendency to underestimate, indicating that the underestimation effects seen in Experiments 1A-1C were not attributable to knowledge of the estimation task. The degree of underestimation depends on the efficiency of the search task, with more severe underestimation in efficient search tasks. This suggests that the lower attentional demands of very efficient searches leads to less encoding of numerosity of the distractor set.

  17. Geriatric Anxiety Scale: item response theory analysis, differential item functioning, and creation of a ten-item short form (GAS-10).

    Science.gov (United States)

    Mueller, Anne E; Segal, Daniel L; Gavett, Brandon; Marty, Meghan A; Yochim, Brian; June, Andrea; Coolidge, Frederick L

    2015-07-01

    The Geriatric Anxiety Scale (GAS; Segal et al. (Segal, D. L., June, A., Payne, M., Coolidge, F. L. and Yochim, B. (2010). Journal of Anxiety Disorders, 24, 709-714. doi:10.1016/j.janxdis.2010.05.002) is a self-report measure of anxiety that was designed to address unique issues associated with anxiety assessment in older adults. This study is the first to use item response theory (IRT) to examine the psychometric properties of a measure of anxiety in older adults. A large sample of older adults (n = 581; mean age = 72.32 years, SD = 7.64 years, range = 60 to 96 years; 64% women; 88% European American) completed the GAS. IRT properties were examined. The presence of differential item functioning (DIF) or measurement bias by age and sex was assessed, and a ten-item short form of the GAS (called the GAS-10) was created. All GAS items had discrimination parameters of 1.07 or greater. Items from the somatic subscale tended to have lower discrimination parameters than items on the cognitive or affective subscales. Two items were flagged for DIF, but the impact of the DIF was negligible. Women scored significantly higher than men on the GAS and its subscales. Participants in the young-old group (60 to 79 years old) scored significantly higher on the cognitive subscale than participants in the old-old group (80 years old and older). Results from the IRT analyses indicated that the GAS and GAS-10 have strong psychometric properties among older adults. We conclude by discussing implications and future research directions.

  18. Assessment of Single European Sky Implementation in the Functional Airspace Block Central Europe

    Directory of Open Access Journals (Sweden)

    Tomislav Mihetec

    2017-12-01

    implementation is performed through sub-regional grouping of Air Navigation Service Providers in a form of Functional Airspace Blocks. This paper analyses the level of implementation of ATM-related projects in the Functional Airspace Block Central Europe and their relation to other Functional Airspace Blocks defined in Europe. From this paper it is obvious that even though the planning of Single European Sky projects is based on the collaborative implementation of Functional Airspace Block level, the real implementation is fragmented and based on national levels.

  19. Single-photon emission CT in the assessment of low back pain in young athletes

    International Nuclear Information System (INIS)

    Johnson, G.T.; Lagatutta, F.P.; Lazarus, M.L.; Faulkner, T.J.; Nolan, J.P.

    1991-01-01

    Fifty-two teenage and young adult athletes (ages 12-24 years) with low back pain (LBP) underwent routine lumbar radiography and bone scintigraphy including planar and single-photon CT and SPECT imaging. This paper illustrates the significant limitations of routine radiography and the importance of SPECT bone scintigraphy in evaluating young athletes with LBP and suspected spondylolysis; the increased sensitivity and specificity of SPECT compared to planar scintigraphy in the diagnosis of spondylolysis; and the potential utility of follow-up SPECT studies in evaluating success of therapy in athletes with initially positive diagnostic indicators for spondylolysis or impending spondylolysis

  20. Single cell adhesion strength assessed with variable-angle total internal reflection fluorescence microscopy

    Directory of Open Access Journals (Sweden)

    Marcelina Cardoso Dos Santos

    2017-06-01

    Full Text Available We propose a new strategy to evaluate adhesion strength at the single cell level. This approach involves variable-angle total internal reflection fluorescence microscopy to monitor in real time the topography of cell membranes, i.e. a map of the membrane/substrate separation distance. According to the Boltzmann distribution, both potential energy profile and dissociation energy related to the interactions between the cell membrane and the substrate were determined from the membrane topography. We have highlighted on glass substrates coated with poly-L-lysine and fibronectin, that the dissociation energy is a reliable parameter to quantify the adhesion strength of MDA-MB-231 motile cells.

  1. Comparing the Use of 3D Photogrammetry and Computed Tomography in Assessing the Severity of Single-Suture Nonsyndromic Craniosynostosis.

    Science.gov (United States)

    Ho, Olivia A; Saber, Nikoo; Stephens, Derek; Clausen, April; Drake, James; Forrest, Christopher; Phillips, John

    2017-05-01

    Single-suture nonsyndromic craniosynostosis is diagnosed using clinical assessment and computed tomography (CT). With increasing awareness of the associated risks of radiation exposure, the use of CT is particularly concerning in patients with craniosynostosis since they are exposed at a younger age and more frequently than the average child. Three-dimensional (3D) photogrammetry is advantageous-it involves no radiation, is conveniently obtainable within clinic, and does not require general anaesthesia. This study aims to assess how 3D photogrammetry compares to CT in the assessment of craniosynostosis severity, to quantify surgical outcomes, and analyze the validity of 3D photogrammetry in craniosynostosis. Computed tomography images and 3D photographs of patients who underwent craniosynostosis surgery were assessed and aligned to best fit. The intervening area between the CT and 3D photogrammetry curves at the supraorbital bar (bandeau) level in axial view was calculated. Statistical analysis was performed using Student t test. Ninety-five percent confidence intervals were determined and equivalence margins were applied. In total, 41 pairs of CTs and 3D photographs were analyzed. The 95% confidence interval was 198.16 to 264.18 mm 2 and the mean was 231.17 mm 2 . When comparisons were made in the same bandeau region omitting the temporalis muscle, the 95% confidence interval was 108.94 to 147.38 mm 2 , and the mean was 128.16 mm 2 . Although statistically significant difference between the modalities was found, they can be attributable to the dampening effect of soft tissue. Within certain error margins, 3D photogrammetry is comparable to CT in assessing the severity of single-suture nonsyndromic craniosynostosis. However, a dampening effect can be attributable to the soft tissue. Three-dimensional photogrammetry may be more applicable for severe cases of craniosynostosis but not milder deformity. It may also be beneficial for assessing the overall appearance and

  2. Item Response Theory at Subject- and Group-Level. Research Report 90-1.

    Science.gov (United States)

    Tobi, Hilde

    This paper reviews the literature about item response models for the subject level and aggregated level (group level). Group-level item response models (IRMs) are used in the United States in large-scale assessment programs such as the National Assessment of Educational Progress and the California Assessment Program. In the Netherlands, these…

  3. Sharing the cost of redundant items

    DEFF Research Database (Denmark)

    Hougaard, Jens Leth; Moulin, Hervé

    2014-01-01

    We ask how to share the cost of finitely many public goods (items) among users with different needs: some smaller subsets of items are enough to serve the needs of each user, yet the cost of all items must be covered, even if this entails inefficiently paying for redundant items. Typical examples...... are network connectivity problems when an existing (possibly inefficient) network must be maintained. We axiomatize a family cost ratios based on simple liability indices, one for each agent and for each item, measuring the relative worth of this item across agents, and generating cost allocation rules...... additive in costs....

  4. Epilogue: Reading Comprehension Is Not a Single Ability--Implications for Assessment and Instruction

    Science.gov (United States)

    Kamhi, Alan G.; Catts, Hugh W.

    2017-01-01

    Purpose: In this epilogue, we review the 4 response articles and highlight the implications of a multidimensional view of reading for the assessment and instruction of reading comprehension. Method: We reiterate the problems with standardized tests of reading comprehension and discuss the advantages and disadvantages of recently developed…

  5. Assessing single and joint effects of chemicals on the survival and reproduction of Folsomia candida (Collembola) in soil

    International Nuclear Information System (INIS)

    Amorim, M.J.B.; Pereira, C.; Menezes-Oliveira, V.B.; Campos, B.; Soares, A.M.V.M.; Loureiro, S.

    2012-01-01

    Chemicals are often found in the environment as complex mixtures. There has been a large effort in the last decade to assess the combined effect of chemicals, using the conceptual models of Concentration Addition and Independent Action, but also including synergistic, antagonistic, dose-level and dose–ratio dependent deviations from these models. In the present study, single and mixture toxicity of atrazine, dimethoate, lindane, zinc and cadmium were studied in Folsomia candida, assessing survival and reproduction. Different response patterns were observed for the different endpoints and synergistic patterns were observed when pesticides were present. Compared with the previously tested Enchytraeus albidus and Porcellionides pruinosus, the mixture toxicity pattern for F. candida was species specific. The present study highlights the importance of studying toxicity of chemicals mixtures due to the observed potentiation of effects and confirms that for an adequate ecologically relevant risk assessment different organisms and endpoints should be included. - Highlights: ► Folsomia candida (Collembola) were exposed to binary mixtures of atrazine, dimethoate, lindane, zinc and cadmium. ► Synergistic response patterns were often observed when pesticides were present in the mixtures. ► Response patterns upon mixture exposure differed within endpoints (survival vs. reproduction) in some cases. ► As to single chemical toxicity, response patterns for mixture exposures seem to be also species specific. - Exposure to chemical mixtures in Folsomia candida showed potentiation of effects. Mixture toxicity patterns differ among species and endpoint measured.

  6. The reliability and criterion validity of 2D video assessment of single leg squat and hop landing.

    Science.gov (United States)

    Herrington, Lee; Alenezi, Faisal; Alzhrani, Msaad; Alrayani, Hasan; Jones, Richard

    2017-06-01

    The objective was to assess the intra-tester, within and between day reliability of measurement of hip adduction (HADD) and frontal plane projection angles (FPPA) during single leg squat (SLS) and single leg landing (SLL) using 2D video and the validity of these measurements against those found during 3D motion capture. 15 healthy subjects had their SLS and SLL assessed using 3D motion capture and video analysis. Inter-tester reliability for both SLS and SLL when measuring FPPA and HADD show excellent correlations (ICC 2,1 0.97-0.99). Within and between day assessment of SLS and SLL showed good to excellent correlations for both variables (ICC 3,1 0.72-91). 2D FPPA measures were found to have good correlation with knee abduction angle in 3-D (r=0.79, p=0.008) during SLS, and also to knee abduction moment (r=0.65, p=0.009). 2D HADD showed very good correlation with 3D HADD during SLS (r=0.81, p=0.001), and a good correlation during SLL (r=0.62, p=0.013). All other associations were weak (r<0.4). This study suggests that 2D video kinematics have a reasonable association to what is being measured with 3D motion capture. Copyright © 2017 Elsevier Ltd. All rights reserved.

  7. The REFANI-S study protocol: a non-randomised cluster controlled trial to assess the role of an unconditional cash transfer, a non-food item kit, and free piped water in reducing the risk of acute malnutrition among children aged 6-59 months living in camps for internally displaced persons in the Afgooye corridor, Somalia.

    Science.gov (United States)

    Jelle, Mohamed; Grijalva-Eternod, Carlos S; Haghparast-Bidgoli, Hassan; King, Sarah; Cox, Cassy L; Skordis-Worrall, Jolene; Morrison, Joanna; Colbourn, Timothy; Fottrell, Edward; Seal, Andrew J

    2017-07-06

    The prevalence of acute malnutrition is often high in emergency-affected populations and is associated with elevated mortality risk and long-term health consequences. Increasingly, cash transfer programmes (CTP) are used instead of direct food aid as a nutritional intervention, but there is sparse evidence on their nutritional impact. We aim to understand whether CTP reduces acute malnutrition and its known risk factors. A non-randomised, cluster-controlled trial will assess the impact of an unconditional cash transfer of US$84 per month for 5 months, a single non-food items kit, and free piped water on the risk of acute malnutrition in children, aged 6-59 months. The study will take place in camps for internally displaced persons (IDP) in peri-urban Mogadishu, Somalia. A cluster will consist of one IDP camp and 10 camps will be allocated to receive the intervention based on vulnerability targeting criteria. The control camps will then be selected from the same geographical area. Needs assessment data indicates small differences in vulnerability between camps. In each trial arm, 120 households will be randomly sampled and two detailed household surveys will be implemented at baseline and 3 months after the initiation of the cash transfer. The survey questionnaire will cover risk factors for malnutrition including household expenditure, assets, food security, diet diversity, coping strategies, morbidity, WASH, and access to health care. A community surveillance system will collect monthly mid-upper arm circumference measurements from all children aged 6-59 months in the study clusters to assess the incidence of acute malnutrition over the duration of the intervention. Process evaluation data will be compiled from routine quantitative programme data and primary qualitative data collected using key informant interviews and focus group discussions. The UK Department for International Development will provide funding for this study. The European Civil Protection and

  8. The REFANI-S study protocol: a non-randomised cluster controlled trial to assess the role of an unconditional cash transfer, a non-food item kit, and free piped water in reducing the risk of acute malnutrition among children aged 6–59 months living in camps for internally displaced persons in the Afgooye corridor, Somalia

    Directory of Open Access Journals (Sweden)

    Mohamed Jelle

    2017-07-01

    Full Text Available Abstract Background The prevalence of acute malnutrition is often high in emergency-affected populations and is associated with elevated mortality risk and long-term health consequences. Increasingly, cash transfer programmes (CTP are used instead of direct food aid as a nutritional intervention, but there is sparse evidence on their nutritional impact. We aim to understand whether CTP reduces acute malnutrition and its known risk factors. Methods/design A non-randomised, cluster-controlled trial will assess the impact of an unconditional cash transfer of US$84 per month for 5 months, a single non-food items kit, and free piped water on the risk of acute malnutrition in children, aged 6–59 months. The study will take place in camps for internally displaced persons (IDP in peri-urban Mogadishu, Somalia. A cluster will consist of one IDP camp and 10 camps will be allocated to receive the intervention based on vulnerability targeting criteria. The control camps will then be selected from the same geographical area. Needs assessment data indicates small differences in vulnerability between camps. In each trial arm, 120 households will be randomly sampled and two detailed household surveys will be implemented at baseline and 3 months after the initiation of the cash transfer. The survey questionnaire will cover risk factors for malnutrition including household expenditure, assets, food security, diet diversity, coping strategies, morbidity, WASH, and access to health care. A community surveillance system will collect monthly mid-upper arm circumference measurements from all children aged 6–59 months in the study clusters to assess the incidence of acute malnutrition over the duration of the intervention. Process evaluation data will be compiled from routine quantitative programme data and primary qualitative data collected using key informant interviews and focus group discussions. The UK Department for International Development will provide

  9. Reliability assessment of single-phase grid-connected PV microinverters considering mission profile and uncertainties

    DEFF Research Database (Denmark)

    Zare, Mohammad Hadi; Mohamadian, Mustafa; Wang, Huai

    2017-01-01

    Microinverters usually connect a PV panel to a Single-phase power grid. In such system, the input power is constant while the output power oscillates twice the line frequency. Thus, the input and output power differences should be stored in a storage component, which is typically an electrolytic ...... irritation of two different places on the micro inverter lifetime is studied....... capacitor. However, electrolytic capacitors are usually blamed for their short lifetime. Recently, some active power decoupling methods are introduced in the literature which can takes advantage of high reliable film capacitors. However, some extra switches and diodes are added to the microinverter which...... can influence the microinverter lifetime. This paper investigates the microinverter reliability according to mission profile where it is installed. To get more accurate results, uncertainties in both lifetime model and manufacturing process are considered. The effect of ambient temperature and solar...

  10. Siting analysis and risk assessment for small single-purpose heating reactors

    International Nuclear Information System (INIS)

    Tarjanne, R.

    1979-04-01

    Two alternative sites both 10km away from the centre of Helsinki are considered for reactor unit sizes of 400mw and 800mw. The risks associated with a small single-purpose heating reactor is evaluated for normal operation and accident conditions. The evaluation for accident condition is performed for three characteristics accidents. Three pathways are considered in the calculation of the radiation exposure: direct external gamma dose from the release plume, direct gamma radiation from deposited activity on the ground and dose due to inhalation. The risks are compared with the risks from alternative conventional fossil fuelled district heat production methods. The results show that the heating reactor alternative causes an unsignificant risk, which is far less than the risk caused by the fossil-fuelled alternatives

  11. Assessment of advanced technologies for high performance single-engine business airplanes

    Science.gov (United States)

    Kohlman, D. L.; Holmes, B. J.

    1982-01-01

    The prospects for significantly increasing the fuel efficiency and mission capability of single engine business aircraft through the incorporation of advanced propulsion, aerodynamics and materials technologies are explored. It is found that turbine engines cannot match the fuel economy of the heavier rotary, diesel and advanced spark reciprocating engines. The rotary engine yields the lightest and smallest aircraft for a given mission requirement, and also offers greater simplicity and a multifuel capability. Great promise is also seen in the use of composite material primary structures in conjunction with laminar flow wing surfaces, a pusher propeller and conventional wing-tail configuration. This study was conducted with the General Aviation Synthesis Program, which can furnish the most accurate mission performance calculations yet obtained.

  12. On assessing surrogacy in a single trial setting using a semi-competing risks paradigm

    Science.gov (United States)

    Ghosh, Debashis

    2009-01-01

    Summary There has been a recent emphasis on the identification of biomarkers and other biologic measures that may be potentially used as surrogate endpoints in clinical trials. We focus on the setting of data from a single clinical trial. In this paper, we consider a framework in which the surrogate must occur before the true endpoint. This suggests viewing the surrogate and true endpoints as semi-competing risks data; this approach is new to the literature on surrogate endpoints and leads to an asymmetrical treatment of the surrogate and true endpoints. However, such a data structure also conceptually complicates many of the previously considered measures of surrogacy in the literature. We propose novel estimation and inferential procedures for the relative effect and adjusted association quantities proposed by Buyse and Molenberghs (1998, Biometrics, 1014 – 1029). The proposed methodology is illustrated with application to simulated data, as well as to data from a leukemia study. PMID:18759839

  13. The contribution of single photon emission computed tomography in the clinical assessment of Alzheimer type dementia

    International Nuclear Information System (INIS)

    Boudousq, V.; Collombier, L.; Kotzki, P.O.

    1999-01-01

    Interest of brain single-photon emission computed tomography to support clinical diagnosis of Alzheimer-type dementia is now established. Numerous studies have reported a decreased perfusion in the association cortex of the parietal lobe and the posterior temporal regions. In patients with mild cognitive complaints, the presence of focal hypoperfusion may increase substantially the probability of the disease. In addition, emission tomography emerges as a helpful tool in situation in which there is diagnostic doubt. In this case, the presence of temporo-parietal perfusion deficit associated with hippocampal atrophy on MRI or X-ray computed tomography contributes to diagnostic accuracy. However, some studies suggest that emission tomography may be useful for preclinical prediction of Alzheimer's disease and to predict cognitive decline. (author)

  14. Assessing patterns of hybridization between North Atlantic eels using diagnostic single-nucleotide polymorphisms

    DEFF Research Database (Denmark)

    Pujolar, José Martin; Jacobsen, M.W.; Als, Thomas Damm

    2014-01-01

    The two North Atlantic eel species, the European eel (Anguilla anguilla) and the American eel (Anguilla rostrata), spawn in partial sympatry in the Sargasso Sea, providing ample opportunity to interbreed. In this study, we used a RAD (Restriction site Associated DNA) sequencing approach to identify...... species-specific diagnostic single-nucleotide polymorphisms (SNPs) and design a low-density array that combined with screening of a diagnostic mitochondrial DNA marker. Eels from Iceland (N=159) and from the neighboring Faroe Islands (N=29) were genotyped, along with 94 larvae (49 European and 45 American...... eel male crosses, backcrosses were also detected, including a first-generation backcross (F1 hybrid × pure European eel) and three individuals identified as second-generation backcrosses originating from American eel × F1 hybrid backcrosses interbreeding with pure European eels. In comparison...

  15. Preliminary performance assessment strategy for single-shell tank waste disposal

    International Nuclear Information System (INIS)

    Sonnichsen, J.C. Jr.

    1991-10-01

    The disposal of the waste stored in single-shell tanks at the Hanford Site is recognized as a major environmental concern. A comprehensive program has been initiated to evaluate the various alternatives available for disposal of these wastes. Theses wastes will be disposed of in a manner consistent with applicable laws and regulations. Long-term waste isolation is one measure of performance that will be used for purposes of selection. The performance of each disposal alternative will be simulated using numerical models. Contained herein is a discussion of the strategy that has and continues to evolve to establish a general analytical framework to evaluate this performance. This general framework will be used to construct individual models of each waste disposal alternative selected for purposes of evaluation. 30 refs., 3 figs

  16. Graft function assessment in mouse models of single- and dual- kidney transplantation.

    Science.gov (United States)

    Wang, Lei; Wang, Ximing; Jiang, Shan; Wei, Jin; Buggs, Jacentha; Fu, Liying; Zhang, Jie; Liu, Ruisheng

    2018-05-23

    Animal models of kidney transplantation (KTX) are widely used in studying immune response of hosts to implanted grafts. Additionally, KTX can be used in generating kidney-specific knockout animal models by transplantation of kidneys from donors with global knockout of a gene to wild type recipients or vise verse. Dual kidney transplantation (DKT) provides a more physiological environment for recipients than single kidney transplantation (SKT). However, DKT in mice is rare due to technical challenges. In this study, we successfully performed DKT in mice and compared the hemodynamic response and graft function with SKT. The surgical time, complications and survival rate of DKT were not significantly different from SKT, where survival rates were above 85%. Mice with DKT showed less injury and quicker recovery with lower plasma creatinine (Pcr) and higher GFR than SKT mice (Pcr = 0.34 and 0.17 mg/dl in DKT vs. 0.50 and 0.36 mg/dl in SKT at 1 and 3 days, respectively; GFR = 215 and 131 µl/min for DKT and SKT, respectively). In addition, the DKT exhibited better renal functional reserve and long-term outcome of renal graft function than SKT based on the response to acute volume expansion. In conclusion, we have successfully generated a mouse DKT model. The hemodynamic responses of DKT better mimic physiological situations with less kidney injury and better recovery than SKT because of reduced confounding factors such as single nephron hyperfiltration. We anticipate DKT in mice will provide an additional tool for evaluation of renal significance in physiology and disease.

  17. Assessing the Nonequilibrium Thermodynamics in a Quenched Quantum Many-Body System via Single Projective Measurements

    Directory of Open Access Journals (Sweden)

    L. Fusco

    2014-08-01

    Full Text Available We analyze the nature of the statistics of the work done on or by a quantum many-body system brought out of equilibrium. We show that, for the sudden quench and for an initial state that commutes with the initial Hamiltonian, it is possible to retrieve the whole nonequilibrium thermodynamics via single projective measurements of observables. We highlight, in a physically clear way, the qualitative implications for the statistics of work coming from considering processes described by operators that either commute or do not commute with the unperturbed Hamiltonian of a given system. We consider a quantum many-body system and derive an expression that allows us to give a physical interpretation, for a thermal initial state, to all of the cumulants of the work in the case of quenched operators commuting with the unperturbed Hamiltonian. In the commuting case, the observables that we need to measure have an intuitive physical meaning. Conversely, in the noncommuting case, we show that, although it is possible to operate fully within the single-measurement framework irrespectively of the size of the quench, some difficulties are faced in providing a clear-cut physical interpretation to the cumulants. This circumstance makes the study of the physics of the system nontrivial and highlights the nonintuitive phenomenology of the emergence of thermodynamics from the fully quantum microscopic description. We illustrate our ideas with the example of the Ising model in a transverse field showing the interesting behavior of the high-order statistical moments of the work distribution for a generic thermal state and linking them to the critical nature of the model itself.

  18. A Bifactor Multidimensional Item Response Theory Model for Differential Item Functioning Analysis on Testlet-Based Items

    Science.gov (United States)

    Fukuhara, Hirotaka; Kamata, Akihito

    2011-01-01

    A differential item functioning (DIF) detection method for testlet-based data was proposed and evaluated in this study. The proposed DIF model is an extension of a bifactor multidimensional item response theory (MIRT) model for testlets. Unlike traditional item response theory (IRT) DIF models, the proposed model takes testlet effects into…

  19. Emergency Power For Critical Items

    Science.gov (United States)

    Young, William R.

    2009-07-01

    Natural disasters, such as hurricanes, floods, tornados, and tsunami, are becoming a greater problem as climate change impacts our environment. Disasters, whether natural or man made, destroy lives, homes, businesses and the natural environment. Such disasters can happen with little or no warning, leaving hundreds or even thousands of people without medical services, potable water, sanitation, communications and electrical services for up to several weeks. In our modern world, the need for electricity has become a necessity. Modern building codes and new disaster resistant building practices are reducing the damage to homes and businesses. Emergency gasoline and diesel generators are becoming common place for power outages. Generators need fuel, which may not be available after a disaster, but Photovoltaic (solar-electric) systems supply electricity without petroleum fuel as they are powered by the sun. Photovoltaic (PV) systems can provide electrical power for a home or business. PV systems can operate as utility interactive or stand-alone with battery backup. Determining your critical load items and sizing the photovoltaic system for those critical items, guarantees their operation in a disaster.

  20. Therapeutic Assessment of Complex Trauma: A Single-Case Time-Series Study

    OpenAIRE

    Tarocchi, Anna; Aschieri, Filippo; Fantini, Francesca; Smith, Justin D.

    2013-01-01

    The cumulative effect of repeated traumatic experiences in early childhood incrementally increases the risk of adjustment problems later in life. Surviving traumatic environments can lead to the development of an interrelated constellation of emotional and interpersonal symptoms termed complex posttraumatic stress disorder (CPTSD). Effective treatment of trauma begins with a multimethod psychological assessment and requires the use of several evidence-based therapeutic processes, including es...

  1. Stability Assessment of a System Comprising a Single Machine and Inverter with Scalable Ratings

    Energy Technology Data Exchange (ETDEWEB)

    Johnson, Brian B [National Renewable Energy Laboratory (NREL), Golden, CO (United States); Lin, Yashen [National Renewable Energy Laboratory (NREL), Golden, CO (United States); Gevorgian, Vahan [National Renewable Energy Laboratory (NREL), Golden, CO (United States); Purba, Victor [University of Minnesota; Dhople, Sairaj [University of Minnesota

    2017-09-28

    From the inception of power systems, synchronous machines have acted as the foundation of large-scale electrical infrastructures and their physical properties have formed the cornerstone of system operations. However, power electronics interfaces are playing a growing role as they are the primary interface for several types of renewable energy sources and storage technologies. As the role of power electronics in systems continues to grow, it is crucial to investigate the properties of bulk power systems in low inertia settings. In this paper, we assess the properties of coupled machine-inverter systems by studying an elementary system comprised of a synchronous generator, three-phase inverter, and a load. Furthermore, the inverter model is formulated such that its power rating can be scaled continuously across power levels while preserving its closed-loop response. Accordingly, the properties of the machine-inverter system can be assessed for varying ratios of machine-to-inverter power ratings and, hence, differing levels of inertia. After linearizing the model and assessing its eigenvalues, we show that system stability is highly dependent on the interaction between the inverter current controller and machine exciter, thus uncovering a key concern with mixed machine-inverter systems and motivating the need for next-generation grid-stabilizing inverter controls.

  2. Instemmingsgeneigdheid en verskillende item- en responsformate in 'n gesommeerde selfbeoordelingskaal

    Directory of Open Access Journals (Sweden)

    Nadene Hanekom

    1998-06-01

    Full Text Available This study examines the degree of acquiescence present when the item and response formats of a summated rating scale are varied. It is often recommended that acquiescence response bias in rating scales may be controlled by using both positively and negatively worded items. Such items are generally worded in the Likert-type format of statements. The purpose of the study was to establish whether items in question format would result in a smaller degree of acquiescence than items worded as statements. the response format was also varied (five- and seven-point options to determine whether this would influence the reliability and degree of acquiescence in the scales. A twenty-item Locus of Control (LC questionnaire was used, but each item was complemented by its opposite, resulting in 40 items. The subjects, divided randomly into two groups, were second year students who had to complete four versions of the questionnaire, plus a shortened version of Bass's scale for measuring acquiescence. The LC version were questions or statements each combined with a five- or seven-point respons format. Partial counterbalancing was introduced by testing on two separate occasions, presenting the tests to the two groups in the opposite order. The degree of acquiescence was assessed by correlating the items with their opposite, and by correlating scores on each version with scores on the acquiescence questionnaire. No major difference were found between the various item and response format in relation to acquiescence. Opsomming Hierdie ondersoek is uitgevoer om te bepaal of die mate van instemmingsgeneigdheid deur die item- en responsformaat van 'n gesommeerde selfbeoordelingskaal beinvloed word. Daar word dikwels aanbeveel dat die gebruik van positief- sowel as negatiefbewoorde items in 'n vraelys instemmingsgeneigdheid beperk. Suike items word gewoonlik in die tradisionele Likertformaat as stellings geformuleer. Die doel van die ondersoek was om te bepaal of items

  3. Differential item functioning of the UWES-17 in South Africa

    Directory of Open Access Journals (Sweden)

    Leanne Goliath-Yarde

    2011-11-01

    Research purpose: This study assesses the Differential Item Functioning (DIF of the Utrecht Work Engagement Scale (UWES-17 for different South African cultural groups in a South African company. Motivation for the study: Organisations are using the UWES-17 more and more in South Africa to assess work engagement. Therefore, research evidence from psychologists or assessment practitioners on its DIF across different cultural groups is necessary. Research design, approach and method: The researchers conducted a Secondary Data Analysis (SDA on the UWES-17 sample (n = 2429 that they obtained from a cross-sectional survey undertaken in a South African Information and Communication Technology (ICT sector company (n = 24 134. Quantitative item data on the UWES-17 scale enabled the authors to address the research question. Main findings: The researchers found uniform and/or non-uniform DIF on five of the vigour items, four of the dedication items and two of the absorption items. This also showed possible Differential Test Functioning (DTF on the vigour and dedication dimensions. Practical/managerial implications: Based on the DIF, the researchers suggested that organisations should not use the UWES-17 comparatively for different cultural groups or employment decisions in South Africa. Contribution/value add: The study provides evidence on DIF and possible DTF for the UWES-17. However, it also raises questions about possible interaction effects that need further investigation.

  4. Elastographic assessment of liver fibrosis in children: A prospective single center experience

    International Nuclear Information System (INIS)

    Marginean, Cristina Oana; Marginean, Claudiu

    2012-01-01

    Background: The assessment of liver damage in various disease states relies on a combination of clinical findings, biochemical parameters and invasive tests such as liver biopsy. The ultrasound elastography has emerged as a potential alternative test, providing quantifiable information on the elasticity/stiffness of the examined-tissues. We assessed the performance of ultrasound elastography using real-time Acoustic Radiation Force Imaging (ARFI) technology in evaluating the degree of liver fibrosis in children with and without liver disease. Methods: Children aged 0–18 years, hospitalized in the Emergency Clinical County Hospital Tg. Mures, Romania, between September 15, 2010 and January 15, 2011, were eligible for the study. Four groups were recruited as follow: patients with liver disease in the setting of various malignant disorders, children with non-malignant liver disease, overweight and obese children and healthy controls. The liver tissue elasticity was assessed in each individual using Shear Wave Velocity (SWV). Biochemical tests included transaminase levels. 19 children with chronic liver disease underwent biopsies. SWV was measured globally and separately for the liver-segments 1 and 8. Correlations between the SWV and laboratory test were established using non-parametric Spearman correlation test. Results: A total of 103 children underwent liver ultrasound elastographic assessments. Of these, 39 had malignancies, 19 had various chronic liver diseases, 13 had nonalcoholic fatty liver disease (NAFLD), and 32 were healthy controls. The transaminase values differed significantly between children with liver diseases and controls. In normal controls SWV values in the 1st segment were significantly lower compared to those in the in 8th segment of the liver (p = 0.0216). In the group with hepatic steatosis, the SWV values were statistically higher compared to those in healthy controls. Positive statistical correlations have been established between AST and

  5. Elastographic assessment of liver fibrosis in children: A prospective single center experience

    Energy Technology Data Exchange (ETDEWEB)

    Marginean, Cristina Oana, E-mail: marginean.oana@gmail.com [Department of Paediatrics, University of Medicine and Pharmacy of Tg. Mures (Romania); Marginean, Claudiu, E-mail: marginean.claudiu@gmail.com [Department of Obstetrics and Gynecology, University of Medicine and Pharmacy of Tg. Mures (Romania)

    2012-08-15

    Background: The assessment of liver damage in various disease states relies on a combination of clinical findings, biochemical parameters and invasive tests such as liver biopsy. The ultrasound elastography has emerged as a potential alternative test, providing quantifiable information on the elasticity/stiffness of the examined-tissues. We assessed the performance of ultrasound elastography using real-time Acoustic Radiation Force Imaging (ARFI) technology in evaluating the degree of liver fibrosis in children with and without liver disease. Methods: Children aged 0-18 years, hospitalized in the Emergency Clinical County Hospital Tg. Mures, Romania, between September 15, 2010 and January 15, 2011, were eligible for the study. Four groups were recruited as follow: patients with liver disease in the setting of various malignant disorders, children with non-malignant liver disease, overweight and obese children and healthy controls. The liver tissue elasticity was assessed in each individual using Shear Wave Velocity (SWV). Biochemical tests included transaminase levels. 19 children with chronic liver disease underwent biopsies. SWV was measured globally and separately for the liver-segments 1 and 8. Correlations between the SWV and laboratory test were established using non-parametric Spearman correlation test. Results: A total of 103 children underwent liver ultrasound elastographic assessments. Of these, 39 had malignancies, 19 had various chronic liver diseases, 13 had nonalcoholic fatty liver disease (NAFLD), and 32 were healthy controls. The transaminase values differed significantly between children with liver diseases and controls. In normal controls SWV values in the 1st segment were significantly lower compared to those in the in 8th segment of the liver (p = 0.0216). In the group with hepatic steatosis, the SWV values were statistically higher compared to those in healthy controls. Positive statistical correlations have been established between AST and SWV

  6. Assessment of Genetic Diversity in Faba Bean Based on Single Nucleotide Polymorphism

    Directory of Open Access Journals (Sweden)

    Sukhjiwan Kaur

    2014-01-01

    Full Text Available Detection of genetic diversity is important for characterisation of crop plant collections in order to detect the presence of valuable trait variation for use in breeding programs. A collection of faba bean (Vicia faba L. genotypes was evaluated for intra- and inter-population diversity using a set of 768 genome-wide distributed single nucleotide polymorphism (SNP markers, of which 657 obtained successful amplification and detected polymorphisms. Gene diversity and polymorphism information content (PIC values varied between 0.022–0.500 and 0.023–1.00, with averages of 0.363 and 0.287, respectively. The genetic structure of the germplasm collection was analysed and a neighbour-joining (NJ dendrogram was constructed. The faba bean accessions grouped into two major groups, with several additional smaller sub-groups, predominantly on the basis of geographical origin. These results were further supported by principal co-ordinate analysis (PCoA, deriving two major groupings which were differentiated on the basis of site of origin and pedigree relationships. In general, high levels of heterozygosity were observed, presumably due to the partially allogamous nature of the species. The results will facilitate targeted crossing strategies in future faba bean breeding programs in order to achieve genetic gain.

  7. Performance and risk assessment of subsurface barriers for single-shell tank waste retrieval

    Energy Technology Data Exchange (ETDEWEB)

    Bazinet, G.D.; Cruse, J.M.; Hampsten, K.L. [Westinghouse Hanford Co., Richland, WA (United States); Treat, R.L.

    1995-02-01

    Subsurface barriers are among various alternatives under evaluation to mitigate the threat of leakage from the Hanford Site`s 149 single-shell high-level radioactive waste tanks. The Tank Waste Remediation System (TWRS) division of Westinghouse Hanford Company is conducting this evaluation of subsurface barriers and other alternatives, focusing on risk and cost as performance measures. A number of alternative retrieval/closure approaches were evaluated in terms of risks (carcinogenic and toxicological) to a postulated maximally exposed individual. In addition, worker and accident risks were evaluated and factors developed for each alternative on a relative basis. The work performed to date indicates the use of subsurface barriers may potentially reduce public risk by limiting contamination of groundwater below the Hanford Site; however, the cost in terms of actual funding and in elevated worker risk is significant. The analyses also assume certain performance levels for technologies that have not been demonstrated in field conditions similar to Hanford Site tank farms. The evaluations summarized herein are being used to support a decision by representatives of the US Department of Energy, Richland Operations Office, the Washington State Department of Ecology (Ecology), and the US Environmental Protection Agency (EPA) regarding potential further development of subsurface barrier technology.

  8. Performance and risk assessment of subsurface barriers for single-shell tank waste retrieval

    International Nuclear Information System (INIS)

    Bazinet, G.D.; Cruse, J.M.; Hampsten, K.L.; Treat, R.L.

    1995-02-01

    Subsurface barriers are among various alternatives under evaluation to mitigate the threat of leakage from the Hanford Site's 149 single-shell high-level radioactive waste tanks. The Tank Waste Remediation System (TWRS) division of Westinghouse Hanford Company is conducting this evaluation of subsurface barriers and other alternatives, focusing on risk and cost as performance measures. A number of alternative retrieval/closure approaches were evaluated in terms of risks (carcinogenic and toxicological) to a postulated maximally exposed individual. In addition, worker and accident risks were evaluated and factors developed for each alternative on a relative basis. The work performed to date indicates the use of subsurface barriers may potentially reduce public risk by limiting contamination of groundwater below the Hanford Site; however, the cost in terms of actual funding and in elevated worker risk is significant. The analyses also assume certain performance levels for technologies that have not been demonstrated in field conditions similar to Hanford Site tank farms. The evaluations summarized herein are being used to support a decision by representatives of the US Department of Energy, Richland Operations Office, the Washington State Department of Ecology (Ecology), and the US Environmental Protection Agency (EPA) regarding potential further development of subsurface barrier technology

  9. Bone mineral content in the senescent rat femur: an assessment using single photon absorptiometry

    International Nuclear Information System (INIS)

    Kiebzak, G.M.; Smith, R.; Howe, J.C.; Sacktor, B.

    1988-01-01

    The single photon absorptiometry technique was evaluated for measuring bone mineral content (BMC) of the excised femurs of the rat, and the system was used to examine the changes in cortical and trabecular bone from young adult (6 mo), mature adult (12 mo), and senescent (24 mo) male and female animals. BMC of the femur midshaft, representing cortical bone, apparently increased progressively with advancing age. The width of the femur at the scan site also increased with age. Normalizing the midshaft BMC by width partially compensated for the age-associated increase. However, when bone mineral values were normalized by the cortical area at the scan site, to take into account the geometric differences in the femurs of different aged animals, maximum bone densities were found in the mature adult and these values decreased slightly in the femurs from senescent rats. In contrast, the BMC of the femur distal metaphysis, representing trabecular bone, decreased markedly in the aged rat. The loss of trabecular bone was also evident from morphological examination of the distal metaphysis. These findings indicated that bone mineral loss with age was site specific in the rat femur. These studies provided additional evidence that the rat might serve as a useful animal model for specific experiments related to the pathogenesis of age-associated osteopenia

  10. A Balance Sheet for Educational Item Banking.

    Science.gov (United States)

    Hiscox, Michael D.

    Educational item banking presents observers with a considerable paradox. The development of test items from scratch is viewed as wasteful, a luxury in times of declining resources. On the other hand, item banking has failed to become a mature technology despite large amounts of money and the efforts of talented professionals. The question of which…

  11. 76 FR 60474 - Commercial Item Handbook

    Science.gov (United States)

    2011-09-29

    ... DEPARTMENT OF DEFENSE Defense Acquisition Regulations System Commercial Item Handbook AGENCY.... SUMMARY: DoD has updated its Commercial Item Handbook. The purpose of the Handbook is to help acquisition personnel develop sound business strategies for procuring commercial items. DoD is seeking industry input on...

  12. Towards an authoring system for item construction

    NARCIS (Netherlands)

    Rikers, Jos H.A.N.

    1988-01-01

    The process of writing test items is analyzed, and a blueprint is presented for an authoring system for test item writing to reduce invalidity and to structure the process of item writing. The developmental methodology is introduced, and the first steps in the process are reported. A historical

  13. Obtaining a Proportional Allocation by Deleting Items

    NARCIS (Netherlands)

    Dorn, B.; de Haan, R.; Schlotter, I.; Röthe, J.

    2017-01-01

    We consider the following control problem on fair allocation of indivisible goods. Given a set I of items and a set of agents, each having strict linear preference over the items, we ask for a minimum subset of the items whose deletion guarantees the existence of a proportional allocation in the

  14. Item Analysis in Introductory Economics Testing.

    Science.gov (United States)

    Tinari, Frank D.

    1979-01-01

    Computerized analysis of multiple choice test items is explained. Examples of item analysis applications in the introductory economics course are discussed with respect to three objectives: to evaluate learning; to improve test items; and to help improve classroom instruction. Problems, costs and benefits of the procedures are identified. (JMD)

  15. Re-evaluating a vision-related quality of life questionnaire with item response theory (IRT and differential item functioning (DIF analyses

    Directory of Open Access Journals (Sweden)

    Knol Dirk L

    2011-09-01

    Full Text Available Abstract Background For the Low Vision Quality Of Life questionnaire (LVQOL it is unknown whether the psychometric properties are satisfactory when an item response theory (IRT perspective is considered. This study evaluates some essential psychometric properties of the LVQOL questionnaire in an IRT model, and investigates differential item functioning (DIF. Methods Cross-sectional data were used from an observational study among visually-impaired patients (n = 296. Calibration was performed for every dimension of the LVQOL in the graded response model. Item goodness-of-fit was assessed with the S-X2-test. DIF was assessed on relevant background variables (i.e. age, gender, visual acuity, eye condition, rehabilitation type and administration type with likelihood-ratio tests for DIF. The magnitude of DIF was interpreted by assessing the largest difference in expected scores between subgroups. Measurement precision was assessed by presenting test information curves; reliability with the index of subject separation. Results All items of the LVQOL dimensions fitted the model. There was significant DIF on several items. For two items the maximum difference between expected scores exceeded one point, and DIF was found on multiple relevant background variables. Item 1 'Vision in general' from the "Adjustment" dimension and item 24 'Using tools' from the "Reading and fine work" dimension were removed. Test information was highest for the "Reading and fine work" dimension. Indices for subject separation ranged from 0.83 to 0.94. Conclusions The items of the LVQOL showed satisfactory item fit to the graded response model; however, two items were removed because of DIF. The adapted LVQOL with 21 items is DIF-free and therefore seems highly appropriate for use in heterogeneous populations of visually impaired patients.

  16. Microchip screening platform for single cell assessment of NK cell cytotoxicity

    Directory of Open Access Journals (Sweden)

    Karolin eGuldevall

    2016-04-01

    Full Text Available Here we report a screening platform for assessment of the cytotoxic potential of individual natural killer (NK cells within larger populations. Human primary NK cells were distributed across a silicon-glass microchip containing 32 400 individual microwells loaded with target cells. Through fluorescence screening and automated image analysis the numbers of NK and live or dead target cells in each well could be assessed at different time points after initial mixing. Cytotoxicity was also studied by time-lapse live-cell imaging in microwells quantifying the killing potential of individual NK cells. Although most resting NK cells (≈75% were non-cytotoxic against the leukemia cell line K562, some NK cells were able to kill several (≥3 target cells within the 12 hours long experiment. In addition, the screening approach was adapted to increase the chance to find and evaluate serial killing NK cells. Even if the cytotoxic potential varied between donors it was evident that a small fraction of highly cytotoxic NK cells were responsible for a substantial portion of the killing. We demonstrate multiple assays where our platform can be used to enumerate and characterize cytotoxic cells, such as NK or T cells. This approach could find use in clinical applications, e.g. in the selection of donors for stem cell transplantation or generation of highly specific and cytotoxic cells for adoptive immunotherapy.

  17. Microchip Screening Platform for Single Cell Assessment of NK Cell Cytotoxicity

    Science.gov (United States)

    Guldevall, Karolin; Brandt, Ludwig; Forslund, Elin; Olofsson, Karl; Frisk, Thomas W.; Olofsson, Per E.; Gustafsson, Karin; Manneberg, Otto; Vanherberghen, Bruno; Brismar, Hjalmar; Kärre, Klas; Uhlin, Michael; Önfelt, Björn

    2016-01-01

    Here, we report a screening platform for assessment of the cytotoxic potential of individual natural killer (NK) cells within larger populations. Human primary NK cells were distributed across a silicon–glass microchip containing 32,400 individual microwells loaded with target cells. Through fluorescence screening and automated image analysis, the numbers of NK and live or dead target cells in each well could be assessed at different time points after initial mixing. Cytotoxicity was also studied by time-lapse live-cell imaging in microwells quantifying the killing potential of individual NK cells. Although most resting NK cells (≈75%) were non-cytotoxic against the leukemia cell line K562, some NK cells were able to kill several (≥3) target cells within the 12-h long experiment. In addition, the screening approach was adapted to increase the chance to find and evaluate serial killing NK cells. Even if the cytotoxic potential varied between donors, it was evident that a small fraction of highly cytotoxic NK cells were responsible for a substantial portion of the killing. We demonstrate multiple assays where our platform can be used to enumerate and characterize cytotoxic cells, such as NK or T cells. This approach could find use in clinical applications, e.g., in the selection of donors for stem cell transplantation or generation of highly specific and cytotoxic cells for adoptive immunotherapy. PMID:27092139

  18. Quality of life in infants and children with atopic dermatitis: Addressing issues of differential item functioning across countries in multinational clinical trials

    Directory of Open Access Journals (Sweden)

    Tennant Alan

    2007-07-01

    Full Text Available Abstract Background A previous study had identified 45 items assessing the impact of atopic dermatitis (AD on the whole family. From these it was intended to develop two separate scales, one assessing impact on carers and the other determining the effect on the child. Methods The 45 items were included in three clinical trials designed to test the efficacy of a new topical treatment (pimecrolimus, Elidel cream 1% in the treatment of AD in infants and children and in validation studies in the UK, US, Germany, France and the Netherlands. Rasch analyses were undertaken to determine whether an internationally valid, unidimensional scale could be developed that would inform on the direct impact of AD on the child. Results Rasch analyses applied to the data from the trials indicated that the draft measure consisted of two scales, one assessing the QoL of the carer and the other (consisting of 12 items measuring the impact of AD on the child. Three of the 12 potential items failed to fit the measurement model in Europe and five in the US. In addition, four items exhibiting differential item functioning (DIF by country were identified. After removing the misfitting items and controlling for DIF it was possible to derive a scale; The Childhood Impact of Atopic Dermatitis (CIAD with good item fit for each trial analysis. Analysis of the validation data from each of the different countries confirmed that the CIAD had adequate internal consistency, reproducibility and construct validity. The CIAD demonstrated the benefits of treatment with Elidel over placebo in the European trial. A similar (non-significant trend was found for the US trials. Conclusion The study represents a novel method of dealing with the problem of DIF associated with different cultures. Such problems are likely to arise in any multinational study involving patient-reported outcome measures, as items in the scales are likely to be valued differently in different cultures. However, where

  19. New technologies for item monitoring

    International Nuclear Information System (INIS)

    Abbott, J.A.; Waddoups, I.G.

    1993-12-01

    This report responds to the Department of Energy's request that Sandia National Laboratories compare existing technologies against several advanced technologies as they apply to DOE needs to monitor the movement of material, weapons, or personnel for safety and security programs. The authors describe several material control systems, discuss their technologies, suggest possible applications, discuss assets and limitations, and project costs for each system. The following systems are described: WATCH system (Wireless Alarm Transmission of Container Handling); Tag system (an electrostatic proximity sensor); PANTRAK system (Personnel And Material Tracking); VRIS (Vault Remote Inventory System); VSIS (Vault Safety and Inventory System); AIMS (Authenticated Item Monitoring System); EIVS (Experimental Inventory Verification System); Metrox system (canister monitoring system); TCATS (Target Cueing And Tracking System); LGVSS (Light Grid Vault Surveillance System); CSS (Container Safeguards System); SAMMS (Security Alarm and Material Monitoring System); FOIDS (Fiber Optic Intelligence ampersand Detection System); GRADS (Graded Radiation Detection System); and PINPAL (Physical Inventory Pallet)

  20. New technologies for item monitoring

    Energy Technology Data Exchange (ETDEWEB)

    Abbott, J.A. [EG & G Energy Measurements, Albuquerque, NM (United States); Waddoups, I.G. [Sandia National Labs., Albuquerque, NM (United States)

    1993-12-01

    This report responds to the Department of Energy`s request that Sandia National Laboratories compare existing technologies against several advanced technologies as they apply to DOE needs to monitor the movement of material, weapons, or personnel for safety and security programs. The authors describe several material control systems, discuss their technologies, suggest possible applications, discuss assets and limitations, and project costs for each system. The following systems are described: WATCH system (Wireless Alarm Transmission of Container Handling); Tag system (an electrostatic proximity sensor); PANTRAK system (Personnel And Material Tracking); VRIS (Vault Remote Inventory System); VSIS (Vault Safety and Inventory System); AIMS (Authenticated Item Monitoring System); EIVS (Experimental Inventory Verification System); Metrox system (canister monitoring system); TCATS (Target Cueing And Tracking System); LGVSS (Light Grid Vault Surveillance System); CSS (Container Safeguards System); SAMMS (Security Alarm and Material Monitoring System); FOIDS (Fiber Optic Intelligence & Detection System); GRADS (Graded Radiation Detection System); and PINPAL (Physical Inventory Pallet).

  1. The Body Appreciation Scale-2: item refinement and psychometric evaluation.

    Science.gov (United States)

    Tylka, Tracy L; Wood-Barcalow, Nichole L

    2015-01-01

    Considered a positive body image measure, the 13-item Body Appreciation Scale (BAS; Avalos, Tylka, & Wood-Barcalow, 2005) assesses individuals' acceptance of, favorable opinions toward, and respect for their bodies. While the BAS has accrued psychometric support, we improved it by rewording certain BAS items (to eliminate sex-specific versions and body dissatisfaction-based language) and developing additional items based on positive body image research. In three studies, we examined the reworded, newly developed, and retained items to determine their psychometric properties among college and online community (Amazon Mechanical Turk) samples of 820 women and 767 men. After exploratory factor analysis, we retained 10 items (five original BAS items). Confirmatory factor analysis upheld the BAS-2's unidimensionality and invariance across sex and sample type. Its internal consistency, test-retest reliability, and construct (convergent, incremental, and discriminant) validity were supported. The BAS-2 is a psychometrically sound positive body image measure applicable for research and clinical settings. Copyright © 2014 Elsevier Ltd. All rights reserved.

  2. Item response theory analysis of the mechanics baseline test

    Science.gov (United States)

    Cardamone, Caroline N.; Abbott, Jonathan E.; Rayyan, Saif; Seaton, Daniel T.; Pawl, Andrew; Pritchard, David E.

    2012-02-01

    Item response theory is useful in both the development and evaluation of assessments and in computing standardized measures of student performance. In item response theory, individual parameters (difficulty, discrimination) for each item or question are fit by item response models. These parameters provide a means for evaluating a test and offer a better measure of student skill than a raw test score, because each skill calculation considers not only the number of questions answered correctly, but the individual properties of all questions answered. Here, we present the results from an analysis of the Mechanics Baseline Test given at MIT during 2005-2010. Using the item parameters, we identify questions on the Mechanics Baseline Test that are not effective in discriminating between MIT students of different abilities. We show that a limited subset of the highest quality questions on the Mechanics Baseline Test returns accurate measures of student skill. We compare student skills as determined by item response theory to the more traditional measurement of the raw score and show that a comparable measure of learning gain can be computed.

  3. Improving Measurement Efficiency of the Inner EAR Scale with Item Response Theory.

    Science.gov (United States)

    Jessen, Annika; Ho, Andrew D; Corrales, C Eduardo; Yueh, Bevan; Shin, Jennifer J

    2018-02-01

    Objectives (1) To assess the 11-item Inner Effectiveness of Auditory Rehabilitation (Inner EAR) instrument with item response theory (IRT). (2) To determine whether the underlying latent ability could also be accurately represented by a subset of the items for use in high-volume clinical scenarios. (3) To determine whether the Inner EAR instrument correlates with pure tone thresholds and word recognition scores. Design IRT evaluation of prospective cohort data. Setting Tertiary care academic ambulatory otolaryngology clinic. Subjects and Methods Modern psychometric methods, including factor analysis and IRT, were used to assess unidimensionality and item properties. Regression methods were used to assess prediction of word recognition and pure tone audiometry scores. Results The Inner EAR scale is unidimensional, and items varied in their location and information. Information parameter estimates ranged from 1.63 to 4.52, with higher values indicating more useful items. The IRT model provided a basis for identifying 2 sets of items with relatively lower information parameters. Item information functions demonstrated which items added insubstantial value over and above other items and were removed in stages, creating a 8- and 3-item Inner EAR scale for more efficient assessment. The 8-item version accurately reflected the underlying construct. All versions correlated moderately with word recognition scores and pure tone averages. Conclusion The 11-, 8-, and 3-item versions of the Inner EAR scale have strong psychometric properties, and there is correlational validity evidence for the observed scores. Modern psychometric methods can help streamline care delivery by maximizing relevant information per item administered.

  4. Comparative study of single and multislice computed tomography for assessment of the mandibular canal

    Directory of Open Access Journals (Sweden)

    Adriana da Silva Ferreira Paes

    2007-06-01

    Full Text Available OBJECTIVE: The purpose of this study was to evaluate the accuracy of relative measurements from the roof of the mandibular canal to the alveolar crest in multislice (multidetector computed tomography (MDCT and single-slice computed tomography (SSCT. MATERIAL AND METHODS: The sample consisted of 26 printed CT films (7 SSCT and 19 MDCT from the files of the LABI-3D (3D Imaging Laboratory of the School of Dentistry of the University of São Paulo (FOUSP, which had been acquired using different protocols. Two observers analyzed in a randomized and independent order a series of 22 oblique CT reconstructions of each patient. Each observer analyzed the CT scans twice. The length of the mandibular canal and the distance between the mandibular canal roof and the crest of the alveolar ridge were obtained. Dahlberg test was used for statistical analysis. RESULTS: The mean error found for the mandibular canal length measurements obtained from SSCT was 0.53 mm in the interobserver analysis, and 0.38 mm for both observers. On MDCT images, the mean error was 0.0 mm in the interobserver analysis, and 0.0 and 0.23 mm in the intraobserver analysis. Regarding the distance between the mandibular canal roof and the alveolar bone crest, the SSCT images showed a mean error of 1.16 mm in the interobserver analysis and 0.66 and 0.59 mm in the intraobserver analysis. In the MDCT images, the mean error was 0.72 mm in the interobserver analysis and 0.50 and 0.54 mm in the intraobserver analysis. CONCLUSION: Multislice CT was demonstrated a more accurate method and demonstrated high reproducibility in the analysis of important anatomical landmarks for planning of mandibular dental implants, namely the mandibular canal pathway and alveolar crest height.

  5. Aquatic toxicity assessment of single-walled carbon nanotubes using zebrafish embryos

    Energy Technology Data Exchange (ETDEWEB)

    Pan Huichin; Lin Yujun; Li Mengwei [Department of Biomedical Sciences, Chung Shan Medical University, Taichung 40201, Taiwan (China); Chuang Hanni; Chou Chengchung, E-mail: bioccc@ccu.edu.tw, E-mail: hp29@csmu.edu.tw [Department of Life Science, National Chung Cheng University, Min-Hsiung, 62102 Taiwan (China)

    2011-07-06

    Zebrafish embryos selected at the 64-cell stage were exposed to various concentrations of amide functionalized single-walled carbon nanotubes (SWCNTs) ranging from 1 to 10 {mu}g/ml dissolved in 1% Pluronic F-68 (a cell culture grade surfactant), and the development of embryos was examined from 24 to 120 hours post fertilization (hpf). Incubation of embryos in 1% F-68 did not induce overt abnormal phenotype as compared to the wild-type; neither did it cause significant mortality during the exposure period. Generally, there was a slight developmental delay in larvae treated with SWCNTs of 5 {mu}g/ml or above. Only larvae exposed to {>=} 5 {mu}g/ml SWCNTs showed significantly reduced survival rates. About 50% of the embryos exposed to 5 {mu}g/ml showed abnormal phenotypes at 24 hpf as compared to the control group. As development proceeds to 120 hpf, more embryos displayed defective morphology. A slight hatching delay was observed in embryos exposed to concentrations above 5 {mu}g/ml. There was a general reduction of body axes, including narrowed somite and shortened yolk stalk. In addition, pigmentation in the ventral trunk area was less than that observed in control group. The body lengths of the exposed embryos were decreased significantly at 48 hpf (3.11 mm in control vs. 3.00 mm in SWCNTs-exposed embryos). However, exposure to SWCNTs did not affect the number of somites. Other features that were noticed in the SWCNTs-exposed embryos included edema and shrinkage and blebbling of the epidermal lining. Most of these observed phenotypes persisted from 48 hpf through 120 hpf. Overall, the aforementioned results indicate that soluble amide-functionalized SWCNTs are toxic to zebrafish embryos at a minimum concentration of 5 {mu}g/ml.

  6. A note on monotonicity of item response functions for ordered polytomous item response theory models.

    Science.gov (United States)

    Kang, Hyeon-Ah; Su, Ya-Hui; Chang, Hua-Hua

    2018-03-08

    A monotone relationship between a true score (τ) and a latent trait level (θ) has been a key assumption for many psychometric applications. The monotonicity property in dichotomous response models is evident as a result of a transformation via a test characteristic curve. Monotonicity in polytomous models, in contrast, is not immediately obvious because item response functions are determined by a set of response category curves, which are conceivably non-monotonic in θ. The purpose of the present note is to demonstrate strict monotonicity in ordered polytomous item response models. Five models that are widely used in operational assessments are considered for proof: the generalized partial credit model (Muraki, 1992, Applied Psychological Measurement, 16, 159), the nominal model (Bock, 1972, Psychometrika, 37, 29), the partial credit model (Masters, 1982, Psychometrika, 47, 147), the rating scale model (Andrich, 1978, Psychometrika, 43, 561), and the graded response model (Samejima, 1972, A general model for free-response data (Psychometric Monograph no. 18). Psychometric Society, Richmond). The study asserts that the item response functions in these models strictly increase in θ and thus there exists strict monotonicity between τ and θ under certain specified conditions. This conclusion validates the practice of customarily using τ in place of θ in applied settings and provides theoretical grounds for one-to-one transformations between the two scales. © 2018 The British Psychological Society.

  7. Bile Gastritis Following Laparoscopic Single Anastomosis Gastric Bypass: Pilot Study to Assess Significance of Bilirubin Level in Gastric Aspirate.

    Science.gov (United States)

    Shenouda, Michael M; Harb, Shady ElGhazaly; Mikhail, Sameh A A; Mokhtar, Sherif M; Osman, Ayman M A; Wassef, Arsany T S; Rizkallah, Nayer N H; Milad, Nader M; Anis, Shady E; Nabil, Tamer Mohamed; Zaki, Nader Sh; Halepian, Antoine

    2018-02-01

    Laparoscopic single anastomosis gastric bypass (SAGB) is increasingly performed for morbidly obese patients. This pilot study aims primarily at evaluating the incidence of bile gastritis after SAGB. The occurrence of reflux oesophagitis and reflux symptoms were also assessed. This study included 20 patients having no reflux symptoms. All patients underwent a SAGB as a primary bariatric procedure by a single surgeon. Patients included consented to have an upper GI endoscopy done at 6 months postoperatively. Gastric aspirate was sent for bilirubin level assessment. Gastric and esophageal biopsies were submitted for histopathology and campylobacter-like organism (CLO) test. In our study, the rate of bile gastritis was 30%. In 18 patients, the level of bilirubin in gastric aspirate seems to be related to the degree of mucosal inflammation. The remaining two patients had microscopic moderate to severe gastritis with normal aspirate bilirubin level. Two patients with bilirubin level in aspirate more than 20 mg/dl had severe oesophagitis, gastritis with erosions, and metaplasia. Relationship between bilirubin level and histopathological findings of gastric biopsy examination was statistically significant with a P value of 0.001. The incidence of bile gastritis in this cohort is higher than reported in the literature, and this may be worrying. The correlation between endoscopic findings and patients' symptoms is poor. Bilirubin level and pH in aspirate might be useful tools to confirm alkaline reflux. Its level might help to choose candidates for revision surgery after SAGB. This needs further validation with larger sample size.

  8. Epilogue: Reading Comprehension Is Not a Single Ability-Implications for Assessment and Instruction.

    Science.gov (United States)

    Kamhi, Alan G; Catts, Hugh W

    2017-04-20

    In this epilogue, we review the 4 response articles and highlight the implications of a multidimensional view of reading for the assessment and instruction of reading comprehension. We reiterate the problems with standardized tests of reading comprehension and discuss the advantages and disadvantages of recently developed authentic tests of reading comprehension. In the "Instruction" section, we review the benefits and limitations of strategy instruction and highlight suggestions from the response articles to improve content and language knowledge. We argue that the only compelling reason to administer a standardized test of reading comprehension is when these tests are necessary to qualify students for special education services. Instruction should be focused on content knowledge, language knowledge, and specific task and learning requirements. This instruction may entail the use of comprehension strategies, particularly those that are specific to the task and focus on integrating new knowledge with prior knowledge.

  9. Approximation Preserving Reductions among Item Pricing Problems

    Science.gov (United States)

    Hamane, Ryoso; Itoh, Toshiya; Tomita, Kouhei

    When a store sells items to customers, the store wishes to determine the prices of the items to maximize its profit. Intuitively, if the store sells the items with low (resp. high) prices, the customers buy more (resp. less) items, which provides less profit to the store. So it would be hard for the store to decide the prices of items. Assume that the store has a set V of n items and there is a set E of m customers who wish to buy those items, and also assume that each item i ∈ V has the production cost di and each customer ej ∈ E has the valuation vj on the bundle ej ⊆ V of items. When the store sells an item i ∈ V at the price ri, the profit for the item i is pi = ri - di. The goal of the store is to decide the price of each item to maximize its total profit. We refer to this maximization problem as the item pricing problem. In most of the previous works, the item pricing problem was considered under the assumption that pi ≥ 0 for each i ∈ V, however, Balcan, et al. [In Proc. of WINE, LNCS 4858, 2007] introduced the notion of “loss-leader, ” and showed that the seller can get more total profit in the case that pi < 0 is allowed than in the case that pi < 0 is not allowed. In this paper, we derive approximation preserving reductions among several item pricing problems and show that all of them have algorithms with good approximation ratio.

  10. Physical assessment of the GE/CGR Neurocam and comparison with a single rotating gamma-camera

    International Nuclear Information System (INIS)

    Kouris, K.; Jarritt, P.H.; Costa, D.C.; Ell, P.J.

    1992-01-01

    The GE/CGR Neurocam is a triple-headed single photon emission tomography (SPET) system dedicated to multi-slice brain tomography. We have assessed its physical performance in terms of sensitivity and resolution, and its clinical efficacy in comparison with a modern, single, rotating gamma-camera (GE 400XCT). Using a water-filled cylinder containing TC-99m, the tomographic volume sensitivity of the Neurocam was 30.0 and 50.7 kcps/MBq.ml.cm for the high-resolution and general-purpose collimators, respectively; the corresponding values for the single rotating camera were 7.6 and 12.8 kcps/MBq.ml.cm. Tomographic resolution was measured in air and in water. In air, the Neurocam resolution at the centre of the field-of-view is 9.0 and 10.7 mm full width at half-maximum (FWHM) with the collimators, respectively, and is isotropic in the three orthogonal planes; the resolution of the GE 400XCT with its 13-cm radius of rotation is 10.3 and 11.7 mm, respectively. For the Neurocam with the HR collimator, the transaxial FWHM values in water were 9.7 mm at the centre and 9.5 mm radial (6.6 mm tangential) at 8 cm from the centre. The physical characteristics of the Neurocam enable the routine acquisition of brain perfusion data with Tc-99m hexamethyl-propylene amine oxime in about 14 min, yielding better image quality than with a single rotating camera in 40 min. (orig./HP)

  11. A Generalized Logistic Regression Procedure to Detect Differential Item Functioning among Multiple Groups

    Science.gov (United States)

    Magis, David; Raiche, Gilles; Beland, Sebastien; Gerard, Paul

    2011-01-01

    We present an extension of the logistic regression procedure to identify dichotomous differential item functioning (DIF) in the presence of more than two groups of respondents. Starting from the usual framework of a single focal group, we propose a general approach to estimate the item response functions in each group and to test for the presence…

  12. Prevalence of single nucleotide polymorphism among 27 diverse alfalfa genotypes as assessed by transcriptome sequencing

    Directory of Open Access Journals (Sweden)

    Li Xuehui

    2012-10-01

    Full Text Available Abstract Background Alfalfa, a perennial, outcrossing species, is a widely planted forage legume producing highly nutritious biomass. Currently, improvement of cultivated alfalfa mainly relies on recurrent phenotypic selection. Marker assisted breeding strategies can enhance alfalfa improvement efforts, particularly if many genome-wide markers are available. Transcriptome sequencing enables efficient high-throughput discovery of single nucleotide polymorphism (SNP markers for a complex polyploid species. Result The transcriptomes of 27 alfalfa genotypes, including elite breeding genotypes, parents of mapping populations, and unimproved wild genotypes, were sequenced using an Illumina Genome Analyzer IIx. De novo assembly of quality-filtered 72-bp reads generated 25,183 contigs with a total length of 26.8 Mbp and an average length of 1,065 bp, with an average read depth of 55.9-fold for each genotype. Overall, 21,954 (87.2% of the 25,183 contigs represented 14,878 unique protein accessions. Gene ontology (GO analysis suggested that a broad diversity of genes was represented in the resulting sequences. The realignment of individual reads to the contigs enabled the detection of 872,384 SNPs and 31,760 InDels. High resolution melting (HRM analysis was used to validate 91% of 192 putative SNPs identified by sequencing. Both allelic variants at about 95% of SNP sites identified among five wild, unimproved genotypes are still present in cultivated alfalfa, and all four US breeding programs also contain a high proportion of these SNPs. Thus, little evidence exists among this dataset for loss of significant DNA sequence diversity from either domestication or breeding of alfalfa. Structure analysis indicated that individuals from the subspecies falcata, the diploid subspecies caerulea, and the tetraploid subspecies sativa (cultivated tetraploid alfalfa were clearly separated. Conclusion We used transcriptome sequencing to discover large numbers of SNPs

  13. Development of six PROMIS pediatrics proxy-report item banks.

    Science.gov (United States)

    Irwin, Debra E; Gross, Heather E; Stucky, Brian D; Thissen, David; DeWitt, Esi Morgan; Lai, Jin Shei; Amtmann, Dagmar; Khastou, Leyla; Varni, James W; DeWalt, Darren A

    2012-02-22

    Pediatric self-report should be considered the standard for measuring patient reported outcomes (PRO) among children. However, circumstances exist when the child is too young, cognitively impaired, or too ill to complete a PRO instrument and a proxy-report is needed. This paper describes the development process including the proxy cognitive interviews and large-field-test survey methods and sample characteristics employed to produce item parameters for the Patient Reported Outcomes Measurement Information System (PROMIS) pediatric proxy-report item banks. The PROMIS pediatric self-report items were converted into proxy-report items before undergoing cognitive interviews. These items covered six domains (physical function, emotional distress, social peer relationships, fatigue, pain interference, and asthma impact). Caregivers (n = 25) of children ages of 5 and 17 years provided qualitative feedback on proxy-report items to assess any major issues with these items. From May 2008 to March 2009, the large-scale survey enrolled children ages 8-17 years to complete the self-report version and caregivers to complete the proxy-report version of the survey (n = 1548 dyads). Caregivers of children ages 5 to 7 years completed the proxy report survey (n = 432). In addition, caregivers completed other proxy instruments, PedsQL™ 4.0 Generic Core Scales Parent Proxy-Report version, PedsQL™ Asthma Module Parent Proxy-Report version, and KIDSCREEN Parent-Proxy-52. Item content was well understood by proxies and did not require item revisions but some proxies clearly noted that determining an answer on behalf of their child was difficult for some items. Dyads and caregivers of children ages 5-17 years old were enrolled in the large-scale testing. The majority were female (85%), married (70%), Caucasian (64%) and had at least a high school education (94%). Approximately 50% had children with a chronic health condition, primarily asthma, which was diagnosed or treated within 6

  14. Development of six PROMIS pediatrics proxy-report item banks

    Directory of Open Access Journals (Sweden)

    Irwin Debra E

    2012-02-01

    Full Text Available Abstract Background Pediatric self-report should be considered the standard for measuring patient reported outcomes (PRO among children. However, circumstances exist when the child is too young, cognitively impaired, or too ill to complete a PRO instrument and a proxy-report is needed. This paper describes the development process including the proxy cognitive interviews and large-field-test survey methods and sample characteristics employed to produce item parameters for the Patient Reported Outcomes Measurement Information System (PROMIS pediatric proxy-report item banks. Methods The PROMIS pediatric self-report items were converted into proxy-report items before undergoing cognitive interviews. These items covered six domains (physical function, emotional distress, social peer relationships, fatigue, pain interference, and asthma impact. Caregivers (n = 25 of children ages of 5 and 17 years provided qualitative feedback on proxy-report items to assess any major issues with these items. From May 2008 to March 2009, the large-scale survey enrolled children ages 8-17 years to complete the self-report version and caregivers to complete the proxy-report version of the survey (n = 1548 dyads. Caregivers of children ages 5 to 7 years completed the proxy report survey (n = 432. In addition, caregivers completed other proxy instruments, PedsQL™ 4.0 Generic Core Scales Parent Proxy-Report version, PedsQL™ Asthma Module Parent Proxy-Report version, and KIDSCREEN Parent-Proxy-52. Results Item content was well understood by proxies and did not require item revisions but some proxies clearly noted that determining an answer on behalf of their child was difficult for some items. Dyads and caregivers of children ages 5-17 years old were enrolled in the large-scale testing. The majority were female (85%, married (70%, Caucasian (64% and had at least a high school education (94%. Approximately 50% had children with a chronic health condition, primarily

  15. Losing Items in the Psychogeriatric Nursing Home

    Directory of Open Access Journals (Sweden)

    J. van Hoof PhD

    2016-09-01

    Full Text Available Introduction: Losing items is a time-consuming occurrence in nursing homes that is ill described. An explorative study was conducted to investigate which items got lost by nursing home residents, and how this affects the residents and family caregivers. Method: Semi-structured interviews and card sorting tasks were conducted with 12 residents with early-stage dementia and 12 family caregivers. Thematic analysis was applied to the outcomes of the sessions. Results: The participants stated that numerous personal items and assistive devices get lost in the nursing home environment, which had various emotional, practical, and financial implications. Significant amounts of time are spent on trying to find items, varying from 1 hr up to a couple of weeks. Numerous potential solutions were identified by the interviewees. Discussion: Losing items often goes together with limitations to the participation of residents. Many family caregivers are reluctant to replace lost items, as these items may get lost again.

  16. Development and psychometric evaluation of the PROMIS Pediatric Life Satisfaction item banks, child-report, and parent-proxy editions.

    Science.gov (United States)

    Forrest, Christopher B; Devine, Janine; Bevans, Katherine B; Becker, Brandon D; Carle, Adam C; Teneralli, Rachel E; Moon, JeanHee; Tucker, Carole A; Ravens-Sieberer, Ulrike

    2018-01-01

    To describe the psychometric evaluation and item response theory calibration of the PROMIS Pediatric Life Satisfaction item banks, child-report, and parent-proxy editions. A pool of 55 life satisfaction items was administered to 1992 children 8-17 years old and 964 parents of children 5-17 years old. Analyses included descriptive statistics, reliability, factor analysis, differential item functioning, and assessment of construct validity. Thirteen items were deleted because of poor psychometric performance. An 8-item short form was administered to a national sample of 996 children 8-17 years old, and 1294 parents of children 5-17 years old. The combined sample (2988 children and 2258 parents) was used in item response theory (IRT) calibration analyses. The final item banks were unidimensional, the items were locally independent, and the items were free from impactful differential item functioning. The 8-item and 4-item short form scales showed excellent reliability, convergent validity, and discriminant validity. Life satisfaction decreased with declining socio-economic status, presence of a special health care need, and increasing age for girls, but not boys. After IRT calibration, we found that 4- and 8-item short forms had a high degree of precision (reliability) across a wide range (>4 SD units) of the latent variable. The PROMIS Pediatric Life Satisfaction item banks and their short forms provide efficient, precise, and valid assessments of life satisfaction in children and youth.

  17. Quantitative Analysis of Complex Multiple-Choice Items in Science Technology and Society: Item Scaling

    Directory of Open Access Journals (Sweden)

    Ángel Vázquez Alonso

    2005-05-01

    Full Text Available The scarce attention to assessment and evaluation in science education research has been especially harmful for Science-Technology-Society (STS education, due to the dialectic, tentative, value-laden, and controversial nature of most STS topics. To overcome the methodological pitfalls of the STS assessment instruments used in the past, an empirically developed instrument (VOSTS, Views on Science-Technology-Society have been suggested. Some methodological proposals, namely the multiple response models and the computing of a global attitudinal index, were suggested to improve the item implementation. The final step of these methodological proposals requires the categorization of STS statements. This paper describes the process of categorization through a scaling procedure ruled by a panel of experts, acting as judges, according to the body of knowledge from history, epistemology, and sociology of science. The statement categorization allows for the sound foundation of STS items, which is useful in educational assessment and science education research, and may also increase teachers’ self-confidence in the development of the STS curriculum for science classrooms.

  18. Australian Biology Test Item Bank, Years 11 and 12. Volume II: Year 12.

    Science.gov (United States)

    Brown, David W., Ed.; Sewell, Jeffrey J., Ed.

    This document consists of test items which are applicable to biology courses throughout Australia (irrespective of course materials used); assess key concepts within course statement (for both core and optional studies); assess a wide range of cognitive processes; and are relevant to current biological concepts. These items are arranged under…

  19. Australian Biology Test Item Bank, Years 11 and 12. Volume I: Year 11.

    Science.gov (United States)

    Brown, David W., Ed.; Sewell, Jeffrey J., Ed.

    This document consists of test items which are applicable to biology courses throughout Australia (irrespective of course materials used); assess key concepts within course statement (for both core and optional studies); assess a wide range of cognitive processes; and are relevant to current biological concepts. These items are arranged under…

  20. What Does a Verbal Test Measure? A New Approach to Understanding Sources of Item Difficulty.

    Science.gov (United States)

    Berk, Eric J. Vanden; Lohman, David F.; Cassata, Jennifer Coyne

    Assessing the construct relevance of mental test results continues to present many challenges, and it has proven to be particularly difficult to assess the construct relevance of verbal items. This study was conducted to gain a better understanding of the conceptual sources of verbal item difficulty using a unique approach that integrates…

  1. Should we assess clinical performance in single patient encounters or consistent behaviors of clinical performance over a series of encounters? A qualitative exploration of narrative trainee profiles

    NARCIS (Netherlands)

    Oerlemans, M.; Dielissen, P.W.; Timmerman, A.; Ram, P.; Maiburg, B.; Muris, J.; Vleuten, C. van der

    2017-01-01

    BACKGROUND: A variety of tools have been developed to assess performance which typically use a single clinical encounter as a source for making competency inferences. This strategy may miss consistent behaviors. We therefore explored experienced clinical supervisors' perceptions of behavioral

  2. Assessment of Intervertebral Disc Degeneration With Magnetic Resonance Single-Voxel Spectroscopy

    Science.gov (United States)

    Zuo, Jin; Saadat, Ehsan; Romero, Adan; Loo, Kimberly; Li, Xiaojuan; Link, Thomas M.; Kurhanewicz, John; Majumdar, Sharmila

    2014-01-01

    This study examined the feasibility of using short-echo water-suppressed point-resolved spectroscopy (PRESS) on a clinical 3T magnetic resonance (MR) scanner for evaluating biochemical changes in degenerated bovine and cadaveric human inter-vertebral discs. In bovine discs (N = 17), degeneration was induced with papain injections. Degeneration of human cadaveric discs (N = 27) was assessed using the Pfirrmann grading on T2-weighted images. Chemicals in the carbohydrate region (Carb), the choline head group (Cho), the N-acetyl region (N-acetyl), and the lipid and lactate region (Lac+Lip) were quantified using 1H PRESS, and were compared between specimens with different degrees of degeneration. The correlation between the spectroscopic findings and glycosaminoglycan (GAG) quantification using biochemical assays was determined. Significant differences were found between the ratios (N-acetyl/Cho, N-acetyl/Lac+Lip) acquired before and after papain injection in bovine discs. For human cadaveric discs, significant differences in the ratios (N-acetyl/Carb, N-acetyl/Lac+Lip) were found between discs having high and low Pfirrmann scores. Significant correlations were found between N-acetyl/Lac+Lip and GAG content in bovine discs (R = 0.77, P = 0.0007) and cadaveric discs (R = 0.83, P < 0.0001). Significant correlation between N-acetyl/Cho and GAG content was also found in cadaver discs (R = 0.64, P = 0.0039). This study demonstrates for the first time that short-echo PRESS on a clinical 3T MR scanner can be used to noninvasively and can reproducibly quantify metabolic changes associated with degeneration of intervertebral discs. PMID:19780173

  3. Assessing nitrogen fixation in mixed- and single-species plantations of Eucalyptus globulus and Acacia mearnsii.

    Science.gov (United States)

    Forrester, David I; Schortemeyer, Marcus; Stock, William D; Bauhus, Jürgen; Khanna, Partap K; Cowie, Annette L

    2007-09-01

    Mixtures of Eucalyptus globulus Labill. and Acacia mearnsii de Wildeman are twice as productive as E. globulus monocultures growing on the same site in East Gippsland, Victoria, Australia, possibly because of increased nitrogen (N) availability owing to N(2) fixation by A. mearnsii. To investigate whether N(2) fixation by A. mearnsii could account for the mixed-species growth responses, we assessed N(2) fixation by the accretion method and the (15)N natural abundance method. Nitrogen gained by E. globulus and A. mearnsii mixtures and monocultures was calculated by the accretion method with plant and soil samples collected 10 years after plantation establishment. Nitrogen in biomass and soil confirmed that A. mearnsii influenced N dynamics. Assuming that the differences in soil, forest floor litter and biomass N of plots containing A. mearnsii compared with E. globulus monocultures were due to N(2) fixation, the 10-year annual mean rates of N(2) fixation were 38 and 86 kg ha(-1) year(-1) in 1:1 mixtures and A. mearnsii monocultures, respectively. Nitrogen fixation by A. mearnsii could not be quantified on the basis of the natural abundance of (15)N because such factors as mycorrhization type and fractionation of N isotopes during N cycling within the plant confounded the effect of the N source on the N isotopic signature of plants. This study shows that A. mearnsii fixed significant quantities of N(2) when mixed with E. globulus. A decline in delta(15)N values of E. globulus and A. mearnsii with time, from 2 to 10 years, is further evidence that N(2) was fixed and cycled through the stands. The increased aboveground biomass production of E. globulus trees in mixtures when compared with monocultures can be attributed to increases in N availability.

  4. Perioperative risk assessment in robotic general surgery: lessons learned from 884 cases at a single institution.

    Science.gov (United States)

    Buchs, Nicolas C; Addeo, Pietro; Bianco, Francesco M; Gorodner, Veronica; Ayloo, Subhashini M; Elli, Enrique F; Oberholzer, José; Benedetti, Enrico; Giulianotti, Pier C

    2012-08-01

    To assess factors associated with morbidity and mortality following the use of robotics in general surgery. Case series. University of Illinois at Chicago. Eight hundred eighty-four consecutive patients who underwent a robotic procedure in our institution between April 2007 and July 2010. Perioperative morbidity and mortality. During the study period, 884 patients underwent a robotic procedure. The conversion rate was 2%, the mortality rate was 0.5%, and the overall postoperative morbidity rate was 16.7%. The reoperation rate was 2.4%. Mean length of stay was 4.5 days (range, 0.2-113 days). In univariate analysis, several factors were associated with increased morbidity and included either patient-related (cardiovascular and renal comorbidities, American Society of Anesthesiologists score ≥ 3, body mass index [calculated as weight in kilograms divided by height in meters squared] surgery, malignant disease, body mass index of less than 30, hypertension, and transfusion were factors significantly associated with a higher risk for complications. American Society of Anesthesiologists score of 3 or greater, age 70 years or older, cardiovascular comorbidity, and blood loss of 500 mL or more were also associated with increased risk for mortality. Use of the robotic approach for general surgery can be achieved safely with low morbidity and mortality. Several risk factors have been identified as independent causes for higher morbidity and mortality. These can be used to identify patients at risk before and during the surgery and, in the future, to develop a scoring system for the use of robotic general surgery

  5. The Piper Fatigue Scale-12 (PFS-12): psychometric findings and item reduction in a cohort of breast cancer survivors.

    Science.gov (United States)

    Reeve, Bryce B; Stover, Angela M; Alfano, Catherine M; Smith, Ashley Wilder; Ballard-Barbash, Rachel; Bernstein, Leslie; McTiernan, Anne; Baumgartner, Kathy B; Piper, Barbara F

    2012-11-01

    Brief, valid measures of fatigue, a prevalent and distressing cancer symptom, are needed for use in research. This study's primary aim was to create a shortened version of the revised Piper Fatigue Scale (PFS-R) based on data from a diverse cohort of breast cancer survivors. A secondary aim was to determine whether the PFS captured multiple distinct aspects of fatigue (a multidimensional model) or a single overall fatigue factor (a unidimensional model). Breast cancer survivors (n = 799; stages in situ through IIIa; ages 29-86 years) were recruited through three SEER registries (New Mexico, Western Washington, and Los Angeles, CA) as part of the Health, Eating, Activity, and Lifestyle (HEAL) study. Fatigue was measured approximately 3 years post-diagnosis using the 22-item PFS-R that has four subscales (Behavior, Affect, Sensory, and Cognition). Confirmatory factor analysis was used to compare unidimensional and multidimensional models. Six criteria were used to make item selections to shorten the PFS-R: scale's content validity, items' relationship with fatigue, content redundancy, differential item functioning by race and/or education, scale reliability, and literacy demand. Factor analyses supported the original 4-factor structure. There was also evidence from the bi-factor model for a dominant underlying fatigue factor. Six items tested positive for differential item functioning between African-American and Caucasian survivors. Four additional items either showed poor association, local dependence, or content validity concerns. After removing these 10 items, the reliability of the PFS-12 subscales ranged from 0.87 to 0.89, compared to 0.90-0.94 prior to item removal. The newly developed PFS-12 can be used to assess fatigue in African-American and Caucasian breast cancer survivors and reduces response burden without compromising reliability or validity. This is the first study to determine PFS literacy demand and to compare PFS-R responses in African

  6. Using Differential Item Functioning Procedures to Explore Sources of Item Difficulty and Group Performance Characteristics.

    Science.gov (United States)

    Scheuneman, Janice Dowd; Gerritz, Kalle

    1990-01-01

    Differential item functioning (DIF) methodology for revealing sources of item difficulty and performance characteristics of different groups was explored. A total of 150 Scholastic Aptitude Test items and 132 Graduate Record Examination general test items were analyzed. DIF was evaluated for males and females and Blacks and Whites. (SLD)

  7. Assessment of mixed-layer height estimation from single-wavelength ceilometer profiles

    Directory of Open Access Journals (Sweden)

    T. N. Knepp

    2017-10-01

    Full Text Available Differing boundary/mixed-layer height measurement methods were assessed in moderately polluted and clean environments, with a focus on the Vaisala CL51 ceilometer. This intercomparison was performed as part of ongoing measurements at the Chemistry And Physics of the Atmospheric Boundary Layer Experiment (CAPABLE site in Hampton, Virginia and during the 2014 Deriving Information on Surface Conditions from Column and Vertically Resolved Observations Relevant to Air Quality (DISCOVER-AQ field campaign that took place in and around Denver, Colorado. We analyzed CL51 data that were collected via two different methods (BLView software, which applied correction factors, and simple terminal emulation logging to determine the impact of data collection methodology. Further, we evaluated the STRucture of the ATmosphere (STRAT algorithm as an open-source alternative to BLView (note that the current work presents an evaluation of the BLView and STRAT algorithms and does not intend to act as a validation of either. Filtering criteria were defined according to the change in mixed-layer height (MLH distributions for each instrument and algorithm and were applied throughout the analysis to remove high-frequency fluctuations from the MLH retrievals. Of primary interest was determining how the different data-collection methodologies and algorithms compare to each other and to radiosonde-derived boundary-layer heights when deployed as part of a larger instrument network. We determined that data-collection methodology is not as important as the processing algorithm and that much of the algorithm differences might be driven by impacts of local meteorology and precipitation events that pose algorithm difficulties. The results of this study show that a common processing algorithm is necessary for light detection and ranging (lidar-based MLH intercomparisons and ceilometer-network operation, and that sonde-derived boundary layer heights are higher (10–15 % at

  8. Benefits Assessment for Single-Airport Tactical Runway Configuration Management Tool (TRCM)

    Science.gov (United States)

    Oseguera-Lohr, Rosa; Phojanamonogkolkij, Nipa; Lohr, Gary W.

    2015-01-01

    -level or generic in nature (not focusing on specific airports), and benefits were aggregated for the entire NAS, with relatively low fidelity simulation of SORM functions and aircraft trajectories. For SORM research, a more detailed benefits assessment of RCM and CADRS for specific airports or metroplexes is needed.

  9. A Comparison of Item Fit Statistics for Mixed IRT Models

    Science.gov (United States)

    Chon, Kyong Hee; Lee, Won-Chan; Dunbar, Stephen B.

    2010-01-01

    In this study we examined procedures for assessing model-data fit of item response theory (IRT) models for mixed format data. The model fit indices used in this study include PARSCALE's G[superscript 2], Orlando and Thissen's S-X[superscript 2] and S-G[superscript 2], and Stone's chi[superscript 2*] and G[superscript 2*]. To investigate the…

  10. The 10-item Remembered Relationship with Parents (RRP10) scale

    DEFF Research Database (Denmark)

    Denollet, Johan; Smolderen, Kim G E; van den Broek, Krista C

    2007-01-01

    Dysfunctional parenting styles are associated with poor mental and physical health. The 10-item Remembered Relationship with Parents (RRP(10)) scale retrospectively assesses Alienation (dysfunctional communication and intimacy) and Control (overprotection by parents), with an emphasis...... on deficiencies in empathic parenting. We examined the 2-factor structure of the RRP(10) and its relationship with adult depression....

  11. Bad Questions: An Essay Involving Item Response Theory

    Science.gov (United States)

    Thissen, David

    2016-01-01

    David Thissen, a professor in the Department of Psychology and Neuroscience, Quantitative Program at the University of North Carolina, has consulted and served on technical advisory committees for assessment programs that use item response theory (IRT) over the past couple decades. He has come to the conclusion that there are usually two purposes…

  12. Multiplanar lumbopelvic control in patients with low back pain: is multiplanar assessment better than single plane assessment in discriminating between patients and healthy controls?

    Science.gov (United States)

    Nelson-Wong, E; Gallant, P; Alexander, S; Dehmer, K; Ingvalson, S; McClenahan, B; Piatte, A; Poupore, K; Davis, A M

    2016-02-01

    Patients with low back pain (LBP) commonly have lumbopelvic control deficits. Lumbopelvic assessment during sagittal motion is incorporated into commonly used clinical examination algorithms for Treatment Based Classification. The purpose of this study was to investigate whether combined assessment of lumbopelvic control during sagittal and frontal plane motion discriminates between people with and without LBP better than single plane assessment alone. Nineteen patients with LBP and 18 healthy control participants volunteered for this study. The active straight leg raise (ASLR) and active hip abduction (AHAbd) tests were used to assess lumbopelvic control during sagittal and frontal plane motion, respectively. The tests were scored as positive or negative using published scoring criteria. Contingency tables were created for each test alone and for the combined tests (both positive/both negative) with presence/absence of LBP as the reference standard to calculate accuracy statistics of sensitivity (sn), specificity (sp), likelihood (+LR and -LR), and diagnostic odds ratios (OR). Active straight leg raise and AHAbd tests alone had sn of 0·63, 0·74, respectively, sp of 0·61, 0·50, respectively, and OR of 2·7, 2·8, respectively. The combined tests had sn = 0·89, sp = 0·60, and OR = 12·0. Forty percent of patients with LBP had control deficits in both planes of motion. The AHAbd and ALSR tests appear to have greater diagnostic discrimination when used in combination than when used independently. A percentage of patients with LBP had control deficits in both planes, while others demonstrated uniplanar deficits only. These findings highlight the importance of multiplanar assessment in patients with LBP.

  13. Single-compound and cumulative risk assessment of mycotoxins present in breakfast cereals consumed by children from Lisbon region, Portugal.

    Science.gov (United States)

    Assunção, Ricardo; Vasco, Elsa; Nunes, Baltazar; Loureiro, Susana; Martins, Carla; Alvito, Paula

    2015-12-01

    Humans can be exposed to multiple chemicals, but current risk assessment is usually carried out on one chemical at a time. Mycotoxins are commonly found in a variety of foods including those intended to consumption by children namely breakfast cereals. The present study aims to perform, the risk assessment of single and multiple mycotoxins present in breakfast cereals consumed by children (1-3 years old) from Lisbon region, Portugal. Daily exposure of children to ochratoxin A, fumonisins and trichothecenes showed no health risks to the children population considering individual mycotoxins, while exposure to aflatoxin B1 (AFB1) suggested a potential health concern for the high percentiles of intake (P90, P95 and P99). The combined exposure to fumonisins and trichothecenes are not expected to be of health concern. The combined margin of exposure (MoET) for the aflatoxins group could constitute a potential health concern and AFB1 was the main contributor for MoET. Legal limits and control strategies regarding the presence of multiple mycotoxins in foodstuffs is an urgent need. To the best of our knowledge, this is the first time a cumulative risk assessment was performed on multiple mycotoxins present in breakfast cereals consumed by children. Copyright © 2015 Elsevier Ltd. All rights reserved.

  14. Assessment of consistency of the whole tumor and single section perfusion imaging with 256-slice spiral CT: a preliminary study

    International Nuclear Information System (INIS)

    Sun Hongliang; Xu Yanyan; Hu Yingying; Tian Yuanjiang; Wang Wu

    2014-01-01

    Objective: To determine the consistency between quantitative CT perfusion measurements of colorectal cancer obtained from single section with maximal tumor dimension and from average of whole tumor, and compare intra- and inter-observer consistency of the two analysis methods. Methods: Twenty-two patients with histologically proven colorectal cancer were examined prospectively with 256-slice CT and the whole tumor perfusion images were obtained. Perfusion parameters were obtained from region of interest (ROI) inserted in single section showing maximal tumor dimension, then from ROI inserted in all tumor-containing sections by two radiologists. Consistency between values of blood flow (BF), blood volume (BV) and time to peak (TTP) calculated by two methods was assessed. Intra-observer consistency was evaluated by comparing repeated measurements done by the same radiologist using both methods after 3 months. Perfusion measurements were done by another radiologist independently to assess inter-observer consistency of both methods. The results from different methods were compared using paired t test and Bland-Altman plot. Results: Twenty-two patients were examined successfully. The perfusion parameters BF, BV and TTP obtained by whole tumor perfusion and single-section analysis were (35.59 ± 14.59) ml · min -1 · 100 g -1 , (17.55 ±4.21) ml · 100 g -1 , (21.30 ±7.57) s and (34.64 ± 13.29)ml · min -1 · 100 g -1 , (17.61 ±6.39)ml · 100 g -1 , (19.82 ±9.01) s, respectively. No significant differences were observed between the means of the perfusion parameters (BF, BV, TTP) calculated by the two methods (t=0.218, -0.033, -0.668, P>0.05, respectively). The intra-observer 95% limits of consistency of perfusion parameters were BF -5.3% to 10.0%, BV -13.8% to 10.8%, TTP -15.0% to 12.6% with whole tumor analysis, respectively; BF -14.3% to 16.5%, BV -24.2% to 22.2%, TTP -19.0% to 16.1% with single section analysis, respectively. The inter-observer 95% limits of

  15. Developing a short version of the Toronto Structured Interview for Alexithymia using item response theory.

    Science.gov (United States)

    Sekely, Angela; Taylor, Graeme J; Bagby, R Michael

    2018-03-17

    The Toronto Structured Interview for Alexithymia (TSIA) was developed to provide a structured interview method for assessing alexithymia. One drawback of this instrument is the amount of time it takes to administer and score. The current study used item response theory (IRT) methods to analyze data from a large heterogeneous multi-language sample (N = 842) to investigate whether a subset of items could be selected to create a short version of the instrument. Samejima's (1969) graded response model was used to fit the item responses. Items providing maximum information were retained in the short model, resulting in the elimination of 12-items from the original 24-items. Despite the 50% reduction in the number of items, 65.22% of the information was retained. Further studies are needed to validate the short version. A short version of the TSIA is potentially of practical value to clinicians and researchers with time constraints. Copyright © 2018. Published by Elsevier B.V.

  16. Psychometric aspects of item mapping for criterion-referenced interpretation and bookmark standard setting.

    Science.gov (United States)

    Huynh, Huynh

    2010-01-01

    Locating an item on an achievement continuum (item mapping) is well-established in technical work for educational/psychological assessment. Applications of item mapping may be found in criterion-referenced (CR) testing (or scale anchoring, Beaton and Allen, 1992; Huynh, 1994, 1998a, 2000a, 2000b, 2006), computer-assisted testing, test form assembly, and in standard setting methods based on ordered test booklets. These methods include the bookmark standard setting originally used for the CTB/TerraNova tests (Lewis, Mitzel, Green, and Patz, 1999), the item descriptor process (Ferrara, Perie, and Johnson, 2002) and a similar process described by Wang (2003) for multiple-choice licensure and certification examinations. While item response theory (IRT) models such as the Rasch and two-parameter logistic (2PL) models traditionally place a binary item at its location, Huynh has argued in the cited papers that such mapping may not be appropriate in selecting items for CR interpretation and scale anchoring.

  17. Comparative assessment of single-stage and two-stage anaerobic digestion for the treatment of thin stillage.

    Science.gov (United States)

    Nasr, Noha; Elbeshbishy, Elsayed; Hafez, Hisham; Nakhla, George; El Naggar, M Hesham

    2012-05-01

    A comparative evaluation of single-stage and two-stage anaerobic digestion processes for biomethane and biohydrogen production using thin stillage was performed to assess the impact of separating the acidogenic and methanogenic stages on anaerobic digestion. Thin stillage, the main by-product from ethanol production, was characterized by high total chemical oxygen demand (TCOD) of 122 g/L and total volatile fatty acids (TVFAs) of 12 g/L. A maximum methane yield of 0.33 L CH(4)/gCOD(added) (STP) was achieved in the two-stage process while a single-stage process achieved a maximum yield of only 0.26 L CH(4)/gCOD(added) (STP). The separation of acidification stage increased the TVFAs to TCOD ratio from 10% in the raw thin stillage to 54% due to the conversion of carbohydrates into hydrogen and VFAs. Comparison of the two processes based on energy outcome revealed that an increase of 18.5% in the total energy yield was achieved using two-stage anaerobic digestion. Copyright © 2012 Elsevier Ltd. All rights reserved.

  18. Automated single-trial assessment of laser-evoked potentials as an objective functional diagnostic tool for the nociceptive system.

    Science.gov (United States)

    Hatem, S M; Hu, L; Ragé, M; Gierasimowicz, A; Plaghki, L; Bouhassira, D; Attal, N; Iannetti, G D; Mouraux, A

    2012-12-01

    To assess the clinical usefulness of an automated analysis of event-related potentials (ERPs). Nociceptive laser-evoked potentials (LEPs) and non-nociceptive somatosensory electrically-evoked potentials (SEPs) were recorded in 37 patients with syringomyelia and 21 controls. LEP and SEP peak amplitudes and latencies were estimated using a single-trial automated approach based on time-frequency wavelet filtering and multiple linear regression, as well as a conventional approach based on visual inspection. The amplitudes and latencies of normal and abnormal LEP and SEP peaks were identified reliably using both approaches, with similar sensitivity and specificity. Because the automated approach provided an unbiased solution to account for average waveforms where no ERP could be identified visually, it revealed significant differences between patients and controls that were not revealed using the visual approach. The automated analysis of ERPs characterized reliably and objectively LEP and SEP waveforms in patients. The automated single-trial analysis can be used to characterize normal and abnormal ERPs with a similar sensitivity and specificity as visual inspection. While this does not justify its use in a routine clinical setting, the technique could be useful to avoid observer-dependent biases in clinical research. Copyright © 2012 International Federation of Clinical Neurophysiology. Published by Elsevier Ireland Ltd. All rights reserved.

  19. Applying automatic item generation to create cohesive physics testlets

    Science.gov (United States)

    Mindyarto, B. N.; Nugroho, S. E.; Linuwih, S.

    2018-03-01

    Computer-based testing has created the demand for large numbers of items. This paper discusses the production of cohesive physics testlets using an automatic item generation concepts and procedures. The testlets were composed by restructuring physics problems to reveal deeper understanding of the underlying physical concepts by inserting a qualitative question and its scientific reasoning question. A template-based testlet generator was used to generate the testlet variants. Using this methodology, 1248 testlet variants were effectively generated from 25 testlet templates. Some issues related to the effective application of the generated physics testlets in practical assessments were discussed.

  20. International Semiotics: Item Difficulty and the Complexity of Science Item Illustrations in the PISA-2009 International Test Comparison

    Science.gov (United States)

    Solano-Flores, Guillermo; Wang, Chao; Shade, Chelsey

    2016-01-01

    We examined multimodality (the representation of information in multiple semiotic modes) in the context of international test comparisons. Using Program of International Student Assessment (PISA)-2009 data, we examined the correlation of the difficulty of science items and the complexity of their illustrations. We observed statistically…