WorldWideScience

Sample records for lesions interrater agreement

  1. Impact of educational intervention on the inter-rater agreement of nasal endoscopy interpretation

    Science.gov (United States)

    Colley, Patrick; Mace, Jess C.; Schaberg, Madeleine R.; Smith, Timothy L.; Tabaee, Abtin

    2015-01-01

    OBJECTIVE Nasal endoscopy is integral to the evaluation of sinonasal disorders. However, prior studies have shown significant variability in the inter-rater agreement of nasal endoscopy interpretation amongst practicing rhinologists. The objective of the current study is to evaluate the inter-rater agreement of nasal endoscopy amongst otolaryngology residents from a single training program at baseline and following an educational intervention. METHODS 11 otolaryngology residents completed nasal endoscopy grading forms for 8 digitally recorded nasal endoscopic examinations. An instructional lecture reviewing nasal endoscopy interpretation was subsequently provided. The residents then completed grading forms for 8 different nasal endoscopic examinations. Inter-rate agreement amongst residents for the pre- and post-lecture videos was calculated using the unweighted Fleiss’ kappa statistic (Kf) and intra-class correlation agreement (ICC). RESULTS Inter-rater agreement improved from a baseline level of fair (Kf range 0.268–0.383) to a post-educational level of moderate (Kf range 0.401–0.547) for nasal endoscopy findings of middle meatus mucosa, middle turbinate mucosa, middle meatus discharge, sphenoethmoid recess mucosa, sphenoethmoid recess discharge and atypical lesions (ICC, pendoscopy interpretation amongst otolaryngology residents. The inter-rater agreement for the majority of the characteristics that were evaluated improved after educational intervention. Further study is needed to improve nasal endoscopy interpretation. PMID:25781864

  2. International inter-rater agreement in scoring acne severity utilizing cloud-based image sharing of mobile phone photographs.

    Science.gov (United States)

    Foolad, Negar; Ornelas, Jennifer N; Clark, Ashley K; Ali, Ifrah; Sharon, Victoria R; Al Mubarak, Luluah; Lopez, Andrés; Alikhan, Ali; Al Dabagh, Bishr; Firooz, Alireza; Awasthi, Smita; Liu, Yu; Li, Chin-Shang; Sivamani, Raja K

    2017-09-01

    Cloud-based image sharing technology allows facilitated sharing of images. Cloud-based image sharing technology has not been well-studied for acne assessments or treatment preferences, among international evaluators. We evaluated inter-rater variability of acne grading and treatment recommendations among an international group of dermatologists that assessed photographs. This is a prospective, single visit photographic study to assess inter-rater agreement of acne photographs shared through an integrated mobile device, cloud-based, and HIPAA-compliant platform. Inter-rater agreements for global acne assessment and acne lesion counts were evaluated by the Kendall's coefficient of concordance while correlations between treatment recommendations and acne severity were calculated by Spearman's rank correlation coefficient. There was good agreement for the evaluation of inflammatory lesions (KCC = 0.62, P cloud-based image sharing for acne assessment. Cloud-based sharing may facilitate acne care and research among international collaborators. © 2017 The International Society of Dermatology.

  3. Unfolding the phenomenon of inter-rater agreement

    DEFF Research Database (Denmark)

    Slaug, Bjørn; Schilling, Oliver; Helle, Tina

    2011-01-01

    Objective: The overall objective was to unfold the phenomenon of inter-rater agreement: to identify potential sources of variation in agreement data and to explore how they can be statistically accounted for. The ultimate aim was to propose recommendations for in-depth examination of agreement, i...

  4. EasyDIAg: A tool for easy determination of interrater agreement.

    Science.gov (United States)

    Holle, Henning; Rein, Robert

    2015-09-01

    Reliable measurements are fundamental for the empirical sciences. In observational research, measurements often consist of observers categorizing behavior into nominal-scaled units. Since the categorization is the outcome of a complex judgment process, it is important to evaluate the extent to which these judgments are reproducible, by having multiple observers independently rate the same behavior. A challenge in determining interrater agreement for timed-event sequential data is to develop clear objective criteria to determine whether two raters' judgments relate to the same event (the linking problem). Furthermore, many studies presently report only raw agreement indices, without considering the degree to which agreement can occur by chance alone. Here, we present a novel, free, and open-source toolbox (EasyDIAg) designed to assist researchers with the linking problem, while also providing chance-corrected estimates of interrater agreement. Additional tools are included to facilitate the development of coding schemes and rater training.

  5. Inter-Rater Agreement of Auscultation, Palpable Fremitus, and Ventilator Waveform Sawtooth Patterns Between Clinicians.

    Science.gov (United States)

    Berry, Marc P; Martí, Joan-Daniel; Ntoumenopoulos, George

    2016-10-01

    Clinicians often use numerous bedside assessments for secretion retention in participants who are receiving invasive mechanical ventilation. This study aimed to evaluate inter-rater agreement between clinicians when using standard clinical assessments of secretion retention and whether differences in clinician experience influenced inter-rater agreement. Seventy-one mechanically ventilated participants were assessed by a research clinician and by one of 13 ICU clinicians. Each clinician conducted a standardized assessment of lung auscultation, palpation for chest-wall (rhonchal) fremitus, and ventilator inspiratory/expiratory flow-time waveforms for the sawtooth pattern. On the presence of breath sounds, agreement ranged from absolute to moderate in the upper zones and the lower zones, respectively. Kappa values for abnormal and adventitious lung sounds achieved moderate agreement in the upper zones, less than chance agreement to substantial agreement in the middle zones, and moderate agreement to almost perfect agreement in the lower zones. Moderate to almost perfect agreement was established for palpable fremitus in the upper zones, moderate to substantial agreement in the middle zones, and less than chance to moderate agreement in the lower zones. Inter-rater agreement on the presence of expiratory sawtooth pattern identification showed moderate agreement. The level of percentage agreement between the research and ICU clinicians for each respiratory assessment studied did not relate directly to level of clinical experience. Inter-rater agreement for all assessments showed variability between lung regions but maintained reasonable percentage agreement in mechanically ventilated participants. The level of percentage agreement achieved between clinicians did not directly relate to clinical experience for all respiratory assessments. Therefore, these respiratory assessments should not necessarily be viewed in isolation but interpreted within the context of a full

  6. Interrater and intrarater agreements of magnetic resonance imaging findings in the lumbar spine: significant variability across degenerative conditions.

    Science.gov (United States)

    Fu, Michael C; Buerba, Rafael A; Long, William D; Blizzard, Daniel J; Lischuk, Andrew W; Haims, Andrew H; Grauer, Jonathan N

    2014-10-01

    Magnetic resonance imaging (MRI) is frequently used in the evaluation of degenerative conditions in the lumbar spine. The relative interrater and intrarater agreements of MRI findings across different pathologic conditions are underexplored, as most studies are focused on specific findings. The purpose of this study was to characterize the interrater and intrarater agreements of MRI findings used to assess the degenerative lumbar spine. A retrospective diagnostic study at a large academic medical center was undertaken with a panel of orthopedic surgeons and musculoskeletal radiologists to assess lumbar MRIs using standardized criteria. Seventy-five subjects who underwent routine lumbar spine MRI at our institution were included. Each MRI study was assessed for 10 lumbar degenerative findings using standardized criteria. Lumbar vertebral levels were assessed independently, where applicable, for a total of 52 data points collected per study. T2-weighted axial and sagittal MRI sequences were presented in random order to the four reviewers (two orthopedic spine surgeons and two musculoskeletal radiologists) independently to determine interrater agreement. The first 10 studies were reevaluated at the end to determine intrarater agreement. Images were assessed using standardized and pilot-tested criteria to assess disc degeneration, stenosis, and other degenerative changes. Interrater and intrarater absolute percent agreements were calculated. To highlight the most clinically important MRI disagreements, a modified agreement analysis was also performed (in which disagreements between the lowest two severity grades for applicable conditions were ignored). Fleiss kappa coefficients for interrater agreement were determined. The overall absolute and modified interrater agreements were 76.9% and 93.5%, respectively. The absolute and modified intrarater agreements were 81.3% and 92.7%, respectively. Average Fleiss kappa coefficient was 0.431, suggesting moderate overall

  7. Inter-rater agreement on PIVC-associated phlebitis signs, symptoms and scales.

    Science.gov (United States)

    Marsh, Nicole; Mihala, Gabor; Ray-Barruel, Gillian; Webster, Joan; Wallis, Marianne C; Rickard, Claire M

    2015-10-01

    Many peripheral intravenous catheter (PIVC) infusion phlebitis scales and definitions are used internationally, although no existing scale has demonstrated comprehensive reliability and validity. We examined inter-rater agreement between registered nurses on signs, symptoms and scales commonly used in phlebitis assessment. Seven PIVC-associated phlebitis signs/symptoms (pain, tenderness, swelling, erythema, palpable venous cord, purulent discharge and warmth) were observed daily by two raters (a research nurse and registered nurse). These data were modelled into phlebitis scores using 10 different tools. Proportions of agreement (e.g. positive, negative), observed and expected agreements, Cohen's kappa, the maximum achievable kappa, prevalence- and bias-adjusted kappa were calculated. Two hundred ten patients were recruited across three hospitals, with 247 sets of paired observations undertaken. The second rater was blinded to the first's findings. The Catney and Rittenberg scales were the most sensitive (phlebitis in >20% of observations), whereas the Curran, Lanbeck and Rickard scales were the most restrictive (≤2% phlebitis). Only tenderness and the Catney (one of pain, tenderness, erythema or palpable cord) and Rittenberg scales (one of erythema, swelling, tenderness or pain) had acceptable (more than two-thirds, 66.7%) levels of inter-rater agreement. Inter-rater agreement for phlebitis assessment signs/symptoms and scales is low. This likely contributes to the high degree of variability in phlebitis rates in literature. We recommend further research into assessment of infrequent signs/symptoms and the Catney or Rittenberg scales. New approaches to evaluating vein irritation that are valid, reliable and based on their ability to predict complications need exploration. © 2015 John Wiley & Sons, Ltd.

  8. Chest Radiographs for Pediatric TB Diagnosis: Interrater Agreement and Utility

    Directory of Open Access Journals (Sweden)

    G. Kaguthi

    2014-01-01

    Full Text Available The chest radiograph (CXR is considered a key diagnostic tool for pediatric tuberculosis (TB in clinical management and endpoint determination in TB vaccine trials. We set out to compare interrater agreement for TB diagnosis in western Kenya. A pediatric pulmonologist and radiologist (experts, a medical officer (M.O, and four clinical officers (C.Os with basic training in pediatric CXR reading blindly assessed CXRs of infants who were TB suspects in a cohort study. C.Os had access to clinical findings for patient management. Weighted kappa scores summarized interrater agreement on lymphadenopathy and abnormalities consistent with TB. Sensitivity and specificity of raters were determined using microbiologically confirmed TB as the gold standard (n=8. A total of 691 radiographs were reviewed. Agreement on abnormalities consistent with TB was poor; k=0.14 (95% CI: 0.10–0.18 and on lymphadenopathy moderate k=0.26 (95% CI: 0.18–0.36. M.O [75% (95% CI: 34.9%–96.8%] and C.Os [63% (95% CI: 24.5%–91.5%] had high sensitivity for culture confirmed TB. TB vaccine trials utilizing expert agreement on CXR as a nonmicrobiologically confirmed endpoint will have reduced specificity and will underestimate vaccine efficacy. C.Os detected many of the bacteriologically confirmed cases; however, this must be interpreted cautiously as they were unblinded to clinical features.

  9. Ocular Motor Score (OMS): a clinical tool to evaluating ocular motor functions in children. Intrarater and inter-rater agreement.

    Science.gov (United States)

    Olsson, Monica; Teär Fahnehjelm, Kristina; Rydberg, Agneta; Ygge, Jan

    2015-08-01

    Ocular motor score (OMS) is a new clinical test protocol for evaluating ocular motor functions in children and young adults. OMS is a set of 15 important and relevant non-invasive ocular motor function parameters derived from clinical practice. The aim of the study was to evaluate OMS according to intrarater and inter-rater agreement. Forty children aged 4-10 years, 23 girls median age 6.5 (range 4.3-9.3) and 17 boys median age 5.8 (range 4.1-9.8) were included. The ocular motor functions were assessed and scored according to the OMS protocol. The examinations were videotaped. To obtain the intrarater agreement, the first author examined and scored the children twice, first in the clinic and 2 weeks later by watching the videotape. To obtain the inter-rater agreement, three other raters independently scored the ocular motor function of the children by watching the videotapes. The overall observed intrarater agreement was 88%, and the observed inter-rater agreement between the three raters was 80%. For none of the subtests was there an observed intrarater agreement lower than 65%. Three of the subtests had an observed inter-rater agreement of 65% or below. Overall there was high observed intra- and inter-rater agreement for the OMS test protocol. Subtests such as saccades and smooth pursuit were more difficult for raters to score similarly according the clinical OMS test protocol. © 2015 Acta Ophthalmologica Scandinavica Foundation. Published by John Wiley & Sons Ltd.

  10. Inter-rater agreement among orthodontists in a blocked experiment.

    Science.gov (United States)

    Korn, E L; Baumrind, S

    1985-01-01

    Five orthodontists were asked to predict for 64 patients a particular dichotomous outcome of treatment based on pre-treatment X-ray films. The orthodontists rated the cases in blocks of size 4-6 with the knowledge of the number of positive outcomes in each block. We discuss the reasons why this blocked design is appropriate whenever clinicians are asked to rate cases which have not been randomly selected from a clinical practice similar to their own. We give a simple description of the inter-rater agreement for this type of blocked experiment as well as a procedure to test that the agreement is no better than that expected by random independent assignment.

  11. Interrater agreement for the schedule for affective disorders and schizophrenia epidemiological version for school-age children (K-SADS-E

    Directory of Open Access Journals (Sweden)

    Polanczyk Guilherme V

    2003-01-01

    Full Text Available OBJETIVE: The main objective of this study was to assess the interrater agreement for the Schedule for Affective Disorders and Schizophrenia Epidemiological version for School-Age Children (K-SADS-E. METHODS: Four interviewers being trained with the K-SADS-E scored independently 29 videotaped interviews performed with psychiatric outpatients in the ADHD Outpatient Clinic at Hospital de Clínicas de Porto Alegre. Interrater agreement analysis was performed using the kappa coefficient (k. RESULTS: Kappa coefficients were .93 (p<.001 for affective disorders, .9 (p<.001 for anxiety disorders, .94 (p<.001 for attention-deficit/hyperactivity disorders and disruptive behavior disorders. CONCLUSION: These findings suggest an excellent interrater agreement for the diagnosis of several mental disorders in childhood and adolescence by the Brazilian Portuguese version of the K-SADS-E.

  12. Inter-rater agreement in visual assessment of footpad dermatitis in Danish broiler chickens

    DEFF Research Database (Denmark)

    Oliveira, A.R.S.; Lund, Vibe Pedersen; Christensen, Jens Peter

    2017-01-01

    1. The performance of the scoring in the Danish footpad dermatitis (FPD) surveillance system was evaluated by determining inter-rater agreement in visual inspection of FPD in broilers between two independent raters (R1 and R2) and the official scoring at a Danish slaughterhouse. 2. FPD scores were...

  13. Intra- and interrater reliability and agreement of the Danish version of the Dynamic Gait Index in older people with balance impairments

    DEFF Research Database (Denmark)

    Jønsson, Line R; Kristensen, Morten; Tibaek, Sigrid

    2011-01-01

    To examine the intrarater and interrater reliability and agreement of the Danish version of the Dynamic Gait Index (DGI) in hospitalized and community-dwelling older people with balance impairments.......To examine the intrarater and interrater reliability and agreement of the Danish version of the Dynamic Gait Index (DGI) in hospitalized and community-dwelling older people with balance impairments....

  14. The Smile Esthetic Index (SEI): A method to measure the esthetics of the smile. An intra-rater and inter-rater agreement study.

    Science.gov (United States)

    Rotundo, Roberto; Nieri, Michele; Bonaccini, Daniele; Mori, Massimiliano; Lamberti, Elena; Massironi, Domenico; Giachetti, Luca; Franchi, Lorenzo; Venezia, Piero; Cavalcanti, Raffaele; Bondi, Elena; Farneti, Mauro; Pinchi, Vilma; Buti, Jacopo

    2015-01-01

    To propose a method to measure the esthetics of the smile and to report its validation by means of an intra-rater and inter-rater agreement analysis. Ten variables were chosen as determinants for the esthetics of a smile: smile line and facial midline, tooth alignment, tooth deformity, tooth dischromy, gingival dischromy, gingival recession, gingival excess, gingival scars and diastema/missing papillae. One examiner consecutively selected seventy smile pictures, which were in the frontal view. Ten examiners, with different levels of clinical experience and specialties, applied the proposed assessment method twice on the selected pictures, independently and blindly. Intraclass correlation coefficient (ICC) and Fleiss' kappa) statistics were performed to analyse the intra-rater and inter-rater agreement. Considering the cumulative assessment of the Smile Esthetic Index (SEI), the ICC value for the inter-rater agreement of the 10 examiners was 0.62 (95% CI: 0.51 to 0.72), representing a substantial agreement. Intra-rater agreement ranged from 0.86 to 0.99. Inter-rater agreement (Fleiss' kappa statistics) calculated for each variable ranged from 0.17 to 0.75. The SEI was a reproducible method, to assess the esthetic component of the smile, useful for the diagnostic phase and for setting appropriate treatment plans.

  15. High inter-rater reliability, agreement, and convergent validity of Constant score in patients with clavicle fractures

    DEFF Research Database (Denmark)

    Ban, Ilija; Troelsen, Anders; Kristensen, Morten Tange

    2016-01-01

    BACKGROUND: The Constant score (CS) has been the primary endpoint in most studies on clavicle fractures. However, the CS was not developed to assess patients with clavicle fractures. Our aim was to examine inter-rater reliability and agreement of the CS in patients with clavicle fractures...... standardized CS assessment at a mean of 6.8 weeks (SD, 1.0 weeks) after injury. Reliability and agreement of the CS were determined by 2 raters. The interclass correlation coefficient (ICC2,1), standard error of measurement, minimal detectable change, Cronbach α coefficient, and Pearson correlation coefficient...... were estimated. RESULTS: Inter-rater reliability of the total CS was excellent (interclass correlation coefficient, 0.94; 95% confidence interval, 0.88-0.97), with no systematic difference between the 2 raters (P = .75). The standard error of measurement (measurement error at the group level) was 4...

  16. Systematic reviews need to consider applicability to disadvantaged populations: inter-rater agreement for a health equity plausibility algorithm.

    Science.gov (United States)

    Welch, Vivian; Brand, Kevin; Kristjansson, Elizabeth; Smylie, Janet; Wells, George; Tugwell, Peter

    2012-12-19

    Systematic reviews have been challenged to consider effects on disadvantaged groups. A priori specification of subgroup analyses is recommended to increase the credibility of these analyses. This study aimed to develop and assess inter-rater agreement for an algorithm for systematic review authors to predict whether differences in effect measures are likely for disadvantaged populations relative to advantaged populations (only relative effect measures were addressed). A health equity plausibility algorithm was developed using clinimetric methods with three items based on literature review, key informant interviews and methodology studies. The three items dealt with the plausibility of differences in relative effects across sex or socioeconomic status (SES) due to: 1) patient characteristics; 2) intervention delivery (i.e., implementation); and 3) comparators. Thirty-five respondents (consisting of clinicians, methodologists and research users) assessed the likelihood of differences across sex and SES for ten systematic reviews with these questions. We assessed inter-rater reliability using Fleiss multi-rater kappa. The proportion agreement was 66% for patient characteristics (95% confidence interval: 61%-71%), 67% for intervention delivery (95% confidence interval: 62% to 72%) and 55% for the comparator (95% confidence interval: 50% to 60%). Inter-rater kappa, assessed with Fleiss kappa, ranged from 0 to 0.199, representing very low agreement beyond chance. Users of systematic reviews rated that important differences in relative effects across sex and socioeconomic status were plausible for a range of individual and population-level interventions. However, there was very low inter-rater agreement for these assessments. There is an unmet need for discussion of plausibility of differential effects in systematic reviews. Increased consideration of external validity and applicability to different populations and settings is warranted in systematic reviews to meet this

  17. Intra- and Inter-rater Agreement of Superior Vena Cava Flow and Right Ventricular Outflow Measurements in Late Preterm and Term Neonates.

    Science.gov (United States)

    Mahoney, Liam; Fernandez-Alvarez, Jose R; Rojas-Anaya, Hector; Aiton, Neil; Wertheim, David; Seddon, Paul; Rabe, Heike

    2018-02-24

    To explore the intra- and inter-rater agreement of superior vena cava (SVC) flow and right ventricular (RV) outflow in healthy and unwell late preterm neonates (33-37 weeks' gestational age), term neonates (≥37 weeks' gestational age), and neonates receiving total-body cooling. The intra- and inter-rater agreement (n = 25 and 41 neonates, respectively) rates for SVC flow and RV outflow were determined by echocardiography in healthy and unwell late preterm and term neonates with the use of Bland-Altman plots, the repeatability coefficient, the repeatability index, and intraclass correlation coefficients. The intra-rater repeatability index values were 41% for SVC flow and 31% for RV outflow, with intraclass correlation coefficients indicating good agreement for both measures. The inter-rater repeatability index values for SVC flow and RV outflow were 63% and 51%, respectively, with intraclass correlation coefficients indicating moderate agreement for both measures. If SVC flow or RV outflow is used in the hemodynamic treatment of neonates, sequential measurements should ideally be performed by the same clinician to reduce potential variability. © 2018 by the American Institute of Ultrasound in Medicine.

  18. Inter-rater agreement of the PEWS tools used in Central Denmark Region

    DEFF Research Database (Denmark)

    Jensen, Claus Sixtus; Aagaard, Hanne; Olesen, Hanne Vebert

    2017-01-01

    BACKGROUND: Paediatric early warning score (PEWS) assessment tools can assist healthcare providers in the timely detection and recognition of subtle patient condition changes signalling clinical deterioration. However, PEWS tools instrument data are only as reliable and accurate as the caregivers...... agreement. The nurses assigned the exact same aggregated score for both PEWS models in 76% of the cases. In 98% of the PEWS assessments, the aggregated PEWS scores assigned by the nurses were equal to or below 1 point in both models. CONCLUSION: The study showed good to very good inter-rater reliability...

  19. Inter-rater agreement of comorbid DSM-IV personality disorders in substance abusers

    Directory of Open Access Journals (Sweden)

    Thylstrup Birgitte

    2008-05-01

    Full Text Available Abstract Background Little is known about the inter-rater agreement of personality disorders in clinical settings. Methods Clinicians rated 75 patients with substance use disorders on the DSM-IV criteria of personality disorders in random order, and on rating scales representing the severity of each. Results Convergent validity agreement was moderate (range for r = 0.55, 0.67 for cluster B disorders rated with DSM-IV criteria, and discriminant validity was moderate for eight of the ten personality disorders. Convergent validity of the rating scales was only moderate for antisocial and narcissistic personality disorder. Discussion Dimensional ratings may be used in research studies and clinical practice with some caution, and may be collected as one of several sources of information to describe the personality of a patient.

  20. Interrater agreement of an observational tool to code knockouts and technical knockouts in mixed martial arts.

    Science.gov (United States)

    Lawrence, David W; Hutchison, Michael G; Cusimano, Michael D; Singh, Tanveer; Li, Luke

    2014-09-01

    Interrater agreement evaluation of a tool to document and code the situational factors and mechanisms of knockouts (KOs) and technical knockouts (TKOs) in mixed martial arts (MMA). Retrospective case series. Professional MMA matches from the Ultimate Fighting Championship-2006-2012. Two nonmedically trained independent raters. The MMA Knockout Tool (MMA-KT) consists of 20 factors and captures and codes information on match characteristics, situational context preceding KOs and TKOs, as well as describing competitor states during these outcomes. The MMA-KT also evaluates the mechanism of action and subsequent events surrounding a KO. The 2 raters coded 125 unique events for a total of 250 events. The 8 factors of Part A had an average κ of 0.87 (SD = 0.10; range = 0.65-0.98); 7 were considered "substantial" agreement and 1 "moderate." Part B consists of 12 factors with an average κ of 0.84 (SD = 0.16; range = 0.59-1.0); 7 classified as "substantial" agreement, 4 "moderate," and 1 "fair." The majority of the factors in the MMA-KT demonstrated substantial interrater agreement, with an average κ of 0.86 (SD = 0.13; range = 0.59-1.0). The MMA-KT is a reliable tool to extract and code relevant information to investigate the situational factors and mechanism of KOs and TKOs in MMA competitions.

  1. Inter-rater and intra-rater agreement of confocal microscopy imaging in diagnosing and subtyping basal cell carcinoma

    NARCIS (Netherlands)

    Kadouch, D. J.; van Haersma de With, A.; Elshot, Y. S.; Peppelman, M.; Bekkenk, M. W.; Wolkerstorfer, A.; Eekhout, I.; Prinsen, C. A. C.; de Rie, M. A.

    2017-01-01

    Reflectance confocal microscopy (RCM) imaging can be used to diagnose and subtype basal cell carcinoma (BCC) but relies on individual morphologic pattern recognition that might vary among users. We assessed the inter-rater and intra-rater agreement of RCM in correctly diagnosing and subtyping BCC.

  2. Inter-rater Agreement of Clinicians' Treatment Recommendations Based on Modified Barium Swallow Study Reports.

    Science.gov (United States)

    Slovarp, Laurie; Danielson, Jennifer; Liss, Julie

    2018-06-07

    The modified barium swallow study (MBSS) is a commonly used radiographic procedure for diagnosis and treatment of swallowing disorders. Despite attempts by dysphagia specialists to standardize the MBSS, most institutions have not adopted such standardized procedures. High variability of assessment patterns arguably contribute to variability of treatment recommendations made from diagnostic information derived from the MBSS report. An online survey was distributed to speech-language pathologists (SLPs) participating in American Speech Language Hearing Association (ASHA) listservs. Sixty-three SLPs who treat swallowing disorders participated. Participating SLPs reviewed two MBSS reports and chose physiologic treatment targets (e.g., tongue base retraction) based on each report. One report primarily contained symptomatology (e.g., aspiration, pharyngeal residue) with minimal information on impaired physiology (e.g., laryngeal incompetence, reduced hyolaryngeal elevation/excursion). In contrast, the second report contained a clear description of impaired physiology to explain the dysphagia symptoms. Fleiss kappa coefficients were used to analyze inter-rater agreement across the high and low physiology report types. Results revealed significantly higher inter-rater agreement across clinicians when reviewing reports with clear explanation(s) of physiologic impairment relative to reports that primarily focused on symptomatology. Clinicians also reported significantly greater satisfaction and treatment confidence following review of reports with clear description(s) of impaired physiology.

  3. Inter-rater and intra-rater agreement on the Nordic Orofacial Test--Screening examination in children, adolescents and young adults with cerebral palsy.

    Science.gov (United States)

    Edvinsson, Siv Elisabet; Lundqvist, Lars-Olov

    2014-02-01

    To evaluate inter-rater and intra-rater agreement on the Nordic Orofacial Test-Screening (NOT-S) examination applied to children, adolescents and young adults with cerebral palsy (CP). Using the NOT-S examination, two speech and language pathologists independently assessed video recordings of 48 subjects with CP aged 5-22 years and representing all CP sub-diagnoses and levels of gross motor function and manual ability. Thirty-one subjects were reassessed. Fifteen out of 17 items in the NOT-S examination domains (1) Face at rest, (2) Nose breathing, (3) Facial expression, (4) Masticatory muscle and jaw function, (5) Oral motor function and (6) Speech were rated using a 'yes' (dysfunction observed)/'no' format, generating an overall score of 0-6. Inter-rater agreement: Twelve out of 15 items and five out of six domains showed acceptable unweighted kappa values (κ = 0.46-1.00). The lowest kappa value was found for domain 4 (κ = -0.04), although it had high inter-rater agreement (92%). The linear weighted kappa value for the overall NOT-S examination score was 0.65 (95% CI = 0.49-0.82). Intra-rater agreement: All items and domains showed acceptable unweighted kappa values (items 0.58-1.00 and 0.59-1.00, domains 0.81-1.00 and 0.62-0.89) for both raters. The linear weighted kappa value for the overall NOT-S examination score was 0.81 (95% CI = 0.63-0.99) for rater A and 0.54 (95% CI = 0.25-0.82) for rater B. The NOT-S examination has acceptable inter-rater and intra-rater agreement when used in young individuals with CP.

  4. DWI-ASPECTS (Diffusion-Weighted Imaging-Alberta Stroke Program Early Computed Tomography Scores) and DWI-FLAIR (Diffusion-Weighted Imaging-Fluid Attenuated Inversion Recovery) Mismatch in Thrombectomy Candidates: An Intrarater and Interrater Agreement Study.

    Science.gov (United States)

    Fahed, Robert; Lecler, Augustin; Sabben, Candice; Khoury, Naim; Ducroux, Célina; Chalumeau, Vanessa; Botta, Daniele; Kalsoum, Erwah; Boisseau, William; Duron, Loïc; Cabral, Dominique; Koskas, Patricia; Benaïssa, Azzedine; Koulakian, Hasmik; Obadia, Michael; Maïer, Benjamin; Weisenburger-Lile, David; Lapergue, Bertrand; Wang, Adrien; Redjem, Hocine; Ciccio, Gabriele; Smajda, Stanislas; Desilles, Jean-Philippe; Mazighi, Mikaël; Ben Maacha, Malek; Akkari, Inès; Zuber, Kevin; Blanc, Raphaël; Raymond, Jean; Piotin, Michel

    2018-01-01

    We aimed to study the intrarater and interrater agreement of clinicians attributing DWI-ASPECTS (Diffusion-Weighted Imaging-Alberta Stroke Program Early Computed Tomography Scores) and DWI-FLAIR (Diffusion-Weighted Imaging-Fluid Attenuated Inversion Recovery) mismatch in patients with acute ischemic stroke referred for mechanical thrombectomy. Eighteen raters independently scored anonymized magnetic resonance imaging scans of 30 participants from a multicentre thrombectomy trial, in 2 different reading sessions. Agreement was measured using Fleiss κ and Cohen κ statistics. Interrater agreement for DWI-ASPECTS was slight (κ=0.17 [0.14-0.21]). Four raters (22.2%) had a substantial (or higher) intrarater agreement. Dichotomization of the DWI-ASPECTS (0-5 versus 6-10 or 0-6 versus 7-10) increased the interrater agreement to a substantial level (κ=0.62 [0.48-0.75] and 0.68 [0.55-0.79], respectively) and more raters reached a substantial (or higher) intrarater agreement (17/18 raters [94.4%]). Interrater agreement for DWI-FLAIR mismatch was moderate (κ=0.43 [0.33-0.57]); 11 raters (61.1%) reached a substantial (or higher) intrarater agreement. Agreement between clinicians assessing DWI-ASPECTS and DWI-FLAIR mismatch may not be sufficient to make repeatable clinical decisions in mechanical thrombectomy. The dichotomization of the DWI-ASPECTS (0-5 versus 0-6 or 0-6 versus 7-10) improved interrater and intrarater agreement, however, its relevance for patients selection for mechanical thrombectomy needs to be validated in a randomized trial. © 2017 American Heart Association, Inc.

  5. An Overview of Interrater Agreement on Likert Scales for Researchers and Practitioners

    Directory of Open Access Journals (Sweden)

    Thomas A. O'Neill

    2017-05-01

    Full Text Available Applications of interrater agreement (IRA statistics for Likert scales are plentiful in research and practice. IRA may be implicated in job analysis, performance appraisal, panel interviews, and any other approach to gathering systematic observations. Any rating system involving subject-matter experts can also benefit from IRA as a measure of consensus. Further, IRA is fundamental to aggregation in multilevel research, which is becoming increasingly common in order to address nesting. Although, several technical descriptions of a few specific IRA statistics exist, this paper aims to provide a tractable orientation to common IRA indices to support application. The introductory overview is written with the intent of facilitating contrasts among IRA statistics by critically reviewing equations, interpretations, strengths, and weaknesses. Statistics considered include rwg, rwg*, r′wg, rwg(p, average deviation (AD, awg, standard deviation (Swg, and the coefficient of variation (CVwg. Equations support quick calculation and contrasting of different agreement indices. The article also includes a “quick reference” table and three figures in order to help readers identify how IRA statistics differ and how interpretations of IRA will depend strongly on the statistic employed. A brief consideration of recommended practices involving statistical and practical cutoff standards is presented, and conclusions are offered in light of the current literature.

  6. Relationships between Contextual and Task Performance and Interrater Agreement: Are There Any?

    Science.gov (United States)

    Díaz-Vilela, Luis F; Delgado Rodríguez, Naira; Isla-Díaz, Rosa; Díaz-Cabrera, Dolores; Hernández-Fernaud, Estefanía; Rosales-Sánchez, Christian

    2015-01-01

    Work performance is one of the most important dependent variables in Work and Organizational Psychology. The main objective of this paper was to explore the relationships between citizenship performance and task performance measures obtained from different appraisers and their consistency through a seldom-used methodology, intraclass correlation coefficients. Participants were 135 public employees, the total staff in a local government department. Jobs were clustered into job families through a work analysis based on standard questionnaires. A task description technique was used to develop a performance appraisal questionnaire for each job family, with three versions: self-, supervisor-, and peer-evaluation, in addition to a measure of citizenship performance. Only when the self-appraisal bias is controlled, significant correlations appeared between task performance rates. However, intraclass correlations analyses show that only self- (contextual and task) performance measures are consistent, while interrater agreement disappears. These results provide some interesting clues about the procedure of appraisal instrument development, the role of appraisers, and the importance of choosing adequate consistency analysis methods.

  7. Inter-Rater Agreement of Pressure Ulcer Risk and Prevention Measures in the National Database of Nursing Quality Indicators(®) (NDNQI).

    Science.gov (United States)

    Waugh, Shirley Moore; Bergquist-Beringer, Sandra

    2016-06-01

    In this descriptive multi-site study, we examined inter-rater agreement on 11 National Database of Nursing Quality Indicators(®) (NDNQI(®) ) pressure ulcer (PrU) risk and prevention measures. One hundred twenty raters at 36 hospitals captured data from 1,637 patient records. At each hospital, agreement between the most experienced rater and each other team rater was calculated for each measure. In the ratings studied, 528 patients were rated as "at risk" for PrU and, therefore, were included in calculations of agreement for the prevention measures. Prevalence-adjusted kappa (PAK) was used to interpret inter-rater agreement because prevalence of single responses was high. The PAK values for eight measures indicated "substantial" to "near perfect" agreement between most experienced and other team raters: Skin assessment on admission (.977, 95% CI [.966-.989]), PrU risk assessment on admission (.978, 95% CI [.964-.993]), Time since last risk assessment (.790, 95% CI [.729-.852]), Risk assessment method (.997, 95% CI [.991-1.0]), Risk status (.877, 95% CI [.838-.917]), Any prevention (.856, 95% CI [.76-.943]), Skin assessment (.956, 95% CI [.904-1.0]), and Pressure-redistribution surface use (.839, 95% CI [.763-.916]). For three intervention measures, PAK values fell below the recommended value of ≥.610: Routine repositioning (.577, 95% CI [.494-.661]), Nutritional support (.500, 95% CI [.418-.581]), and Moisture management (.556, 95% CI [.469-.643]). Areas of disagreement were identified. Findings provide support for the reliability of 8 of the 11 measures. Further clarification of data collection procedures is needed to improve reliability for the less reliable measures. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  8. Interrater variability of EEG interpretation in comatose cardiac arrest patients

    DEFF Research Database (Denmark)

    Westhall, Erik; Rosén, Ingmar; Rossetti, Andrea O

    2015-01-01

    OBJECTIVE: EEG is widely used to predict outcome in comatose cardiac arrest patients, but its value has been limited by lack of a uniform classification. We used the EEG terminology proposed by the American Clinical Neurophysiology Society (ACNS) to assess interrater variability in a cohort...... who were blinded for patient outcome. Percent agreement and kappa (κ) for the categories in the ACNS EEG terminology and for prespecified malignant EEG-patterns were calculated. RESULTS: There was substantial interrater agreement (κ 0.71) for highly malignant patterns and moderate agreement (κ 0.......42) for malignant patterns. Substantial agreement was found for malignant periodic or rhythmic patterns (κ 0.72) while agreement for identifying an unreactive EEG was fair (κ 0.26). CONCLUSIONS: The ACNS EEG terminology can be used to identify highly malignant EEG-patterns in post cardiac arrest patients...

  9. Relationships between Contextual and Task Performance and Interrater Agreement: Are There Any?

    Directory of Open Access Journals (Sweden)

    Luis F Díaz-Vilela

    Full Text Available Work performance is one of the most important dependent variables in Work and Organizational Psychology. The main objective of this paper was to explore the relationships between citizenship performance and task performance measures obtained from different appraisers and their consistency through a seldom-used methodology, intraclass correlation coefficients. Participants were 135 public employees, the total staff in a local government department. Jobs were clustered into job families through a work analysis based on standard questionnaires. A task description technique was used to develop a performance appraisal questionnaire for each job family, with three versions: self-, supervisor-, and peer-evaluation, in addition to a measure of citizenship performance. Only when the self-appraisal bias is controlled, significant correlations appeared between task performance rates. However, intraclass correlations analyses show that only self- (contextual and task performance measures are consistent, while interrater agreement disappears. These results provide some interesting clues about the procedure of appraisal instrument development, the role of appraisers, and the importance of choosing adequate consistency analysis methods.

  10. Diagnosing central lesions of the triangular fibrocartilage as traumatic or degenerative: a review of clinical accuracy.

    Science.gov (United States)

    Löw, S; Erne, H; Pillukat, T; Mühldorfer-Fodor, M; Unglaub, F; Spies, C K

    2017-05-01

    This study examined the reliability of surgeons' estimations as to whether central lesions of the triangular fibrocartilage complex were traumatic or degenerative. A total of 50 consecutive central triangular fibrocartilage complex lesions were independently rated by ten experienced wrist surgeons viewing high-quality arthroscopy videos. The videos were reassessed after intervals of 3 months; at the second assessment surgeons were given the patient's history, radiographs and both, each in a randomized order. Finally, the surgeons assessed the histories and radiographs without the videos. Kappa statistics revealed fair interrater agreement when the histories were added to the videos. The other four modalities demonstrated moderate agreement, with lower Kappa values for the assessment without videos. Intra-rater reliability showed fair agreement for three surgeons, moderate agreement for two surgeons and substantial agreement for five surgeons. It appears that classification of central triangular fibrocartilage complex lesions depends on the information provided upon viewing the triangular fibrocartilage complex at arthroscopy. II.

  11. How to assess and compare inter-rater reliability, agreement and correlation of ratings: an exemplary analysis of mother-father and parent-teacher expressive vocabulary rating pairs.

    Science.gov (United States)

    Stolarova, Margarita; Wolf, Corinna; Rinker, Tanja; Brielmann, Aenne

    2014-01-01

    This report has two main purposes. First, we combine well-known analytical approaches to conduct a comprehensive assessment of agreement and correlation of rating-pairs and to dis-entangle these often confused concepts, providing a best-practice example on concrete data and a tutorial for future reference. Second, we explore whether a screening questionnaire developed for use with parents can be reliably employed with daycare teachers when assessing early expressive vocabulary. A total of 53 vocabulary rating pairs (34 parent-teacher and 19 mother-father pairs) collected for two-year-old children (12 bilingual) are evaluated. First, inter-rater reliability both within and across subgroups is assessed using the intra-class correlation coefficient (ICC). Next, based on this analysis of reliability and on the test-retest reliability of the employed tool, inter-rater agreement is analyzed, magnitude and direction of rating differences are considered. Finally, Pearson correlation coefficients of standardized vocabulary scores are calculated and compared across subgroups. The results underline the necessity to distinguish between reliability measures, agreement and correlation. They also demonstrate the impact of the employed reliability on agreement evaluations. This study provides evidence that parent-teacher ratings of children's early vocabulary can achieve agreement and correlation comparable to those of mother-father ratings on the assessed vocabulary scale. Bilingualism of the evaluated child decreased the likelihood of raters' agreement. We conclude that future reports of agreement, correlation and reliability of ratings will benefit from better definition of terms and stricter methodological approaches. The methodological tutorial provided here holds the potential to increase comparability across empirical reports and can help improve research practices and knowledge transfer to educational and therapeutic settings.

  12. How to assess and compare inter-rater reliability, agreement and correlation of ratings: an exemplary analysis of mother-father and parent-teacher expressive vocabulary rating pairs

    Science.gov (United States)

    Stolarova, Margarita; Wolf, Corinna; Rinker, Tanja; Brielmann, Aenne

    2014-01-01

    This report has two main purposes. First, we combine well-known analytical approaches to conduct a comprehensive assessment of agreement and correlation of rating-pairs and to dis-entangle these often confused concepts, providing a best-practice example on concrete data and a tutorial for future reference. Second, we explore whether a screening questionnaire developed for use with parents can be reliably employed with daycare teachers when assessing early expressive vocabulary. A total of 53 vocabulary rating pairs (34 parent–teacher and 19 mother–father pairs) collected for two-year-old children (12 bilingual) are evaluated. First, inter-rater reliability both within and across subgroups is assessed using the intra-class correlation coefficient (ICC). Next, based on this analysis of reliability and on the test-retest reliability of the employed tool, inter-rater agreement is analyzed, magnitude and direction of rating differences are considered. Finally, Pearson correlation coefficients of standardized vocabulary scores are calculated and compared across subgroups. The results underline the necessity to distinguish between reliability measures, agreement and correlation. They also demonstrate the impact of the employed reliability on agreement evaluations. This study provides evidence that parent–teacher ratings of children's early vocabulary can achieve agreement and correlation comparable to those of mother–father ratings on the assessed vocabulary scale. Bilingualism of the evaluated child decreased the likelihood of raters' agreement. We conclude that future reports of agreement, correlation and reliability of ratings will benefit from better definition of terms and stricter methodological approaches. The methodological tutorial provided here holds the potential to increase comparability across empirical reports and can help improve research practices and knowledge transfer to educational and therapeutic settings. PMID:24994985

  13. A Surgery Oral Examination: Interrater Agreement and the Influence of Rater Characteristics.

    Science.gov (United States)

    Burchard, Kenneth W.; And Others

    1995-01-01

    A study measured interrater reliability among 140 United States and Canadian surgery exam raters and the influences of age, years in practice, and experience as an examiner on individual scores. Results indicate three aspects of examinee performance influenced scores: verbal style, dress, and content of answers. No rater characteristic…

  14. How to assess and compare inter-rater reliability, agreement and correlation of ratings: an exemplary analysis of mother-father and parent-teacher expressive vocabulary rating pairs

    Directory of Open Access Journals (Sweden)

    Margarita eStolarova

    2014-06-01

    Full Text Available This report has two main purposes. First, we combine well-known analytical approaches to conduct a comprehensive assessment of agreement and correlation of rating-pairs and to dis-entangle these often confused concepts, providing a best-practice example on concrete data and a tutorial for future reference. Second, we explore whether a screening questionnaire deve-loped for use with parents can be reliably employed with daycare teachers when assessing early expressive vocabulary. A total of 53 vocabulary rating pairs (34 parent-teacher and 19 mother-father pairs collected for two-year-old children (12 bilingual are evaluated. First, inter-rater reliability both within and across subgroups is assessed using the intra-class correlation coefficient (ICC. Next, based on this analysis of reliability and on the test-retest reliability of the employed tool, inter-rater agreement is analyzed, magnitude and direction of rating differences are considered. Finally, Pearson correlation coefficients of standardized vocabulary scores are calculated and compared across subgroups. The results underline the necessity to distinguish between reliability measures, agreement and correlation. They also demonstrate the impact of the employed reliability on agreement evaluations. This study provides evidence that parent-teacher ratings of children’s early vocabulary can achieve agreement and correlation comparable to those of mother-father ratings on the assessed vocabulary scale. Bilingualism of the evaluated child decreased the likelihood of raters’ agreement. We conclude that future reports of agree-ment, correlation and reliability of ratings will benefit from better definition of terms and stricter methodological approaches. The methodological tutorial provided here holds the potential to increase comparability across empirical reports and can help improve research practices and knowledge transfer to educational and therapeutic settings.

  15. Interrater reliability of a Pilates movement-based classification system.

    Science.gov (United States)

    Yu, Kwan Kenny; Tulloch, Evelyn; Hendrick, Paul

    2015-01-01

    To determine the interrater reliability for identification of a specific movement pattern using a Pilates Classification system. Videos of 5 subjects performing specific movement tasks were sent to raters trained in the DMA-CP classification system. Ninety-six raters completed the survey. Interrater reliability for the detection of a directional bias was excellent (Pi = 0.92, and K(free) = 0.89). Interrater reliability for classifying an individual into a specific subgroup was moderate (Pi = 0.64, K(free) = 0.55) however raters who had completed levels 1-4 of the DMA-CP training and reported using the assessment daily demonstrated excellent reliability (Pi = 0.89 and K(free) = 0.87). The reliability of the classification system demonstrated almost perfect agreement in determining the existence of a specific movement pattern and classifying into a subgroup for experienced raters. There was a trend for greater reliability associated with increased levels of training and experience of the raters. Copyright © 2014 Elsevier Ltd. All rights reserved.

  16. Laryngeal precursor lesions

    DEFF Research Database (Denmark)

    Mehlum, Camilla Slot; Larsen, Stine Rosenkilde; Kiss, Katalin

    2018-01-01

    ; estimates of interrater and intrarater agreements were calculated with kappa statistics. Results: The crude intrarater agreements using the WHOC 2005 were 0.93 for rater 1 and 0.62 for rater 2. The corresponding unweighted kappa values were 0.90 (95% confidence interval [CI], 0.86-0.95) for rater 1 and 0.......43 (95% CI, 0.35-0.54) for rater 2, whereas the standard linear weighted kappa values were 0.93 (95% CI, 0.90-0.97) for rater 1 and 0.60 (95% CI, 0.53-0.69) for rater 2. The crude interrater agreement for the WHOC 2005 was 0.57, with a corresponding unweighted kappa value 0.38 (95% CI, 0...

  17. Lung, liver and lymph node metastases in follow-up MSCT. Comprehensive volumetric assessment of lesion size changes

    International Nuclear Information System (INIS)

    Wulff, A.M.; Fischer, S.; Biederer, J.; Heller, M.; Fabel, M.; Bolte, H.; Freitag-Wolf, S.; Soza, G.; Tietjen, C.

    2012-01-01

    Purpose: To investigate measurement accuracy in terms of precision and inter-rater variability in the simultaneous volumetric assessment of lung, liver and lymph node metastasis size change over time in comparison to RECIST 1.1. Materials and Methods: Three independent readers evaluated multislice CT data from clinical follow-up studies (chest/abdomen) in 50 patients with metastases. A total of 117 lung, 77 liver and 97 lymph node metastases were assessed manually (RECIST 1.1) and by volumetry with semi-automated software. The quality of segmentation and need for manual adjustments were recorded. Volumes were converted to effective diameters to allow comparison to RECIST. For statistical assessment of precision and interobserver agreement, the Wilcoxon-signed rank test and Bland-Altman plots were utilized. Results: The quality of segmentation after manual correction was acceptable to excellent in 95 % of lesions and manual corrections were applied in 21 - 36 % of all lesions, most predominantly in lymph nodes. Mean precision was 2.6 - 6.3 % (manual) with 0.2 - 1.5 % (effective) relative measurement deviation (p <.001). Inter-reader median variation coefficients ranged from 9.4 - 12.8 % (manual) and 2.9 - 8.2 % (volumetric) for different lesion types (p <.001). The limits of agreement were ± 9.8 to ± 11.2 % for volumetric assessment. Conclusion: Superior precision and inter-rater variability of volumetric over manual measurement of lesion change over time was demonstrated in a whole body setting. (orig.)

  18. Lung, liver and lymph node metastases in follow-up MSCT. Comprehensive volumetric assessment of lesion size changes

    Energy Technology Data Exchange (ETDEWEB)

    Wulff, A.M.; Fischer, S.; Biederer, J.; Heller, M.; Fabel, M. [Universitaetsklinikum Schleswig-Holstein, Kiel (Germany). Klinik fuer Diagnostische Radiologie; Bolte, H. [Universitaetsklinikum Muenster (Germany). Klinik und Poliklinik fuer Nuklearmedizin; Freitag-Wolf, S. [Universitaetsklinikum Schleswig-Holstein, Kiel (Germany). Inst. fuer Medizinische Informatik und Statistik; Soza, G.; Tietjen, C. [Siemens AG (Germany). Imaging and IT Div. Computed Tomography

    2012-09-15

    Purpose: To investigate measurement accuracy in terms of precision and inter-rater variability in the simultaneous volumetric assessment of lung, liver and lymph node metastasis size change over time in comparison to RECIST 1.1. Materials and Methods: Three independent readers evaluated multislice CT data from clinical follow-up studies (chest/abdomen) in 50 patients with metastases. A total of 117 lung, 77 liver and 97 lymph node metastases were assessed manually (RECIST 1.1) and by volumetry with semi-automated software. The quality of segmentation and need for manual adjustments were recorded. Volumes were converted to effective diameters to allow comparison to RECIST. For statistical assessment of precision and interobserver agreement, the Wilcoxon-signed rank test and Bland-Altman plots were utilized. Results: The quality of segmentation after manual correction was acceptable to excellent in 95 % of lesions and manual corrections were applied in 21 - 36 % of all lesions, most predominantly in lymph nodes. Mean precision was 2.6 - 6.3 % (manual) with 0.2 - 1.5 % (effective) relative measurement deviation (p <.001). Inter-reader median variation coefficients ranged from 9.4 - 12.8 % (manual) and 2.9 - 8.2 % (volumetric) for different lesion types (p <.001). The limits of agreement were {+-} 9.8 to {+-} 11.2 % for volumetric assessment. Conclusion: Superior precision and inter-rater variability of volumetric over manual measurement of lesion change over time was demonstrated in a whole body setting. (orig.)

  19. Inter-rater and intra-rater reliability of a movement control test in shoulder.

    Science.gov (United States)

    Rajasekar, S; Bangera, Rakshith K; Sekaran, Padmanaban

    2017-07-01

    Movement faults are commonly observed in patients with musculoskeletal pain. The Kinetic Medial Rotation Test (KMRT) is a movement control test used to identify movement faults of the scapula and gleno-humeral joints during arm movement. Objective tests such as the KMRT need to be reliable and valid for the results to be applied across different clinical settings and patient populations. The primary objective of the present study was to determine the intra-rater and inter-rater reliability of KMRT in subjects with and without shoulder pain. Sixty subjects were included in this study based on specific inclusion and exclusion criteria. Two musculoskeletal physiotherapists with different levels of clinical experience performed the tests. The intra-rater reliability was tested in twenty asymptomatic subjects by a single assessor at two week intervals. An equal number of subjects with and without shoulder pain were tested by both the assessors to determine the inter-rater reliability. Both components of the KMRT, the Gleno- Humeral Anterior Translation (GHAT) and the Scapular Forward Tilt (SCFT) were tested. The Kappa values for inter-rater reliability of the GHAT and SCFT were K = 0.68 & K = 0.65 respectively in subjects with shoulder pain. In asymptomatic subjects, the inter-rater reliability of GHAT was K = 0.61 and SCFT was K = 0.85. Intra-rater reliability ranged from K = 0.66 for GHAT to K = 0.87 for SCFT. Our study found substantial agreement in inter-rater reliability of KMRT in subjects with shoulder pain, whereas substantial to near perfect agreement was found in intra-rater and inter-rater reliability of KMRT in subjects without shoulder pain. Copyright © 2017 Elsevier Ltd. All rights reserved.

  20. Inter-rater reliability of diagnostic criteria for sacroiliac joint-, disc- and facet joint pain.

    Science.gov (United States)

    van Tilburg, Cornelis W J; Groeneweg, Johannes G; Stronks, Dirk L; Huygen, Frank J P M

    2017-01-01

    Several diagnostic criteria sets are described in the literature to identify low back pain subtypes, but very little is known about the inter-rater reliability of these criteria. We conducted a study to determine the reliability of diagnostic tests that point towards SI joint-, disc- or facet joint pain. Inter-rater reliability study alongside three randomized clinical trials. Multidisciplinary pain center of general hospital. Patients aged 18 or more with medical history and physical examination suggestive of sacroiliac joint-, disc- and facet joint pain on lumbar level. Making use of nowadays most common used diagnostic criteria, a physical examination is taken independently by three physicians (two pain physicians and one orthopedic surgeon). Inter-rater reliability (Kappa (κ) measure of agreement) and significance (p) between raters are presented. Strengths of agreement, indicated with κ values above 0,20, are presented in order of agreement. One hundred patients were included. None of the parameters from the physical investigation had κ values of more than 0.21 (fair) in all pairs of raters. Between two raters (C and D), there was an almost perfect agreement on three parameters, more specifically ``Abnormal sensory and motor examination, hyperactive or diminished reflexes'', ``Sitting exam shows no reflex, motor or sensory signs in the legs'' and ``Straight leg raising (Laségue) negative between 30 and 70 degrees of flexion''. The ``Drop test positive'' parameters had moderate strength of agreement between raters A and D and fair strength between raters A and B. The ``Digital interspinous pressure test positive'' had moderate strength of agreement between raters C and D and fair strength of agreement between raters A and B as well as raters B and C. Three other parameters had a fair strength of agreement between two raters, all other parameters had a slight or poor strength of agreement. Inter-rater reliability, confidence intervals and significance of

  1. Inter-rater and intra-rater reliability of the Bahasa Melayu version of Rose Angina Questionnaire.

    Science.gov (United States)

    Hassan, N B; Choudhury, S R; Naing, L; Conroy, R M; Rahman, A R A

    2007-01-01

    The objective of the study is to translate the Rose Questionnaire (RQ) into a Bahasa Melayu version and adapt it cross-culturally, and to measure its inter-rater and intrarater reliability. This cross sectional study was conducted in the respondents' homes or workplaces in Kelantan, Malaysia. One hundred respondents aged 30 and above with different socio-demographic status were interviewed for face validity. For each inter-rater and intra-rater reliability, a sample of 150 respondents was interviewed. Inter-rater and intra-rater reliabilities were assessed by Cohen's kappa. The overall inter-rater agreements by the five pair of interviewers at point one and two were 0.86, and intrarater reliability by the five interviewers on the seven-item questionnaire at poinone and two was 0.88, as measured by kappa coefficient. The translated Malay version of RQ demonstrated an almost perfect inter-rater and intra-rater reliability and further validation such as sensitivity and specificity analysis of this translated questionnaire is highly recommended.

  2. Intra- and interrater reliability of three different MRI grading and classification systems after acute hamstring injuries.

    Science.gov (United States)

    Wangensteen, Arnlaug; Tol, Johannes L; Roemer, Frank W; Bahr, Roald; Dijkstra, H Paul; Crema, Michel D; Farooq, Abdulaziz; Guermazi, Ali

    2017-04-01

    To assess and compare the intra- and interrater reliability of three different MRI grading and classification systems after acute hamstring injury. Male athletes (n=40) with clinical diagnosis of acute hamstring injury and MRI ≤5days were selected from a prospective cohort. Two radiologists independently evaluated the MRIs using standardised scoring form including the modified Peetrons grading system, the Chan acute muscle strain injury classification and the British Athletics Muscle Injury Classification. Intra-and interrater reliability was assessed with linear weighted kappa (κ) or unweighted Cohen's κ and percentage agreement was calculated. We observed 'substantial' to 'almost perfect' intra- (κ range 0.65-1.00) and interrater reliability (κ range 0.77-1.00) with percentage agreement 83-100% and 88-100%, respectively, for severity gradings, overall anatomical sites and overall classifications for the three MRI systems. We observed substantial variability (κ range -0.05 to 1.00) for subcategories within the Chan classification and the British Athletics Muscle Injury Classification, however, the prevalence of positive scorings was low for some subcategories. The modified Peetrons grading system, overall Chan classification and overall British Athletics Muscle Injury Classification demonstrated 'substantial' to 'almost perfect' intra- and interrater reliability when scored by experienced radiologists. The intra- and interrater reliability for the anatomical subcategories within the classifications remains unclear. Copyright © 2017 Elsevier B.V. All rights reserved.

  3. Unfolding the phenomenon of interrater agreement

    DEFF Research Database (Denmark)

    Slaug, Björn; Schilling, Oliver; Helle, Tina

    2012-01-01

    accounted for 6-11%, the items for 32-33%, and the residual for 57-60% of the variation. Multilevel regression analysis showed barrier prevalence and raters' familiarity with using standardized instruments to have the strongest impact on agreement. CONCLUSION: Supported by a conceptual analysis, we propose...

  4. Interrater reliability assessment using the Test of Gross Motor Development-2.

    Science.gov (United States)

    Barnett, Lisa M; Minto, Christine; Lander, Natalie; Hardy, Louise L

    2014-11-01

    The aim was to examine interrater reliability of the object control subtest from the Test of Gross Motor Development-2 by live observation in a school field setting. Reliability Study--cross sectional. Raters were rated on their ability to agree on (1) the raw total for the six object control skills; (2) each skill performance and (3) the skill components. Agreement for the object control subtest and the individual skills was assessed by an intraclass correlation (ICC) and a kappa statistic assessed for skill component agreement. A total of 37 children (65% girls) aged 4-8 years (M = 6.2, SD = 0.8) were assessed in six skills by two raters; equating to 222 skill tests. Interrater reliability was excellent for the object control subset (ICC = 0.93), and for individual skills, highest for the dribble (ICC = 0.94) followed by strike (ICC = 0.85), overhand throw (ICC = 0.84), underhand roll (ICC = 0.82), kick (ICC = 0.80) and the catch (ICC = 0.71). The strike and the throw had more components with less agreement. Even though the overall subtest score and individual skill agreement was good, some skill components had lower agreement, suggesting these may be more problematic to assess. This may mean some skill components need to be specified differently in order to improve component reliability. Crown Copyright © 2013. Published by Elsevier Ltd. All rights reserved.

  5. The Post-Ureteroscopic Lesion Scale (PULS)

    DEFF Research Database (Denmark)

    Schoenthaler, Martin; Buchholz, Noor; Farin, Erik

    2014-01-01

    The Post-Ureteroscopic Lesion Scale (PULS) offers a simple grading system for the description of ureteral lesions after ureteroscopy. In this article, we present the results of a video-based multicenter evaluation of the inter-rater reliability of clinically important PULS grades 0-3....

  6. Interobserver agreement for sonograms of breast lesions obtained by an automated breast volume scanner

    International Nuclear Information System (INIS)

    Zhang, Jing; Lai, Xing-Jian; Zhu, Qing-Li; Wang, Hong-Yan; Jiang, Yu-Xin; Liu, He; Dai, Qing; You, Shan-Shan; Xiao, Meng-Su

    2012-01-01

    Objective: To evaluate the interobserver agreement of radiologists in the description and final assessment of breast sonograms obtained using an automated breast volume scanner (ABVS) using a unique descriptor of three-dimensional ultrasound (3D US) and the Breast Imaging Reporting and Data System (BI-RADS) US lexicon. Methods: From October to December 2010, 208 patients were subjected to an ABVS examination in the supine position, and data were automatically sent to the ABVS workstation. Two radiologists independently evaluated 234 breast masses (148 benign and 86 malignant masses) using a unique descriptor from the 3D US and the BI-RADS US lexicon. The reviewers were blinded to the patient's mammographic images, medical history, and pathologic findings. The interobserver agreement was measured using kappa statistics. Results: Substantial agreement was obtained for lesion shape, orientation, margin, echo pattern, posterior acoustic features, calcification and final assessment (κ = 0.79, 0.74, 0.76, 0.69, 0.68, 0.71 and 0.70, respectively). Fair agreement was obtained for retraction phenomenon and lesion boundary (κ = 0.54 and 0.42, respectively). Conclusions: The interobserver agreement for breast sonograms obtained by ABVS is good, especially for lesion shape and margin; however, the interobserver agreement for the retraction phenomenon, which is a unique descriptor of coronal-plane 3D US, needs to be improved

  7. Evaluating the agreement between tumour volumetry and the estimated volumes of tumour lesions using an algorithm

    Energy Technology Data Exchange (ETDEWEB)

    Laubender, Ruediger P. [German Cancer Consortium (DKTK), Heidelberg (Germany); University Hospital Munich - Campus Grosshadern, Institute of Medical Informatics, Biometry, and Epidemiology (IBE), Munich (Germany); German Cancer Research Center (DKFZ), Heidelberg (Germany); Lynghjem, Julia; D' Anastasi, Melvin; Graser, Anno [University Hospital Munich - Campus Grosshadern, Institute for Clinical Radiology, Munich (Germany); Heinemann, Volker; Modest, Dominik P. [University Hospital Munich - Campus Grosshadern, Department of Medical Oncology, Munich (Germany); Mansmann, Ulrich R. [University Hospital Munich - Campus Grosshadern, Institute of Medical Informatics, Biometry, and Epidemiology (IBE), Munich (Germany); Sartorius, Ute; Schlichting, Michael [Merck KGaA, Darmstadt (Germany)

    2014-07-15

    To evaluate the agreement between tumour volume derived from semiautomated volumetry (SaV) and tumor volume defined by spherical volume using longest lesion diameter (LD) according to Response Evaluation Criteria In Solid Tumors (RECIST) or ellipsoid volume using LD and longest orthogonal diameter (LOD) according to World Health Organization (WHO) criteria. Twenty patients with metastatic colorectal cancer from the CIOX trial were included. A total of 151 target lesions were defined by baseline computed tomography and followed until disease progression. All assessments were performed by a single reader. A variance component model was used to compare the three volume versions. There was a significant difference between the SaV and RECIST-based tumour volumes. The same model showed no significant difference between the SaV and WHO-based volumes. Scatter plots showed that the RECIST-based volumes overestimate lesion volume. The agreement between the SaV and WHO-based relative changes in tumour volume, evaluated by intraclass correlation, showed nearly perfect agreement. Estimating the volume of metastatic lesions using both the LD and LOD (WHO) is more accurate than those based on LD only (RECIST), which overestimates lesion volume. The good agreement between the SaV and WHO-based relative changes in tumour volume enables a reasonable approximation of three-dimensional tumour burden. (orig.)

  8. Inter-rater reliability of measures to characterize the tobacco retail environment in Mexico

    Directory of Open Access Journals (Sweden)

    Marissa G Hall

    2015-11-01

    Full Text Available Objective. To evaluate the inter-rater reliability of a data collection instrument to assess the tobacco retail environ- ment in Mexico, after major marketing regulations were implemented. Materials and methods. In 2013, two data collectors independently evaluated 21 stores in two census tracts, through a data collection instrument that assessed the presence of price promotions, whether single cigarettes were sold, the number of visible advertisements, the pre- sence of signage prohibiting the sale of cigarettes to minors, and characteristics of cigarette pack displays. We evaluated the inter-rater reliability of the collected data, through the calculation of metrics such as intraclass correlation coefficient, percent agreement, Cohen’s kappa and Krippendorff’s alpha. Results. Most measures demonstrated substantial or perfect inter-rater reliability. Conclusions. Our results indicate the potential utility of the data collection instrument for future point-of-sale research.

  9. Diagnostic sensitivity and interobserver agreement of radiography and ultrasonography for detecting trochlear ridge osteochondrosis lesions in the equine stifle.

    Science.gov (United States)

    Beccati, Francesca; Chalmers, Heather J; Dante, Sara; Lotto, Eleonora; Pepe, Marco

    2013-01-01

    Osteochondrosis lesions commonly occur on the femoral trochlear ridges in horses and radiography and ultrasonography are routinely used to diagnose these lesions. However, poor correlation has been found between radiographic and arthroscopic findings of affected trochlear ridges. Interobserver agreement for ultrasonographic diagnoses and correlation between ultrasonographic and arthroscopic findings have not been previously described. Objectives of this study were to describe diagnostic sensitivity and interobserver agreement of radiography and ultrasonography for detecting and grading osteochondrosis lesions of the equine trochlear ridges, using arthroscopy as the reference standard. Twenty-two horses were sampled. Two observers independently recorded radiographic and ultrasonographic findings without knowledge of arthroscopic findings. Imaging findings were compared between observers and with arthroscopic findings. Agreement between observers was moderate to excellent (κ 0.48-0.86) for detecting lesions using radiography and good to excellent (κ 0.74-0.87) for grading lesions using radiography. Agreement between observers was good to excellent (κ 0.78-0.94) for detecting lesions using ultrasonography and very good to excellent (κ 0.86-0.93) for grading lesions using ultrasonography. Diagnostic sensitivity was 84-88% for radiography and 100% for ultrasonography. Diagnostic specificity was 89-100% for radiography and 60-82% for ultrasonography. Agreement between radiography and arthroscopy was good (κ 0.64-0.78). Agreement between ultrasonography and arthroscopy was very good to excellent (κ 0.81-0.87). Findings from this study support ultrasound as a preferred method for predicting presence and severity of osteochondrosis lesions involving the femoral trochlear ridges in horses. © 2012 Veterinary Radiology & Ultrasound.

  10. Intra- and interrater reliability of three different MRI grading and classification systems after acute hamstring injuries

    Energy Technology Data Exchange (ETDEWEB)

    Wangensteen, Arnlaug, E-mail: arnlaug.wangensteen@nih.no [Aspetar, Orthopaedic and Sports Medicine Hospital, Doha (Qatar); Oslo Sports Trauma Research Center, Department of Sports Medicine, Norwegian School of Sport Sciences, Oslo (Norway); Tol, Johannes L., E-mail: johannes.tol@aspetar.com [Aspetar, Orthopaedic and Sports Medicine Hospital, Doha (Qatar); Amsterdam Center for Evidence Sports Medicine, Academic Medical Center (Netherlands); The Sports Physician Group, OLVG, Amsterdam (Netherlands); Roemer, Frank W. [Quantitative Imaging Center, Department of Radiology, Boston University School of Medicine, Boston, MA (United States); Department of Radiology, University of Erlangen-Nuremberg, Erlangen (Germany); Bahr, Roald [Aspetar, Orthopaedic and Sports Medicine Hospital, Doha (Qatar); Oslo Sports Trauma Research Center, Department of Sports Medicine, Norwegian School of Sport Sciences, Oslo (Norway); Dijkstra, H. Paul [Aspetar, Orthopaedic and Sports Medicine Hospital, Doha (Qatar); Crema, Michel D. [Quantitative Imaging Center, Department of Radiology, Boston University School of Medicine, Boston, MA (United States); Department of Radiology, Saint-Antoine Hospital, University Paris VI, Paris (France); Farooq, Abdulaziz [Aspetar, Orthopaedic and Sports Medicine Hospital, Doha (Qatar); Guermazi, Ali [Quantitative Imaging Center, Department of Radiology, Boston University School of Medicine, Boston, MA (United States)

    2017-04-15

    Highlights: • Three different MRI grading and classification systems for acute hamstring injuries are overall reliable. • Reliability for the subcategories within these MRI grading and classification systems remains, however, unclear. - Abstract: Objective: To assess and compare the intra- and interrater reliability of three different MRI grading and classification systems after acute hamstring injury. Methods: Male athletes (n = 40) with clinical diagnosis of acute hamstring injury and MRI ≤5 days were selected from a prospective cohort. Two radiologists independently evaluated the MRIs using standardised scoring form including the modified Peetrons grading system, the Chan acute muscle strain injury classification and the British Athletics Muscle Injury Classification. Intra-and interrater reliability was assessed with linear weighted kappa (κ) or unweighted Cohen's κ and percentage agreement was calculated. Results: We observed ‘substantial’ to ‘almost perfect’ intra- (κ range 0.65–1.00) and interrater reliability (κ range 0.77–1.00) with percentage agreement 83–100% and 88–100%, respectively, for severity gradings, overall anatomical sites and overall classifications for the three MRI systems. We observed substantial variability (κ range −0.05 to 1.00) for subcategories within the Chan classification and the British Athletics Muscle Injury Classification, however, the prevalence of positive scorings was low for some subcategories. Conclusions: The modified Peetrons grading system, overall Chan classification and overall British Athletics Muscle Injury Classification demonstrated ‘substantial' to ‘almost perfect' intra- and interrater reliability when scored by experienced radiologists. The intra- and interrater reliability for the anatomical subcategories within the classifications remains unclear.

  11. Intra- and interrater reliability of three different MRI grading and classification systems after acute hamstring injuries

    International Nuclear Information System (INIS)

    Wangensteen, Arnlaug; Tol, Johannes L.; Roemer, Frank W.; Bahr, Roald; Dijkstra, H. Paul; Crema, Michel D.; Farooq, Abdulaziz; Guermazi, Ali

    2017-01-01

    Highlights: • Three different MRI grading and classification systems for acute hamstring injuries are overall reliable. • Reliability for the subcategories within these MRI grading and classification systems remains, however, unclear. - Abstract: Objective: To assess and compare the intra- and interrater reliability of three different MRI grading and classification systems after acute hamstring injury. Methods: Male athletes (n = 40) with clinical diagnosis of acute hamstring injury and MRI ≤5 days were selected from a prospective cohort. Two radiologists independently evaluated the MRIs using standardised scoring form including the modified Peetrons grading system, the Chan acute muscle strain injury classification and the British Athletics Muscle Injury Classification. Intra-and interrater reliability was assessed with linear weighted kappa (κ) or unweighted Cohen's κ and percentage agreement was calculated. Results: We observed ‘substantial’ to ‘almost perfect’ intra- (κ range 0.65–1.00) and interrater reliability (κ range 0.77–1.00) with percentage agreement 83–100% and 88–100%, respectively, for severity gradings, overall anatomical sites and overall classifications for the three MRI systems. We observed substantial variability (κ range −0.05 to 1.00) for subcategories within the Chan classification and the British Athletics Muscle Injury Classification, however, the prevalence of positive scorings was low for some subcategories. Conclusions: The modified Peetrons grading system, overall Chan classification and overall British Athletics Muscle Injury Classification demonstrated ‘substantial' to ‘almost perfect' intra- and interrater reliability when scored by experienced radiologists. The intra- and interrater reliability for the anatomical subcategories within the classifications remains unclear.

  12. Inter-rater reliability and agreement of the 6-minute walk test in females with hip fractures

    DEFF Research Database (Denmark)

    Overgaard, Jan; Larsen, Camilla Marie; Tange Kristensen, Morten

    physiotherapy students independently examined (randomized order) a convenient sample of 20 participants; their assessments were separated by two days, and testing followed instructions from the American Thoracic Society. Hip pain was assessed with the Verbal Ranking Scale. Participants (all women) with a mean...... (SD) age of 78.1 ± 5.9 years performed the test within a mean of 31.5 ± 5.8 days post-surgery; 10 had a cervical and 10 a trochanteric fracture. Excellent inter-rater reliability; ICC2.1 = 0.92 (95% CI, 0.81 - 0.97) was found, and the standard error of measurement (SEM) and smallest real difference.......6 meters longer, at the second trial (P = 0.002). Participants with moderate hip fracture-related pain walked a shorter distance than those with no or light pain during the first test (P = 0.04), while this was not the case during the second (P = 0.25). Excellent inter-rater reliability was found...

  13. Diagnostic Accuracy and Inter-Observer Agreement of Shoulder Magnetic Resonance Arthrography in the Detection of Labral Lesion and Assessment of Lesion Location

    Energy Technology Data Exchange (ETDEWEB)

    Hwang, Ji Young; Song, Sook Yun; Choi, Jin Ha; Shin, Sang Jin [School of Medicine, Ewha Womans University, Mokdong Hospital, Seoul (Korea, Republic of)

    2012-11-15

    To evaluate the diagnostic accuracy and inter-observer agreement of magnetic resonance (MR) arthrography in the detection of labral lesions by location and to describe useful MR imaging findings of labral tears. Sixty-eight patients who underwent both pre-operative MR arthrography and arthroscopy were included. The location of the labrum was classified into anterior (2-6 o'clock), superior (12-2 o'clock), and posterior (6-12 o'clock). Sensitivity, specificity, accuracy, and inter-observer agreement of MR arthrography for the diagnosis of labral lesions by location were calculated. Frequency of MR imaging findings such as detachment, high signal intensity cleft, contour change, absence, and signal change of the labrum by location were analyzed. 35 anterior, 44 superior and 15 posterior labral lesions were detected by arthroscopy. The corresponding sensitivities were 91.4%, 79.5%, and 40.0%, specificities were 90.9%, 20.8%, and 86.8%, accuracies were 91.2%, 58.8%, and 76.5%, and kappa values were 0.823, 0.252, and 0.394, for anterior, superior, posterior lesions, respectively. The most common MR imaging findings were detachment in 60.0% of anterior labrums, high signal intensity cleft in 52.3% of superior labrums, and normal in 60.0% of posterior labrums. Diagnostic accuracy and inter-observer agreement of MR arthrography in the diagnosis of labral lesions are high in anterior labrums and low in superior or posterior labrums. The useful MR imaging findings of labral tears were different according to labral location.

  14. Interrater Reliability of the Categorization of Late Radiographic Changes After Lung Stereotactic Body Radiation Therapy

    Energy Technology Data Exchange (ETDEWEB)

    Faruqi, Salman [Department of Radiation Oncology, Princess Margaret Cancer Centre, Toronto, ON (Canada); Giuliani, Meredith E., E-mail: meredith.giuliani@rmp.uhn.on.ca [Department of Radiation Oncology, Princess Margaret Cancer Centre, Toronto, ON (Canada); Raziee, Hamid; Yap, Mei Ling [Department of Radiation Oncology, Princess Margaret Cancer Centre, Toronto, ON (Canada); Roberts, Heidi [Department of Radiology, University Health Network, Toronto, Ontario (Canada); Le, Lisa W. [Department of Biostatistics, Princess Margaret Cancer Centre, Toronto, Ontario (Canada); Brade, Anthony; Cho, John; Sun, Alexander; Bezjak, Andrea; Hope, Andrew J. [Department of Radiation Oncology, Princess Margaret Cancer Centre, Toronto, ON (Canada)

    2014-08-01

    Purpose: Radiographic changes after lung stereotactic body radiation therapy (SBRT) have been categorized into 4 groups: modified conventional pattern (A), mass-like fibrosis; (B), scar-like fibrosis (C), and no evidence of increased density (D). The purpose of this study was to assess the interrater reliability of this categorization system in patients with early-stage non-small cell lung cancer (NSCLC). Methods and Materials: Seventy-seven patients were included in this study, all treated with SBRT for early-stage (T1/2) NSCLC at a single institution, with a minimum follow-up of 6 months. Six experienced clinicians familiar with post-SBRT radiographic changes scored the serial posttreatment CT images independently in a blinded fashion. The proportion of patients categorized as A, B, C, or D at each interval was determined. Krippendorff's alpha (KA), Multirater kappa (M-kappa), and Gwet's AC1 (AC1) scores were used to establish interrater reliability. A leave-one-out analysis was performed to demonstrate the variability among raters. Interrater agreement of the first and last 20 patients scored was calculated to explore whether a training effect existed. Results: The number of ratings ranged from 450 at 6 months to 84 at 48 months of follow-up. The proportion of patients in each category was as follows: A, 45%; B, 16%; C, 13%; and D, 26%. KA and M-kappa ranged from 0.17 to 0.34. AC1 measure range was 0.22 to 0.48. KA increased from 0.24 to 0.36 at 12 months with training. The percent agreement for pattern A peaked at 12 month with a 54% chance of having >50% raters in agreement and decreased over time, whereas that for patterns B and C increased over time to a maximum of 20% and 22%, respectively. Conclusion: This post-SBRT radiographic change categorization system has modest interrater agreement, and there is a suggestion of a training effect. Patterns of fibrosis evolve after SBRT and alternative categorization systems should be evaluated.

  15. Intra-rater and inter-rater reliability of a medical record abstraction study on transition of care after childhood cancer.

    Directory of Open Access Journals (Sweden)

    Micòl E Gianinazzi

    Full Text Available The abstraction of data from medical records is a widespread practice in epidemiological research. However, studies using this means of data collection rarely report reliability. Within the Transition after Childhood Cancer Study (TaCC which is based on a medical record abstraction, we conducted a second independent abstraction of data with the aim to assess a intra-rater reliability of one rater at two time points; b the possible learning effects between these two time points compared to a gold-standard; and c inter-rater reliability.Within the TaCC study we conducted a systematic medical record abstraction in the 9 Swiss clinics with pediatric oncology wards. In a second phase we selected a subsample of medical records in 3 clinics to conduct a second independent abstraction. We then assessed intra-rater reliability at two time points, the learning effect over time (comparing each rater at two time-points with a gold-standard and the inter-rater reliability of a selected number of variables. We calculated percentage agreement and Cohen's kappa.For the assessment of the intra-rater reliability we included 154 records (80 for rater 1; 74 for rater 2. For the inter-rater reliability we could include 70 records. Intra-rater reliability was substantial to excellent (Cohen's kappa 0-6-0.8 with an observed percentage agreement of 75%-95%. In all variables learning effects were observed. Inter-rater reliability was substantial to excellent (Cohen's kappa 0.70-0.83 with high agreement ranging from 86% to 100%.Our study showed that data abstracted from medical records are reliable. Investigating intra-rater and inter-rater reliability can give confidence to draw conclusions from the abstracted data and increase data quality by minimizing systematic errors.

  16. Approaches to describing inter-rater reliability of the overall clinical appearance of febrile infants and toddlers in the emergency department

    Directory of Open Access Journals (Sweden)

    Paul Walsh

    2014-11-01

    Full Text Available Objectives. To measure inter-rater agreement of overall clinical appearance of febrile children aged less than 24 months and to compare methods for doing so.Study Design and Setting. We performed an observational study of inter-rater reliability of the assessment of febrile children in a county hospital emergency department serving a mixed urban and rural population. Two emergency medicine healthcare providers independently evaluated the overall clinical appearance of children less than 24 months of age who had presented for fever. They recorded the initial ‘gestalt’ assessment of whether or not the child was ill appearing or if they were unsure. They then repeated this assessment after examining the child. Each rater was blinded to the other’s assessment. Our primary analysis was graphical. We also calculated Cohen’s κ, Gwet’s agreement coefficient and other measures of agreement and weighted variants of these. We examined the effect of time between exams and patient and provider characteristics on inter-rater agreement.Results. We analyzed 159 of the 173 patients enrolled. Median age was 9.5 months (lower and upper quartiles 4.9–14.6, 99/159 (62% were boys and 22/159 (14% were admitted. Overall 118/159 (74% and 119/159 (75% were classified as well appearing on initial ‘gestalt’ impression by both examiners. Summary statistics varied from 0.223 for weighted κ to 0.635 for Gwet’s AC2. Inter rater agreement was affected by the time interval between the evaluations and the age of the child but not by the experience levels of the rater pairs. Classifications of ‘not ill appearing’ were more reliable than others.Conclusion. The inter-rater reliability of emergency providers’ assessment of overall clinical appearance was adequate when described graphically and by Gwet’s AC. Different summary statistics yield different results for the same dataset.

  17. Rater Agreement Indexes for Performance Assessment.

    Science.gov (United States)

    Burry-Stock, Judith A.; And Others

    1996-01-01

    It is argued that interrater agreement is a psychometric property which is theoretically different from classic reliability. Formulas are presented to illustrate a set of algebraically equivalent rater agreement indices that are intended to provide educational and psychological researchers with a practical way to establish a measure of rater…

  18. Reproducibility of tender point examination in chronic low back pain patients as measured by intrarater and inter-rater reliability and agreement

    DEFF Research Database (Denmark)

    Jensen, Ole Kudsk; Callesen, Jacob; Nielsen, Merete Graakjaer

    2013-01-01

    back examination and return-to-work intervention, 43 and 39 patients, respectively (18 women, 46%) entered and completed the study. MAIN OUTCOME MEASURES: The reliability was estimated by the intraclass correlation coefficient (ICC), and agreement was calculated for up to ±3 TPs. Furthermore......, the smallest detectable difference was calculated. RESULTS: TP examination was performed twice by two consultants in rheumatology and rehabilitation at 20 min intervals and repeated 1 week later. Intrarater reliability in the more and less experienced rater was ICC 0.84 (95% CI 0.69 to 0.98) and 0.72 (95% CI 0.......49 to 0.95), respectively. The figures for inter-rater reliability were intermediate between these figures. In more than 70% of the cases, the raters agreed within ±3 TPs in both men and women and between test days. The smallest detectable difference between raters was 5, and for the more and less...

  19. Improved assessment of multiple sclerosis lesion segmentation agreement via detection and outline error estimates

    Directory of Open Access Journals (Sweden)

    Wack David S

    2012-07-01

    Full Text Available Abstract Background Presented is the method “Detection and Outline Error Estimates” (DOEE for assessing rater agreement in the delineation of multiple sclerosis (MS lesions. The DOEE method divides operator or rater assessment into two parts: 1 Detection Error (DE -- rater agreement in detecting the same regions to mark, and 2 Outline Error (OE -- agreement of the raters in outlining of the same lesion. Methods DE, OE and Similarity Index (SI values were calculated for two raters tested on a set of 17 fluid-attenuated inversion-recovery (FLAIR images of patients with MS. DE, OE, and SI values were tested for dependence with mean total area (MTA of the raters' Region of Interests (ROIs. Results When correlated with MTA, neither DE (ρ = .056, p=.83 nor the ratio of OE to MTA (ρ = .23, p=.37, referred to as Outline Error Rate (OER, exhibited significant correlation. In contrast, SI is found to be strongly correlated with MTA (ρ = .75, p  Conclusions The DE and OER indices are proposed as a better method than SI for comparing rater agreement of ROIs, which also provide specific information for raters to improve their agreement.

  20. The Use of Bayesian Networks to Assess the Quality of Evidence from Research Synthesis: 2. Inter-Rater Reliability and Comparison with Standard GRADE Assessment.

    Directory of Open Access Journals (Sweden)

    Alexis Llewellyn

    Full Text Available The grades of recommendation, assessment, development and evaluation (GRADE approach is widely implemented in systematic reviews, health technology assessment and guideline development organisations throughout the world. We have previously reported on the development of the Semi-Automated Quality Assessment Tool (SAQAT, which enables a semi-automated validity assessment based on GRADE criteria. The main advantage to our approach is the potential to improve inter-rater agreement of GRADE assessments particularly when used by less experienced researchers, because such judgements can be complex and challenging to apply without training. This is the first study examining the inter-rater agreement of the SAQAT.We conducted two studies to compare: a the inter-rater agreement of two researchers using the SAQAT independently on 28 meta-analyses and b the inter-rater agreement between a researcher using the SAQAT (who had no experience of using GRADE and an experienced member of the GRADE working group conducting a standard GRADE assessment on 15 meta-analyses.There was substantial agreement between independent researchers using the Quality Assessment Tool for all domains (for example, overall GRADE rating: weighted kappa 0.79; 95% CI 0.65 to 0.93. Comparison between the SAQAT and a standard GRADE assessment suggested that inconsistency was parameterised too conservatively by the SAQAT. Therefore the tool was amended. Following amendment we found fair-to-moderate agreement between the standard GRADE assessment and the SAQAT (for example, overall GRADE rating: weighted kappa 0.35; 95% CI 0.09 to 0.87.Despite a need for further research, the SAQAT may aid consistent application of GRADE, particularly by less experienced researchers.

  1. The Problems with the Kappa Statistic as a Metric of Interobserver Agreement on Lesion Detection Using a Third-reader Approach When Locations Are Not Prespecified.

    Science.gov (United States)

    Shih, Joanna H; Greer, Matthew D; Turkbey, Baris

    2018-03-16

    To point out the problems with Cohen kappa statistic and to explore alternative metrics to determine interobserver agreement on lesion detection when locations are not prespecified. Use of kappa and two alternative methods, namely index of specific agreement (ISA) and modified kappa, for measuring interobserver agreement on the location of detected lesions are presented. These indices of agreement are illustrated by application to a retrospective multireader study in which nine readers detected and scored prostate cancer lesions in 163 consecutive patients (n = 110 cases, n = 53 controls) using the guideline of Prostate Imaging Reporting and Data System version 2 on multiparametric magnetic resonance imaging. The proposed modified kappa, which properly corrects for the amount of agreement by chance, is shown to be approximately equivalent to the ISA. In the prostate cancer data, average kappa, modified kappa, and ISA equaled 30%, 55%, and 57%, respectively, for all lesions and 20%, 87%, and 87%, respectively, for index lesions. The application of kappa could result in a substantial downward bias in reader agreement on lesion detection when locations are not prespecified. ISA is recommended for assessment of reader agreement on lesion detection. Published by Elsevier Inc.

  2. Validity and Interrater Reliability of the Visual Quarter-Waste Method for Assessing Food Waste in Middle School and High School Cafeteria Settings.

    Science.gov (United States)

    Getts, Katherine M; Quinn, Emilee L; Johnson, Donna B; Otten, Jennifer J

    2017-11-01

    Measuring food waste (ie, plate waste) in school cafeterias is an important tool to evaluate the effectiveness of school nutrition policies and interventions aimed at increasing consumption of healthier meals. Visual assessment methods are frequently applied in plate waste studies because they are more convenient than weighing. The visual quarter-waste method has become a common tool in studies of school meal waste and consumption, but previous studies of its validity and reliability have used correlation coefficients, which measure association but not necessarily agreement. The aims of this study were to determine, using a statistic measuring interrater agreement, whether the visual quarter-waste method is valid and reliable for assessing food waste in a school cafeteria setting when compared with the gold standard of weighed plate waste. To evaluate validity, researchers used the visual quarter-waste method and weighed food waste from 748 trays at four middle schools and five high schools in one school district in Washington State during May 2014. To assess interrater reliability, researcher pairs independently assessed 59 of the same trays using the visual quarter-waste method. Both validity and reliability were assessed using a weighted κ coefficient. For validity, as compared with the measured weight, 45% of foods assessed using the visual quarter-waste method were in almost perfect agreement, 42% of foods were in substantial agreement, 10% were in moderate agreement, and 3% were in slight agreement. For interrater reliability between pairs of visual assessors, 46% of foods were in perfect agreement, 31% were in almost perfect agreement, 15% were in substantial agreement, and 8% were in moderate agreement. These results suggest that the visual quarter-waste method is a valid and reliable tool for measuring plate waste in school cafeteria settings. Copyright © 2017 Academy of Nutrition and Dietetics. Published by Elsevier Inc. All rights reserved.

  3. Clinical Functional Capacity Testing in Patients With Facioscapulohumeral Muscular Dystrophy: Construct Validity and Interrater Reliability of Antigravity Tests.

    Science.gov (United States)

    Rijken, Noortje H; van Engelen, Baziel G; Weerdesteyn, Vivian; Geurts, Alexander C

    2015-12-01

    To evaluate the construct validity and interrater reliability of 4 simple antigravity tests in a small group of patients with facioscapulohumeral muscular dystrophy (FSHD). Case-control study. University medical center. Patients with various severity levels of FSHD (n=9) and healthy control subjects (n=10) were included (N=19). Not applicable. A 4-point ordinal scale was designed to grade performance on the following 4 antigravity tests: sit to stance, stance to sit, step up, and step down. In addition, the 6-minute walk test, 10-m walking test, Berg Balance Scale, and timed Up and Go test were administered as conventional tests. Construct validity was determined by linear regression analysis using the Clinical Severity Score (CSS) as the dependent variable. Interrater agreement was tested using a κ analysis. Patients with FSHD performed worse on all 4 antigravity tests compared with the controls. Stronger correlations were found within than between test categories (antigravity vs conventional). The antigravity tests revealed the highest explained variance with regard to the CSS (R(2)=.86, P=.014). Interrater agreement was generally good. The results of this exploratory study support the construct validity and interrater reliability of the proposed antigravity tests for the assessment of functional capacity in patients with FSHD taking into account the use of compensatory strategies. Future research should further validate these results in a larger sample of patients with FSHD. Copyright © 2015 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.

  4. Interrater Reliability of mHealth App Rating Measures: Analysis of Top Depression and Smoking Cessation Apps.

    Science.gov (United States)

    Powell, Adam C; Torous, John; Chan, Steven; Raynor, Geoffrey Stephen; Shwarts, Erik; Shanahan, Meghan; Landman, Adam B

    2016-02-10

    There are over 165,000 mHealth apps currently available to patients, but few have undergone an external quality review. Furthermore, no standardized review method exists, and little has been done to examine the consistency of the evaluation systems themselves. We sought to determine which measures for evaluating the quality of mHealth apps have the greatest interrater reliability. We identified 22 measures for evaluating the quality of apps from the literature. A panel of 6 reviewers reviewed the top 10 depression apps and 10 smoking cessation apps from the Apple iTunes App Store on these measures. Krippendorff's alpha was calculated for each of the measures and reported by app category and in aggregate. The measure for interactiveness and feedback was found to have the greatest overall interrater reliability (alpha=.69). Presence of password protection (alpha=.65), whether the app was uploaded by a health care agency (alpha=.63), the number of consumer ratings (alpha=.59), and several other measures had moderate interrater reliability (alphas>.5). There was the least agreement over whether apps had errors or performance issues (alpha=.15), stated advertising policies (alpha=.16), and were easy to use (alpha=.18). There were substantial differences in the interrater reliabilities of a number of measures when they were applied to depression versus smoking apps. We found wide variation in the interrater reliability of measures used to evaluate apps, and some measures are more robust across categories of apps than others. The measures with the highest degree of interrater reliability tended to be those that involved the least rater discretion. Clinical quality measures such as effectiveness, ease of use, and performance had relatively poor interrater reliability. Subsequent research is needed to determine consistent means for evaluating the performance of apps. Patients and clinicians should consider conducting their own assessments of apps, in conjunction with

  5. Inter-rater reliability of healthcare professional skills' portfolio assessments: The Andalusian Agency for Healthcare Quality model

    Directory of Open Access Journals (Sweden)

    Antonio Almuedo-Paz

    2014-07-01

    Full Text Available This study aims to determine the reliability of assessment criteria used for a portfolio at the Andalusian Agency for Healthcare Quality (ACSA. Data: all competences certification processes, regardless of their discipline. Period: 2010-2011. Three types of tests are used: 368 certificates, 17,895 reports and 22,642 clinical practice reports (N = 3,010 candidates. The tests were evaluated in pairs by the ACSA team of raters using two categories: valid and invalid. Results: The percentage agreement in assessments of certificates was 89,9%, while for the reports of clinical practice was 85,1 % and for clinical practice reports was 81,7%. The inter-rater agreement coefficients (kappa ranged from 0,468 to 0,711. Discussion: The results of this study show that the inter-rater reliability of assessments varies from fair to good. Compared with other similar studies, the results put the reliability of the model in a comfortable position. Among the improvements incorporated, progressive automation of evaluations must be highlighted.

  6. Measuring the quality of life in mild to very severe dementia: testing the inter-rater and intra-rater reliability of the German version of the QUALIDEM.

    Science.gov (United States)

    Dichter, Martin Nikolaus; Schwab, Christian G G; Meyer, Gabriele; Bartholomeyczik, Sabine; Dortmann, Olga; Halek, Margareta

    2014-05-01

    Quality of life (Qol) is an increasingly used outcome measure in dementia research. The QUALIDEM is a dementia-specific and proxy-rated Qol instrument. We aimed to determine the inter-rater and intra-rater reliability in residents with dementia in German nursing homes. The QUALIDEM consists of nine subscales that were applied to a sample of 108 people with mild to severe dementia and six consecutive subscales that were applied to a sample of 53 people with very severe dementia. The proxy raters were 49 registered nurses and nursing assistants. Inter-rater and intra-rater reliability scores were calculated on the subscale and item level. None of the QUALIDEM subscales showed strong inter-rater reliability based on the single-measure Intra-Class Correlation Coefficient (ICC) for absolute agreement ≥ 0.70. Based on the average-measure ICC for four raters, eight subscales for people with mild to severe dementia (care relationship, positive affect, negative affect, restless tense behavior, social relations, social isolation, feeling at home and having something to do) and five subscales for very severe dementia (care relationship, negative affect, restless tense behavior, social relations and social isolation) yielded a strong inter-rater agreement (ICC: 0.72-0.86). All of the QUALIDEM subscales, regardless of dementia severity, showed strong intra-rater agreement. The ICC values ranged between 0.70 and 0.79 for people with mild to severe dementia and between 0.75 and 0.87 for people with very severe dementia. This study demonstrated insufficient inter-rater reliability and sufficient intra-rater reliability for all subscales of both versions of the German QUALIDEM. The degree of inter-rater reliability can be improved by collaborative Qol rating by more than one nurse. The development of a measurement manual with accurate item definitions and a standardized education program for proxy raters is recommended.

  7. Multirater agreement for grading the femoral and tibial cartilage surface lesions at CT arthrography and analysis of causes of disagreement

    International Nuclear Information System (INIS)

    Omoumi, Patrick; Michoux, Nicolas; Larbi, Ahmed; Lacoste, Laure; Lecouvet, Frédéric E.; Perlepe, Vasiliki; Vande Berg, Bruno C.

    2017-01-01

    Highlights: • The multirater agreement of the modified Outerbridge system is only fair when readers of varying levels of experience are taken into account. • Inter- and intra-observer agreement increase with readers experience. • Interobserver agreement is substantial for grade 4 lesions. • Knowledge of normal variations of cartilage thickness is required to help avoid pitfalls and improve interobserver agreement in reporting cartilage lesions. - Abstract: Objective: To assess the multirater agreement of the modified Outerbridge system for the grading of predefined areas of femorotibial cartilage at CT arthrography with multiple readers, with varying experience. Design: Five readers with varying experience (two junior radiologists, three musculoskeletal radiologists including two experts in cartilage imaging) separately analyzed 962 cartilage sectors from pre-divided knee CT arthrograms with femorotibial osteoarthritis (Kellgren/Lawrence = 3). Each cartilage area was graded twice by each reader, at a three-month interval, according to the modified 5-grade Outerbridge system. Interobserver and intraobserver agreement were assessed. After the second reading, 121 areas exhibiting the highest interobserver disagreement were reviewed in consensus to determine the sources of disagreement. Results: The global interobserver agreement was fair (k = 0.35), and increased with the grade (from k = 0.14 to k = 0.76 from grade 0–4). The intraobserver agreement varied with the readers’ experience from moderate (k = 0.59) to almost perfect (k = 0.92). The majority of cases of disagreement (44%) was due to difficulties in assessing the normal variations of cartilage thickness, including diffuse cartilage thinning (23%) and normal variants of cartilage thickness (22%). 32% of cases of disagreement were due to retrospectively avoidable interpretation errors. Conclusions: The multirater agreement of the modified Outerbridge system is only fair when readers of different

  8. Multirater agreement for grading the femoral and tibial cartilage surface lesions at CT arthrography and analysis of causes of disagreement

    Energy Technology Data Exchange (ETDEWEB)

    Omoumi, Patrick, E-mail: patrick.omoumi@chuv.ch [Department of Radiology, Cliniques Universitaires St Luc − UC Louvain, Hippocrate Avenue 10/2942, B-1200 Brussels (Belgium); Department of Diagnostic and Interventional Radiology, Lausanne University Hospital, Bugnon 46, CH-1011 Lausanne (Switzerland); Michoux, Nicolas; Larbi, Ahmed; Lacoste, Laure; Lecouvet, Frédéric E.; Perlepe, Vasiliki; Vande Berg, Bruno C. [Department of Radiology, Cliniques Universitaires St Luc − UC Louvain, Hippocrate Avenue 10/2942, B-1200 Brussels (Belgium)

    2017-03-15

    Highlights: • The multirater agreement of the modified Outerbridge system is only fair when readers of varying levels of experience are taken into account. • Inter- and intra-observer agreement increase with readers experience. • Interobserver agreement is substantial for grade 4 lesions. • Knowledge of normal variations of cartilage thickness is required to help avoid pitfalls and improve interobserver agreement in reporting cartilage lesions. - Abstract: Objective: To assess the multirater agreement of the modified Outerbridge system for the grading of predefined areas of femorotibial cartilage at CT arthrography with multiple readers, with varying experience. Design: Five readers with varying experience (two junior radiologists, three musculoskeletal radiologists including two experts in cartilage imaging) separately analyzed 962 cartilage sectors from pre-divided knee CT arthrograms with femorotibial osteoarthritis (Kellgren/Lawrence = 3). Each cartilage area was graded twice by each reader, at a three-month interval, according to the modified 5-grade Outerbridge system. Interobserver and intraobserver agreement were assessed. After the second reading, 121 areas exhibiting the highest interobserver disagreement were reviewed in consensus to determine the sources of disagreement. Results: The global interobserver agreement was fair (k = 0.35), and increased with the grade (from k = 0.14 to k = 0.76 from grade 0–4). The intraobserver agreement varied with the readers’ experience from moderate (k = 0.59) to almost perfect (k = 0.92). The majority of cases of disagreement (44%) was due to difficulties in assessing the normal variations of cartilage thickness, including diffuse cartilage thinning (23%) and normal variants of cartilage thickness (22%). 32% of cases of disagreement were due to retrospectively avoidable interpretation errors. Conclusions: The multirater agreement of the modified Outerbridge system is only fair when readers of different

  9. Interrater and intrarater reliability of the Knosp scale for pituitary adenoma grading.

    Science.gov (United States)

    Mooney, Michael A; Hardesty, Douglas A; Sheehy, John P; Bird, Robert; Chapple, Kristina; White, William L; Little, Andrew S

    2017-05-01

    OBJECTIVE The goal of this study was to determine the interrater and intrarater reliability of the Knosp grading scale for predicting pituitary adenoma cavernous sinus (CS) involvement. METHODS Six independent raters (3 neurosurgery residents, 2 pituitary surgeons, and 1 neuroradiologist) participated in the study. Each rater scored 50 unique pituitary MRI scans (with contrast) of biopsy-proven pituitary adenoma. Reliabilities for the full scale were determined 3 ways: 1) using all 50 scans, 2) using scans with midrange scores versus end scores, and 3) using a dichotomized scale that reflects common clinical practice. The performance of resident raters was compared with that of faculty raters to assess the influence of training level on reliability. RESULTS Overall, the interrater reliability of the Knosp scale was "strong" (0.73, 95% CI 0.56-0.84). However, the percent agreement for all 6 reviewers was only 10% (26% for faculty members, 30% for residents). The reliability of the middle scores (i.e., average rated Knosp Grades 1 and 2) was "very weak" (0.18, 95% CI -0.27 to 0.56) and the percent agreement for all reviewers was only 5%. When the scale was dichotomized into tumors unlikely to have intraoperative CS involvement (Grades 0, 1, and 2) and those likely to have CS involvement (Grades 3 and 4), the reliability was "strong" (0.60, 95% CI 0.39-0.75) and the percent agreement for all raters improved to 60%. There was no significant difference in reliability between residents and faculty (residents 0.72, 95% CI 0.55-0.83 vs faculty 0.73, 95% CI 0.56-0.84). Intrarater reliability was moderate to strong and increased with the level of experience. CONCLUSIONS Although these findings suggest that the Knosp grading scale has acceptable interrater reliability overall, it raises important questions about the "very weak" reliability of the scale's middle grades. By dichotomizing the scale into clinically useful groups, the authors were able to address the poor

  10. Ischemic lesion volume determination on diffusion weighted images vs. apparent diffusion coefficient maps.

    Science.gov (United States)

    Bråtane, Bernt Tore; Bastan, Birgul; Fisher, Marc; Bouley, James; Henninger, Nils

    2009-07-07

    Though diffusion weighted imaging (DWI) is frequently used for identifying the ischemic lesion in focal cerebral ischemia, the understanding of spatiotemporal evolution patterns observed with different analysis methods remains imprecise. DWI and calculated apparent diffusion coefficient (ADC) maps were serially obtained in rat stroke models (MCAO): permanent, 90 min, and 180 min temporary MCAO. Lesion volumes were analyzed in a blinded and randomized manner by 2 investigators using (i) a previously validated ADC threshold, (ii) visual determination of hypointense regions on ADC maps, and (iii) visual determination of hyperintense regions on DWI. Lesion volumes were correlated with 24 hour 2,3,5-triphenyltetrazoliumchloride (TTC)-derived infarct volumes. TTC-derived infarct volumes were not significantly different from the ADC and DWI-derived lesion volumes at the last imaging time points except for significantly smaller DWI lesions in the pMCAO model (p=0.02). Volumetric calculation based on TTC-derived infarct also correlated significantly stronger to volumetric calculation based on last imaging time point derived lesions on ADC maps than DWI (pdetermined lesion volumes on ADC maps and DWI by both investigators correlated significantly with threshold-derived lesion volumes on ADC maps with the former method demonstrating a stronger correlation. There was also a better interrater agreement for ADC map analysis than for DWI analysis. Ischemic lesion determination by ADC was more accurate in final infarct prediction, rater independent, and provided exclusive information on ischemic lesion reversibility.

  11. Inter-rater reliability of assessment of levator ani muscle strength and attachment to the pubic bone in nulliparous women.

    Science.gov (United States)

    van Delft, K; Schwertner-Tiepelmann, N; Thakar, R; Sultan, A H

    2013-09-01

    The modified Oxford scale (MOS) has been found previously to have poor inter-rater reliability, whereas digital assessment of levator ani muscle (LAM) attachment to the pubic bone has been shown to have acceptable reliability. Our aim was to evaluate inter-rater reliability of the validated MOS and to develop a reliable classification system for digital assessment of LAM attachment, correlating this to findings on transperineal ultrasound (TPUS) examination. Evaluation of the MOS by palpation was performed in nulliparous women by two investigators. LAM attachment was evaluated using digital palpation, for which a novel classification system was developed with four grades based on the position of the attachment and presence of discernible muscle. Findings were compared with those on TPUS examination. Inter-rater reliability was assessed using Cohen's kappa statistic. Twenty-five nulliparous women were examined. There was agreement in MOS scores between the investigators in 64% of women (n = 16), with a kappa of 0.66 (indicating substantial agreement). There was agreement in palpation of LAM attachment using the new grading system in 96% of women (n = 24), with a kappa of 0.90 (indicating almost perfect agreement). TPUS examination did not show LAM avulsion in any woman, with the exception of one with a partial avulsion. In this group of nulliparous patients, there was substantial agreement between the two investigators in evaluation of the MOS and there was good agreement between grades of LAM attachment using the new classification system, which correlated with findings on TPUS examination. It therefore appears that these results are reproducible in nulliparous women and the techniques can be readily learned and reliably incorporated into clinical practice and research after appropriate training. Further research is required to establish clinical utility of the grading system for LAM attachment in postpartum women and in women with symptomatic pelvic organ

  12. The inter-rater reliability of the incontinence-associated dermatitis intervention tool-D (IADIT-D) between two independent registered nurses of nursing home residents in long-term care facilities.

    Science.gov (United States)

    Braunschmidt, Brigitte; Müller, Gerhard; Jukic-Puntigam, Margareta; Steininger, Alfred

    2013-01-01

    Incontinence-associated dermatitis (IAD) is the clinical manifestation of moisture related skin damage (Beeckman, Woodward, & Gray, 2011). Valid assessment instruments are needed for risk assessment and classification of IAD. Aim of the quantitative-descriptive cross-sectional study was to determine the inter-rater reliability of the item scores of the German Incontinence Associated Dermatitis Intervention Tool (IADIT-D) between two independent assessors of nursing home residents (n = 381) in long-term care facilities. The 19 pairs of assessors consisted of registered nurses. The data analysis was computed first with the calculation of the total percentage of agreement. Because this value is not randomly adjusted, the calculation of the Kappa-coefficients and AC1-Statistic was done as well. The total percentage of the inter-rater agreement was 84% (n = 319). In a second step of analysis, the calculation of all items determined high (kappa = .70) and very high agreement (AC1 = .83) levels, respectively. For the risk assessment (kappa = .82; AC1 = .94), the values amounted to very high agreement levels and for the classification (kappa(w) = .70; AC1 = .76) to high agreement levels. The high to very high agreement values of IADIT-D demonstrate that the items can be regarded as stable in regards to the inter-rater reliability for the use in long-term care facilities. In addition, further validation studies are needed.

  13. Blinded evaluation of interrater reliability of an operative competency assessment tool for direct laryngoscopy and rigid bronchoscopy.

    Science.gov (United States)

    Ishman, Stacey L; Benke, James R; Johnson, Kaalan Erik; Zur, Karen B; Jacobs, Ian N; Thorne, Marc C; Brown, David J; Lin, Sandra Y; Bhatti, Nasir; Deutsch, Ellen S

    2012-10-01

    OBJECTIVES To confirm interrater reliability using blinded evaluation of a skills-assessment instrument to assess the surgical performance of resident and fellow trainees performing pediatric direct laryngoscopy and rigid bronchoscopy in simulated models. DESIGN Prospective, paired, blinded observational validation study. SUBJECTS Paired observers from multiple institutions simultaneously evaluated residents and fellows who were performing surgery in an animal laboratory or using high-fidelity manikins. The evaluators had no previous affiliation with the residents and fellows and did not know their year of training. INTERVENTIONS One- and 2-page versions of an objective structured assessment of technical skills (OSATS) assessment instrument composed of global and a task-specific surgical items were used to evaluate surgical performance. RESULTS Fifty-two evaluations were completed by 17 attending evaluators. The instrument agreement for the 2-page assessment was 71.4% when measured as a binary variable (ie, competent vs not competent) (κ = 0.38; P = .08). Evaluation as a continuous variable revealed a 42.9% percentage agreement (κ = 0.18; P = .14). The intraclass correlation was 0.53, considered substantial/good interrater reliability (69% reliable). For the 1-page instrument, agreement was 77.4% when measured as a binary variable (κ = 0.53, P = .0015). Agreement when evaluated as a continuous measure was 71.0% (κ = 0.54, P formative feedback on operational competency.

  14. Measurement of the Inter-Rater Reliability Rate Is Mandatory for Improving the Quality of a Medical Database: Experience with the Paulista Lung Cancer Registry.

    Science.gov (United States)

    Lauricella, Leticia L; Costa, Priscila B; Salati, Michele; Pego-Fernandes, Paulo M; Terra, Ricardo M

    2018-06-01

    Database quality measurement should be considered a mandatory step to ensure an adequate level of confidence in data used for research and quality improvement. Several metrics have been described in the literature, but no standardized approach has been established. We aimed to describe a methodological approach applied to measure the quality and inter-rater reliability of a regional multicentric thoracic surgical database (Paulista Lung Cancer Registry). Data from the first 3 years of the Paulista Lung Cancer Registry underwent an audit process with 3 metrics: completeness, consistency, and inter-rater reliability. The first 2 methods were applied to the whole data set, and the last method was calculated using 100 cases randomized for direct auditing. Inter-rater reliability was evaluated using percentage of agreement between the data collector and auditor and through calculation of Cohen's κ and intraclass correlation. The overall completeness per section ranged from 0.88 to 1.00, and the overall consistency was 0.96. Inter-rater reliability showed many variables with high disagreement (>10%). For numerical variables, intraclass correlation was a better metric than inter-rater reliability. Cohen's κ showed that most variables had moderate to substantial agreement. The methodological approach applied to the Paulista Lung Cancer Registry showed that completeness and consistency metrics did not sufficiently reflect the real quality status of a database. The inter-rater reliability associated with κ and intraclass correlation was a better quality metric than completeness and consistency metrics because it could determine the reliability of specific variables used in research or benchmark reports. This report can be a paradigm for future studies of data quality measurement. Copyright © 2018 American College of Surgeons. Published by Elsevier Inc. All rights reserved.

  15. Inter-Rater Reliability of Neck Reflex Points in Women with Chronic Neck Pain.

    Science.gov (United States)

    Weinschenk, Stefan; Göllner, Richard; Hollmann, Markus W; Hotz, Lorenz; Picardi, Susanne; Hubbert, Katharina; Strowitzki, Thomas; Meuser, Thomas

    2016-01-01

    Neck reflex points (NRP) are tender soft tissue areas of the cervical region that display reflectory changes in response to chronic inflammations of correlated regions in the visceral cranium. Six bilateral areas, NRP C0, C1, C2, C3, C4 and C7, are detectable by palpating the lateral neck. We investigated the inter-rater reliability of NRP to assess their potential clinical relevance. 32 consecutive patients with chronic neck pain were examined for NRP tenderness by an experienced physician and an inexperienced medical student in a blinded design. A detailed description of the palpation technique is included in this section. Absence of pain was defined as pain index (PI) = 0, slight tenderness = 1, and marked pain = 2. Findings were evaluated either by pair-wise Cohen's kappa (ĸ) or by percentage of agreement (PA). Examiners identified 40% and 41% of positive NRP, respectively (PI > 0, physician: 155, student: 157) with a slight preference for the left side (1.2:1). The number of patients identified with >6 positive NRP by the examiners was similar (13 vs. 12 patients). ĸ values ranged from 0.52 to 0.95. The overall kappa was ĸ = 0.80 for the left and ĸ = 0.74 for the right side. PA varied from 78.1% to 96.9% with strongest agreement at NRP C0, NRP C2, and NRP C7. Inter-rater agreement was independent of patients' age, gender, body mass index and examiner's experience. The high reproducibility suggests the clinical relevance of NRP in women. © 2016 S. Karger GmbH, Freiburg.

  16. Interrater reliability of the Volume-Viscosity Swallow Test; screening for dysphagia among hospitalized elderly medical patients.

    Science.gov (United States)

    Jørgensen, Lise Walther; Søndergaard, Kasper; Melgaard, Dorte; Warming, Susan

    2017-12-01

    Oropharyngeal dysphagia (OD) is prevalent among medical and geriatric patients admitted due to acute illness and it is associated with malnutrition, increased length of stay and increased mortality. A valid and reliable bedside screening test for patients at risk of OD is essential in order to detect patients in need of further assessment. The Volume-Viscosity Swallow Test (V-VST) has been shown to be a valid screening test for OD in mixed outpatient populations. However, as reliability of the test has yet to be investigated in a population of medical and geriatric patients admitted due to acute illness, we aimed to determine the interrater reliability of the V-VST in this clinical setting. Reporting in this study is in accordance with proposed guidelines for the reporting of reliability and agreement studies (GRRAS). In three Danish hospitals (CRD-BFH, CRD-GH, NDR-H) 11 skilled occupational therapists examined an unselected group of 110 patients admitted to geriatric or medical wards. In an overall agreement phase raters reached ≥80% agreement before data collection phase was commenced. The V-VST was applied to patients twice within maximum one hour by raters who administrated the test in an order based on randomization, blinded to each other's results. Agreement, Kappa values, weighed Kappa values and Kappa adjusted for bias and prevalence are reported. The interrater reliability of V-VST as screening test for OD in patients admitted to geriatric or medical wards was substantial with an overall Kappa value of 0.77 (95% CI 0.65-0.89) however interrater reliability varied among hospitals ranging from 0.37 (95% CI -0.01 to 0.41) to 0.85 (95% CI 0.75-1.00). Interrater reliability of the accompanying recommendations of volume and viscosity was moderate with a weighted kappa value of 0.55 (95% CI 0.37-0.73) for viscosity and 0.53 (95% CI 0.36-0.7) for volume. The overall prevalence of OD was 34.5%, ranging from 8% to 53.6% across hospitals. The prevalence and bias

  17. Quality of nursing intensity data: inter-rater reliability of the patient classification after two decades in clinical use.

    Science.gov (United States)

    Liljamo, Pia; Kinnunen, Ulla-Mari; Ohtonen, Pasi; Saranto, Kaija

    2017-09-01

    The aim of this study was to measure the inter-rater reliability of the Oulu Patient Classification and to discuss existing methods of reliability testing. The Oulu Patient Classification, part of the RAFAELA ® System, has been developed to assist nursing managers with the proper allocation of nursing resources. Due to the increased intensity of inpatient care during recent years, there is a need for the reliability testing of the classification, which has been in clinical use for 20 years. Retrospective statistical study. To test inter-rater reliability, a pair of nurses classified the same patients, without knowledge of each other's ratings, as a part of annually conducted standardization. Data on the parallel classifications (n = 19,997) was obtained from inpatient units (n = 32) with different specialties at a university hospital in Finland during 2010-2015. Parallel classification practices were also analysed. The reliability of the overall classification and its subareas were calculated using suitable statistical coefficients. Inter-rater reliability coefficients were a reliable or almost perfect means of considering the nursing intensity category and various practices, but there were detectable differences between subareas. The lowest agreement levels occurred in the subareas 'Planning and Coordination of Nursing Care' and 'Guiding of Care/Continued Care and Emotional Support'. There is a need to develop the descriptions of subareas and to clarify the related concepts. Precise nursing documentation can promote a high level of agreement and reliable results. The traditional overall proportion of agreement does not provide an adequate picture of reliability - weighted kappa coefficients should be used instead. © 2017 John Wiley & Sons Ltd.

  18. An Investigation of Interrater Reliability for the Rorschach Performance Assessment System (R-PAS) in a Nonpatient U.S. Sample.

    Science.gov (United States)

    Kivisalu, Trisha M; Lewey, Jennifer H; Shaffer, Thomas W; Canfield, Merle L

    2016-01-01

    The Rorschach Performance Assessment System (R-PAS) aims to provide an evidence-based approach to administration, coding, and interpretation of the Rorschach Inkblot Method (RIM). R-PAS analyzes individualized communications given by respondents to each card to code a wide pool of possible variables. Due to the large number of possible codes that can be assigned to these responses, it is important to consider the concordance rates among different assessors. This study investigated interrater reliability for R-PAS protocols. Data were analyzed from a nonpatient convenience sample of 50 participants who were recruited through networking, local marketing, and advertising efforts from January 2013 through October 2014. Blind recoding was used and discrepancies between the initial and blind coders' ratings were analyzed for each variable with SPSS yielding percent agreement and intraclass correlation values. Data for Location, Space, Contents, Synthesis, Vague, Pairs, Form Quality, Populars, Determinants, and Cognitive and Thematic codes are presented. Rates of agreement for 1,168 responses were higher for more simplistic coding (e.g., Location), whereas agreement was lower for more complex codes (e.g., Cognitive and Thematic codes). Overall, concordance rates achieved good to excellent agreement. Results suggest R-PAS is an effective method with high interrater reliability supporting its empirical basis.

  19. The interrater and intrarater reliability of the Philpott-Javer staging system based on level of training.

    Science.gov (United States)

    Parhar, Harman S; Thamboo, Andrew; Habib, Al-Rahim; Chang, Brent; Gan, Eng Cern; Javer, Amin R

    2014-04-01

    The Philpott-Javer postoperative endoscopic mucosal staging system for allergic fungal rhinosinusitis has previously demonstrated acceptable interrater reliability among rhinologists. There are, however, numerous learners involved in patient care at tertiary centers. This study aims to analyze the interrater and intrarater reliability of this system among learners in otolaryngology at different stages in training. A prospective analysis of retrospectively collected endoscopic photographs. A tertiary care teaching hospital (January 2013). Fifty patients undergoing routine follow-up. Three photographs from each of 50 patients undergoing routine postsurgical nasoendoscopy were reviewed. Images were played twice, 1 week apart, in 2 differently randomized cycles and scored according to Philpott-Javer criteria by a rhinologist, a rhinology fellow, a senior otolaryngology resident, a junior otolaryngology resident, and a medical student. Interobserver reliability was assessed using the intraclass correlation coefficient, while intrarater reliability was assessed by Shrout-Fleiss κ values. Agreement between each learner and the rhinologist was also assessed using κ values. The interclass correlation among the 5 raters was 0.7600 (95% confidence interval, 0.6917-0.8161) for the Philpott-Javer scoring system, suggesting substantial reliability. Intrarater data showed substantial to almost-perfect reliability (κ values between 0.668 and 0.815) among all raters using this system. There was also moderate to substantial agreement between the learners and the rhinologist (κ values between 0.534 and 0.710). Results suggest that the Philpott-Javer staging system has acceptable intrarater and interrater reliability among learners of differing levels of clinical experience and is suitable for evaluating progress following surgery.

  20. Interrater reliability of the Saint-Anne Dargassies Scale in assessing the neurological patterns of healthy preterm newborns

    Directory of Open Access Journals (Sweden)

    Carla Ismirna Santos Alves

    Full Text Available Abstract Objectives: to assess the interrater reliability of the Saint-Anne Dargassies Scale in assessing neurological patterns of healthy preterm newborns. Methods: twenty preterm newborns met the inclusion criteria for participation in this prospective study. The neurologic examination was performed using the Saint-Anne Dargassies Scale, showing normal serial cranial ultrasound examination. In order to test the reliability, the study was structured as follows: group I (rater 1/physiotherapist; rater 2/neonatologist; group II (rater 3/physiotherapist; rater 4/child neurologist and the gold standard (expert and professor in pediatric neurology. Results: high interrater agreement was observed between groups I - II compared with the gold standard in assessing postural pattern (p<0.01. Regarding the assessment ofprimitive reflexes, greater agreement was observed in the evaluation of palmar grasp reflex and Moro reflex (p< 0.01 for group I compared with the gold standard. An analysis of tone demonstrated heterogeneous agreement, without compromising the reliability of the scale. The probability of equality between measurements of head circumference in the two groups, compared with the gold standard, was observed. Conclusions: the Saint-Anne Dargassies Scale demonstrated high reliability and homogeneity with significant power of reproducibility and may be capable to identify preterm newborns suspected of having neurological deficits.

  1. Inter-Rater Reliability of Provider Interpretations of Irritable Bowel Syndrome Food and Symptom Journals.

    Science.gov (United States)

    Zia, Jasmine; Chung, Chia-Fang; Xu, Kaiyuan; Dong, Yi; Schenk, Jeanette M; Cain, Kevin; Munson, Sean; Heitkemper, Margaret M

    2017-11-04

    There are currently no standardized methods for identifying trigger food(s) from irritable bowel syndrome (IBS) food and symptom journals. The primary aim of this study was to assess the inter-rater reliability of providers' interpretations of IBS journals. A second aim was to describe whether these interpretations varied for each patient. Eight providers reviewed 17 IBS journals and rated how likely key food groups (fermentable oligo-di-monosaccharides and polyols, high-calorie, gluten, caffeine, high-fiber) were to trigger IBS symptoms for each patient. Agreement of trigger food ratings was calculated using Krippendorff's α-reliability estimate. Providers were also asked to write down recommendations they would give to each patient. Estimates of agreement of trigger food likelihood ratings were poor (average α = 0.07). Most providers gave similar trigger food likelihood ratings for over half the food groups. Four providers gave the exact same written recommendation(s) (range 3-7) to over half the patients. Inter-rater reliability of provider interpretations of IBS food and symptom journals was poor. Providers favored certain trigger food likelihood ratings and written recommendations. This supports the need for a more standardized method for interpreting these journals and/or more rigorous techniques to accurately identify personalized IBS food triggers.

  2. Inter-Rater Reliability of Provider Interpretations of Irritable Bowel Syndrome Food and Symptom Journals

    Directory of Open Access Journals (Sweden)

    Jasmine Zia

    2017-11-01

    Full Text Available There are currently no standardized methods for identifying trigger food(s from irritable bowel syndrome (IBS food and symptom journals. The primary aim of this study was to assess the inter-rater reliability of providers’ interpretations of IBS journals. A second aim was to describe whether these interpretations varied for each patient. Eight providers reviewed 17 IBS journals and rated how likely key food groups (fermentable oligo-di-monosaccharides and polyols, high-calorie, gluten, caffeine, high-fiber were to trigger IBS symptoms for each patient. Agreement of trigger food ratings was calculated using Krippendorff’s α-reliability estimate. Providers were also asked to write down recommendations they would give to each patient. Estimates of agreement of trigger food likelihood ratings were poor (average α = 0.07. Most providers gave similar trigger food likelihood ratings for over half the food groups. Four providers gave the exact same written recommendation(s (range 3–7 to over half the patients. Inter-rater reliability of provider interpretations of IBS food and symptom journals was poor. Providers favored certain trigger food likelihood ratings and written recommendations. This supports the need for a more standardized method for interpreting these journals and/or more rigorous techniques to accurately identify personalized IBS food triggers.

  3. The impact of revised DSM-5 criteria on the relative distribution and inter-rater reliability of eating disorder diagnoses in a residential treatment setting.

    Science.gov (United States)

    Thomas, Jennifer J; Eddy, Kamryn T; Murray, Helen B; Tromp, Marilou D P; Hartmann, Andrea S; Stone, Melissa T; Levendusky, Philip G; Becker, Anne E

    2015-09-30

    This study evaluated the relative distribution and inter-rater reliability of revised DSM-5 criteria for eating disorders in a residential treatment program. Consecutive adolescent and young adult females (N=150) admitted to a residential eating disorder treatment facility were assigned both DSM-IV and DSM-5 diagnoses by a clinician (n=14) via routine clinical interview and a research assessor (n=4) via structured interview. We compared the frequency of diagnostic assignments under each taxonomy and by type of assessor. We evaluated concordance between clinician and researcher assignment through inter-rater reliability kappa and percent agreement. Significantly fewer patients received either clinician or researcher diagnoses of a residual eating disorder under DSM-5 (clinician-12.0%; researcher-31.3%) versus DSM-IV (clinician-28.7%; researcher-59.3%), with the majority of reassigned DSM-IV residual cases reclassified as DSM-5 anorexia nervosa. Researcher and clinician diagnoses showed moderate inter-rater reliability under DSM-IV (κ=.48) and DSM-5 (κ=.57), though agreement for specific DSM-5 other specified feeding or eating disorder (OSFED) presentations was poor (κ=.05). DSM-5 revisions were associated with significantly less frequent residual eating disorder diagnoses, but not with reduced inter-rater reliability. Findings support specific dimensions of clinical utility for revised DSM-5 criteria for eating disorders. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  4. Inter-rater reliability of three standardized functional tests in patients with low back pain

    Science.gov (United States)

    Tidstrand, Johan; Horneij, Eva

    2009-01-01

    Background Of all patients with low back pain, 85% are diagnosed as "non-specific lumbar pain". Lumbar instability has been described as one specific diagnosis which several authors have described as delayed muscular responses, impaired postural control as well as impaired muscular coordination among these patients. This has mostly been measured and evaluated in a laboratory setting. There are few standardized and evaluated functional tests, examining functional muscular coordination which are also applicable in the non-laboratory setting. In ordinary clinical work, tests of functional muscular coordination should be easy to apply. The aim of this present study was to therefore standardize and examine the inter-rater reliability of three functional tests of muscular functional coordination of the lumbar spine in patients with low back pain. Methods Nineteen consecutive individuals, ten men and nine women were included. (Mean age 42 years, SD ± 12 yrs). Two independent examiners assessed three tests: "single limb stance", "sitting on a Bobath ball with one leg lifted" and "unilateral pelvic lift" on the same occasion. The standardization procedure took altered positions of the spine or pelvis and compensatory movements of the free extremities into account. The inter-rater reliability was analyzed by Cohen's kappa coefficient (κ) and by percentage agreement. Results The inter-rater reliability for the right and the left leg respectively was: for the single limb stance very good (κ: 0.88–1.0), for sitting on a Bobath ball good (κ: 0.79) and very good (κ: 0.88) and for the unilateral pelvic lift: good (κ: 0.61) and moderate (κ: 0.47). Conclusion The present study showed good to very good inter-rater reliability for two standardized tests, that is, the single-limb stance and sitting on a Bobath-ball with one leg lifted. Inter-rater reliability for the unilateral pelvic lift test was moderate to good. Validation of the tests in their ability to evaluate lumbar

  5. Inter-rater reliability of three standardized functional tests in patients with low back pain

    Directory of Open Access Journals (Sweden)

    Tidstrand Johan

    2009-06-01

    Full Text Available Abstract Background Of all patients with low back pain, 85% are diagnosed as "non-specific lumbar pain". Lumbar instability has been described as one specific diagnosis which several authors have described as delayed muscular responses, impaired postural control as well as impaired muscular coordination among these patients. This has mostly been measured and evaluated in a laboratory setting. There are few standardized and evaluated functional tests, examining functional muscular coordination which are also applicable in the non-laboratory setting. In ordinary clinical work, tests of functional muscular coordination should be easy to apply. The aim of this present study was to therefore standardize and examine the inter-rater reliability of three functional tests of muscular functional coordination of the lumbar spine in patients with low back pain. Methods Nineteen consecutive individuals, ten men and nine women were included. (Mean age 42 years, SD ± 12 yrs. Two independent examiners assessed three tests: "single limb stance", "sitting on a Bobath ball with one leg lifted" and "unilateral pelvic lift" on the same occasion. The standardization procedure took altered positions of the spine or pelvis and compensatory movements of the free extremities into account. The inter-rater reliability was analyzed by Cohen's kappa coefficient (κ and by percentage agreement. Results The inter-rater reliability for the right and the left leg respectively was: for the single limb stance very good (κ: 0.88–1.0, for sitting on a Bobath ball good (κ: 0.79 and very good (κ: 0.88 and for the unilateral pelvic lift: good (κ: 0.61 and moderate (κ: 0.47. Conclusion The present study showed good to very good inter-rater reliability for two standardized tests, that is, the single-limb stance and sitting on a Bobath-ball with one leg lifted. Inter-rater reliability for the unilateral pelvic lift test was moderate to good. Validation of the tests in their

  6. Inter-rater and intrarater reliability of the South African Triage Scale in low-resource settings of Haiti and Afghanistan.

    Science.gov (United States)

    Dalwai, Mohammed; Tayler-Smith, Katie; Twomey, Michèle; Nasim, Masood; Popal, Abdul Qayum; Haqdost, Waliul Haq; Gayraud, Olivia; Cheréstal, Sophia; Wallis, Lee; Valles, Pola

    2018-03-16

    The South African Triage Scale (SATS) has demonstrated good validity in the EDs of Médecins Sans Frontières (MSF)-supported sites in Afghanistan and Haiti; however, corresponding reliability in these settings has not yet been reported on. This study set out to assess the inter-rater and intrarater reliability of the SATS in four MSF-supported EDs in Afghanistan and Haiti (two trauma-only EDs and two mixed (including both medical and trauma cases) EDs). Under classroom conditions between December 2013 and February 2014, ED nurses at each site assigned triage ratings to a set of context-specific vignettes (written case reports of ED patients). Inter-rater reliability was assessed by comparing triage ratings among nurses; intrarater reliability was assessed by asking the nurses to retriage 10 random vignettes from the original set and comparing these duplicate ratings. Inter-rater reliability was calculated using the unweighted kappa, linearly weighted kappa and quadratically weighted kappa (QWK) statistics, and the intraclass correlation coefficient (ICC). Intrarater reliability was calculated according to the percentage of exact agreement and the percentage of agreement allowing for one level of discrepancy in triage ratings. The correlation between years of nursing experience and reliability of the SATS was assessed based on comparison of ICCs and the respective 95% CIs. A total of 67 nurses agreed to participate in the study: In Afghanistan there were 19 nurses from Kunduz Trauma Centre and nine from Ahmed Shah Baba; in Haiti, there were 20 nurses from Martissant Emergency Centre and 19 from Tabarre Surgical and Trauma Centre. Inter-rater agreement was moderate across all sites (ICC range: 0.50-0.60; QWK range: 0.50-0.59) apart from the trauma ED in Haiti where it was moderate to substantial (ICC: 0.58; QWK: 0.61). Intrarater agreement was similar across the four sites (68%-74% exact agreement); when allowing for a one-level discrepancy in triage ratings

  7. Inter-rater reliability of h-index scores calculated by Web of Science and Scopus for clinical epidemiology scientists.

    Science.gov (United States)

    Walker, Benjamin; Alavifard, Sepand; Roberts, Surain; Lanes, Andrea; Ramsay, Tim; Boet, Sylvain

    2016-06-01

    We investigated the inter-rater reliability of Web of Science (WoS) and Scopus when calculating the h-index of 25 senior scientists in the Clinical Epidemiology Program of the Ottawa Hospital Research Institute. Bibliometric information and the h-indices for the subjects were computed by four raters using the automatic calculators in WoS and Scopus. Correlation and agreement between ratings was assessed using Spearman's correlation coefficient and a Bland-Altman plot, respectively. Data could not be gathered from Google Scholar due to feasibility constraints. The Spearman's rank correlation between the h-index of scientists calculated with WoS was 0.81 (95% CI 0.72-0.92) and with Scopus was 0.95 (95% CI 0.92-0.99). The Bland-Altman plot showed no significant rater bias in WoS and Scopus; however, the agreement between ratings is higher in Scopus compared to WoS. Our results showed a stronger relationship and increased agreement between raters when calculating the h-index of a scientist using Scopus compared to WoS. The higher inter-rater reliability and simple user interface used in Scopus may render it the more effective database when calculating the h-index of senior scientists in epidemiology. © 2016 Health Libraries Group.

  8. Assessment of apraxia: inter-rater reliability of a new apraxia test, association between apraxia and other cognitive deficits and prevalence of apraxia in a rehabilitation setting.

    Science.gov (United States)

    Zwinkels, Angeliek; Geusgens, Chantal; van de Sande, Peter; Van Heugten, Caroline

    2004-11-01

    To investigate the inter-rater reliability of a new apraxia test. Furthermore to examine the association of apraxia with other neuropsychological impairments and the prevalence of apraxia in a rehabilitation setting on the basis of the new test. Cross-sectional cohort study, involving 100 patients with a first stroke admitted to a rehabilitation centre in the Netherlands. General patient characteristics and stroke-related aspects. Cognitive screening involving apraxia, visuospatial scanning, abstract thinking and reasoning, memory, attention, planning and aphasia. The indices for inter-rater agreement range from excellent to poor. Significant correlations are found between apraxia and visuospatial scanning, memory, attention, planning and aphasia. The patients with apraxia perform significantly worse than the patients without apraxia on memory, the time needed to complete the tests for scanning and attention, and aphasia. The prevalence of apraxia is 25.3% in the total group, 51.3% in the left hemisphere stroke patients and 6.0% in the right hemisphere stroke patients. Patients with and without apraxia do not differ significantly concerning age, gender and type of stroke. The apraxia test has been shown to be a reliable instrument. Apraxia is often associated with aphasia, memory problems and mental slowness. This study shows that on the basis of the apraxia test, the prevalence of apraxia among patients in the rehabilitation centre is high, especially among patients with left hemisphere lesions.

  9. Intra and Inter-Rater Reliability of Screening for Movement Impairments: Movement Control Tests from The Foundation Matrix

    Science.gov (United States)

    Mischiati, Carolina R.; Comerford, Mark; Gosford, Emma; Swart, Jacqueline; Ewings, Sean; Botha, Nadine; Stokes, Maria; Mottram, Sarah L.

    2015-01-01

    Pre-season screening is well established within the sporting arena, and aims to enhance performance and reduce injury risk. With the increasing need to identify potential injury with greater accuracy, a new risk assessment process has been produced; The Performance Matrix (battery of movement control tests). As with any new method of objective testing, it is fundamental to establish whether the same results can be reproduced between examiners and by the same examiner on consecutive occasions. This study aimed to determine the intra-rater test re-test and inter-rater reliability of tests from a component of The Performance Matrix, The Foundation Matrix. Twenty participants were screened by two experienced musculoskeletal therapists using nine tests to assess the ability to control movement during specific tasks. Movement evaluation criteria for each test were rated as pass or fail. The therapists observed participants real-time and tests were recorded on video to enable repeated ratings four months later to examine intra-rater reliability (videos rated two weeks apart). Overall test percentage agreement was 87% for inter-rater reliability; 98% Rater 1, 94% Rater 2 for test re-test reliability; and 75% for real-time versus video. Intraclass-correlation coefficients (ICCs) were excellent between raters (0.81) and within raters (Rater 1, 0.96; Rater 2, 0.88) but poor for real-time versus video (0.23). Reliability for individual components of each test was more variable: inter-rater, 68-100%; intra-rater, 88-100% Rater 1, 75-100% Rater 2; and real-time versus video 31-100%. Cohen’s Kappa values for inter-rater reliability were 0.0-1.0; intra-rater 0.6-1.0 for Rater 1; -0.1-1.0 for Rater 2; and -0.1-1 for real-time versus video. It is concluded that both inter and intra-rater reliability of tests in The Foundation Matrix are acceptable when rated by experienced therapists. Recommendations are made for modifying some of the criteria to improve reliability where

  10. Inter-rater reliability of AMSTAR is dependent on the pair of reviewers.

    Science.gov (United States)

    Pieper, Dawid; Jacobs, Anja; Weikert, Beate; Fishta, Alba; Wegewitz, Uta

    2017-07-11

    Inter-rater reliability (IRR) is mainly assessed based on only two reviewers of unknown expertise. The aim of this paper is to examine differences in the IRR of the Assessment of Multiple Systematic Reviews (AMSTAR) and R(evised)-AMSTAR depending on the pair of reviewers. Five reviewers independently applied AMSTAR and R-AMSTAR to 16 systematic reviews (eight Cochrane reviews and eight non-Cochrane reviews) from the field of occupational health. Responses were dichotomized and reliability measures were calculated by applying Holsti's method (r) and Cohen's kappa (κ) to all potential pairs of reviewers. Given that five reviewers participated in the study, there were ten possible pairs of reviewers. Inter-rater reliability varied for AMSTAR between r = 0.82 and r = 0.98 (median r = 0.88) using Holsti's method and κ = 0.41 and κ = 0.69 (median κ = 0.52) using Cohen's kappa and for R-AMSTAR between r = 0.77 and r = 0.89 (median r = 0.82) and κ = 0.32 and κ = 0.67 (median κ = 0.45) depending on the pair of reviewers. The same pair of reviewers yielded the highest IRR for both instruments. Pairwise Cohen's kappa reliability measures showed a moderate correlation between AMSTAR and R-AMSTAR (Spearman's ρ =0.50). The mean inter-rater reliability for AMSTAR was highest for item 1 (κ = 1.00) and item 5 (κ = 0.78), while lowest values were found for items 3, 8, 9 and 11, which showed only fair agreement. Inter-rater reliability varies widely depending on the pair of reviewers. There may be some shortcomings associated with conducting reliability studies with only two reviewers. Further studies should include additional reviewers and should probably also take account of their level of expertise.

  11. Examining Design and Inter-Rater Reliability of a Rubric Measuring Research Quality across Multiple Disciplines

    Directory of Open Access Journals (Sweden)

    Marilee J. Bresciani

    2009-05-01

    Full Text Available The paper presents a rubric to help evaluate the quality of research projects. The rubric was applied in a competition across a variety of disciplines during a two-day research symposium at one institution in the southwest region of the United States of America. It was collaboratively designed by a faculty committee at the institution and was administered to 204 undergraduate, master, and doctoral oral presentations by approximately 167 different evaluators. No training or norming of the rubric was given to 147 of the evaluators prior to the competition. The findings of the inter-rater reliability analysis reveal substantial agreement among the judges, which contradicts literature describing the fact that formal norming must occur prior to seeing substantial levels of inter-rater reliability. By presenting the rubric along with the methodology used in its design and evaluation, it is hoped that others will find this to be a useful tool for evaluating documents and for teaching research methods.

  12. Inter-rater reliability of the evaluation of muscular chains associated with posture alterations in scoliosis

    Directory of Open Access Journals (Sweden)

    Fortin Carole

    2012-05-01

    Full Text Available Abstract Background In the Global postural re-education (GPR evaluation, posture alterations are associated with anterior or posterior muscular chain impairments. Our goal was to assess the reliability of the GPR muscular chain evaluation. Methods Design: Inter-rater reliability study. Fifty physical therapists (PTs and two experts trained in GPR assessed the standing posture from photographs of five youths with idiopathic scoliosis using a posture analysis grid with 23 posture indices (PI. The PTs and experts indicated the muscular chain associated with posture alterations. The PTs were also divided into three groups according to their experience in GPR. Experts’ results (after consensus were used to verify agreement between PTs and experts for muscular chain and posture assessments. We used Kappa coefficients (K and the percentage of agreement (%A to assess inter-rater reliability and intra-class coefficients (ICC for determining agreement between PTs and experts. Results For the muscular chain evaluation, reliability was moderate to substantial for 12 PI for the PTs (%A: 56 to 82; K: 0.42 to 0.76 and perfect for 19 PI for the experts. For posture assessment, reliability was moderate to substantial for 12 PI for the PTs (%A > 60%; K: 0.42 to 0.75 and moderate to perfect for 18 PI for the experts (%A: 80 to 100; K: 0.55 to 1.00. The agreement between PTs and experts was good for most muscular chain evaluations (18 PI; ICC: 0.82 to 0.99 and PI (19 PI; ICC: 0.78 to 1.00. Conclusions The GPR muscular chain evaluation has good reliability for most posture indices. GPR evaluation should help guide physical therapists in targeting affected muscles for treatment of abnormal posture patterns.

  13. The intra- and inter-rater reliability of five clinical muscle performance tests in patients with and without neck pain

    Science.gov (United States)

    2013-01-01

    Background This study investigates the reliability of muscle performance tests using cost- and time-effective methods similar to those used in clinical practice. When conducting reliability studies, great effort goes into standardising test procedures to facilitate a stable outcome. Therefore, several test trials are often performed. However, when muscle performance tests are applied in the clinical setting, clinicians often only conduct a muscle performance test once as repeated testing may produce fatigue and pain, thus variation in test results. We aimed to investigate whether cervical muscle performance tests, which have shown promising psychometric properties, would remain reliable when examined under conditions similar to those of daily clinical practice. Methods The intra-rater (between-day) and inter-rater (within-day) reliability was assessed for five cervical muscle performance tests in patients with (n = 33) and without neck pain (n = 30). The five tests were joint position error, the cranio-cervical flexion test, the neck flexor muscle endurance test performed in supine and in a 45°-upright position and a new neck extensor test. Results Intra-rater reliability ranged from moderate to almost perfect agreement for joint position error (ICC ≥ 0.48-0.82), the cranio-cervical flexion test (ICC ≥ 0.69), the neck flexor muscle endurance test performed in supine (ICC ≥ 0.68) and in a 45°-upright position (ICC ≥ 0.41) with the exception of a new test (neck extensor test), which ranged from slight to moderate agreement (ICC = 0.14-0.41). Likewise, inter-rater reliability ranged from moderate to almost perfect agreement for joint position error (ICC ≥ 0.51-0.75), the cranio-cervical flexion test (ICC ≥ 0.85), the neck flexor muscle endurance test performed in supine (ICC ≥ 0.70) and in a 45°-upright position (ICC ≥ 0.56). However, only slight to fair agreement was found for the neck extensor test (ICC

  14. [Inter-rater agreement on self-reported exposure to ergonomic risk factors for the upper extremities among mechanic assemblers in an automotive industry].

    Science.gov (United States)

    d'Errico, Angelo; Fontana, Dario; Merogno, Angela

    2016-01-01

    to assess reproducibility of self-reported exposure to ergonomic hazards for the upper limbs, measured through a questionnaire based on a diffused checklist for the assessment of ergonomic risk (OCRA) in a sample of mechanical assemblers of an automotive industry. cross-sectional study; reproducibility was assessed as interrater agreement of a composite index of ergonomic risk, estimated through the intraclass correlation coefficient (ICC). 58 mechanical assemblers, working in 29 twin areas, characterised by same work stations and tasks. composite index of ergonomic risk for the upper limbs. reproducibility of the ergonomic index was high in the overall sample (ICC: 0.81) and it was higher for the twin areas employing same-gender workers (ICC: 0.96), compared to those with workers of the opposite gender (ICC: 0.66). these results indicate that a questionnaire measuring with a great detail the exposure to the main ergonomic risk factors for the upper limbs, as the one based on the OCRA checklist used for this study, would allow to obtain a highly reproducible ergonomic index. If its validity against the corresponding observational checklist will be found elevated by future studies, this questionnaire may represent a useful tool for a preliminary assessment of workers' exposure to ergonomic hazards for the upper limbs.

  15. Intra-rater and inter-rater reliability of the standardized ultrasound protocol for assessing subacromial structures

    DEFF Research Database (Denmark)

    Hougs Kjær, Birgitte; Ellegaard, Karen; Wieland, Ina

    2017-01-01

    BACKGROUND: US-examinations related to shoulder impingement (SI) often vary due to methodological differences, examiner positions, transducers, and recording parameters. Reliable US protocols for examination of different structures related to shoulder impingement are therefore needed. OBJECTIVES...... of the supraspinatus tendon (SUPRA) and subacromial subdeltoid (SASD) bursa in two imaging positions, and the acromial humeral distance (AHD) in one position. Additionally, agreement on dynamic impingement (DI) examination was performed. The intra- and inter-rater reliability was carried out on the same day...

  16. The inter-rater reliability and prognostic value of coma scales in Nepali children with acute encephalitis syndrome.

    Science.gov (United States)

    Ray, Stephen; Rayamajhi, Ajit; Bonnett, Laura J; Solomon, Tom; Kneen, Rachel; Griffiths, Michael J

    2018-02-01

    Background Acute encephalitis syndrome (AES) is a common cause of coma in Nepali children. The Glasgow coma scale (GCS) is used to assess the level of coma in these patients and predict outcome. Alternative coma scales may have better inter-rater reliability and prognostic value in encephalitis in Nepali children, but this has not been studied. The Adelaide coma scale (ACS), Blantyre coma scale (BCS) and the Alert, Verbal, Pain, Unresponsive scale (AVPU) are alternatives to the GCS which can be used. Methods Children aged 1-14 years who presented to Kanti Children's Hospital, Kathmandu with AES between September 2010 and November 2011 were recruited. All four coma scales (GCS, ACS, BCS and AVPU) were applied on admission, 48 h later and on discharge. Inter-rater reliability (unweighted kappa) was measured for each. Correlation and agreement between total coma score and outcome (Liverpool outcome score) was measured by Spearman's rank and Bland-Altman plot. The prognostic value of coma scales alone and in combination with physiological variables was investigated in a subgroup (n = 22). A multivariable logistic regression model was fitted by backward stepwise. Results Fifty children were recruited. Inter-rater reliability using the variables scales was fair to moderate. However, the scales poorly predicted clinical outcome. Combining the scales with physiological parameters such as systolic blood pressure improved outcome prediction. Conclusion This is the first study to compare four coma scales in Nepali children with AES. The scales exhibited fair to moderate inter-rater reliability. However, the study is inadequately powered to answer the question on the relationship between coma scales and outcome. Further larger studies are required.

  17. Intra- and inter-rater reliability of 3D passive intervertebral motion in subjects with nonspecific neck pain assessed by physical therapy students: A pilot study.

    Science.gov (United States)

    Rossettini, Giacomo; Rondoni, Angie; Lovato, Tommaso; Strobe, Marco; Verzè, Elisa; Vicentini, Marco; Testa, Marco

    2016-06-03

    Passive Intervertebral Movements (PIVMs) are commonly used to assess and treat patients with nonspecific neck pain. Only very few studies have investigated 3D movements until now. This study assessed intra- and inter-rater reliability of three-dimensional (3D) cervical PIVMs performed by physical therapy students in patients with nonspecific neck pain. Thirty-one patients, mean age 47.2 ± 7.2 years, were independently evaluated by 2 physical therapy students. The raters (A and B) assessed mobility, end-feel and pain provocation performing bilaterally the 3D cervical segmental side-bending test (3D CSSB) from levels C2-C3 to C6-C7. Percentage agreement (raw, positive and negative), Cohen's kappa (95% CI), prevalence index and bias index were calculated to estimate intra- and inter-reliability. Intra-rater reliability showed kappa values ranging between fair and substantial (k 0.29-0.80) for pain provocation, mobility and end-feel, with percentage agreements between 61%-90%. Inter-rater reliability presented kappa values ranging between fair and substantial (k 0.22-0.62) for pain provocation, mobility and end-feel, with percentage agreements between 61% and 80%. Intra-rater reliability of 3D PIVMs was superior to inter-rater reliability in patients with nonspecific neck pain. The most repeatable evaluation parameter was pain. However overall poor reliability suggests avoiding the use of these techniques alone to examine patients and measure their outcome. Further studies are needed to investigate PIVMs reliability in combination with other assessment procedure in symptomatic patients.

  18. Intra-individual comparison of CAIPIRINHA VIBE technique with conventional VIBE sequences in contrast-enhanced MRI of focal liver lesions

    Energy Technology Data Exchange (ETDEWEB)

    Albrecht, M.H., E-mail: MoritzAlbrecht@gmx.net [University Hospital Frankfurt, Department of Diagnostic and Interventional Radiology, Frankfurt (Germany); Medical University of South Carolina, Department of Radiology and Radiological Science, Charleston, SC (United States); Bodelle, B., E-mail: boris.bodelle@kgu.de [University Hospital Frankfurt, Department of Diagnostic and Interventional Radiology, Frankfurt (Germany); Varga-Szemes, A., E-mail: vargaasz@musc.edu [Medical University of South Carolina, Department of Radiology and Radiological Science, Charleston, SC (United States); Dewes, P., E-mail: patricia.dewes@kgu.de [University Hospital Frankfurt, Department of Diagnostic and Interventional Radiology, Frankfurt (Germany); Bucher, A.M., E-mail: andreas.bucher@kgu.de [University Hospital Frankfurt, Department of Diagnostic and Interventional Radiology, Frankfurt (Germany); Ball, B.D., E-mail: BallBr@musc.edu [Medical University of South Carolina, Department of Radiology and Radiological Science, Charleston, SC (United States); De Cecco, C.N., E-mail: dececco@musc.edu [Medical University of South Carolina, Department of Radiology and Radiological Science, Charleston, SC (United States); Schoepf, U.J., E-mail: schoepf@musc.edu [Medical University of South Carolina, Department of Radiology and Radiological Science, Charleston, SC (United States); Zhu, X., E-mail: zhuxun@shzu.edu.cn [Shihezi University, Department of Psychology, Beisi Road, Xinjiang (China); Zangos, S., E-mail: Zangos@em.uni-frankfurt.de [University Hospital Frankfurt, Department of Diagnostic and Interventional Radiology, Frankfurt (Germany); Gruber-Rouh, T., E-mail: tatjanagruber2004@yahoo.de [University Hospital Frankfurt, Department of Diagnostic and Interventional Radiology, Frankfurt (Germany); Wichmann, J.L., E-mail: dochwichmann@gmail.com [University Hospital Frankfurt, Department of Diagnostic and Interventional Radiology, Frankfurt (Germany); and others

    2017-01-15

    Purpose: To evaluate the impact of controlled aliasing in parallel imaging results in higher acceleration (CAIPIRINHA) volume interpolated breath-hold examination (VIBE) magnetic resonance imaging (MRI) technique on image quality, reader confidence, and inter-observer agreement for the assessment of focal liver lesions in comparison with the standard VIBE approach. Material and methods: In this IRB-approved intra-individual comparison study, abdominal arterial and portal-venous contrast-enhanced MRI studies were retrospectively analyzed in 38 patients with malignant liver lesions. Each patient underwent both CAIPIRINHA and conventional VIBE 3 T MRI within 3 months, showing stable disease. Images were evaluated using 5-point rating scales by two blinded radiologists with more than 20 and 5 years of experience in MRI, respectively. Readers scored dignity of liver lesions and assessed which liver segments were affected by malignancy (ranging from 1 = definitely benign/not affected to 5 = definitely malignant/affected by malignancy). Readers also rated overall image quality, sharpness of intrahepatic veins, and diagnostic confidence (ranging from 1 = poor to 5 = excellent). Results: Reviewers achieved a higher inter-observer reliability using CAIPIRINHA when they reported which liver segments were affected by malignancy compared to traditional VIBE series (κ = 0.62 and 0.54, respectively, p < 0.05). Similarly, CAIPIRINHA showed a slightly higher inter-rater agreement for the dignity of focal liver lesions versus the standard VIBE images (κ = 0.50 and 0.49, respectively, p < 0.05). CAIPIRINHA series also scored higher in comparison to standard VIBE sequences (mean scores: image quality, 4.2 and 3.5; sharpness of intrahepatic vessels, 3.8 and 3.2, respectively, p < 0.05) for both reviewers and allowed for higher subjective diagnostic confidence (ratings, 3.8 and 3.2, respectively, p < 0.05). Conclusion: Compared to the standard VIBE approach, CAIPIRINHA VIBE technique

  19. Development and interrater reliability testing of a telephone interview training programme for Australian nurse interviewers.

    Science.gov (United States)

    Ahern, Tracey; Gardner, Anne; Gardner, Glenn; Middleton, Sandy; Della, Phillip

    2013-05-01

    The final phase of a three phase study analysing the implementation and impact of the nurse practitioner role in Australia (the Australian Nurse Practitioner Project or AUSPRAC) was undertaken in 2009, requiring nurse telephone interviewers to gather information about health outcomes directly from patients and their treating nurse practitioners. A team of several registered nurses was recruited and trained as telephone interviewers. The aim of this paper is to report on development and evaluation of the training process for telephone interviewers. The training process involved planning the content and methods to be used in the training session; delivering the session; testing skills and understanding of interviewers post-training; collecting and analysing data to determine the degree to which the training process was successful in meeting objectives and post-training follow-up. All aspects of the training process were informed by established educational principles. Interrater reliability between interviewers was high for well-validated sections of the survey instrument resulting in 100% agreement between interviewers. Other sections with unvalidated questions showed lower agreement (between 75% and 90%). Overall the agreement between interviewers was 92%. Each interviewer was also measured against a specifically developed master script or gold standard and for this each interviewer achieved a percentage of correct answers of 94.7% or better. This equated to a Kappa value of 0.92 or better. The telephone interviewer training process was very effective and achieved high interrater reliability. We argue that the high reliability was due to the use of well validated instruments and the carefully planned programme based on established educational principles. There is limited published literature on how to successfully operationalise educational principles and tailor them for specific research studies; this report addresses this knowledge gap. Copyright © 2012 Elsevier

  20. Quantification of benign lesion regression as a function of 532-nm pulsed potassium titanyl phosphate laser parameter selection.

    Science.gov (United States)

    Mallur, Pavan S; Tajudeen, Bobby A; Aaronson, Nicole; Branski, Ryan C; Amin, Milan R

    2011-03-01

    Although the potassium titanyl phosphate (KTP) laser is versatile, the variability in laser parameters for laryngeal pathologies and the lack of clinical efficacy data remain problematic. We provide preliminary data regarding these parameters for benign lesion regression. In addition, we describe a novel method for the quantification of the effects of the KTP laser on vocal fold (VF) lesions. Retrospective chart review. Images were captured from examinations before and after in-office KTP treatment in patients with a range of benign lesions. Laser settings were noted for each patient. Imaging software was then used to calculate a ratio of lesion area to VF length. Ten percent of images were requantified to determine inter-rater reliability. Thirty-two patients underwent 47 procedures for lesions including hemorrhagic polyp, nonhemorrhagic polyp, vocal process granuloma, Reinke's edema, cyst/pseudocyst, leukoplakia, and squamous cell carcinoma in situ. No statistically significant differences were observed with regard to the laser parameters used as a function of lesion type. Regardless, by 1 month following treatment, all lesions had significantly decreased in size, except nonhemorrhagic polyps. Similar data were obtained at 2-month follow-up. We then compared the pre-KTP lesion size with the smallest lesion size quantified during the 1-year follow-up period. All lesions were significantly smaller, with the exception of Reinke's edema. Inter-rater reliability was quite good. KTP laser effectively reduced VF lesion size, irrespective of the laser parameters used. In addition, our quantification method for lesion size appeared to be both viable and reliable. Copyright © 2011 The American Laryngological, Rhinological, and Otological Society, Inc.

  1. Inter-Rater Reliability and Validity of the Australian Football League’s Kicking and Handball Tests

    Science.gov (United States)

    Cripps, Ashley J.; Hopper, Luke S.; Joyce, Christopher

    2015-01-01

    Talent identification tests used at the Australian Football League’s National Draft Combine assess the capacities of athletes to compete at a professional level. Tests created for the National Draft Combine are also commonly used for talent identification and athlete development in development pathways. The skills tests created by the Australian Football League required players to either handball (striking the ball with the hand) or kick to a series of 6 randomly generated targets. Assessors subjectively rate each skill execution giving a 0-5 score for each disposal. This study aimed to investigate the inter-rater reliability and validity of the skills tests at an adolescent sub-elite level. Male Australian footballers were recruited from sub-elite adolescent teams (n = 121, age = 15.7 ± 0.3 years, height = 1.77 ± 0.07 m, mass = 69.17 ± 8.08 kg). The coaches (n = 7) of each team were also recruited. Inter-rater reliability was assessed using Inter-class correlations (ICC) and Limits of Agreement statistics. Both the kicking (ICC = 0.96, p handball tests (ICC = 0.89, p handball test. Key points The skill tests created by the AFL demonstrated acceptable levels of relative and absolute inter-rater reliability. Both the AFL’s skills tests are able to differentiate between athletes dominant and non-dominant limbs. However, only the kicking test could consistently differentiated between score outcomes over a range of Australian Football specific disposal distances. Both tests demonstrated poor concurrent validity, with no correlation found between coaches’ perceptions of technical skills and actual skill outcomes measured. PMID:26336356

  2. Interrater reliability and accuracy of clinicians and trained research assistants performing prospective data collection in emergency department patients with potential acute coronary syndrome.

    Science.gov (United States)

    Cruz, Carlos O; Meshberg, Emily B; Shofer, Frances S; McCusker, Christine M; Chang, Anna Marie; Hollander, Judd E

    2009-07-01

    Clinical research requires high-quality data collection. Data collected at the emergency department evaluation is generally considered more precise than data collected through chart abstraction but is cumbersome and time consuming. We test whether trained research assistants without a medical background can obtain clinical research data as accurately as physicians. We hypothesize that they would be at least as accurate because they would not be distracted by clinical requirements. We conducted a prospective comparative study of 33 trained research assistants and 39 physicians (35 residents) to assess interrater reliability with respect to guideline-recommended clinical research data. Immediately after the research assistant and clinician evaluation, the data were compared by a tiebreaker third person who forced the patient to choose one of the 2 answers as the correct one when responses were discordant. Crude percentage agreement and interrater reliability were assessed (kappa statistic). One hundred forty-three patients were recruited (mean age 50.7 years; 47% female patients). Overall, the median agreement was 81% (interquartile range [IQR] 73% to 92%) and interrater reliability was fair (kappa value 0.36 [IQR 0.26 to 0.52]) but varied across categories of data: cardiac risk factors (median 86% [IQR 81% to 93%]; median 0.69 [IQR 0.62 to 0.83]), other cardiac history (median 93% [IQR 79% to 95%]; median 0.56 [IQR 0.29 to 0.77]), pain location (median 92% [IR 86% to 94%]; median 0.37 [IQR 0.25 to 0.29]), radiation (median 86% [IQR 85% to 87%]; median 0.37 [IQR 0.26 to 0.42]), quality (median 85% [IQR 75% to 94%]; median 0.29 [IQR 0.23 to 0.40]), and associated symptoms (median 74% [IQR 65% to 78%]; median 0.28 [IQR 0.20 to 0.40]). When discordant information was obtained, the research assistant was more often correct (median 64% [IQR 53% to 72%]). The relatively fair interrater reliability observed in our study is consistent with previous studies evaluating

  3. Intrarater and interrater reliability for measurements in videofluoroscopy of swallowing

    International Nuclear Information System (INIS)

    Baijens, Laura; Barikroo, Ali; Pilz, Walmari

    2013-01-01

    Objective: Intrarater and interrater reliability is crucial to the quality of diagnostic and therapy-effect studies. This paper reports on a systematic review of studies on intrarater and interrater reliability for measurements in videofluoroscopy of swallowing. The aim of this review was to summarize and qualitatively analyze published studies on that topic. Materials and methods: Those published up to March 2013 were found through a comprehensive electronic database search using PubMed, Embase, and The Cochrane Library. Two reviewers independently assessed the studies using strict inclusion criteria. Results: Nineteen studies were included and then qualitatively analyzed. In several of these, methodological problems were found. Moreover, intrarater and interrater reliability varied with the measure applied. A meta-analysis was not carried out as studies were not of sufficient quality to warrant doing so. Conclusion: In order to achieve reliable measurements in videofluoroscopy of swallowing, it is recommended that raters use well-defined guidelines for the levels of ordinal visuoperceptual variables. Furthermore, in order to make the measurements reliable (intrarater and interrater) it is recommended that, following protocolled pre-experimental training, the raters should have maximum consensus about the definition of the measured variables

  4. Intrarater and interrater reliability for measurements in videofluoroscopy of swallowing

    Energy Technology Data Exchange (ETDEWEB)

    Baijens, Laura, E-mail: laura.baijens@mumc.nl [Department of Otorhinolaryngology, Head and Neck Surgery, Maastricht University Medical Center, Maastricht (Netherlands); Barikroo, Ali, E-mail: a.Barikroo@ufl.edu [Swallowing Research Laboratory, Department of Speech, Language and Hearing Sciences, College of Public Health and Health Professions, University of Florida, Gainesville, FL (United States); Pilz, Walmari, E-mail: walmari.pilz@mumc.nl [Department of Otorhinolaryngology, Head and Neck Surgery, Maastricht University Medical Center, Maastricht (Netherlands)

    2013-10-01

    Objective: Intrarater and interrater reliability is crucial to the quality of diagnostic and therapy-effect studies. This paper reports on a systematic review of studies on intrarater and interrater reliability for measurements in videofluoroscopy of swallowing. The aim of this review was to summarize and qualitatively analyze published studies on that topic. Materials and methods: Those published up to March 2013 were found through a comprehensive electronic database search using PubMed, Embase, and The Cochrane Library. Two reviewers independently assessed the studies using strict inclusion criteria. Results: Nineteen studies were included and then qualitatively analyzed. In several of these, methodological problems were found. Moreover, intrarater and interrater reliability varied with the measure applied. A meta-analysis was not carried out as studies were not of sufficient quality to warrant doing so. Conclusion: In order to achieve reliable measurements in videofluoroscopy of swallowing, it is recommended that raters use well-defined guidelines for the levels of ordinal visuoperceptual variables. Furthermore, in order to make the measurements reliable (intrarater and interrater) it is recommended that, following protocolled pre-experimental training, the raters should have maximum consensus about the definition of the measured variables.

  5. Inter-Rater Reliability and Agreement of the 6-Minute Walk Test in Women With Hip Fracture

    DEFF Research Database (Denmark)

    Larsen, Camilla Marie; Overgaard, Jan; Tange Kristensen, Morten

    MWT in individuals with hip fractures. Methods: Two senior physiotherapy students independently examined (randomized order) a convenient sample of 20 participants; their assessments were separated by two days, and testing followed instructions from the American Thoracic Society(1). Hip pain...... was assessed with the Verbal Ranking Scale. Results: Participants (all women) with a mean (SD) age of 78.1 ± 5.9 years performed the test within a mean of 31.5 ± 5.8 days post-surgery; 10 had a cervical and 10 a trochanteric fracture. Excellent inter-rater reliability; ICC2.1 =0.92 (95% CI, 0.81 - 0...... = -0.196, P = 0.41). On the contrary, participants walked a mean of 21.7 ± 22.6 meters longer, at the second trial (P = 0.002). Participants with moderate hip fracture- related pain walked a shorter distance than those with no or light pain during the first test (P = 0.04), while this was not the case...

  6. Accuracy of chemical shift MR imaging in diagnosing indeterminate bone marrow lesions in the pelvis: review of a single institution's experience

    International Nuclear Information System (INIS)

    Kohl, Chad A.; Chivers, F.S.; Lorans, Roxanne; Roberts, Catherine C.; Kransdorf, Mark J.

    2014-01-01

    To re-assess the accuracy of chemical shift imaging in diagnosing indeterminate bone marrow lesions as benign or malignant. We retrospectively reviewed our experience with MR imaging of the pelvis to assess the accuracy of chemical shift imaging in distinguishing benign from malignant bone lesions. Two musculoskeletal radiologists retrospectively reviewed all osseous lesions biopsied since 2006, when chemical shift imaging was added to our routine pelvic imaging protocol. Study inclusion criteria required (1) MR imaging of an indeterminate bone marrow lesion about the pelvis and (2) subsequent histologic confirmation. The study group included 50 patients (29 male, 21 female) with an average age of 67 years (range, 41-89 years). MR imaging results were evaluated using biopsy results as the ''gold standard.'' There were 27 malignant and 23 benign lesions. Chemical shift imaging using an opposed-phase signal loss criteria of less than 20 % to indicate a malignant lesion, correctly diagnosed 27/27 malignant lesions and 14/23 benign lesions, yielding a 100 % sensitivity, 61 % specificity, 75 % PPV, 100 % NPV, and 82 % accuracy. The area under the receiver operator characteristic (ROC) curve was 0.88. The inter-rater and intra-rater agreement K values were both 1.0. Chemical shift imaging is a useful adjunct MR technique to characterize focal and diffuse marrow abnormalities on routine non-contrast pelvic imaging. It is highly sensitive in identifying malignant disease. Despite its lower specificity, the need for biopsy could be eliminated in more than 60 % of patients with benign disease. (orig.)

  7. Computerized back postural assessment in physiotherapy practice: Intra-rater and inter-rater reliability of the MIDAS system.

    Science.gov (United States)

    McAlpine, R T; Bettany-Saltikov, J A; Warren, J G

    2009-01-01

    Assessment of spinal posture during physiotherapy practice is routine, yet few objective measures exist to this end. The Middlesbrough Integrated Digital Assessment System (MIDAS) is a low cost portable system able to record 3D information on posture. The purpose of this study was to assess both the intra-rater and inter-rater reliability of the MIDAS system. Twenty-five healthy subjects were recruited. A repeated measures design was used to record fifteen pre-palpated landmarks on the back of each subject. To limit the sources of variability, the principal researcher palpated the landmarks for each subject. Each of three raters took two measurements on each subject in a standardized upright posture. X (medio-lateral), Y (antero-posterior) and Z (height) landmark positions were recorded via a computer interface. Both intra-rater agreement (mean ICCs - rater 1 r=0.970, rater 2 r=0.965 and rater 3 r=0.965, pMIDAS demonstrated both high inter-rater and intra-rater reliability and provides an objective method for the assessment of posture in physiotherapy practice.

  8. Chest computed tomography-based scoring of thoracic sarcoidosis: Inter-rater reliability of CT abnormalities

    Energy Technology Data Exchange (ETDEWEB)

    Heuvel, D.A.V. den; Es, H.W. van; Heesewijk, J.P. van; Spee, M. [St. Antonius Hospital Nieuwegein, Department of Radiology, Nieuwegein (Netherlands); Jong, P.A. de [University Medical Center Utrecht, Department of Radiology, Utrecht (Netherlands); Zanen, P.; Grutters, J.C. [University Medical Center Utrecht, Division Heart and Lungs, Utrecht (Netherlands); St. Antonius Hospital Nieuwegein, Center of Interstitial Lung Diseases, Department of Pulmonology, Nieuwegein (Netherlands)

    2015-09-15

    To determine inter-rater reliability of sarcoidosis-related computed tomography (CT) findings that can be used for scoring of thoracic sarcoidosis. CT images of 51 patients with sarcoidosis were scored by five chest radiologists for various abnormal CT findings (22 in total) encountered in thoracic sarcoidosis. Using intra-class correlation coefficient (ICC) analysis, inter-rater reliability was analysed and reported according to the Guidelines for Reporting Reliability and Agreement Studies (GRRAS) criteria. A pre-specified sub-analysis was performed to investigate the effect of training. Scoring was trained in a distinct set of 15 scans in which all abnormal CT findings were represented. Median age of the 51 patients (36 men, 70 %) was 43 years (range 26 - 64 years). All radiographic stages were present in this group. ICC ranged from 0.91 for honeycombing to 0.11 for nodular margin (sharp versus ill-defined). The ICC was above 0.60 in 13 of the 22 abnormal findings. Sub-analysis for the best-trained observers demonstrated an ICC improvement for all abnormal findings and values above 0.60 for 16 of the 22 abnormalities. In our cohort, reliability between raters was acceptable for 16 thoracic sarcoidosis-related abnormal CT findings. (orig.)

  9. Chest computed tomography-based scoring of thoracic sarcoidosis: Inter-rater reliability of CT abnormalities

    International Nuclear Information System (INIS)

    Heuvel, D.A.V. den; Es, H.W. van; Heesewijk, J.P. van; Spee, M.; Jong, P.A. de; Zanen, P.; Grutters, J.C.

    2015-01-01

    To determine inter-rater reliability of sarcoidosis-related computed tomography (CT) findings that can be used for scoring of thoracic sarcoidosis. CT images of 51 patients with sarcoidosis were scored by five chest radiologists for various abnormal CT findings (22 in total) encountered in thoracic sarcoidosis. Using intra-class correlation coefficient (ICC) analysis, inter-rater reliability was analysed and reported according to the Guidelines for Reporting Reliability and Agreement Studies (GRRAS) criteria. A pre-specified sub-analysis was performed to investigate the effect of training. Scoring was trained in a distinct set of 15 scans in which all abnormal CT findings were represented. Median age of the 51 patients (36 men, 70 %) was 43 years (range 26 - 64 years). All radiographic stages were present in this group. ICC ranged from 0.91 for honeycombing to 0.11 for nodular margin (sharp versus ill-defined). The ICC was above 0.60 in 13 of the 22 abnormal findings. Sub-analysis for the best-trained observers demonstrated an ICC improvement for all abnormal findings and values above 0.60 for 16 of the 22 abnormalities. In our cohort, reliability between raters was acceptable for 16 thoracic sarcoidosis-related abnormal CT findings. (orig.)

  10. SPSS Macros for Assessing the Reliability and Agreement of Student Evaluations of Teaching

    Science.gov (United States)

    Morley, Donald D.

    2009-01-01

    This article reports and demonstrates two SPSS macros for calculating Krippendorff's alpha and intraclass reliability coefficients in repetitive situations where numerous coefficients are needed. Specifically, the reported SPSS macros were used to evaluate the interrater agreement and reliability of student evaluations of teaching in thousands of…

  11. Inter-rater reliability of shoulder measurements in middle-aged women.

    Science.gov (United States)

    De Groef, A; Van Kampen, M; Vervloesem, N; Clabau, E; Christiaens, M-R; Neven, P; Geraerts, I; Struyf, F; Devoogdt, N

    2017-06-01

    To investigate inter-rater reliability of a set of shoulder measurements including inclinometry [shoulder range of motion (ROM)], acromion-table distance and pectoralis minor muscle length (static scapular positioning), upward rotation with two inclinometers (scapular kinematics) and pain pressure thresholds (muscle tenderness) in middle-aged women. Observational study. Thirty symptom-free middle-aged women (first cohort) were measured by two raters. All measurements with an intraclass correlation coefficient (ICC) below 0.75 were retested after an additional training period in a second cohort of 30 symptom-free middle-aged women. Inter-rater reliability of all variables was measured with the ICC (95% confidence interval) and standard error of measurement (SEM). Acromion-table distance (ICC=0.91, SEM 0.22 to 0.28% of body length), pectoralis minor muscle length (ICC=0.91, SEM 0.16% of body length), pain pressure thresholds (ICC=0.78 to 0.85, SEM 0.39 to 0.70kg) and abduction ROM (ICC=0.77, SEM 5°) showed good to excellent inter-rater reliability in the first cohort. After an additional training period, forward flexion ROM showed good inter-rater reliability (ICC=0.83, SEM 5°), scapular upward rotation in resting position showed moderate reliability (ICC=0.52, SEM 2°), and other scaption angles showed weak reliability (ICC=0.26 to 0.43, SEM 3 to 8°). In a battery of clinical tools to evaluate factors contributing to shoulder pain, static scapular positioning and pressure pain thresholds were found to have good to excellent inter-rater reliability in middle-aged women. Additional training is recommended for measurements with a gravity inclinometer. Copyright © 2016 Chartered Society of Physiotherapy. Published by Elsevier Ltd. All rights reserved.

  12. Assessing the suitability of written stroke materials: an evaluation of the interrater reliability of the suitability assessment of materials (SAM) checklist.

    Science.gov (United States)

    Hoffmann, Tammy; Ladner, Yvette

    2012-01-01

    Written materials are frequently used to provide education to stroke patients and their carers. However, poor quality materials are a barrier to effective information provision. A quick and reliable method of evaluating material quality is needed. This study evaluated the interrater reliability of the Suitability Assessment of Materials (SAM) checklist in a sample of written stroke education materials. Two independent raters evaluated the materials (n = 25) using the SAM, and ratings were analyzed to reveal total percentage agreements and weighted kappa values for individual items and overall SAM rating. The majority of the individual SAM items had high interrater reliability, with 17 of the 22 items achieving substantial, almost perfect, or perfect weighted kappa value scores. The overall SAM rating achieved a weighted kappa value of 0.60, with a percentage total agreement of 96%. Health care professionals should evaluate the content and design characteristics of written education materials before using them with patients. A tool such as the SAM checklist can be used; however, raters should exercise caution when interpreting results from items with more subjective scoring criteria. Refinements to the scoring criteria for these items are recommended. The value of the SAM is that it can be used to identify specific elements that should be modified before education materials are provided to patients.

  13. Exploring Differences in Measurement and Reporting of Classroom Observation Inter-Rater Reliability

    Science.gov (United States)

    Wilhelm, Anne Garrison; Gillespie Rouse, Amy; Jones, Francesca

    2018-01-01

    Although inter-rater reliability is an important aspect of using observational instruments, it has received little theoretical attention. In this article, we offer some guidance for practitioners and consumers of classroom observations so that they can make decisions about inter-rater reliability, both for study design and in the reporting of data…

  14. Reliability of MRI assessment of acute musculotendinous groin injuries in athletes

    Energy Technology Data Exchange (ETDEWEB)

    Serner, Andreas; Hoelmich, Per [Aspetar, Orthopaedic and Sports Medicine Hospital, Sports City Street, P.O. Box 29222, Doha (Qatar); Copenhagen University Hospital, Sports Orthopaedic Research Center-Copenhagen (SORC-C), Department of Orthopedic Surgery, Amager-Hvidovre (Denmark); Roemer, Frank W. [University of Erlangen-Nuremberg, Department of Radiology, Erlangen (Germany); Boston University School of Medicine, Quantitative Imaging Center (QIC), Department of Radiology, Boston, MA (United States); Thorborg, Kristian [Copenhagen University Hospital, Sports Orthopaedic Research Center-Copenhagen (SORC-C), Department of Orthopedic Surgery, Amager-Hvidovre (Denmark); Niu, Jingbo [Boston University School of Medicine, Clinical Epidemiology and Training Unit, Department of Medicine, Boston, MA (United States); Weir, Adam [Aspetar, Orthopaedic and Sports Medicine Hospital, Sports City Street, P.O. Box 29222, Doha (Qatar); Tol, Johannes L. [Aspetar, Orthopaedic and Sports Medicine Hospital, Sports City Street, P.O. Box 29222, Doha (Qatar); OLVG West, The Sports Physician Group, Department of Sports Medicine, Amsterdam (Netherlands); Academic Medical Center, Amsterdam Center of Evidence Based Sports Medicine, Amsterdam (Netherlands); Guermazi, Ali [Boston University School of Medicine, Quantitative Imaging Center (QIC), Department of Radiology, Boston, MA (United States)

    2017-04-15

    To describe a multi-dimensional MRI assessment approach with a focus on acute musculotendinous groin lesions, and to evaluate scoring reproducibility. Male athletes who participated in competitive sports and presented within 7 days of an acute onset of sports-related groin pain were included. All athletes underwent MRI (1.5 T) according to a standardized groin-centred protocol. From several calibration sessions, a system was developed assessing grade, location and extent of muscle strains, peri-lesional haematoma, as well as other non-acute findings commonly associated with long-standing groin pain. Kappa (K) statistics and intraclass correlation coefficients (ICCs) were used to describe intra- and inter-rater reproducibility. Seventy-five athletes (mean age 26.6 ± 4.4 years) were included in the analyses, and 85 different acute lesions were observed. Adductor longus lesions were most common (42.7 %) followed by rectus femoris lesions (16.3 %). Kappa values ranged between 0.70 and 1.00 for almost all categorical features for acute lesions, with almost perfect intra- and inter-rater agreement (K = 0.89-1.00) for presence, number, location and grading of lesions. ICCs ranged between 0.77 and 1.00 for continuous measures of acute lesion extent. A standardized MRI assessment approach of acute groin injuries was described and showed good intra- and inter-rater reproducibility. (orig.)

  15. Reliability of MRI assessment of acute musculotendinous groin injuries in athletes

    International Nuclear Information System (INIS)

    Serner, Andreas; Hoelmich, Per; Roemer, Frank W.; Thorborg, Kristian; Niu, Jingbo; Weir, Adam; Tol, Johannes L.; Guermazi, Ali

    2017-01-01

    To describe a multi-dimensional MRI assessment approach with a focus on acute musculotendinous groin lesions, and to evaluate scoring reproducibility. Male athletes who participated in competitive sports and presented within 7 days of an acute onset of sports-related groin pain were included. All athletes underwent MRI (1.5 T) according to a standardized groin-centred protocol. From several calibration sessions, a system was developed assessing grade, location and extent of muscle strains, peri-lesional haematoma, as well as other non-acute findings commonly associated with long-standing groin pain. Kappa (K) statistics and intraclass correlation coefficients (ICCs) were used to describe intra- and inter-rater reproducibility. Seventy-five athletes (mean age 26.6 ± 4.4 years) were included in the analyses, and 85 different acute lesions were observed. Adductor longus lesions were most common (42.7 %) followed by rectus femoris lesions (16.3 %). Kappa values ranged between 0.70 and 1.00 for almost all categorical features for acute lesions, with almost perfect intra- and inter-rater agreement (K = 0.89-1.00) for presence, number, location and grading of lesions. ICCs ranged between 0.77 and 1.00 for continuous measures of acute lesion extent. A standardized MRI assessment approach of acute groin injuries was described and showed good intra- and inter-rater reproducibility. (orig.)

  16. Reevaluating Interrater Reliability in Offender Risk Assessment

    NARCIS (Netherlands)

    van der Knaap, L.M.; Leenarts, L.E.W.; Born, M.P.; Oosterveld, P.

    2012-01-01

    Offender risk and needs assessment, one of the pillars of the risk-need-responsivity model of offender rehabilitation, usually depends on raters assessing offender risk and needs. The few available studies of interrater reliability in offender risk assessment are, however, limited in the

  17. Reproducibility of range of motion and muscle strength measurements in patients with hip osteoarthritis – an interrater study

    DEFF Research Database (Denmark)

    Poulsen, Erik; Christensen, Henrik Wulff; Penny, Jeannette Østergaard

    2012-01-01

    ABSTRACT: BACKGROUND: Assessment of range of motion (ROM) and muscle strength is fundamental in the clinical diagnosis of hip osteoarthritis (OA) but reproducibility of these measurements has mostly involved clinicians from secondary care and has rarely reported agreement parameters. Therefore......, the primary objective of the study was to determine the inter-rater reproducibility of ROM and muscle strength measurements. Furthermore, the reliability of the overall assessment of clinical hip OA was evaluated. Reporting is in accordance with proposed guidelines for the reporting of reliability...... differentiate between hips with or without osteoarthritis....

  18. Inter-Rater Reliability of Cyclotorsion Measurements Using Fundus Photography.

    Science.gov (United States)

    Dysli, Muriel; Kanku, Madeleine; Traber, Ghislaine L

    2018-04-01

    The foveo-papillary angle (FPA) on fundus photographs is the accepted standard for the measurement of ocular cyclotorsion. We assessed the inter-rater reliability of this method in healthy subjects and in patients with trochlear nerve palsies. In this methodological study, fundus photographs of healthy subjects and of patients with trochlear nerve palsies were made with a fundus camera (Zeiss Fundus Camera FF 450 plus, Jena, Germany). Three independent observers measured the FPA on the fundus photographs of all subjects in synedra View (synedra View 16, Version 16.0.0.11, Innsbruck, Austria). One hundred and four eyes of 52 subjects (26 healthy controls and 26 patients) were assessed. The mean FPA of the healthy controls was 5.80 degrees (°) [± 0.44 standard error of the mean (SEM)] compared to 11.55° (± 0.80 SEM) for patients with trochlear nerve palsies. The inter-rater reliability of all measured FPAs showed an intraclass correlation coefficient (ICC) of 0.98 (95% CI 0.97 - 0.98). The inter-rater reliability of objective cyclotorsion measurements using fundus photographs was very high. Georg Thieme Verlag KG Stuttgart · New York.

  19. The Reliability of Assessing Radiographic Healing of Osteochondritis Dissecans of the Knee.

    Science.gov (United States)

    Wall, Eric J; Milewski, Matthew D; Carey, James L; Shea, Kevin G; Ganley, Theodore J; Polousky, John D; Grimm, Nathan L; Eismann, Emily A; Jacobs, Jake C; Murnaghan, Lucas; Nissen, Carl W; Myer, Gregory D; Weiss, Jennifer; Edmonds, Eric W; Anderson, Allen F; Lyon, Roger M; Heyworth, Benton E; Fabricant, Peter D; Zbojniewicz, Andy

    2017-05-01

    The reliability of assessing healing on plain radiographs has not been well-established for knee osteochondritis dissecans (OCD). To determine the inter- and intrarater reliability of specific radiographic criteria in judging healing of femoral condyle OCD. Cohort study (Diagnosis); Level of evidence, 3. Ten orthopedic sports surgeons rated the radiographic healing of 30 knee OCD lesions at 2 time points, a minimum of 1 month apart. First, raters compared pretreatment and 2-year follow-up radiographs on "overall healing" and on 5 subfeatures of healing, including OCD boundary, sclerosis, size, shape, and ossification using a continuous slider scale. "Overall healing" was also rated using a 7-tier ordinal scale. Raters then compared the same 30 pretreatment knee radiographs in a stepwise progression to the 2-, 4-, 7-, 12-, and 24-month follow-up radiographs on "overall healing" using a continuous slider scale. Interrater and intrarater reliability were assessed using intraclass correlations (ICC) derived from a 2-way mixed effects analysis of variance for absolute agreement. Overall healing of the OCD lesions from pretreatment to 2-year follow-up radiographs was rated with excellent interrater reliability (ICC = 0.94) and intrarater reliability (ICC = 0.84) when using a continuous scale. The reliability of the 5 subfeatures of healing was also excellent (interrater ICCs of 0.87-0.89; intrarater ICCs of 0.74-0.84). The 7-tier ordinal scale rating of overall healing had lower interrater (ICC = 0.61) and intrarater (ICC = 0.68) reliability. The overall healing of OCD lesions at the 5 time points up to 24 months had interrater ICCs of 0.81-0.88 and intrarater ICCs of 0.65-0.70. Interrater reliability was excellent when judging the overall healing of OCD femoral condyle lesions on radiographs as well as on 5 specific features of healing on 2-year follow-up radiographs. Continuous scale rating of OCD radiographic healing yielded higher reliability than the ordinal scale

  20. Poor Agreement Among Expert Witnesses in Bile Duct Injury Malpractice Litigation An Expert Panel Survey

    NARCIS (Netherlands)

    de Reuver, Philip R.; Dijkgraaf, Marcel G. W.; Gevers, Sjef K. M.; Gouma, Dirk J.

    2008-01-01

    Objective: To determine the inter-rater agreement of expert witness testimonies in bile duct injury malpractice litigation. Background Data: Malpractice litigation is an increasing concern in modem surgical practice. As most of the lawyers are not educated in medicine, expert witnesses are asked to

  1. Poor agreement among expert witnesses in bile duct injury malpractice litigation: an expert panel survey.

    NARCIS (Netherlands)

    Reuver, P.R. de; Dijkgraaf, M.G.; Gevers, S.K.; Gouma, D.J.; Bleichrodt, R.P.; Cuesta, M.A.; Erp, W.F. van; Gerritsen, J.; Hesselink, E.J.; Laarhoven, C.J.H.M. van; Lange, J. de; Obertop, H.; Stassen, L.P.; Terpstra, O.T.; Tilanus, H.W.; Vroonhoven, T.J.; Wit, L. de

    2008-01-01

    OBJECTIVE: To determine the inter-rater agreement of expert witness testimonies in bile duct injury malpractice litigation. BACKGROUND DATA: Malpractice litigation is an increasing concern in modern surgical practice. As most of the lawyers are not educated in medicine, expert witnesses are asked to

  2. Inter-rater reliability of the Greek version of CAARMS among two groups of mental health professionals.

    Science.gov (United States)

    Kollias, C; Kontaxakis, V; Havaki-Kontaxaki, B; Simmons, M B; Stefanis, N; Papageorgiou, C

    2015-01-01

    There is increasing interest within the Greek psychiatric community in the early detection and prevention of psychotic disorders. To support this, there is a need for a valid and reliable tool to identify young people that may be at risk of developing a psychotic disorder. Our team has previously translated the Comprehensive Assessment of At-Risk Mental States (CAARMS). The validity of the CAARMS was ensured by the procedure of translation and the aim of the current study was to estimate the interrater reliability of the CAARMS Greek translation among residents in psychiatry and specialized mental health professionals. 43 mental health workers (27 residents in psychiatry and 16 specialized mental health professionals (i.e. 11 psychiatrists and 5 psychologist) participated in two seminars that covered theoretical information about the ultra high risk concept and training in the CAARMS. During the seminars, 10 vignettes with psychiatric history cases were presented, including healthy, ultra high risk and first episode psychosis. The mean correlated percentage of agreement with the correct answers regarding diagnosis of the presented history cases among all our subjects was 81.42, among specialized mental health professionals 77.88, and among residents 84.46. Intraclass correlation co-efficients were 0.994 for specialized mental health professionals and 0.997 for residents. The translated Greek version of CAARMS presents a satisfying interrater reliability when used by both residents and specialized mental health professionals. Residents declare even higher intraclass correlation co-efficients and mean correlated percentage of agreement than specialized mental health professionals, which indicate that residents are capable of using the CAARMS in early intervention units.

  3. Validation and inter-rater reliability of a three item falls risk screening tool

    Directory of Open Access Journals (Sweden)

    Catherine Maree Said

    2017-11-01

    Full Text Available Abstract Background Falls screening tools are routinely used in hospital settings and the psychometric properties of tools should be examined in the setting in which they are used. The aim of this study was to explore the concurrent and predictive validity of the Austin Health Falls Risk Screening Tool (AHFRST, compared with The Northern Hospital Modified St Thomas’s Risk Assessment Tool (TNH-STRATIFY, and the inter-rater reliability of the AHFRST. Methods A research physiotherapist used the AHFRST and TNH-STRATIFY to classify 130 participants admitted to Austin Health (five acute wards, n = 115 two subacute wards n = 15; median length of stay 6 days IQR 3–12 as ‘High’ or ‘Low’ falls risk. The AHFRST was also completed by nursing staff on patient admission. Falls data was collected from the hospital incident reporting system. Results Six falls occurred during the study period (fall rate of 4.6 falls per 1000 bed days. There was substantial agreement between the AHFRST and the TNH-STRATIFY (Kappa = 0.68, 95% CI 0.52–0.78. Both tools had poor predictive validity, with low specificity (AHFRST 46.0%, 95% CI 37.0–55.1; TNH-STRATIFY 34.7%, 95% CI 26.4–43.7 and positive predictive values (AHFRST 5.6%, 95% CI 1.6–13.8; TNH-STRATIFY 6.9%, 95% CI 2.6–14.4. The AHFRST showed moderate inter-rater reliability (Kappa = 0.54, 95% CI = 0.36–0.67, p < 0.001 although 18 patients did not have the AHFRST completed by nursing staff. Conclusions There was an acceptable level of agreement between the 3 item AHFRST classification of falls risk and the longer, 9 item TNH-STRATIFY classification. However, both tools demonstrated limited predictive validity in the Austin Health population. The results highlight the importance of evaluating the validity of falls screening tools, and the clinical utility of these tools should be reconsidered.

  4. Inter-rater reliability of case-note audit: a systematic review.

    Science.gov (United States)

    Lilford, Richard; Edwards, Alex; Girling, Alan; Hofer, Timothy; Di Tanna, Gian Luca; Petty, Jane; Nicholl, Jon

    2007-07-01

    The quality of clinical care is often assessed by retrospective examination of case-notes (charts, medical records). Our objective was to determine the inter-rater reliability of case-note audit. We conducted a systematic review of the inter-rater reliability of case-note audit. Analysis was restricted to 26 papers reporting comparisons of two or three raters making independent judgements about the quality of care. Sixty-six separate comparisons were possible, since some papers reported more than one measurement of reliability. Mean kappa values ranged from 0.32 to 0.70. These may be inflated due to publication bias. Measured reliabilities were found to be higher for case-note reviews based on explicit, as opposed to implicit, criteria and for reviews that focused on outcome (including adverse effects) rather than process errors. We found an association between kappa and the prevalence of errors (poor quality care), suggesting alternatives such as tetrachoric and polychoric correlation coefficients be considered to assess inter-rater reliability. Comparative studies should take into account the relationship between kappa and the prevalence of the events being measured.

  5. Inter-radiologist agreement for CT scoring of pediatric splenic injuries and effect on an established clinical practice guideline.

    Science.gov (United States)

    Leschied, Jessica R; Mazza, Michael B; Davenport, Matthew; Chong, Suzanne T; Smith, Ethan A; Hoff, Carrie N; Ladino-Torres, Maria F; Khalatbari, Shokoufeh; Ehrlich, Peter F; Dillman, Jonathan R

    2016-02-01

    The American Pediatric Surgical Association (APSA) advocates for the use of a clinical practice guideline to direct management of hemodynamically stable pediatric spleen injuries. The clinical practice guideline is based on the CT score of the spleen injury according to the American Association for the Surgery of Trauma (AAST) CT scoring system. To determine the potential effect of radiologist agreement for CT scoring of pediatric spleen injuries on an established APSA clinical practice guideline. We retrospectively analyzed blunt splenic injuries occurring in children from January 2007 to January 2012 at a single level 1 trauma center (n = 90). Abdominal CT exams performed at clinical presentation were reviewed by four radiologists who documented the following: (1) splenic injury grade (AAST system), (2) arterial extravasation and (3) pseudoaneurysm. Inter-rater agreement for AAST injury grade was assessed using the multi-rater Fleiss kappa and Kendall coefficient of concordance. Inter-rater agreement was assessed using weighted (AAST injury grade) or prevalence-adjusted bias-adjusted (binary measures) kappa statistics; 95% confidence intervals were calculated. We evaluated the hypothetical effect of radiologist disagreement on an established APSA clinical practice guideline. Inter-rater agreement was good for absolute AAST injury grade (kappa: 0.64 [0.59–0.69]) and excellent for relative AAST injury grade (Kendall w: 0.90). All radiologists agreed on the AAST grade in 52% of cases. Based on an established clinical practice guideline, radiologist disagreement could have changed the decision for intensive care management in 11% (10/90) of children, changed the length of hospital stay in 44% (40/90), and changed the time to return to normal activity in 44% (40/90). Radiologist agreement when assigning splenic AAST injury grades is less than perfect, and disagreements have the potential to change management in a substantial number of pediatric patients.

  6. Inter-radiologist agreement for CT scoring of pediatric splenic injuries and effect on an established clinical practice guideline

    International Nuclear Information System (INIS)

    Leschied, Jessica R.; Smith, Ethan A.; Ladino-Torres, Maria F.; Dillman, Jonathan R.; Mazza, Michael B.; Chong, Suzanne T.; Hoff, Carrie N.; Davenport, Matthew S.; Khalatbari, Shokoufeh; Ehrlich, Peter F.

    2016-01-01

    The American Pediatric Surgical Association (APSA) advocates for the use of a clinical practice guideline to direct management of hemodynamically stable pediatric spleen injuries. The clinical practice guideline is based on the CT score of the spleen injury according to the American Association for the Surgery of Trauma (AAST) CT scoring system. To determine the potential effect of radiologist agreement for CT scoring of pediatric spleen injuries on an established APSA clinical practice guideline. We retrospectively analyzed blunt splenic injuries occurring in children from January 2007 to January 2012 at a single level 1 trauma center (n = 90). Abdominal CT exams performed at clinical presentation were reviewed by four radiologists who documented the following: (1) splenic injury grade (AAST system), (2) arterial extravasation and (3) pseudoaneurysm. Inter-rater agreement for AAST injury grade was assessed using the multi-rater Fleiss kappa and Kendall coefficient of concordance. Inter-rater agreement was assessed using weighted (AAST injury grade) or prevalence-adjusted bias-adjusted (binary measures) kappa statistics; 95% confidence intervals were calculated. We evaluated the hypothetical effect of radiologist disagreement on an established APSA clinical practice guideline. Inter-rater agreement was good for absolute AAST injury grade (kappa: 0.64 [0.59-0.69]) and excellent for relative AAST injury grade (Kendall w: 0.90). All radiologists agreed on the AAST grade in 52% of cases. Based on an established clinical practice guideline, radiologist disagreement could have changed the decision for intensive care management in 11% (10/90) of children, changed the length of hospital stay in 44% (40/90), and changed the time to return to normal activity in 44% (40/90). Radiologist agreement when assigning splenic AAST injury grades is less than perfect, and disagreements have the potential to change management in a substantial number of pediatric patients. (orig.)

  7. Inter-radiologist agreement for CT scoring of pediatric splenic injuries and effect on an established clinical practice guideline

    Energy Technology Data Exchange (ETDEWEB)

    Leschied, Jessica R.; Smith, Ethan A.; Ladino-Torres, Maria F.; Dillman, Jonathan R. [University of Michigan Health System, Department of Radiology, Section of Pediatric Radiology, C.S. Mott Children' s Hospital, Ann Arbor, MI (United States); Mazza, Michael B.; Chong, Suzanne T.; Hoff, Carrie N. [University of Michigan Health System, Department of Radiology, Division of Emergency Radiology, C.S. Mott Children' s Hospital, Ann Arbor, MI (United States); Davenport, Matthew S. [University of Michigan Health System, Department of Radiology, Division of Abdominal Imaging, C.S. Mott Children' s Hospital, Ann Arbor, MI (United States); Khalatbari, Shokoufeh [University of Michigan, Michigan Institute for Clinical and Health Research, Ann Arbor, MI (United States); Ehrlich, Peter F. [University of Michigan Health System, Department of Surgery, Section of Pediatric Surgery, C.S. Mott Children' s Hospital, Ann Arbor, MI (United States)

    2016-02-15

    The American Pediatric Surgical Association (APSA) advocates for the use of a clinical practice guideline to direct management of hemodynamically stable pediatric spleen injuries. The clinical practice guideline is based on the CT score of the spleen injury according to the American Association for the Surgery of Trauma (AAST) CT scoring system. To determine the potential effect of radiologist agreement for CT scoring of pediatric spleen injuries on an established APSA clinical practice guideline. We retrospectively analyzed blunt splenic injuries occurring in children from January 2007 to January 2012 at a single level 1 trauma center (n = 90). Abdominal CT exams performed at clinical presentation were reviewed by four radiologists who documented the following: (1) splenic injury grade (AAST system), (2) arterial extravasation and (3) pseudoaneurysm. Inter-rater agreement for AAST injury grade was assessed using the multi-rater Fleiss kappa and Kendall coefficient of concordance. Inter-rater agreement was assessed using weighted (AAST injury grade) or prevalence-adjusted bias-adjusted (binary measures) kappa statistics; 95% confidence intervals were calculated. We evaluated the hypothetical effect of radiologist disagreement on an established APSA clinical practice guideline. Inter-rater agreement was good for absolute AAST injury grade (kappa: 0.64 [0.59-0.69]) and excellent for relative AAST injury grade (Kendall w: 0.90). All radiologists agreed on the AAST grade in 52% of cases. Based on an established clinical practice guideline, radiologist disagreement could have changed the decision for intensive care management in 11% (10/90) of children, changed the length of hospital stay in 44% (40/90), and changed the time to return to normal activity in 44% (40/90). Radiologist agreement when assigning splenic AAST injury grades is less than perfect, and disagreements have the potential to change management in a substantial number of pediatric patients. (orig.)

  8. A toolbox for multiple sclerosis lesion segmentation

    International Nuclear Information System (INIS)

    Roura, Eloy; Oliver, Arnau; Valverde, Sergi; Llado, Xavier; Cabezas, Mariano; Pareto, Deborah; Rovira, Alex; Vilanova, Joan C.; Ramio-Torrenta, Lluis

    2015-01-01

    Lesion segmentation plays an important role in the diagnosis and follow-up of multiple sclerosis (MS). This task is very time-consuming and subject to intra- and inter-rater variability. In this paper, we present a new tool for automated MS lesion segmentation using T1w and fluid-attenuated inversion recovery (FLAIR) images. Our approach is based on two main steps, initial brain tissue segmentation according to the gray matter (GM), white matter (WM), and cerebrospinal fluid (CSF) performed in T1w images, followed by a second step where the lesions are segmented as outliers to the normal apparent GM brain tissue on the FLAIR image. The tool has been validated using data from more than 100 MS patients acquired with different scanners and at different magnetic field strengths. Quantitative evaluation provided a better performance in terms of precision while maintaining similar results on sensitivity and Dice similarity measures compared with those of other approaches. Our tool is implemented as a publicly available SPM8/12 extension that can be used by both the medical and research communities. (orig.)

  9. A toolbox for multiple sclerosis lesion segmentation

    Energy Technology Data Exchange (ETDEWEB)

    Roura, Eloy; Oliver, Arnau; Valverde, Sergi; Llado, Xavier [University of Girona, Computer Vision and Robotics Group, Girona (Spain); Cabezas, Mariano; Pareto, Deborah; Rovira, Alex [Vall d' Hebron University Hospital, Magnetic Resonance Unit, Dept. of Radiology, Barcelona (Spain); Vilanova, Joan C. [Girona Magnetic Resonance Center, Girona (Spain); Ramio-Torrenta, Lluis [Dr. Josep Trueta University Hospital, Institut d' Investigacio Biomedica de Girona, Multiple Sclerosis and Neuroimmunology Unit, Girona (Spain)

    2015-10-15

    Lesion segmentation plays an important role in the diagnosis and follow-up of multiple sclerosis (MS). This task is very time-consuming and subject to intra- and inter-rater variability. In this paper, we present a new tool for automated MS lesion segmentation using T1w and fluid-attenuated inversion recovery (FLAIR) images. Our approach is based on two main steps, initial brain tissue segmentation according to the gray matter (GM), white matter (WM), and cerebrospinal fluid (CSF) performed in T1w images, followed by a second step where the lesions are segmented as outliers to the normal apparent GM brain tissue on the FLAIR image. The tool has been validated using data from more than 100 MS patients acquired with different scanners and at different magnetic field strengths. Quantitative evaluation provided a better performance in terms of precision while maintaining similar results on sensitivity and Dice similarity measures compared with those of other approaches. Our tool is implemented as a publicly available SPM8/12 extension that can be used by both the medical and research communities. (orig.)

  10. Evaluating observer agreement of scoring systems for foot integrity and footrot lesions in sheep

    Directory of Open Access Journals (Sweden)

    Foddai Alessandro

    2012-05-01

    Full Text Available Abstract Background A scoring scale with five ordinal categories is used for visual diagnosis of footrot in sheep and to study its epidemiology and control. More recently a 4 point ordinal scale has been used by researchers to score foot integrity (wall and sole horn damage in sheep. There is no information on observer agreement using either of these scales. Observer agreement for ordinal scores is usually estimated by single measure values such as weighted kappa or Kendall’s coefficient of concordance which provide no information where the disagreement lies. Modeling techniques such as latent class models provide information on both observer bias and whether observers have different thresholds at which they change the score given. In this paper we use weighted kappa and located latent class modeling to explore observer agreement when scoring footrot lesions (using photographs and videos and foot integrity (using post mortem specimens in sheep. We used 3 observers and 80 photographs and videos and 80 feet respectively. Results Both footrot and foot integrity scoring scales were more consistent within observers than between. The weighted kappa values between observers for both footrot and integrity scoring scales ranged from moderate to substantial. There was disagreement between observers with both observer bias and different thresholds between score values. The between observer thresholds were different for scores 1 and 2 for footrot (using photographs and videos and for all scores for integrity (both walls and soles. The within observer agreement was higher with weighted kappa values ranging from substantial to almost perfect. Within observer thresholds were also more consistent than between observer thresholds. Scoring using photographs was less variable than scoring using video clips or feet. Conclusions Latent class modeling is a useful method for exploring components of disagreement within and between observers and this information could

  11. Intra- and inter-rater reliability of the Knee Society Knee Score when used by two physiotherapists in patients post total knee arthroplasty

    Directory of Open Access Journals (Sweden)

    S. Gopal

    2010-01-01

    Full Text Available Background and Purpose: It has yet to be shown whether routine physiotherapy plays a role in the rehabilitation of patients post totalknee arthroplasty (Rajan et al 2004. Physiotherapists should be using validoutcome measures to provide evidence of the benefit of their intervention. The aim of this study was to establish the intra and inter-rater reliability of the Knee Society Knee Score, a scoring system developed by Insall et al(1989. The Knee Society Knee Score can be used to assess the integrity of theknee joint of patients undergoing total knee arthroplasty. Since the scoreinvolves clinical testing, the intra-rater reliability of the clinician should be established prior to using the scores as datain clinical research. W here multiple clinicians are involved, inter-rater reliability should also be established.Design: This was a correlation study.Subjects: A  sample of thirty patients post total knee arthroplasty attending the arthroplasty clinic at Johannesburg Hospital between six weeks and twelve months postoperatively.M ethod: Recruited patients were evaluated twice with a time interval of one hour between each assessment. Statistical A nalysis: The intra- and inter-rater reliability were estimated using Intraclass Correlation Coefficient (ICC. R esults: The intra-rater reliability showed excellent reliability (h= 0.95 for Examiner A  and good reliability (h= 0.71for Examiner B. The inter-rater reliability showed moderate reliability (h= 0.67 during test one and h= 0.66 during test two.Conclusion: The KSKS has good intra-rater reliability when tested within a period of one hour. The KSKS demonstrated moderate agreement for inter rater reliability.

  12. Reevaluating Interrater Reliability in Offender Risk Assessment

    Science.gov (United States)

    van der Knaap, Leontien M.; Leenarts, Laura E. W.; Born, Marise Ph.; Oosterveld, Paul

    2012-01-01

    Offender risk and needs assessment, one of the pillars of the risk-need-responsivity model of offender rehabilitation, usually depends on raters assessing offender risk and needs. The few available studies of interrater reliability in offender risk assessment are, however, limited in the generalizability of their results. The present study…

  13. "A Comparison of Consensus, Consistency, and Measurement Approaches to Estimating Interrater Reliability"

    OpenAIRE

    Steven E. Stemler

    2004-01-01

    This article argues that the general practice of describing interrater reliability as a single, unified concept is..at best imprecise, and at worst potentially misleading. Rather than representing a single concept, different..statistical methods for computing interrater reliability can be more accurately classified into one of three..categories based upon the underlying goals of analysis. The three general categories introduced and..described in this paper are: 1) consensus estimates, 2) cons...

  14. Methods to achieve high interrater reliability in data collection from primary care medical records.

    Science.gov (United States)

    Liddy, Clare; Wiens, Miriam; Hogg, William

    2011-01-01

    We assessed interrater reliability (IRR) of chart abstractors within a randomized trial of cardiovascular care in primary care. We report our findings, and outline issues and provide recommendations related to determining sample size, frequency of verification, and minimum thresholds for 2 measures of IRR: the κ statistic and percent agreement. We designed a data quality monitoring procedure having 4 parts: use of standardized protocols and forms, extensive training, continuous monitoring of IRR, and a quality improvement feedback mechanism. Four abstractors checked a 5% sample of charts at 3 time points for a predefined set of indicators of the quality of care. We set our quality threshold for IRR at a κ of 0.75, a percent agreement of 95%, or both. Abstractors reabstracted a sample of charts in 16 of 27 primary care practices, checking a total of 132 charts with 38 indicators per chart. The overall κ across all items was 0.91 (95% confidence interval, 0.90-0.92) and the overall percent agreement was 94.3%, signifying excellent agreement between abstractors. We gave feedback to the abstractors to highlight items that had a κ of less than 0.70 or a percent agreement less than 95%. No practice had to have its charts abstracted again because of poor quality. A 5% sampling of charts for quality control using IRR analysis yielded κ and agreement levels that met or exceeded our quality thresholds. Using 3 time points during the chart audit phase allows for early quality control as well as ongoing quality monitoring. Our results can be used as a guide and benchmark for other medical chart review studies in primary care.

  15. Effects of questionnaire-based diagnosis and training on inter-rater reliability among practitioners of traditional Chinese medicine.

    Science.gov (United States)

    Mist, Scott; Ritenbaugh, Cheryl; Aickin, Mikel

    2009-07-01

    To investigate whether a training process that focused on a questionnaire-based diagnosis in Traditional Chinese Medicine (TCM), and developing diagnostic consensus, would improve the agreement of TCM diagnoses among 10 TCM practitioners evaluating patients with temporomandibular joint disorder (TMJD). Evaluation of a diagnostic training program at the Department of Family and Community Medicine, University of Arizona, Tucson, Arizona, and the Oregon College of Oriental Medicine, Portland, Oregon. Screened participants for a study of TCM for TMJD. PRACTITIONERS: Ten (10) licensed acupuncturists with a minimum of 5 years licensure and education in Chinese herbs. A training session using a questionnaire-based diagnostic form was conducted, followed by waves of diagnostic sessions. Between sessions, practitioners discussed the results of the previous round of participants with a focus on reducing variability in primary diagnosis and severity rating of each diagnosis: 3 waves of 5 patients were assessed by 4 practitioner pairs for a total of 120 diagnoses. At 18 months, practitioners completed a recalibration exercise with a similar format with a total of 32 diagnoses. These diagnoses were then examined with respect to the rate of agreement among the 10 practitioners using inter-rater correlations and kappas. The inter-rater correlation with respect to the TCM diagnoses among the 10 practitioners increased from 0.112 to 0.618 with training. Statistically significant improvements were found between the baseline and 18 month exercises (p reliability of TCM diagnosis may be improved through a training process and a questionnaire-based diagnosis process. The improvements varied by diagnosis, with the greatest congruence among primary and more severe diagnoses. Future TCM studies should consider including calibration training to improve the validity of results.

  16. Generative adversarial networks for brain lesion detection

    Science.gov (United States)

    Alex, Varghese; Safwan, K. P. Mohammed; Chennamsetty, Sai Saketh; Krishnamurthi, Ganapathy

    2017-02-01

    Manual segmentation of brain lesions from Magnetic Resonance Images (MRI) is cumbersome and introduces errors due to inter-rater variability. This paper introduces a semi-supervised technique for detection of brain lesion from MRI using Generative Adversarial Networks (GANs). GANs comprises of a Generator network and a Discriminator network which are trained simultaneously with the objective of one bettering the other. The networks were trained using non lesion patches (n=13,000) from 4 different MR sequences. The network was trained on BraTS dataset and patches were extracted from regions excluding tumor region. The Generator network generates data by modeling the underlying probability distribution of the training data, (PData). The Discriminator learns the posterior probability P (Label Data) by classifying training data and generated data as "Real" or "Fake" respectively. The Generator upon learning the joint distribution, produces images/patches such that the performance of the Discriminator on them are random, i.e. P (Label Data = GeneratedData) = 0.5. During testing, the Discriminator assigns posterior probability values close to 0.5 for patches from non lesion regions, while patches centered on lesion arise from a different distribution (PLesion) and hence are assigned lower posterior probability value by the Discriminator. On the test set (n=14), the proposed technique achieves whole tumor dice score of 0.69, sensitivity of 91% and specificity of 59%. Additionally the generator network was capable of generating non lesion patches from various MR sequences.

  17. THE INTRA- AND INTER-RATER RELIABILITY OF THE SOCCER INJURY MOVEMENT SCREEN (SIMS).

    Science.gov (United States)

    McCunn, Robert; Aus der Fünten, Karen; Govus, Andrew; Julian, Ross; Schimpchen, Jan; Meyer, Tim

    2017-02-01

    The growing volume of movement screening research reveals a belief among practitioners and researchers alike that movement quality may have an association with injury risk. However, existing movement screening tools have not considered the sport-specific movement and injury patterns relevant to soccer. The present study introduces the Soccer Injury Movement Screen (SIMS), which has been designed specifically for use within soccer. Furthermore, the purpose of the present study was to assess the intra- and inter-rater reliability of the SIMS and determine its suitability for use in further research. The study utilized a test-retest design to discern reliablility. Twenty-five (11 males, 14 females) healthy, recreationally active university students (age 25.5 ± 4.0 years, height 171 ± 9 cm, weight 64.7 ± 12.6 kg) agreed to participate. The SIMS contains five sub-tests: the anterior reach, single-leg deadlift, in-line lunge, single-leg hop for distance and tuck jump. Each movement was scored out of 10 points and summed to produce a composite score out of 50. The anterior reach and single-leg hop for distance were scored in real-time while the remaining tests were filmed and scored retrospectively. Three raters conducted the SIMS with each participant on three occasions separated by an average of three and a half days (minimum one day, maximum seven days). Rater 1 re-scored the filmed movements for all participants on all occasions six months later to establish the 'pure' intra-rater (intra-occasion) reliability for those movements. Intraclass correlation coefficient (ICC) values for intra- and inter-rater composite score reliability ranged from 0.66-0.72 and 0.79-0.86 respectively. Weighted kappa values representing the intra- and inter-rater reliability of the individual sub-tests ranged from 0.35-0.91 indicating fair to almost perfect agreement. Establishing the reliability of the SIMS is a prerequisite for further research seeking to investigate

  18. The definition of polytrauma: variable interrater versus intrarater agreement--a prospective international study among trauma surgeons.

    Science.gov (United States)

    Butcher, Nerida E; Enninghorst, Natalie; Sisak, Krisztian; Balogh, Zsolt J

    2013-03-01

    The international trauma community has recognized the lack of a validated consensus definition of "polytrauma." We hypothesized that using a subjective definition, trauma surgeons will not have substantial agreement; thus, an objective definition is needed. A prospective observational study was conducted between December 2010 and June 2011 (John Hunter Hospital, Level I trauma center). Inclusion criteria were all trauma call patients with subsequent intensive care unit admission. The study was composed of four stages as follows: (1) four trauma surgeons assessed patients until 24 hours, then coded as either "yes" or "no" for polytrauma, and results compared for agreement; (2) eight trauma surgeons representing the United States, Germany, and the Netherlands graded the same prospectively assessed patients and coded as either "yes" or "no" for polytrauma; (3) 12 months later, the original four trauma surgeons repeated assessment via data sheets to test intrarater variability; and (4) individual subjective definitions were compared with three anatomic scores, namely, (a) Injury Severity Score (ISS) of greater than 15, (b) ISS of greater 17, and (c) Abbreviated Injury Scale (AIS) score of greater than 2 in at least two ISS body regions. A total of 52 trauma patients were included. Results for each stage were as follows: (1) κ score of 0.50, moderate agreement; (2) κ score of 0.41, moderate agreement; (3) Rater 1 had moderate intrarater agreement (κ score, 0.59), while Raters 2, 3, 4 had substantial intrarater agreement (κ scores, 0.75, 0.66, and 0.71, respectively); and (4) none had most agreement with ISS of greater than 15 (κ score, 0.16), while both definitions ISS greater than 17 and Abbreviated Injury Scale (AIS) score of greater than 2 in at least two ISS body regions had on average fair agreement (κ scores, 0.27 and 0.39, respectively). Based on subjective assessments, trauma surgeons do not agree on the definition of polytrauma, with the subjective

  19. Location of Osteochondritis Dissecans Lesions of the Capitellum.

    Science.gov (United States)

    Johnson, Christine C; Roberts, Susanne; Mintz, Douglas; Fabricant, Peter D; Hotchkiss, Robert N; Daluiski, Aaron

    2018-04-17

    The location of capitellar osteochondritis dissecans (OCD) lesions in the sagittal plane guides the surgical approach, and lesion location in the coronal plane influences surgical management. Although most lesions have been reported to occur between 4 o'clock and 4:30 (120° to 135° anterior to the humerus), some lesions are located elsewhere in the capitellum. The primary aim was to define the region of the capitellum affected by OCD lesions using a novel clock-face localization system. We reviewed 104 magnetic resonance imaging examinations diagnosing a nontraumatic capitellar OCD lesion. In the sagittal plane, lesion margins were recorded as degrees on the capitellum and converted into a clock-face format in which 0° corresponds to 12:00 with the forearm facing to the right. The 0° axis (12-o'clock axis) was defined as a line parallel to the anterior humeral line that intersects the capitellum center. The following coronal measurements were recorded: lesion width, capitellar width, and distance between the lateral capitellum and lateral lesion. Two independent observers took measurements. In the sagittal plane, average lesion location was 92° to 150° (3:04-5:00, clock face) and ranged from 52.1° to 249.5° (1:44-8:19, clock face). Average lesion dimensions were 10.7 mm (mediolateral width) and 5.2 mm (anteroposterior depth). Interrater reliability was high (intraclass correlation coefficient = 0.98). Using a magnetic resonance imaging-based clock-face localization system, we found that capitellar OCD lesions affect a broad region of the capitellum in the sagittal plane. The clock-face localization system allows for precise description of capitellar OCD lesion location, which may facilitate intraoperative decision and longitudinal monitoring. Copyright © 2018 American Society for Surgery of the Hand. Published by Elsevier Inc. All rights reserved.

  20. Quantitative measurement of hypertrophic scar: interrater reliability and concurrent validity.

    Science.gov (United States)

    Nedelec, Bernadette; Correa, José A; Rachelska, Grazyna; Armour, Alexis; LaSalle, Léo

    2008-01-01

    Research into the pathophysiology and treatment of hypertrophic scar (HSc) remains limited by the heterogeneity of scar and the imprecision with which its severity is measured. The objective of this study was to test the interrater reliability and concurrent validity of the Cutometer measurement of elasticity, the Mexameter measurement of erythema and pigmentation, and total thickness measure of the DermaScan C relative to the modified Vancouver Scar Scale (mVSS) in patient-matched normal skin, normal scar, and HSc. Three independent investigators evaluated 128 sites (severe HSc, moderate or mild HSc, donor site, and normal skin) on 32 burn survivors using all of the above measurement tools. The intraclass correlation coefficient, which was used to measure interrater reliability, reflects the inherent amount of error in the measure and is considered acceptable when it is >0.75. Interrater reliability of the totals of the height, pliability, and vascularity subscales of the mVSS fell below the acceptable limit ( congruent with0.50). The individual subscales of the mVSS fell well below the acceptable level (0.89) for each study site with the exception of severe scar. Mexameter and DermaScan C reliability measurements were acceptable for all sites (>0.82). Concurrent validity correlations with the mVSS were significant except for the comparison of the mVSS pliability subscale and the Cutometer maximum deformation measure comparison in severe scar. In conclusion, the Mexameter and DermaScan C measurements of scar color and thickness of all sites, as well as the Cutometer measurement of elasticity in all but the most severe scars shows high interrater reliability. Their significant concurrent validity with the mVSS confirms that these tools are measuring the same traits as the mVSS, and in a more objective way.

  1. Face validity and inter-rater reliability of the Danish version of the modified-Yale Preoperative Anxiety Scale

    DEFF Research Database (Denmark)

    Skovby, Pernille; Rask, Charlotte Ulrikka; Dall, Rolf

    2014-01-01

    -YPAS to Danish cultural and linguistic conditions and to test face validity and inter-reliability in a clinical setting. Materials and methods The translation was performed in accordance with WHO guidelines. Face validity as well as linguistic difficulties of the Danish version was tested and solved in a focus...... of the m-YPAS as suitable and relevant, i.e. the face validity satisfactory. Inter-rater reliability analysis revealed that inter-observer agreement at induction 1 were good to very good (kw: 0.63–0.98) and at induction 2, the agreement was good to very good (kw: 0.72–0.96). ICC for the overall weighted...... anxiety score was in: induction 1:0.92 and induction 2: 0.92 Conclusion Standardized and validated assessment tools are needed to evaluate interventions aiming to reduce preoperative anxiety in children. The Danish m-YPAS had a satisfactory face validity and inter-reliability, based on a minor empirical...

  2. Interobserver agreement on the interpretation of automated whole breast ultrasonography

    International Nuclear Information System (INIS)

    Kim, Eun Jeong; KIm, Sung Hun; Kang, Bong Joo; Kim, Yun Ju

    2014-01-01

    The purpose of this study was to prospectively evaluate the interobserver agreement on lesion characterization and the final assessment of automated whole breast ultrasonography (ABUS) images. Between March and August 2012, 172 women underwent bilateral ABUS before biopsy guided by handheld ultrasonography (HHUS) and mammography. A total of 206 breast lesions were confirmed histopathologically by biopsy. Three-dimensional volume data from ABUS scans were analyzed by two radiologists without the knowledge of HHUS results or patient clinical information. The two readers described the type, shape, orientation, margin, echogenicity, posterior acoustic features, and categorization of the final assessment of detected breast lesions. Kappa statistics were used to analyze the described characteristics of the breast lesions detected by both of the two readers. Of the 206 histopathologically confirmed lesions, reader 1 detected 166 lesions and reader 2 detected 150 lesions. A total of 145 lesions were detected by both readers using ABUS images. There was substantial agreement on shape (k=0.707), and moderate agreement on type, margin, mass orientation, echogenicity, and posterior acoustic features (k=0.592, 0.438, 0.472, 0.524, and 0.541, respectively). Breast Imaging Reporting and Data System final assessment values yielded a kappa value of 0.3971 when category subdivisions 4A, 4B, and 4C were included. With respect to the C2, C3, C4, and C5 categories, the interobserver agreement was moderate (k=0.505). ABUS is a promising diagnostic tool with a good interobserver agreement, comparable to that of HHUS.

  3. Interobserver agreement on the interpretation of automated whole breast ultrasonography

    Energy Technology Data Exchange (ETDEWEB)

    Kim, Eun Jeong; KIm, Sung Hun; Kang, Bong Joo; Kim, Yun Ju [Dept. of Radiology, Seoul St. Mary' s Hospital, The Catholic University of Korea College of Medicine, Seoul (Korea, Republic of)

    2014-12-15

    The purpose of this study was to prospectively evaluate the interobserver agreement on lesion characterization and the final assessment of automated whole breast ultrasonography (ABUS) images. Between March and August 2012, 172 women underwent bilateral ABUS before biopsy guided by handheld ultrasonography (HHUS) and mammography. A total of 206 breast lesions were confirmed histopathologically by biopsy. Three-dimensional volume data from ABUS scans were analyzed by two radiologists without the knowledge of HHUS results or patient clinical information. The two readers described the type, shape, orientation, margin, echogenicity, posterior acoustic features, and categorization of the final assessment of detected breast lesions. Kappa statistics were used to analyze the described characteristics of the breast lesions detected by both of the two readers. Of the 206 histopathologically confirmed lesions, reader 1 detected 166 lesions and reader 2 detected 150 lesions. A total of 145 lesions were detected by both readers using ABUS images. There was substantial agreement on shape (k=0.707), and moderate agreement on type, margin, mass orientation, echogenicity, and posterior acoustic features (k=0.592, 0.438, 0.472, 0.524, and 0.541, respectively). Breast Imaging Reporting and Data System final assessment values yielded a kappa value of 0.3971 when category subdivisions 4A, 4B, and 4C were included. With respect to the C2, C3, C4, and C5 categories, the interobserver agreement was moderate (k=0.505). ABUS is a promising diagnostic tool with a good interobserver agreement, comparable to that of HHUS.

  4. Inter-rater and intra-rater reliability of a clinical protocol for measuring turnout in collegiate dancers.

    Science.gov (United States)

    Greene, Amanda; Lasner, Andrea; Deu, Rajwinder; Oliphant, Seth; Johnson, Kenneth

    2018-02-02

    Reliable methods of measuring turnout in dancers and comparing active turnout (used in class) with functional (uncompensated) turnout are needed. Authors have suggested measurement techniques but there is no clinically useful, easily reproducible technique with established inter-rater and intra-rater reliability. We adapted a technique based on previous research, which is easily reproducible. We hypothesized excellent inter-rater and intra-rater reliability between experienced physical therapists (PTs) and a briefly trained faculty member from a university's department of dance. Thirty-two participants were recruited from the same dance department. Dancers' active and functional turnout was measured by each rater. We found that our technique for measuring active and functional turnout has excellent inter-rater and intra-rater reliability when performed by two experienced PTs and by one briefly trained university-level dance faculty member. For active turnout, inter-rater reliability was 0.78 among all raters and 0.82 among only the PT raters; intra-rater reliability was 0.82 among all raters and 0.85 among only the PT raters. For functional turnout, inter-rater reliability was 0.86 among all raters and 0.88 among only the PT raters; intra-rater reliability was 0.87 among all raters and 0.88 among only the PT raters. The measurement technique described provides a standardized protocol with excellent inter-rater and intra-rater reliability when performed by experienced PTs or by a briefly trained university-level dance faculty member.

  5. Simulated patient training: Using inter-rater reliability to evaluate simulated patient consistency in nursing education.

    Science.gov (United States)

    MacLean, Sharon; Geddes, Fiona; Kelly, Michelle; Della, Phillip

    2018-03-01

    Simulated patients (SPs) are frequently used for training nursing students in communication skills. An acknowledged benefit of using SPs is the opportunity to provide a standardized approach by which participants can demonstrate and develop communication skills. However, relatively little evidence is available on how to best facilitate and evaluate the reliability and accuracy of SPs' performances. The aim of this study is to investigate the effectiveness of an evidenced based SP training framework to ensure standardization of SPs. The training framework was employed to improve inter-rater reliability of SPs. A quasi-experimental study was employed to assess SP post-training understanding of simulation scenario parameters using inter-rater reliability agreement indices. Two phases of data collection took place. Initially a trial phase including audio-visual (AV) recordings of two undergraduate nursing students completing a simulation scenario is rated by eight SPs using the Interpersonal Communication Assessments Scale (ICAS) and Quality of Discharge Teaching Scale (QDTS). In phase 2, eight SP raters and four nursing faculty raters independently evaluated students' (N=42) communication practices using the QDTS. Intraclass correlation coefficients (ICC) were >0.80 for both stages of the study in clinical communication skills. The results support the premise that if trained appropriately, SPs have a high degree of reliability and validity to both facilitate and evaluate student performance in nurse education. Crown Copyright © 2018. Published by Elsevier Ltd. All rights reserved.

  6. Inter-rater and test-retest reliability of quality assessments by novice student raters using the Jadad and Newcastle-Ottawa Scales.

    Science.gov (United States)

    Oremus, Mark; Oremus, Carolina; Hall, Geoffrey B C; McKinnon, Margaret C

    2012-01-01

    Quality assessment of included studies is an important component of systematic reviews. The authors investigated inter-rater and test-retest reliability for quality assessments conducted by inexperienced student raters. Student raters received a training session on quality assessment using the Jadad Scale for randomised controlled trials and the Newcastle-Ottawa Scale (NOS) for observational studies. Raters were randomly assigned into five pairs and they each independently rated the quality of 13-20 articles. These articles were drawn from a pool of 78 papers examining cognitive impairment following electroconvulsive therapy to treat major depressive disorder. The articles were randomly distributed to the raters. Two months later, each rater re-assessed the quality of half of their assigned articles. McMaster Integrative Neuroscience Discovery and Study Program. 10 students taking McMaster Integrative Neuroscience Discovery and Study Program courses. The authors measured inter-rater reliability using κ and the intraclass correlation coefficient type 2,1 or ICC(2,1). The authors measured test-retest reliability using ICC(2,1). Inter-rater reliability varied by scale question. For the six-item Jadad Scale, question-specific κs ranged from 0.13 (95% CI -0.11 to 0.37) to 0.56 (95% CI 0.29 to 0.83). The ranges were -0.14 (95% CI -0.28 to 0.00) to 0.39 (95% CI -0.02 to 0.81) for the NOS cohort and -0.20 (95% CI -0.49 to 0.09) to 1.00 (95% CI 1.00 to 1.00) for the NOS case-control. For overall scores on the six-item Jadad Scale, ICC(2,1)s for inter-rater and test-retest reliability (accounting for systematic differences between raters) were 0.32 (95% CI 0.08 to 0.52) and 0.55 (95% CI 0.41 to 0.67), respectively. Corresponding ICC(2,1)s for the NOS cohort were -0.19 (95% CI -0.67 to 0.35) and 0.62 (95% CI 0.25 to 0.83), and for the NOS case-control, the ICC(2,1)s were 0.46 (95% CI -0.13 to 0.92) and 0.83 (95% CI 0.48 to 0.95). Inter-rater reliability was generally poor

  7. Hypsarrhythmia assessment exhibits poor interrater reliability: a threat to clinical trial validity.

    Science.gov (United States)

    Hussain, Shaun A; Kwong, Grace; Millichap, John J; Mytinger, John R; Ryan, Nicole; Matsumoto, Joyce H; Wu, Joyce Y; Lerner, Jason T; Sankar, Raman

    2015-01-01

    Hypsarrhythmia is the classic interictal electroencephalographic pattern associated with infantile spasms, and characterized by high voltage, disorganization, and multifocal independent epileptiform discharges. Given this seemingly simple definition, one might expect excellent interrater reliability (IRR) in the identification of this pattern. Alternatively, it may be argued that assessments of voltage and disorganization are fairly subjective, and thus quite challenging in borderline cases. We sought to test the IRR of hypsarrhythmia assessment in a systematic fashion. Six blinded pediatric electroencephalographers from four centers reviewed 22 electroencephalography (EEG) samples from patients with infantile spasms. Each sample was 5 min in duration and included only wakefulness. Raters determined if each EEG was abnormal and if hypsarrhythmia was present/absent, and characterized relevant features: voltage, organization, epileptiform discharges, slowing, interictal attenuations, symmetry, and synchrony. In addition, raters indicated their level of confidence for each assessment. Multirater kappa statistics (κ) were calculated for the assessment of hypsarrhythmia and each feature. Although IRR was favorable in determining whether a study was normal or abnormal (κ=0.89), reliability was unfavorable for assessment of hypsarrhythmia (κ=0.40), modified hypsarrhythmia (κ=0.47), high voltage (κ=0.37), disorganization (κ=0.22), multifocal epileptiform discharges (κ=0.68), interictal voltage attenuations (κ=0.21), slowing (κ=0.20), asymmetry (κ=0.26), and asynchrony (κ=0.08). Despite generally unsatisfactory interrater agreement, raters consistently reported high confidence in assessments. This study contradicts the view that hypsarrhythmia assessment is straightforward. Even small variability in the identification of hypsarrhythmia has potentially deleterious consequences for clinical care, as its presence or absence impacts decisions to pursue high-risk and

  8. Magnetic resonance cholangiographic (MRCP) features of ischemic-type biliary lesions (ITBL). A case-control study; Magnetresonanz-Cholangiografie (MRCP) bei Patienten mit Ischemic-Type Biliary Lesions (ITBL). Eine Fall-Kontroll-Studie

    Energy Technology Data Exchange (ETDEWEB)

    Collettini, F.; Kroencke, T.J.; Renz, D.M.; Bucourt, M. de; Poellinger, A. [Charite, Berlin (Germany). Klinik fuer Radiologie; Heidenhain, C.; Neuhaus, P. [Charite, Berlin (Germany). Klinik fuer Allgemein-, Visceral- und Transplantationschirurgie CVK

    2011-08-15

    Purpose: To evaluate the spectrum of MR cholangiography (MRCP) features of ischemic-type biliary lesions (ITBL) after orthotopic liver transplantation (OLT). Materials and Methods: 30 patients (16 m, 14 f) with an average age of 52 years (9 - 69 y) were examined in two 1.5 MR units using breath-hold 2D-SS-FSE-sequences and 3D-MRCP sequences. 20 of the 30 patients had an established ITBL, and the remaining 10 patients were post-OLT controls. MRCPs were evaluated independently by two experienced radiologists that were blinded to the clinical history as well as the results of other imaging modalities. All images were analyzed for the presence of 16 different pathological features. Differences between ITBL patients and controls were analyzed using the Mann-Whitney-U Test. Inter-rater variability was tested using the Cohen's Kappa test. Results: Abnormal findings of bile ducts were seen in all patients. The most common findings were (in percentage for reader 1/2) intrahepatic bile duct dilatation (95 %/95 %) and extrahepatic bile duct stenoses (95 %/85 %), followed by intrahepatic main duct stenoses (90 %/95 %) and segmental duct stenoses (85 %/85 %). Differences between ITBL patients and controls were significant for most of the analyzed features (Mann-Whitney-U test, p < 0.05). For 12 of 16 features, there was substantial or almost perfect agreement (K = 0.61 - 1.00), for 2 of 16 features moderate agreement (K = 0.41- 0.60) and for 2 of 16 features fair agreement (K < 0.40). Conclusion: In patients with ITBL, MR cholangiography reveals characteristic features that may allow differentiation from other biliary complications after liver transplantation. (orig.)

  9. Orthopaedic nurses' knowledge and interrater reliability of neurovascular assessments with 2-point discrimination test.

    Science.gov (United States)

    Turney, Jennifer; Raley Noble, Deana; Kim, Son Chae

    2013-01-01

    : This study was conducted to evaluate the effects of education on knowledge and interrater reliability of neurovascular assessments with 2-point discrimination (2-PD) test among pediatric orthopaedic nurses. : A pre- and posttest study was done among 60 nurses attending 2-hour educational sessions. Neurovascular assessments with 2-PD test were performed on 64 casted pediatric patients by the nurses and 5 nurse experts before and after the educational sessions. : The mean neurovascular assessment knowledge score was improved at posteducation compared with the preeducation (p < .001). The 2-PD test interrater reliability also improved from Cohen's kappa value of 0.24-0.48 at posteducation. : The 2-hour educational session may be effective in improving nurses' knowledge and the interrater reliability of neurovascular assessment with 2-PD test.

  10. Interrater reliability of videotaped observational gait-analysis assessments.

    Science.gov (United States)

    Eastlack, M E; Arvidson, J; Snyder-Mackler, L; Danoff, J V; McGarvey, C L

    1991-06-01

    The purpose of this study was to determine the interrater reliability of videotaped observational gait-analysis (VOGA) assessments. Fifty-four licensed physical therapists with varying amounts of clinical experience served as raters. Three patients with rheumatoid arthritis who demonstrated an abnormal gait pattern served as subjects for the videotape. The raters analyzed each patient's most severely involved knee during the four subphases of stance for the kinematic variables of knee flexion and genu valgum. Raters were asked to determine whether these variables were inadequate, normal, or excessive. The temporospatial variables analyzed throughout the entire gait cycle were cadence, step length, stride length, stance time, and step width. Generalized kappa coefficients ranged from .11 to .52. Intraclass correlation coefficients (2,1) and (3,1) were slightly higher. Our results indicate that physical therapists' VOGA assessments are only slightly to moderately reliable and that improved interrater reliability of the assessments of physical therapists utilizing this technique is needed. Our data suggest that there is a need for greater standardization of gait-analysis training.

  11. The interrater and test-retest reliability of the Home Falls and Accidents Screening Tool (HOME FAST) in Malaysia: Using raters with a range of professional backgrounds.

    Science.gov (United States)

    Romli, Muhammad Hibatullah; Mackenzie, Lynette; Lovarini, Meryl; Tan, Maw Pin; Clemson, Lindy

    2017-06-01

    Falls can be a devastating issue for older people living in the community, including those living in Malaysia. Health professionals and community members have a responsibility to ensure that older people have a safe home environment to reduce the risk of falls. Using a standardised screening tool is beneficial to intervene early with this group. The Home Falls and Accidents Screening Tool (HOME FAST) should be considered for this purpose; however, its use in Malaysia has not been studied. Therefore, the aim of this study was to evaluate the interrater and test-retest reliability of the HOME FAST with multiple professionals in the Malaysian context. A cross-sectional design was used to evaluate interrater reliability where the HOME FAST was used simultaneously in the homes of older people by 2 raters and a prospective design was used to evaluate test-retest reliability with a separate group of older people at different times in their homes. Both studies took place in an urban area of Kuala Lumpur. Professionals from 9 professional backgrounds participated as raters in this study, and a group of 51 community older people were recruited for the interrater reliability study and another group of 30 for the test-retest reliability study. The overall agreement was moderate for interrater reliability and good for test-retest reliability. The HOME FAST was consistently rated by different professionals, and no bias was found among the multiple raters. The HOME FAST can be used with confidence by a variety of professionals across different settings. The HOME FAST can become a universal tool to screen for home hazards related to falls. © 2017 John Wiley & Sons, Ltd.

  12. Interrater Reliability in Analysis of Laryngoscopic Features for Unilateral Vocal Fold Paresis.

    Science.gov (United States)

    Isseroff, Tova F; Parasher, Arjun K; Richards, Amanda; Sivak, Mark; Woo, Peak

    2016-11-01

    The diagnosis of paresis in patients with vocal fold motion impairment remains a challenge. In particular, laryngoscopy examination may result in significant disagreement in diagnosis among providers. We hypothesize that systematically evaluating for a standard set of clinical parameters will increase the diagnostic concordance among providers. Prospective case series conducted at a Tertiary referral Laryngology office. Two laryngologists (rater 1) and two trainees (rater 2) rated laryngoscopy findings in 19 patients suspected of paresis. The diagnosis was confirmed with laryngeal electromyogram. A standard set of 27 ratings was used for each examination that included movement, laryngeal configuration, and stroboscopy signs. A kappa coefficient was calculated for agreement in laryngoscopy findings and effectiveness in predicting the laterality of paresis. A substantial agreement (kappa coefficient > 0.61) existed between the raters for vocal fold length, vocal fold thickness, bowing, and reduction in movement. A moderate agreement (kappa coefficient > 0.41) existed between raters for piriform opening and reduced kinesis. The senior author was accurately able to diagnose the side of paresis in 89.5% of cases for a kappa coefficient of 0.78, whereas the trainees correctly predicted the side of paresis in 63.1% for a kappa coefficient of 0.35. The raters agreed on the diagnosis in 73.7% of cases for a kappa coefficient of 0.50. Using a standard set of laryngoscopy findings may improve the provider's ability to identify the laterality of vocal fold paresis and increase interrater reliability compared with other series. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  13. Intra- and interrater reliability of the Chicago Classification of achalasia subtypes in pediatric high-resolution esophageal manometry (HRM) recordings.

    Science.gov (United States)

    Singendonk, M M J; Rosen, R; Oors, J; Rommel, N; van Wijk, M P; Benninga, M A; Nurko, S; Omari, T I

    2017-11-01

    Subtyping achalasia by high-resolution manometry (HRM) is clinically relevant as response to therapy and prognosis have shown to vary accordingly. The aim of this study was to assess inter- and intrarater reliability of diagnosing achalasia and achalasia subtyping in children using the Chicago Classification (CC) V3.0. Six observers analyzed 40 pediatric HRM recordings (22 achalasia and 18 non-achalasia) twice by using dedicated analysis software (ManoView 3.0, Given Imaging, Los Angeles, CA, USA). Integrated relaxation pressure (IRP4s), distal contractile integral (DCI), intrabolus pressurization pattern (IBP), and distal latency (DL) were extracted and analyzed hierarchically. Cohen's κ (2 raters) and Fleiss' κ (>2 raters) and the intraclass correlation coefficient (ICC) were used for categorical and ordinal data, respectively. Based on the results of dedicated analysis software only, intra- and interrater reliability was excellent and moderate (κ=0.89 and κ=0.52, respectively) for differentiating achalasia from non-achalasia. For subtyping achalasia, reliability decreased to substantial and fair (κ=0.72 and κ=0.28, respectively). When observers were allowed to change the software-driven diagnosis according to their own interpretation of the manometric patterns, intra- and interrater reliability increased for diagnosing achalasia (κ=0.98 and κ=0.92, respectively) and for subtyping achalasia (κ=0.79 and κ=0.58, respectively). Intra- and interrater agreement for diagnosing achalasia when using HRM and the CC was very good to excellent when results of automated analysis software were interpreted by experienced observers. More variability was seen when relying solely on the software-driven diagnosis and for subtyping achalasia. Therefore, diagnosing and subtyping achalasia should be performed in pediatric motility centers with significant expertise. © 2017 John Wiley & Sons Ltd.

  14. Reliability of Patient-Led Screening with the Malnutrition Screening Tool: Agreement between Patient and Health Care Professional Scores in the Cancer Care Ambulatory Setting.

    Science.gov (United States)

    Di Bella, Alexandra; Blake, Claire; Young, Adrienne; Pelecanos, Anita; Brown, Teresa

    2018-02-01

    The prevalence of malnutrition in patients with cancer is reported as high as 60% to 80%, and malnutrition is associated with lower survival, reduced response to treatment, and poorer functional status. The Malnutrition Screening Tool (MST) is a validated tool when administered by health care professionals; however, it has not been evaluated for patient-led screening. This study aims to assess the reliability of patient-led MST screening through assessment of inter-rater reliability between patient-led and dietitian-researcher-led screening and intra-rater reliability between an initial and a repeat patient screening. This cross-sectional study included 208 adults attending ambulatory cancer care services in a metropolitan teaching hospital in Queensland, Australia, in October 2016 (n=160 inter-rater reliability; n=48 intra-rater reliability measured in a separate sample). Primary outcome measures were MST risk categories (MST 0-1: not at risk, MST ≥2: at risk) as determined by screening completed by patients and a dietitian-researcher, patient test-retest screening, and patient acceptability. Percent and chance-corrected agreement (Cohen's kappa coefficient, κ) were used to determine agreement between patient-MST and dietitian-MST (inter-rater reliability) and MST completed by patient on admission to unit (patient-MSTA) and MST completed by patient 1 to 3 hours after completion of initial MST (patient-MSTB) (intra-rater reliability). High inter-rater reliability and intra-rater reliability were observed. Agreement between patient-MST and dietitian-MST was 96%, with "almost perfect" chance-adjusted agreement (κ=0.92, 95% CI 0.84 to 0.97). Agreement between repeated patient-MSTA and patient-MSTB was 94%, with "almost perfect" chance-adjusted agreement (κ=0.88, 95% CI 0.71 to 1.00). Based on dietitian-MST, 33% (n=53) of patients were identified as being at risk for malnutrition, and 40% of these reported not seeing a dietitian. Of 156 patients who provided

  15. An adaptation of the Interpersonal Problem Areas Rating Scale: pilot and interrater agreement study Adaptação da Escala de Áreas Problema da Psicoterapia Interpessoal: estudo piloto e avaliação de concordância

    Directory of Open Access Journals (Sweden)

    Ana Claudia Fontes de Andrade

    2008-12-01

    Full Text Available OBJECTIVE: This article describes the adaptation of a rating scale of interpersonal psychotherapy problem areas to include a fifth problem area appropriate to bipolar disorder and an interrater agreement study in identifying interpersonal problem areas and selecting a primary treatment focus if patients were to engage in treatment. METHOD: Five research interpersonal psychotherapists assessed nine audiotapes of a single interview with five bipolar and four unipolar patients in which the interpersonal inventory and identification of problem areas were undertaken. RESULTS: Raters agreed on presence and absence of problem areas in seven tapes. Kappas for identification of problem areas were 1.00 (grief, 0.77 (role dispute, 0.61 (role transition, 0.57 (interpersonal deficits and 1.00 (loss of healthy self. Kappa for agreement on a primary clinical focus if patients were to engage in interpersonal psychotherapy treatment was 0.64. CONCLUSIONS: The adaptation of the original scale to include an area pertinent to bipolar disorder proved to be applicable and relevant for use with this population. The results show substantial interrater agreement in identifying problem areas and potential treatment focus.OBJETIVO: Este artigo descreve a adaptação de uma escala de avaliação de áreas problema da psicoterapia interpessoal que inclui uma área própria ao transtorno bipolar e um estudo de concordância em identificar áreas problema e selecionar um foco primário de tratamento caso os pacientes fossem participar de tratamento. MÉTODO: Cinco terapeutas interpessoais avaliaram nove audiotapes de uma única entrevista com cinco pacientes bipolares e quatro pacientes unipolares em que o inventário interpessoal e identificação de áreas problema foram empreendidos. RESULTADOS: Os avaliadores concordaram na presença e ausência de áreas problema em sete fitas. Kappas para identificação de áreas problema foram 1,00 (luto, 0,77 (disputa de papel, 0

  16. Inter-rater and test–retest reliability of quality assessments by novice student raters using the Jadad and Newcastle–Ottawa Scales

    Science.gov (United States)

    Oremus, Carolina; Hall, Geoffrey B C; McKinnon, Margaret C

    2012-01-01

    ,1)s were 0.46 (95% CI −0.13 to 0.92) and 0.83 (95% CI 0.48 to 0.95). Conclusions Inter-rater reliability was generally poor to fair and test–retest reliability was fair to excellent. A pilot rating phase following rater training may be one way to improve agreement. PMID:22855629

  17. Introducing a new definition of a near fall: intra-rater and inter-rater reliability.

    Science.gov (United States)

    Maidan, I; Freedman, T; Tzemah, R; Giladi, N; Mirelman, A; Hausdorff, J M

    2014-01-01

    Near falls (NFs) are more frequent than falls, and may occur before falls, potentially predicting fall risk. As such, identification of a NF is important. We aimed to assess intra and inter-rater reliability of the traditional definition of a NF and to demonstrate the potential utility of a new definition. To this end, 10 older adults, 10 idiopathic elderly fallers, and 10 patients with Parkinson's disease (PD) walked in an obstacle course while wearing a safety harness. All walks were videotaped. Forty-nine video segments were extracted to create 2 clips each of 8.48 min. Four raters scored each event using the traditional definition and, two weeks later, using the new definition. A fifth rater used only the new definition. Intra-rater reliability was determined using Kappa (K) statistics and inter-rater reliability was determined using ICC. Using the traditional definition, three raters had poor intra-rater reliability (K0.137) and one rater had moderate intra-rater reliability (K=0.624, pdefinition, inter-rater reliability between the four raters was moderate (ICC=0.667, pdefinition showed high intra-rater (K>0.601, pdefinition of NF is required. The results of the present study suggest that the proposed new definition increases intra and inter-rater reliability, a critical step for using NFs to quantify fall risk. Copyright © 2013 Elsevier B.V. All rights reserved.

  18. Interrater and Intrarater Reliability of the Balance Computerized Adaptive Test in Patients With Stroke.

    Science.gov (United States)

    Chiang, Hsin-Yu; Lu, Wen-Shian; Yu, Wan-Hui; Hsueh, I-Ping; Hsieh, Ching-Lin

    2018-04-11

    To examine the interrater and intrarater reliability of the Balance Computerized Adaptive Test (Balance CAT) in patients with chronic stroke having a wide range of balance functions. Repeated assessments design (1wk apart). Seven teaching hospitals. A pooled sample (N=102) including 2 independent groups of outpatients (n=50 for the interrater reliability study; n=52 for the intrarater reliability study) with chronic stroke. Not applicable. Balance CAT. For the interrater reliability study, the values of intraclass correlation coefficient, minimal detectable change (MDC), and percentage of MDC (MDC%) for the Balance CAT were .84, 1.90, and 31.0%, respectively. For the intrarater reliability study, the values of intraclass correlation coefficient, MDC, and MDC% ranged from .89 to .91, from 1.14 to 1.26, and from 17.1% to 18.6%, respectively. The Balance CAT showed sufficient intrarater reliability in patients with chronic stroke having balance functions ranging from sitting with support to independent walking. Although the Balance CAT may have good interrater reliability, we found substantial random measurement error between different raters. Accordingly, if the Balance CAT is used as an outcome measure in clinical or research settings, same raters are suggested over different time points to ensure reliable assessments. Copyright © 2018 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.

  19. Interrater and Intrarater Reliability of the Tuck Jump Assessment by Health Professionals of Varied Educational Backgrounds

    Directory of Open Access Journals (Sweden)

    Lisa A. Dudley

    2013-01-01

    Full Text Available Objective. The Tuck Jump Assessment (TJA, a clinical plyometric assessment, identifies 10 jumping and landing technique flaws. The study objective was to investigate TJA interrater and intrarater reliability with raters of different educational and clinical backgrounds. Methods. 40 participants were video recorded performing the TJA using published protocol and instructions. Five raters of varied educational and clinical backgrounds scored the TJA. Each score of the 10 technique flaws was summed for the total TJA score. Approximately one month later, 3 raters scored the videos again. Intraclass correlation coefficients determined interrater (5 and 3 raters for first and second session, resp. and intrarater (3 raters reliability. Results. Interrater reliability with 5 raters was poor (ICC = 0.47; 95% confidence intervals (CI 0.33–0.62. Interrater reliability between 3 raters who completed 2 scoring sessions improved from 0.52 (95% CI 0.35–0.68 for session one to 0.69 (95% CI 0.55–0.81 for session two. Intrarater reliability was poor to moderate, ranging from 0.44 (95% CI 0.22–0.68 to 0.72 (95% CI 0.55–0.84. Conclusion. Published protocol and training of raters were insufficient to allow consistent TJA scoring. There may be a learned effect with the TJA since interrater reliability improved with repetition. TJA instructions and training should be modified and enhanced before clinical implementation.

  20. Central nervous system imaging in childhood Langerhans cell histiocytosis – a reference center analysis

    International Nuclear Information System (INIS)

    Porto, Luciana; Schöning, Stefan; Hattingen, Elke; Sörensen, Jan; Jurcoane, Alina; Lehrnbecher, Thomas

    2015-01-01

    The aim of our study was (1) to describe central nervous system (CNS) manifestations in children with Langerhans cell histiocytosis (LCH) based on images sent to a reference center and meeting minimum requirements and (2) to assess the inter-rater agreement of CNS-MRI results, which represents the overall reproducibility of this investigation. We retrospectively reviewed brain MRI examinations in children with LCH, for which MRI minimum requirements were met. Abnormalities were rated by two experienced neuroradiologists, and the inter-rater agreement was assessed. Out of a total of 94 imaging studies, only 31 MRIs met the minimum criteria, which included T2w, FLAIR, T1w images before/after contrast in at least two different section planes, and thin post contrast sagittal slices T1w through the sella. The most common changes were osseous abnormalities, followed by solid enlargement of the pineal gland, thickened enhancing stalk and signal changes of the dentate nucleus. Whereas inter-rater agreement in assessing most of the CNS lesions was relatively high (κ > 0.61), the application of minimum criteria often did not allow to evaluate the posterior pituitary. The diversity of radiological protocols from different institutions leads to difficulties in the diagnosis of CNS abnormalities in children with LCH. Although the inter-rater agreement between neuroradiologists was high, not all the LCH manifestations could be completely ruled out when using the minimum criteria. Brain MRIs should therefore follow LCH guideline protocols and include T1 pre-gadolinium sagittal images, and be centrally reviewed in order to improve the comparison of clinical trials

  1. Inter-rater reliability of PATH observations for assessment of ergonomic risk factors in hospital work.

    Science.gov (United States)

    Park, Jung-Keun; Boyer, Jon; Tessler, Jamie; Casey, Jeffrey; Schemm, Linda; Gore, Rebecca; Punnett, Laura

    2009-07-01

    This study examined the inter-rater reliability of expert observations of ergonomic risk factors by four analysts. Ten jobs were observed at a hospital using a newly expanded version of the PATH method (Buchholz et al. 1996), to which selected upper extremity exposures had been added. Two of the four raters simultaneously observed each worker onsite for a total of 443 observation pairs containing 18 categorical exposure items each. For most exposure items, kappa coefficients were 0.4 or higher. For some items, agreement was higher both for the jobs with less rapid hand activity and for the analysts with a higher level of ergonomic job analysis experience. These upper extremity exposures could be characterised reliably with real-time observation, given adequate experience and training of the observers. The revised version of PATH is applicable to the analysis of jobs where upper extremity musculoskeletal strain is of concern.

  2. Magnetic resonance cholangiographic (MRCP) features of ischemic-type biliary lesions (ITBL). A case-control study

    International Nuclear Information System (INIS)

    Collettini, F.; Kroencke, T.J.; Renz, D.M.; Bucourt, M. de; Poellinger, A.; Heidenhain, C.; Neuhaus, P.

    2011-01-01

    Purpose: To evaluate the spectrum of MR cholangiography (MRCP) features of ischemic-type biliary lesions (ITBL) after orthotopic liver transplantation (OLT). Materials and Methods: 30 patients (16 m, 14 f) with an average age of 52 years (9 - 69 y) were examined in two 1.5 MR units using breath-hold 2D-SS-FSE-sequences and 3D-MRCP sequences. 20 of the 30 patients had an established ITBL, and the remaining 10 patients were post-OLT controls. MRCPs were evaluated independently by two experienced radiologists that were blinded to the clinical history as well as the results of other imaging modalities. All images were analyzed for the presence of 16 different pathological features. Differences between ITBL patients and controls were analyzed using the Mann-Whitney-U Test. Inter-rater variability was tested using the Cohen's Kappa test. Results: Abnormal findings of bile ducts were seen in all patients. The most common findings were (in percentage for reader 1/2) intrahepatic bile duct dilatation (95 %/95 %) and extrahepatic bile duct stenoses (95 %/85 %), followed by intrahepatic main duct stenoses (90 %/95 %) and segmental duct stenoses (85 %/85 %). Differences between ITBL patients and controls were significant for most of the analyzed features (Mann-Whitney-U test, p < 0.05). For 12 of 16 features, there was substantial or almost perfect agreement (K = 0.61 - 1.00), for 2 of 16 features moderate agreement (K = 0.41- 0.60) and for 2 of 16 features fair agreement (K < 0.40). Conclusion: In patients with ITBL, MR cholangiography reveals characteristic features that may allow differentiation from other biliary complications after liver transplantation. (orig.)

  3. Inter-Rater Reliability of Historical Data Collected by Non-Medical Research Assistants and Physicians in Patients with Acute Abdominal Pain

    Directory of Open Access Journals (Sweden)

    Mills, Angela M

    2009-02-01

    Full Text Available OBJECTIVES: In many academic emergency departments (ED, physicians are asked to record clinical data for research that may be time consuming and distracting from patient care. We hypothesized that non-medical research assistants (RAs could obtain historical information from patients with acute abdominal pain as accurately as physicians.METHODS: Prospective comparative study conducted in an academic ED of 29 RAs to 32 resident physicians (RPs to assess inter-rater reliability in obtaining historical information in abdominal pain patients. Historical features were independently recorded on standardized data forms by a RA and RP blinded to each others' answers. Discrepancies were resolved by a third person (RA who asked the patient to state the correct answer on a third questionnaire, constituting the "criterion standard." Inter-rater reliability was assessed using kappa statistics (kappa and percent crude agreement (CrA.RESULTS: Sixty-five patients were enrolled (mean age 43. Of 43 historical variables assessed, the median agreement was moderate (kappa 0.59 [Interquartile range 0.37-0.69]; CrA 85.9% and varied across data categories: initial pain location (kappa 0.61 [0.59-0.73]; CrA 87.7%, current pain location (kappa 0.60 [0.47-0.67]; CrA 82.8%, past medical history (kappa 0.60 [0.48-0.74]; CrA 93.8%, associated symptoms (kappa 0.38 [0.37-0.74]; CrA 87.7%, and aggravating/alleviating factors (kappa 0.09 [-0.01-0.21]; CrA 61.5%. When there was disagreement between the RP and the RA, the RA more often agreed with the criterion standard (64% [55-71%] than the RP (36% [29-45%].CONCLUSION: Non-medical research assistants who focus on clinical research are often more accurate than physicians, who may be distracted by patient care responsibilities, at obtaining historical information from ED patients with abdominal pain.

  4. The Outdoor MEDIA DOT: The development and inter-rater reliability of a tool designed to measure food and beverage outlets and outdoor advertising.

    Science.gov (United States)

    Poulos, Natalie S; Pasch, Keryn E

    2015-07-01

    Few studies of the food environment have collected primary data, and even fewer have reported reliability of the tool used. This study focused on the development of an innovative electronic data collection tool used to document outdoor food and beverage (FB) advertising and establishments near 43 middle and high schools in the Outdoor MEDIA Study. Tool development used GIS based mapping, an electronic data collection form on handheld devices, and an easily adaptable interface to efficiently collect primary data within the food environment. For the reliability study, two teams of data collectors documented all FB advertising and establishments within one half-mile of six middle schools. Inter-rater reliability was calculated overall and by advertisement or establishment category using percent agreement. A total of 824 advertisements (n=233), establishment advertisements (n=499), and establishments (n=92) were documented (range=8-229 per school). Overall inter-rater reliability of the developed tool ranged from 69-89% for advertisements and establishments. Results suggest that the developed tool is highly reliable and effective for documenting the outdoor FB environment. Copyright © 2015 Elsevier Ltd. All rights reserved.

  5. Cross-Layer Design of the Inter-RAT Handover between UMTS and WiMAX

    Directory of Open Access Journals (Sweden)

    Bertin Philippe

    2010-01-01

    Full Text Available In future mobile networks, different radio access technologies will have to coexist. IEEE 802.21 MIH (Media-Independent Handover provides primitive mechanisms that ease the implementation of a seamless vertical handover (inter-RAT handover between different radio access technologies. However, it does not specify any handover execution mechanism. The first objective of this paper is to propose a novel MIHF (Media-Independent Handover Function variant, which is renamed interworking (IW sublayer. IW sublayer provides a seamless inter-RAT handover procedure between UMTS and WiMAX systems. It relies on a new intersystem retransmission mechanism with cross-layer interaction ability providing lossless handover while keeping acceptable delays. The second objective of this paper is to design a new TCP snoop agent (TCP Snoop, which interacts with the IW layer in order to mitigate BDP (Bandwidth Delay Product mismatch and to solve spurious RTO (Retransmission TimeOut problems. The cross-layer effects on the handover performance are evaluated by simulations. Our results show that cross-layer interaction between IW layer and TCP Snoop smoothes the handover procedure for TCP traffics. Additionally, this novel inter-RAT cross-layer scheme has the merit of keeping existing TCP protocol stacks unchanged.

  6. Novel techniques for enhancement and segmentation of acne vulgaris lesions.

    Science.gov (United States)

    Malik, A S; Humayun, J; Kamel, N; Yap, F B-B

    2014-08-01

    More than 99% acne patients suffer from acne vulgaris. While diagnosing the severity of acne vulgaris lesions, dermatologists have observed inter-rater and intra-rater variability in diagnosis results. This is because during assessment, identifying lesion types and their counting is a tedious job for dermatologists. To make the assessment job objective and easier for dermatologists, an automated system based on image processing methods is proposed in this study. There are two main objectives: (i) to develop an algorithm for the enhancement of various acne vulgaris lesions; and (ii) to develop a method for the segmentation of enhanced acne vulgaris lesions. For the first objective, an algorithm is developed based on the theory of high dynamic range (HDR) images. The proposed algorithm uses local rank transform to generate the HDR images from a single acne image followed by the log transformation. Then, segmentation is performed by clustering the pixels based on Mahalanobis distance of each pixel from spectral models of acne vulgaris lesions. Two metrics are used to evaluate the enhancement of acne vulgaris lesions, i.e., contrast improvement factor (CIF) and image contrast normalization (ICN). The proposed algorithm is compared with two other methods. The proposed enhancement algorithm shows better result than both the other methods based on CIF and ICN. In addition, sensitivity and specificity are calculated for the segmentation results. The proposed segmentation method shows higher sensitivity and specificity than other methods. This article specifically discusses the contrast enhancement and segmentation for automated diagnosis system of acne vulgaris lesions. The results are promising that can be used for further classification of acne vulgaris lesions for final grading of the lesions. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  7. The PRECIS-2 tool has good interrater reliability and modest discriminant validity.

    Science.gov (United States)

    Loudon, Kirsty; Zwarenstein, Merrick; Sullivan, Frank M; Donnan, Peter T; Gágyor, Ildikó; Hobbelen, Hans J S M; Althabe, Fernando; Krishnan, Jerry A; Treweek, Shaun

    2017-08-01

    PRagmatic Explanatory Continuum Indicator Summary (PRECIS)-2 is a tool that could improve design insight for trialists. Our aim was to validate the PRECIS-2 tool, unlike its predecessor, testing the discriminant validity and interrater reliability. Over 80 international trialists, methodologists, clinicians, and policymakers created PRECIS-2 helping to ensure face validity and content validity. The interrater reliability of PRECIS-2 was measured using 19 experienced trialists who used PRECIS-2 to score a diverse sample of 15 randomized controlled trial protocols. Discriminant validity was tested with two raters to independently determine if the trial protocols were more pragmatic or more explanatory, with scores from the 19 raters for the 15 trials as predictors of pragmatism. Interrater reliability was generally good, with seven of nine domains having an intraclass correlation coefficient over 0.65. Flexibility (adherence) and recruitment had wide confidence intervals, but raters found these difficult to rate and wanted more information. Each of the nine PRECIS-2 domains could be used to differentiate between trials taking more pragmatic or more explanatory approaches with better than chance discrimination for all domains. We have assessed the validity and reliability of PRECIS-2. An elaboration study and web site provide guidance to help future users of the tool which is continuing to be tested by trial teams, systematic reviewers, and funders. Copyright © 2017 Elsevier Inc. All rights reserved.

  8. Candida spp. in oral cancer and oral precancerous lesions.

    Science.gov (United States)

    Gall, Francesca; Colella, Giuseppe; Di Onofrio, Valeria; Rossiello, Raffaele; Angelillo, Italo Francesco; Liguori, Giorgio

    2013-07-01

    To assess the presence of Candida spp. in lesions of the oral cavity in a sample of patients with precancer or cancer of the mouth and evaluate the limitations and advantages of microbiological and histological methods, 103 subjects with precancerous or cancerous lesions and not treated were observed between 2007 and 2009. The presence of Candida in the lesions was analyzed by microbiological and histological methods. Cohen's k statistic was used to assess the agreement between culture method and staining techniques. Forty-eight (47%) patients had cancer and 55 (53%) patients had precancerous lesions. Candida spp. were isolated from 31 (30%) patients with cancerous lesions and 33 (32%) with precancerous lesions. C. albicans was the most frequent species isolated in the lesions. The k value showed a fair overall agreement for comparisons between culture method and PAS (0.2825) or GMS (0.3112). This study supports the frequent presence of Candida spp. in cancer and precancerous lesions of the oral cavity. Both microbiological investigations and histological techniques were reliable for detection of Candida spp. It would be desirable for the two techniques to be considered complementary in the detection of yeast infections in these types of lesions.

  9. MRI of the stener lesion

    International Nuclear Information System (INIS)

    Haramati, N.; Hiller, N.; Dowdle, J.; Jacobson, M.; Barax, C.N.; Lieberfarb, R.I.; Lester, B.; Kulick, R.G.

    1995-01-01

    Six cadaver thumbs had ulnar collateral ligament (UCL) tears created surgically. MR examinations (2D STIR and 3D GRASS) were performed identically on all specimens both before displacement (non-Stener) and after displacement (Stener lesion) of the UCL. The MR images were then randomly numbered. Each image was evaluated separately in blinded fashion by four musculoskeletal radiologists for the presence or absence of a Stener lesion. Each radiologist reinterpreted the images after an interval of several days. The interpretation was based on previously published criteria for Stener lesion diagnosis by MR. The sensitivity of GRASS ranged from 0.17 to 0.67 with the most experienced reader scoring the lowest. The specificity of GRASS ranged from 0.33 to 1.0 (most experienced reader 0.67, 0.83). STIR had a sensitivity of 0.00-0.17 and a specificity of 0.53-0.83. The K values for inter- and intraobserver agreement were measured. The intraobserver K for GRASS was 0.27-0.75 (most experienced reader 0.75). 2D imaging is probably inadequate for the evaluation of Stener lesions. The most likely reason is that the STIR slice thickness of 3 mm limits resolution of small UCLs. The poor sensitivity and specificity of GRASS as well as poor interobserver agreement suggest that MR may not be sufficiently accurate for Stener lesion evaluation. (orig./MG)

  10. Test-retest and interrater reliability of the functional lower extremity evaluation.

    Science.gov (United States)

    Haitz, Karyn; Shultz, Rebecca; Hodgins, Melissa; Matheson, Gordon O

    2014-12-01

    Repeated-measures clinical measurement reliability study. To establish the reliability and face validity of the Functional Lower Extremity Evaluation (FLEE). The FLEE is a 45-minute battery of 8 standardized functional performance tests that measures 3 components of lower extremity function: control, power, and endurance. The reliability and normative values for the FLEE in healthy athletes are unknown. A face validity survey for the FLEE was sent to sports medicine personnel to evaluate the level of importance and frequency of clinical usage of each test included in the FLEE. The FLEE was then administered and rated for 40 uninjured athletes. To assess test-retest reliability, each athlete was tested twice, 1 week apart, by the same rater. To assess interrater reliability, 3 raters scored each athlete during 1 of the testing sessions. Intraclass correlation coefficients were used to assess the test-retest and interrater reliability of each of the FLEE tests. In the face validity survey, the FLEE tests were rated as highly important by 58% to 71% of respondents but frequently used by only 26% to 45% of respondents. Interrater reliability intraclass correlation coefficients ranged from 0.83 to 1.00, and test-retest reliability ranged from 0.71 to 0.95. The FLEE tests are considered clinically important for assessing lower extremity function by sports medicine personnel but are underused. The FLEE also is a reliable assessment tool. Future studies are required to determine if use of the FLEE to make return-to-play decisions may reduce reinjury rates.

  11. The Radiographic Union Score for Hip (RUSH): the use of a checklist to evaluate hip fracture healing improves agreement between radiologists and orthopedic surgeons

    Energy Technology Data Exchange (ETDEWEB)

    Chiavaras, Mary M. [McMaster University, Department of Radiology, Hamilton, Ontario (Canada); Hamilton General Hospital, Department of Diagnostic Imaging, Hamilton, Ontario (Canada); Bains, Simrit [University of Western Ontario Medical School, London, Ontario (Canada); Choudur, Hema; Parasu, Naveen [McMaster University, Department of Radiology, Hamilton, Ontario (Canada); Jacobson, Jon [University of Michigan, Department of Radiology, Ann Arbor, MI (United States); Ayeni, Olufemi; Petrisor, Brad; Sprague, Sheila; Bhandari, Mohit [McMaster University, Department of Orthopedic Surgery, Hamilton, Ontario (Canada); Chakravertty, Rajesh [University of Toronto, Department of Orthopedic Surgery, Toronto, Ontario (Canada)

    2013-08-15

    The assessment of fracture healing following intertrochanteric fracture fixation is highly variable with no validated standards. Agreement with respect to fracture healing following surgery is important for optimal patient management. The purpose of this study was to (1) assess reliability of intertrochanteric fracture healing assessment and (2) determine if a novel radiographic scoring system for hip fractures improves agreement between radiologists and orthopedic surgeons. A panel of three radiologists and three orthopedic surgeons assessed fracture healing in 150 cases of intertrochanteric fractures at two separate time points to determine inter-rater and intra-rater agreement. Reviewers, blinded to the time after injury, first subjectively assessed overall healing using frontal and lateral radiographs for each patient at a single time point. Reviewers then scored each fracture using a Radiographic Union Score for Hip (RUSH) form to determine whether this improves agreement regarding hip fracture healing. Inter-rater agreement for the overall subjective impression of fracture healing between reviewer groups was only fair (intraclass coefficient [ICC] = 0.34, 95 % CI: 0.11-0.52). Use of the RUSH score improved overall agreement between groups to substantial (ICC = 0.66, 95 % CI: 0.53-0.75). Across reviewers, healing of the medial cortex and overall RUSH score itself demonstrated high correlations with overall perceptions of healing (r = 0.53 and r = 0.72, respectively).??The RUSH score improves agreement of fracture healing assessment between orthopedic surgeons and radiologists, offers a systematic approach to evaluating intertrochanteric hip fracture radiographs, and may ultimately provide prognostic information that could predict healing outcomes in patients with femoral neck fractures. (orig.)

  12. The Radiographic Union Score for Hip (RUSH): the use of a checklist to evaluate hip fracture healing improves agreement between radiologists and orthopedic surgeons

    International Nuclear Information System (INIS)

    Chiavaras, Mary M.; Bains, Simrit; Choudur, Hema; Parasu, Naveen; Jacobson, Jon; Ayeni, Olufemi; Petrisor, Brad; Sprague, Sheila; Bhandari, Mohit; Chakravertty, Rajesh

    2013-01-01

    The assessment of fracture healing following intertrochanteric fracture fixation is highly variable with no validated standards. Agreement with respect to fracture healing following surgery is important for optimal patient management. The purpose of this study was to (1) assess reliability of intertrochanteric fracture healing assessment and (2) determine if a novel radiographic scoring system for hip fractures improves agreement between radiologists and orthopedic surgeons. A panel of three radiologists and three orthopedic surgeons assessed fracture healing in 150 cases of intertrochanteric fractures at two separate time points to determine inter-rater and intra-rater agreement. Reviewers, blinded to the time after injury, first subjectively assessed overall healing using frontal and lateral radiographs for each patient at a single time point. Reviewers then scored each fracture using a Radiographic Union Score for Hip (RUSH) form to determine whether this improves agreement regarding hip fracture healing. Inter-rater agreement for the overall subjective impression of fracture healing between reviewer groups was only fair (intraclass coefficient [ICC] = 0.34, 95 % CI: 0.11-0.52). Use of the RUSH score improved overall agreement between groups to substantial (ICC = 0.66, 95 % CI: 0.53-0.75). Across reviewers, healing of the medial cortex and overall RUSH score itself demonstrated high correlations with overall perceptions of healing (r = 0.53 and r = 0.72, respectively).??The RUSH score improves agreement of fracture healing assessment between orthopedic surgeons and radiologists, offers a systematic approach to evaluating intertrochanteric hip fracture radiographs, and may ultimately provide prognostic information that could predict healing outcomes in patients with femoral neck fractures. (orig.)

  13. Concurrent validity and interrater reliability of a new smartphone application to assess 3D active cervical range of motion in patients with neck pain.

    Science.gov (United States)

    Stenneberg, Martijn S; Busstra, Harm; Eskes, Michel; van Trijffel, Emiel; Cattrysse, Erik; Scholten-Peeters, Gwendolijne G M; de Bie, Rob A

    2018-04-01

    There is a lack of valid, reliable, and feasible instruments for measuring planar active cervical range of motion (aCROM) and associated 3D coupling motions in patients with neck pain. Smartphones have advanced sensors and appear to be suitable for these measurements. To estimate the concurrent validity and interrater reliability of a new iPhone application for assessing planar aCROM and associated 3D coupling motions in patients with neck pain, using an electromagnetic tracking device as a reference test. Cross-sectional study. Two samples of neck pain patients were recruited; 30 patients for the validity study and 26 patients for the reliability study. Validity was estimated using intraclass correlation coefficients (ICCs), and by calculating 95% limits of agreement (LoA). To estimate interrater reliability, ICCs were calculated. Cervical 3D coupling motions were analyzed by calculating the cross-correlation coefficients and ratio between the main motions and coupled motions for both instruments. ICCs for concurrent validity and interrater reliability ranged from 0.90 to 0.99. The width of the 95% LoA ranged from about 5° for right lateral bending to 11° for total rotation. No significant differences were found between both devices for associated coupling motion analysis. The iPhone application appears to be a useful discriminative tool for the measurement of planar aCROM and associated coupling motions in patients with neck pain. It fulfills the need for a valid, reliable, and feasible instrument in clinical practice and research. Therapists and researchers should consider measurement error when interpreting scores. Copyright © 2017 Elsevier Ltd. All rights reserved.

  14. Feasibility and Inter-Rater Reliability of Physical Performance Measures in Acutely Admitted Older Medical Patients

    DEFF Research Database (Denmark)

    Bodilsen, Ann Christine; Juul-Larsen, Helle Gybel; Petersen, Janne

    2015-01-01

    OBJECTIVE: Physical performance measures can be used to predict functional decline and increased dependency in older persons. However, few studies have assessed the feasibility or reliability of such measures in hospitalized older patients. Here we assessed the feasibility and inter-rater reliabi......OBJECTIVE: Physical performance measures can be used to predict functional decline and increased dependency in older persons. However, few studies have assessed the feasibility or reliability of such measures in hospitalized older patients. Here we assessed the feasibility and inter......-rater reliability of four simple measures of physical performance in acutely admitted older medical patients. DESIGN: During the first 24 hours of hospitalization, the following were assessed twice by different raters in 52 (≥ 65 years) patients admitted for acute medical illness: isometric hand grip strength, 4......, and 30-s chair stand were 8%, 7%, and 18%, and the SRD95% values were 22%, 17%, and 49%. CONCLUSION: In acutely admitted older medical patients, grip strength, gait speed, and the Cumulated Ambulation Score measurements were feasible and showed high inter-rater reliability when administered by different...

  15. An evaluation by midwives and gynecologists of treatability of cervical lesions by cryotherapy among human papillomavirus-positive women.

    Science.gov (United States)

    Gage, Julia C; Rodriguez, Ana Cecilia; Schiffman, Mark; Adadevoh, Sydney; Larraondo, Manuel J Alvarez; Chumworathayi, Bandit; Lejarza, Sandra Vargas; Araya, Luis Villegas; Garcia, Francisco; Budihas, Scott R; Long, Rodney; Katki, Hormuzd A; Herrero, Rolando; Burk, Robert D; Jeronimo, Jose

    2009-05-01

    To estimate efficacy of a visual triage of human papillomavirus (HPV)-positive women to either immediate cryotherapy or referral if not treatable (eg, invasive cancer, large precancers). We evaluated visual triage in the HPV-positive women aged 25 to 55 years from the 10,000-woman Guanacaste Cohort Study (n = 552). Twelve Peruvian midwives and 5 international gynecologists assessed treatability by cryotherapy using digitized high-resolution cervical images taken at enrollment. The reference standard of treatability was determined by 2 lead gynecologists from the entire 7-year follow-up of the women. Women diagnosed with histologic cervical intraepithelial neoplasia grade 2 or worse or 5-year persistence of carcinogenic HPV infection were defined as needing treatment. Midwives and gynecologists judged 30.8% and 41.2% of women not treatable by cryotherapy, respectively (P cryotherapy. The proportion of women judged not treatable by a reviewer varied widely and ranged from 18.6% to 61.1%. Interrater agreement was poor with mean pairwise overall agreement of 71.4% and 66.3% and kappa's of 0.33 and 0.30 for midwives and gynecologists, respectively. In future "screen-and-treat" cervical cancer prevention programs using HPV testing and cryotherapy, practitioners will visually triage HPV-positive women. The suboptimal performance of visual triage suggests that screen-and-treat programs using cryotherapy might be insufficient for treating precancerous lesions. Improved, low-technology triage methods and/or improved safe and low-technology treatment options are needed.

  16. Inter-rater reliability of three musculoskeletal physical examination techniques used to assess motion in three planes while standing.

    Science.gov (United States)

    Prather, Heidi; Hunt, Devyani; Steger-May, Karen; Hayes, Marcie Harris; Knaus, Evan; Clohisy, John

    2009-07-01

    The objective of the study was to measure the reliability between examiners of 3 basic maneuvers of the Total Body Functional Profile physical examination test. The hypothesis was musculoskeletal health care providers of different disciplines could reliably use the 3 basic maneuvers as part of the musculoskeletal physical examination. A prospective observational study was conducted. Twenty-eight adult volunteers were measured on both the left and right side by 2 independent raters on a single occasion. The subjects were recruited through advertisements placed by the orthopedic department at a tertiary university. Twenty-eight volunteers were recruited and completed the study. The volunteers were between the ages of 18 and 51 years of age, had no symptoms in the lower extremity or spine, had no previous history of surgery or tumor involving the lower extremity, and no medical conditions that would preclude participation. On a single occasion, 2 examiners per 1 volunteer were blinded to their own and each others' measurements. Each examiner assessed the distance of frontal and sagittal plane lunge and angle of motion for transverse plane testing. Inter-rater agreement is expressed with intraclass correlation coefficients (ICCs) and corresponding 95% confidence intervals (CIs). The difference between raters is reported with 95% CIs. Baseline demographics, University of California Los Angeles (UCLA), and Harris hip questionnaires were completed by all participants. The UCLA and Harris hip scores showed no significant activity restrictions or pain limitations in all participants. The inter-rater reliability for sagittal, frontal, and transverse plane matrix testing was good with ICCs of 0.86 (95% CI 0.77-0.91), 0.90 (95% CI 0.84-0.94), and 0.85 (95% CI 0.75-0.91), respectively. The rater reliability between disciplines for transverse, sagittal, and frontal plane matrix testing was good with ICCs of 0.89 (95% CI 0.80-0.94), 0.88 (95% CI 0.79-0.94), and 0.90 (95% CI 0

  17. Grant Peer Review: Improving Inter-Rater Reliability with Training.

    Science.gov (United States)

    Sattler, David N; McKnight, Patrick E; Naney, Linda; Mathis, Randy

    2015-01-01

    This study developed and evaluated a brief training program for grant reviewers that aimed to increase inter-rater reliability, rating scale knowledge, and effort to read the grant review criteria. Enhancing reviewer training may improve the reliability and accuracy of research grant proposal scoring and funding recommendations. Seventy-five Public Health professors from U.S. research universities watched the training video we produced and assigned scores to the National Institutes of Health scoring criteria proposal summary descriptions. For both novice and experienced reviewers, the training video increased scoring accuracy (the percentage of scores that reflect the true rating scale values), inter-rater reliability, and the amount of time reading the review criteria compared to the no video condition. The increase in reliability for experienced reviewers is notable because it is commonly assumed that reviewers--especially those with experience--have good understanding of the grant review rating scale. The findings suggest that both experienced and novice reviewers who had not received the type of training developed in this study may not have appropriate understanding of the definitions and meaning for each value of the rating scale and that experienced reviewers may overestimate their knowledge of the rating scale. The results underscore the benefits of and need for specialized peer reviewer training.

  18. International Endometrial Tumor Analysis (IETA) terminology in women with postmenopausal bleeding and sonographic endometrial thickness ≥ 4.5 mm: agreement and reliability study.

    Science.gov (United States)

    Sladkevicius, P; Installé, A; Van Den Bosch, T; Timmerman, D; Benacerraf, B; Jokubkiene, L; Di Legge, A; Votino, A; Zannoni, L; De Moor, B; De Cock, B; Van Calster, B; Valentin, L

    2018-02-01

    To estimate intra- and interrater agreement and reliability with regard to describing ultrasound images of the endometrium using the International Endometrial Tumor Analysis (IETA) terminology. Four expert and four non-expert raters assessed videoclips of transvaginal ultrasound examinations of the endometrium obtained from 99 women with postmenopausal bleeding and sonographic endometrial thickness ≥ 4.5 mm but without fluid in the uterine cavity. The following features were rated: endometrial echogenicity, endometrial midline, bright edge, endometrial-myometrial junction, color score, vascular pattern, irregularly branching vessels and color splashes. The color content of the endometrial scan was estimated using a visual analog scale graded from 0 to 100. To estimate intrarater agreement and reliability, the same videoclips were assessed twice with a minimum of 2 months' interval. The raters were blinded to their own results and to those of the other raters. Interrater differences in the described prevalence of most IETA variables were substantial, and some variable categories were observed rarely. Specific agreement was poor for variables with many categories. For binary variables, specific agreement was better for absence than for presence of a category. For variables with more than two outcome categories, specific agreement for expert and non-expert raters was best for not-defined endometrial midline (93% and 96%), regular endometrial-myometrial junction (72% and 70%) and three-layer endometrial pattern (67% and 56%). The grayscale ultrasound variable with the best reliability was uniform vs non-uniform echogenicity (multirater kappa (κ), 0.55 for expert and 0.52 for non-expert raters), and the variables with the lowest reliability were appearance of the endometrial-myometrial junction (κ, 0.25 and 0.16) and the nine-category endometrial echogenicity variable (κ, 0.29 and 0.28). The most reliable color Doppler variable was color score (mean weighted

  19. Seven-Tesla Magnetization Transfer Imaging to Detect Multiple Sclerosis White Matter Lesions.

    Science.gov (United States)

    Chou, I-Jun; Lim, Su-Yin; Tanasescu, Radu; Al-Radaideh, Ali; Mougin, Olivier E; Tench, Christopher R; Whitehouse, William P; Gowland, Penny A; Constantinescu, Cris S

    2018-03-01

    Fluid-attenuated inversion recovery (FLAIR) imaging at 3 Tesla (T) field strength is the most sensitive modality for detecting white matter lesions in multiple sclerosis. While 7T FLAIR is effective in detecting cortical lesions, it has not been fully optimized for visualization of white matter lesions and thus has not been used for delineating lesions in quantitative magnetic resonance imaging (MRI) studies of the normal appearing white matter in multiple sclerosis. Therefore, we aimed to evaluate the sensitivity of 7T magnetization-transfer-weighted (MT w ) images in the detection of white matter lesions compared with 3T-FLAIR. Fifteen patients with clinically isolated syndrome, 6 with multiple sclerosis, and 10 healthy participants were scanned with 7T 3-dimensional (D) MT w and 3T-2D-FLAIR sequences on the same day. White matter lesions visible on either sequence were delineated. Of 662 lesions identified on 3T-2D-FLAIR images, 652 were detected on 7T-3D-MT w images (sensitivity, 98%; 95% confidence interval, 97% to 99%). The Spearman correlation coefficient between lesion loads estimated by the two sequences was .910. The intrarater and interrater reliability for 7T-3D-MT w images was good with an intraclass correlation coefficient (ICC) of 98.4% and 81.8%, which is similar to that for 3T-2D-FLAIR images (ICC 96.1% and 96.7%). Seven-Tesla MT w sequences detected most of the white matter lesions identified by FLAIR at 3T. This suggests that 7T-MT w imaging is a robust alternative for detecting demyelinating lesions in addition to 3T-FLAIR. Future studies need to compare the roles of optimized 7T-FLAIR and of 7T-MT w imaging. © 2017 The Authors. Journal of Neuroimaging published by Wiley Periodicals, Inc. on behalf of American Society of Neuroimaging.

  20. The Surgical Safety Checklist and Teamwork Coaching Tools: a study of inter-rater reliability.

    Science.gov (United States)

    Huang, Lyen C; Conley, Dante; Lipsitz, Stu; Wright, Christopher C; Diller, Thomas W; Edmondson, Lizabeth; Berry, William R; Singer, Sara J

    2014-08-01

    To assess the inter-rater reliability (IRR) of two novel observation tools for measuring surgical safety checklist performance and teamwork. Data surgical safety checklists can promote adherence to standards of care and improve teamwork in the operating room. Their use has been associated with reductions in mortality and other postoperative complications. However, checklist effectiveness depends on how well they are performed. Authors from the Safe Surgery 2015 initiative developed a pair of novel observation tools through literature review, expert consultation and end-user testing. In one South Carolina hospital participating in the initiative, two observers jointly attended 50 surgical cases and independently rated surgical teams using both tools. We used descriptive statistics to measure checklist performance and teamwork at the hospital. We assessed IRR by measuring percent agreement, Cohen's κ, and weighted κ scores. The overall percent agreement and κ between the two observers was 93% and 0.74 (95% CI 0.66 to 0.79), respectively, for the Checklist Coaching Tool and 86% and 0.84 (95% CI 0.77 to 0.90) for the Surgical Teamwork Tool. Percent agreement for individual sections of both tools was 79% or higher. Additionally, κ scores for six of eight sections on the Checklist Coaching Tool and for two of five domains on the Surgical Teamwork Tool achieved the desired 0.7 threshold. However, teamwork scores were high and variation was limited. There were no significant changes in the percent agreement or κ scores between the first 10 and last 10 cases observed. Both tools demonstrated substantial IRR and required limited training to use. These instruments may be used to observe checklist performance and teamwork in the operating room. However, further refinement and calibration of observer expectations, particularly in rating teamwork, could improve the utility of the tools. Published by the BMJ Publishing Group Limited. For permission to use (where not already

  1. Interrater reliability of schizoaffective disorder compared with schizophrenia, bipolar disorder, and unipolar depression - A systematic review and meta-analysis.

    Science.gov (United States)

    Santelmann, Hanno; Franklin, Jeremy; Bußhoff, Jana; Baethge, Christopher

    2016-10-01

    Schizoaffective disorder is a common diagnosis in clinical practice but its nosological status has been subject to debate ever since it was conceptualized. Although it is key that diagnostic reliability is sufficient, schizoaffective disorder has been reported to have low interrater reliability. Evidence based on systematic review and meta-analysis methods, however, is lacking. Using a highly sensitive literature search in Medline, Embase, and PsycInfo we identified studies measuring the interrater reliability of schizoaffective disorder in comparison to schizophrenia, bipolar disorder, and unipolar disorder. Out of 4126 records screened we included 25 studies reporting on 7912 patients diagnosed by different raters. The interrater reliability of schizoaffective disorder was moderate (meta-analytic estimate of Cohen's kappa 0.57 [95% CI: 0.41-0.73]), and substantially lower than that of its main differential diagnoses (difference in kappa between 0.22 and 0.19). Although there was considerable heterogeneity, analyses revealed that the interrater reliability of schizoaffective disorder was consistently lower in the overwhelming majority of studies. The results remained robust in subgroup and sensitivity analyses (e.g., diagnostic manual used) as well as in meta-regressions (e.g., publication year) and analyses of publication bias. Clinically, the results highlight the particular importance of diagnostic re-evaluation in patients diagnosed with schizoaffective disorder. They also quantify a widely held clinical impression of lower interrater reliability and agree with earlier meta-analysis reporting low test-retest reliability. Copyright © 2016. Published by Elsevier B.V.

  2. Agreement and reproducibility in identification of endometriosis using magnetic resonance imaging

    International Nuclear Information System (INIS)

    Saba, Luca; Sulcis, Rosa; Mallarini, Giorgio; Guerriero, Stefano; Ajossa, Silvia; Melis, Gianbenedetto

    2010-01-01

    Background: Magnetic resonance imaging (MRI) is a promising technique in the study of endometriosis, allowing a complete mapping of lesions before surgery. However, the value of MRI in the diagnosis of endometriosis in the bladder, in superficial peritoneal lesions, and in ovarian foci and uterosacral ligaments is still under debate. Purpose: To assess inter- and intra-observer agreement in the evaluation of endometriosis in different anatomical locations using MRI. Material and Methods: From June 2006 to February 2008, 83 female patients (mean age 39, range 19-49 years) who had undergone MRI examination for suspected endometriosis were evaluated by two radiologists. MRI at 1.5 Tesla was performed with SE and TSE sequences, T1- and T2-weighted with and without fat suppression. Each examination was completed with gadolinium administration. Each dataset was independently evaluated by the radiologists for the presence or absence of endometriosis. The location (ovaries, uterosacral ligaments (USLs), pouch of Douglas, vagina, rectosigmoid, rectovaginal septum, and bladder) of suspected lesions was recorded. Cohen kappa statistical analysis was performed to calculate agreement between measurements. After 2 months the data were analyzed again by the two observers to assess intra-observer agreement. Results: Of the 83 MRI examinations performed, 12 patients demonstrated no evidence of endometriosis. In the remaining 71 studies, 157 endometriotic lesions ranging in size from 0.4 to 6.2 cm were detected. Of the 157 lesions, 53 (33.75% incidence) were smaller than 1 cm. In the ovaries, the inter-observer agreement was 92.77% and the kappa value was 0.802 (95% CI, 0.695-0.91). In the bladder the inter-observer agreement was 96.39% and the kappa value was 0.553 (95% CI 0.056-1). In the USLs the inter-observer agreement was 90.96% and the kappa value 0.583 (95% CI, 0.381-0.784). In the rectovaginal septum the inter-observer agreement was 94.58% and the kappa value 0.739 (95% CI

  3. Agreement and reproducibility in identification of endometriosis using magnetic resonance imaging

    Energy Technology Data Exchange (ETDEWEB)

    Saba, Luca; Sulcis, Rosa; Mallarini, Giorgio (Dept. of Science of the Images, Azienda Ospedaliero Universitaria di Cagliari, Cagliari (Italy)), e-mail: lucasaba@tiscali.it; Guerriero, Stefano; Ajossa, Silvia; Melis, Gianbenedetto (Dept. of Gynaecology, Azienda Ospedaliero Universitaria di Cagliari, Cagliari (Italy))

    2010-06-15

    Background: Magnetic resonance imaging (MRI) is a promising technique in the study of endometriosis, allowing a complete mapping of lesions before surgery. However, the value of MRI in the diagnosis of endometriosis in the bladder, in superficial peritoneal lesions, and in ovarian foci and uterosacral ligaments is still under debate. Purpose: To assess inter- and intra-observer agreement in the evaluation of endometriosis in different anatomical locations using MRI. Material and Methods: From June 2006 to February 2008, 83 female patients (mean age 39, range 19-49 years) who had undergone MRI examination for suspected endometriosis were evaluated by two radiologists. MRI at 1.5 Tesla was performed with SE and TSE sequences, T1- and T2-weighted with and without fat suppression. Each examination was completed with gadolinium administration. Each dataset was independently evaluated by the radiologists for the presence or absence of endometriosis. The location (ovaries, uterosacral ligaments (USLs), pouch of Douglas, vagina, rectosigmoid, rectovaginal septum, and bladder) of suspected lesions was recorded. Cohen kappa statistical analysis was performed to calculate agreement between measurements. After 2 months the data were analyzed again by the two observers to assess intra-observer agreement. Results: Of the 83 MRI examinations performed, 12 patients demonstrated no evidence of endometriosis. In the remaining 71 studies, 157 endometriotic lesions ranging in size from 0.4 to 6.2 cm were detected. Of the 157 lesions, 53 (33.75% incidence) were smaller than 1 cm. In the ovaries, the inter-observer agreement was 92.77% and the kappa value was 0.802 (95% CI, 0.695-0.91). In the bladder the inter-observer agreement was 96.39% and the kappa value was 0.553 (95% CI 0.056-1). In the USLs the inter-observer agreement was 90.96% and the kappa value 0.583 (95% CI, 0.381-0.784). In the rectovaginal septum the inter-observer agreement was 94.58% and the kappa value 0.739 (95% CI

  4. Intra and inter-rater reliability study of pelvic floor muscle dynamometric measurements

    Directory of Open Access Journals (Sweden)

    Natalia M. Martinho

    2015-04-01

    Full Text Available OBJECTIVE: The aim of this study was to evaluate the intra and inter-rater reliability of pelvic floor muscle (PFM dynamometric measurements for maximum and average strengths, as well as endurance. METHOD: A convenience sample of 18 nulliparous women, without any urogynecological complaints, aged between 19 and 31 (mean age of 25.4±3.9 participated in this study. They were evaluated using a pelvic floor dynamometer based on load cell technology. The dynamometric evaluations were repeated in three successive sessions: two on the same day with a rest period of 30 minutes between them, and the third on the following day. All participants were evaluated twice in each session; first by examiner 1 followed by examiner 2. The vaginal dynamometry data were analyzed using three parameters: maximum strength, average strength, and endurance. The Intraclass Correlation Coefficient (ICC was applied to estimate the PFM dynamometric measurement reliability, considering a good level as being above 0.75. RESULTS: The intra and inter-raters' analyses showed good reliability for maximum strength (ICCintra-rater1=0.96, ICCintra-rater2=0.95, and ICCinter-rater=0.96, average strength (ICCintra-rater1=0.96, ICCintra-rater2=0.94, and ICCinter-rater=0.97, and endurance (ICCintra-rater1=0.88, ICCintra-rater2=0.86, and ICCinter-rater=0.92 dynamometric measurements. CONCLUSIONS: The PFM dynamometric measurements showed good intra- and inter-rater reliability for maximum strength, average strength and endurance, which demonstrates that this is a reliable device that can be used in clinical practice.

  5. Evaluation of the reliability and accuracy of using cone-beam computed tomography for diagnosing periapical cysts from granulomas.

    Science.gov (United States)

    Guo, Jing; Simon, James H; Sedghizadeh, Parish; Soliman, Osman N; Chapman, Travis; Enciso, Reyes

    2013-12-01

    The purpose of this study was to evaluate the reliability and accuracy of cone-beam computed tomographic (CBCT) imaging against the histopathologic diagnosis for the differential diagnosis of periapical cysts (cavitated lesions) from (solid) granulomas. Thirty-six periapical lesions were imaged using CBCT scans. Apicoectomy surgeries were conducted for histopathological examination. Evaluator 1 examined each CBCT scan for the presence of 6 radiologic characteristics of a cyst (ie, location, periphery, shape, internal structure, effects on surrounding structure, and perforation of the cortical plate). Not every cyst showed all radiologic features (eg, not all cysts perforate the cortical plate). For the purpose of finding the minimum number of diagnostic criteria present in a scan to diagnose a lesion as a cyst, we conducted 6 receiver operating characteristic curve analyses comparing CBCT diagnoses with the histopathologic diagnosis. Two other independent evaluators examined the CBCT lesions. Statistical tests were conducted to examine the accuracy, inter-rater reliability, and intrarater reliability of CBCT images. Findings showed that a score of ≥4 positive findings was the optimal scoring system. The accuracies of differential diagnoses of 3 evaluators were moderate (area under the curve = 0.76, 0.70, and 0.69 for evaluators 1, 2, and 3, respectively). The inter-rater agreement of the 3 evaluators was excellent (α = 0.87). The intrarater agreement was good to excellent (κ = 0.71, 0.76, and 0.77). CBCT images can provide a moderately accurate diagnosis between cysts and granulomas. Copyright © 2013 American Association of Endodontists. Published by Elsevier Inc. All rights reserved.

  6. Development and inter-rater reliability of a standardized verbal instruction manual for the Chinese Geriatric Depression Scale-short form.

    Science.gov (United States)

    Wong, M T P; Ho, T P; Ho, M Y; Yu, C S; Wong, Y H; Lee, S Y

    2002-05-01

    The Geriatric Depression Scale (GDS) is a common screening tool for elderly depression in Hong Kong. This study aimed at (1) developing a standardized manual for the verbal administration and scoring of the GDS-SF, and (2) comparing the inter-rater reliability between the standardized and non-standardized verbal administration of GDS-SF. Two studies were reported. In Study 1, the process of developing the manual was described. In Study 2, we compared the inter-rater reliabilities of GDS-SF scores using the standardized verbal instructions and the traditional non-standardized administration. Results of Study 2 indicated that the standardized procedure in verbal administration and scoring improved the inter-rater reliabilities of GDS-SF. Copyright 2002 John Wiley & Sons, Ltd.

  7. [Inter-rater concordance of the "Nursing Activities Score" in intensive care].

    Science.gov (United States)

    Valls-Matarín, Josefa; Salamero-Amorós, Maria; Roldán-Gil, Carmen; Quintana-Riera, Salvador

    2015-01-01

    To evaluate inter-rater concordance in the valuation of the "Nursing Activities Score". Cross-sectional descriptive study conducted from December 2012 until June 2013 in a general intensive care unit with twelve beds. Three evaluator nurses, simultaneously and independently, through the patient daily charts, scored the nursing workload using Nursing Activities Score scale in all patients admitted over 18 years old. Three hundreds and thirty-nine records were collected. The intra-class correlation coefficient (ICC) between evaluators was 0.92 (0.89-0.94). A perfect concordance was obtained in 39.1% of the items, with 52.2% having a high, and 8.7% having lower concordance, corresponding to two of the items with multiple scoring options. Significant differences between two of the evaluators (P=.049) were found. Although the inter-rater concordance was high, more accurate records are needed to reduce the variability of the items with multiple options and to allow more accuracy in the interpretation and measurement of the data regarding nursing workload. Copyright © 2015 Elsevier España, S.L.U. All rights reserved.

  8. Predictors of parent-child agreement on child anxiety diagnoses on the ADIS-IV-C/P.

    Science.gov (United States)

    Hamblin, Rebecca J; Salloum, Alison; Andel, Ross; Nadeau, Joshua M; McBride, Nicole M; Lewin, Adam B; Storch, Eric A

    2016-11-30

    Diagnostic agreement between parents' and children's reports on children's anxiety problems is notoriously poor; however, very few investigations have examined specific predictors of inter-rater agreement on child anxiety diagnoses. This study examined predictors of categories of parent and child diagnostic endorsement on the Anxiety Disorders Interview Schedule for Children-IV. One hundred eight children (ages 7-13) and their parents completed structured diagnostic interviews for non-OCD/PTSD anxiety diagnoses and paper and pencil measures of functioning and impairment in a variety of domains. Parent-child agreement was statistically significant for social phobia and separation anxiety disorder, but was overall poor for all anxiety diagnoses. Externalizing disorder status, family accommodation frequency, and child rated impairment in various domains differentially predicted informant discrepancies for different anxiety disorders. These data are among the first to suggest variables that may explain parent-child concordance. Copyright © 2016. Published by Elsevier Ireland Ltd.

  9. Inter-rater reliability and stability of diagnoses of autism spectrum disorder in children identified through screening at a very young age

    NARCIS (Netherlands)

    van Daalen, Emma; Kemner, Chantal; Dietz, Claudine; Swinkels, Sophie H. N.; Buitelaar, Jan K.; van Engeland, Herman

    2009-01-01

    To examine the inter-rater reliability and stability of autism spectrum disorder (ASD) diagnoses made at a very early age in children identified through a screening procedure around 14 months of age. In a prospective design, preschoolers were recruited from a screening study for ASD. The inter-rater

  10. Inter-rater reliability and stability of diagnoses of autism spectrum disorder in children identified through screening at a very young age.

    NARCIS (Netherlands)

    Daalen, E. van; Kemner, C.; Dietz, C.; Swinkels, S.H.N.; Buitelaar, J.K.; Engeland, H.M. van

    2009-01-01

    To examine the inter-rater reliability and stability of autism spectrum disorder (ASD) diagnoses made at a very early age in children identified through a screening procedure around 14 months of age. In a prospective design, preschoolers were recruited from a screening study for ASD. The inter-rater

  11. Agreement between T2 and haste sequences in the evaluation of thoracolumbar intervertebral disc disease in dogs.

    Science.gov (United States)

    Mankin, Joseph M; Hecht, Silke; Thomas, William B

    2012-01-01

    The purpose of this study was to compare half-Fourier-acquisition single-shot turbo spin-echo (HASTE) and T2-weighted (T2-W) sequences in dogs with thoracolumbar disc extrusion. MRI studies in 60 dogs (767 individual intervertebral disc spaces) were evaluated. Agreement between T2-W and HASTE sequences was assessed for two criteria: presence of an extradural lesion and treatment recommendation. There was moderate agreement between T2-W and HASTE sequences as to presence of an extradural lesion (kappa = 0.575). HASTE was in agreement in 96.1% of the sites where no extradural lesion was identified on T2-W images, but only in 58.1% of the sites where extradural lesions were identified on T2-W images. There was also moderate agreement between T2-W and HASTE sequences as to treatment recommendations (kappa = 0.476). HASTE was in agreement in 98.4% of the sites where a lesion was considered nonsurgical on T2 but only 82.1% of sites a lesion was considered surgical on T2. In 1.0% of sites considered not surgical and in 9.8% of sites considered equivocal based on T2-W images, a surgical lesion was identified on HASTE. Acquisition of a HASTE sequence in addition to conventional sequences may be beneficial in determining the severity of spinal cord compression in some cases when evaluating the canine spine.

  12. Nurses assessing pain with the Nociception Coma Scale: interrater reliability and validity

    NARCIS (Netherlands)

    Vink, Peter; Eskes, Anne Maria; Lindeboom, Robert; van den Munckhof, Pepijn; Vermeulen, Hester

    2014-01-01

    The Nociception Coma Scale (NCS) is a pain observation tool, developed for patients with disorders of consciousness (DOC) due to acquired brain injury (ABI). The aim of this study was to assess the interrater reliability of the NCS and NCS-R among nurses for the assessment of pain in ABI patients

  13. An Evaluation by Midwives and Gynecologists of Treatability of Cervical Lesions by Cryotherapy Among Human Papillomavirus–Positive Women

    Science.gov (United States)

    Gage, Julia C.; Rodriguez, Ana Cecilia; Schiffman, Mark; Adadevoh, Sydney; Alvarez Larraondo, Manuel J.; Chumworathayi, Bandit; Lejarza, Sandra Vargas; Araya, Luis Villegas; Garcia, Francisco; Budihas, Scott R.; Long, Rodney; Katki, Hormuzd A.; Herrero, Rolando; Burk, Robert D.; Jeronimo, Jose

    2010-01-01

    Objectives To estimate efficacy of a visual triage of human papillomavirus (HPV)– positive women to either immediate cryotherapy or referral if not treatable (eg, invasive cancer, large precancers). Methods We evaluated visual triage in the HPV-positive women aged 25 to 55 years from the 10,000-woman Guanacaste Cohort Study (n = 552). Twelve Peruvian midwives and 5 international gynecologists assessed treatability by cryotherapy using digitized high-resolution cervical images taken at enrollment. The reference standard of treatability was determined by 2 lead gynecologists from the entire 7-year follow-up of the women. Women diagnosed with histologic cervical intraepithelial neoplasia grade 2 or worse or 5-year persistence of carcinogenic HPV infection were defined as needing treatment. Results Midwives and gynecologists judged 30.8% and 41.2% of women not treatable by cryotherapy, respectively (P cryotherapy. The proportion of women judged not treatable by a reviewer varied widely and ranged from 18.6%to 61.1%. Interrater agreement was poor with mean pairwise overall agreement of 71.4% and 66.3% and κ ’s of 0.33 and 0.30 for midwives and gynecologists, respectively. Conclusions In future “screen-and-treat” cervical cancer prevention programs using HPV testing and cryotherapy, practitioners will visually triage HPV-positive women. The suboptimal performance of visual triage suggests that screen-and-treat programs using cryotherapy might be insufficient for treating precancerous lesions. Improved, low-technology triage methods and/or improved safe and low-technology treatment options are needed. PMID:19509579

  14. Intra- and interrater reliability of the 'lumbar-locked thoracic rotation test' in competitive swimmers ages 10 through 18 years.

    Science.gov (United States)

    Feijen, Stef; Kuppens, Kevin; Tate, Angela; Baert, Isabel; Struyf, Thomas; Struyf, Filip

    2018-04-17

    Measuring thoracic spine mobility can be of interest to competitive swimmers as it has been associated with shoulder girdle function and scapular position in subjects with and without shoulder pain. At present, no reliability data of thoracic spine mobility measurements are available in the swimming population. This study aims to evaluate the within-session intra- and interrater reliability of the "lumbar-locked rotation test" for thoracic spine rotation in competitive swimmers aged 10 to 18 years. This reliability study is part of a larger prospective cohort study investigating potential risk factors for the development of shoulder pain in competitive swimmers. Within-session, intra- and inter-rater reliability. Competitive swimming clubs in Belgium. 21 competitive swimmers. Intra- and inter-rater reliability of the lumbar-locked thoracic rotation test. Intraclass correlation coefficients (ICCs) ranged from 0.91 (95% CI 0.78 to 0.96) to 0.96 (0.89-0.98) for intra-rater reliability. Results for inter-rater reliability ranged from 0.89 (0.72-0.95) to 0.86 (0.65-0.94) respectively for right and left thoracic rotation. Results suggest good to excellent reliability of the lumbar-locked thoracic rotation test, indicating this test can be used reliably in clinical practice. Copyright © 2018 Elsevier Ltd. All rights reserved.

  15. Inter-rater reliability of a modified version of Delitto et al.’s classification-based system for low back pain : a pilot study

    NARCIS (Netherlands)

    Apeldoorn, Adri T.; van Helvoirt, Hans; Ostelo, Raymond W.; Meihuizen, Hanneke; Kamper, Steven J.; van Tulder, Maurits W.; de Vet, Henrica C W

    2016-01-01

    Study design:: Observational inter-rater reliability study. Objectives: To examine: (1) the inter-rater reliability of a modified version of Delitto et al.’s classification-based algorithm for patients with low back pain; (2) the influence of different levels of familiarity with the system; and (3)

  16. Quality of the Critical Incident Technique in practice: Interrater reliability and users' acceptance under real conditions

    Directory of Open Access Journals (Sweden)

    ANNA KOCH

    2009-03-01

    Full Text Available The Critical Incident Technique (CIT is a widely used task analysis method in personnel psychology. While studies on psychometric properties of the CIT so far primarily took into account relevance ratings of task-lists or attributes, and hence, only a smaller or adapted part of the CIT, little is known about the psychometric properties of the complete CIT in its most meaningful and fruitful way. Therefore, the aim of the present study was to assess interrater reliability and the participants’ view of the CIT under real conditions and especially to provide data for the key step of the CIT: the classification of behavior descriptions into requirements. Additionally, the cost-benefit-ratio and practicability were rated from the participants’ views as an important indicator for the acceptance of the task analysis approach in practice. Instructors of German Institutions for Statutory Accidents Insurance and Prevention as well as their supervisors took part in a job analysis with the CIT. Moderate interrater reliability for the relevance rating was found while the classification step yielded unexpectedly low coefficients for interrater reliability. The cost-benefit-ratio and practicability of the complete CIT were rated very positive. The results are discussed in relation to determinants that facilitate or impede the application of task analysis procedures.

  17. Interrater reliability of the mind map assessment rubric in a cohort of medical students

    Directory of Open Access Journals (Sweden)

    Zipp Genevieve

    2009-04-01

    Full Text Available Abstract Background Learning strategies are thinking tools that students can use to actively acquire information. Examples of learning strategies include mnemonics, charts, and maps. One strategy that may help students master the tsunami of information presented in medical school is the mind map learning strategy. Currently, there is no valid and reliable rubric to grade mind maps and this may contribute to their underutilization in medicine. Because concept maps and mind maps engage learners similarly at a metacognitive level, a valid and reliable concept map assessment scoring system was adapted to form the mind map assessment rubric (MMAR. The MMAR can assess mind map depth based upon concept-links, cross-links, hierarchies, examples, pictures, and colors. The purpose of this study was to examine interrater reliability of the MMAR. Methods This exploratory study was conducted at a US medical school as part of a larger investigation on learning strategies. Sixty-six (N = 66 first-year medical students were given a 394-word text passage followed by a 30-minute presentation on mind mapping. After the presentation, subjects were again given the text passage and instructed to create mind maps based upon the passage. The mind maps were collected and independently scored using the MMAR by 3 examiners. Interrater reliability was measured using the intraclass correlation coefficient (ICC statistic. Statistics were calculated using SPSS version 12.0 (Chicago, IL. Results Analysis of the mind maps revealed the following: concept-links ICC = .05 (95% CI, -.42 to .38, cross-links ICC = .58 (95% CI, .37 to .73, hierarchies ICC = .23 (95% CI, -.15 to .50, examples ICC = .53 (95% CI, .29 to .69, pictures ICC = .86 (95% CI, .79 to .91, colors ICC = .73 (95% CI, .59 to .82, and total score ICC = .86 (95% CI, .79 to .91. Conclusion The high ICC value for total mind map score indicates strong MMAR interrater reliability. Pictures and colors demonstrated moderate

  18. Interrater reliability of the mind map assessment rubric in a cohort of medical students.

    Science.gov (United States)

    D'Antoni, Anthony V; Zipp, Genevieve Pinto; Olson, Valerie G

    2009-04-28

    Learning strategies are thinking tools that students can use to actively acquire information. Examples of learning strategies include mnemonics, charts, and maps. One strategy that may help students master the tsunami of information presented in medical school is the mind map learning strategy. Currently, there is no valid and reliable rubric to grade mind maps and this may contribute to their underutilization in medicine. Because concept maps and mind maps engage learners similarly at a metacognitive level, a valid and reliable concept map assessment scoring system was adapted to form the mind map assessment rubric (MMAR). The MMAR can assess mind map depth based upon concept-links, cross-links, hierarchies, examples, pictures, and colors. The purpose of this study was to examine interrater reliability of the MMAR. This exploratory study was conducted at a US medical school as part of a larger investigation on learning strategies. Sixty-six (N = 66) first-year medical students were given a 394-word text passage followed by a 30-minute presentation on mind mapping. After the presentation, subjects were again given the text passage and instructed to create mind maps based upon the passage. The mind maps were collected and independently scored using the MMAR by 3 examiners. Interrater reliability was measured using the intraclass correlation coefficient (ICC) statistic. Statistics were calculated using SPSS version 12.0 (Chicago, IL). Analysis of the mind maps revealed the following: concept-links ICC = .05 (95% CI, -.42 to .38), cross-links ICC = .58 (95% CI, .37 to .73), hierarchies ICC = .23 (95% CI, -.15 to .50), examples ICC = .53 (95% CI, .29 to .69), pictures ICC = .86 (95% CI, .79 to .91), colors ICC = .73 (95% CI, .59 to .82), and total score ICC = .86 (95% CI, .79 to .91). The high ICC value for total mind map score indicates strong MMAR interrater reliability. Pictures and colors demonstrated moderate to strong interrater reliability. We conclude that the

  19. Inter-rater Reliability of the Dysphagia Outcome and Severity Scale (DOSS): Effects of Clinical Experience, Audio-Recording and Training.

    Science.gov (United States)

    Zarkada, Angeliki; Regan, Julie

    2017-10-19

    The Dysphagia Outcome and Severity Scale (DOSS) is widely used to measure dysphagia severity based on videofluoroscopy (VFSS). This study investigated inter-rater reliability (IRR) of the DOSS. It also determined the effect of clinical experience, VFSS audio-recording and training on DOSS IRR. A quantitative prospective research design was used. Seventeen speech and language pathologists (SLPs) were recruited from an acute teaching hospital, Dublin (> 3 years' VFSS experience, n = 10) and from a postgraduate dysphagia programme in a university setting (training session on DOSS rating after which DOSS IRR was re-tested. Cohen's kappa co-efficient was used to establish IRR. IRR of the DOSS presented only fair agreement (κ = 0.36, p training (κ = 0.328) was significantly better comparing to post-training (κ = 0.218) (p < 0.05). Findings raise concerns as the DOSS is frequently used in clinical practice to capture dysphagia severity and to monitor changes.

  20. IRR (Inter-Rater Reliability) of a COP (Classroom Observation Protocol)--A Critical Appraisal

    Science.gov (United States)

    Rui, Ning; Feldman, Jill M.

    2012-01-01

    Notwithstanding broad utility of COPs (classroom observation protocols), there has been limited documentation of the psychometric properties of even the most popular COPs. This study attempted to fill this void by closely examining the item and domain-level IRR (inter-rater reliability) of a COP that was used in a federally funded striving readers…

  1. Interobserver and intraobserver agreement of ligamentous injuries on conventional MRI after simple elbow dislocation

    Directory of Open Access Journals (Sweden)

    Marc Schnetzke

    2017-02-01

    Full Text Available Abstract Background The primary objective of this study was to assess the interobserver and intraobserver agreement on ligamentous injuries on conventional magnetic resonance imaging (MRI in acute simple elbow dislocation. The secondary objectives were to determine the interobserver agreement on the assessment of joint congruity, joint effusion, loose bodies and chondral lesions on conventional MRI. Methods Conventional MRIs (1.5 Tesla, elbow specific surface coil of 30 patients (40.7 years; range 14–72 with simple elbow dislocations were evaluated by four blinded examiners. An analysis of the interobserver agreement of all raters and for several subgroups (radiologists, orthopaedics, experienced, non-experienced was performed. The examiners assessed the integrity (intact, partial tear, complete tear of the lateral collateral ligament (LCL, medial collateral ligament (MCL, extensor and flexor tendons, as well as the presence of joint congruity, joint effusion, loose bodies and chondral lesions. Agreement strength, correlation and proportion of exact agreement were determined for interobserver agreement, and intraobserver agreement analyses. Results Interobserver agreement of all examiners was fair to moderate for collateral ligaments (LCL: 0.441, MCL: 0.275. Exact agreement of all raters was found in 33.3% for the LCL and in 26.7% for the MCL. The both experienced examiners showed highest agreement strength for the LCL (0.619 and the radiologists showed highest agreement strength for the MCL (0.627, the proportion of exact agreement was 60.0% in both categories. A high proportion of exact agreement regarding joint congruity (90%, joint effusion (100%, loose bodies (96.7% and chondral lesion (80% was found among the radiologists. The evaluation of the intraobserver agreement revealed slight to substantial agreement (0.227 to 0.718 for the collateral ligaments. Conclusions This study shows difficulties in the evaluation of ligaments by

  2. Inter-rater reliability of the German version of the Nurses' Global Assessment of Suicide Risk scale.

    Science.gov (United States)

    Kozel, Bernd; Grieser, Manuela; Abderhalden, Christoph; Cutcliffe, John R

    2016-10-01

    In comparison to the general population, the suicide rates of psychiatric inpatient populations in Germany and Switzerland are very high. An important preventive contribution to the lowering of the suicide rates in mental health care is to ensure that the risk of suicide of psychiatric inpatients is assessed as accurately as possible. While risk-assessment instruments can serve an important function in determining such risk, very few have been translated to German. Therefore, in the present study, we reported on the German version of Nurses' Global Assessment of Suicide Risk (NGASR) scale. After translating the original instrument into German and pretesting the German version, we tested the inter-rater reliability of the instrument. Twelve video case studies were evaluated by 13 raters with the NGASR scale in a 'laboratory' trial. In each case, the observer's agreement was calculated for the single items, the overall scale, the risk levels, and the sum scores. The statistical data analysis was conducted with kappa and AC1 statistics for dichotomous (items, scale) scales. A high-to-very high observers' agreement (AC1: 0.62-1.00, kappa: 0.00-1.00) was determined for 16 items of the German version of the NGASR scale. We conclude that the German version of the NGASR scale is a reliable instrument for evaluating risk factors for suicide. A reliable application in the clinical practise appears to be enhanced by training in the use of the instrument and the right implementation instructions. © 2016 Australian College of Mental Health Nurses Inc.

  3. The Bath metrology index as assessed by a trained and an untrained rater in patients with spondylarthropathy: a study of intra- and inter-rater agreements

    DEFF Research Database (Denmark)

    Madsen, O R; Hansen, L B; Rytter, A

    2008-01-01

    -rater and inter-rater reproducibility of BASMI scoring in 30 Danish patients with SpA (median age 40 years, range 22-56 years) fulfilling the European Spondylarthropathy Study Group criteria, 25 of them satisfying the modified New York Criteria for AS. Measurements were performed twice on two different days......The Bath ankylosing spondylitis metrology index (BASMI; range 0-10) has gained widespread use in daily clinical practice as an objective measure of spinal stiffness not only in patients with ankylosing spondylitis (AS) but also in patients with other spondylarthropathies (SpA). We examined intra...... (median interval 7 days, range 4-11) by a trained physiotherapist (PT) and by an untrained nurse who had undergone a single 1-h training session with the PT. The median BASMI score obtained by the PT on the two test days was 3.5 (range 1-8) and 3.0 (range 1-8), respectively (NS). Test-retest BASMI scores...

  4. OC10 - Inter-rater agreement of the Paediatric Early Warning Score tools used in the central Denmark region

    DEFF Research Database (Denmark)

    Jensen, Claus Sixtus; Aagaard, Hanne; Vebert Olesen, Hanne

    2016-01-01

    through simultaneous blinded PEWS assessment on the same patients by two nurses. Fleiss' kappa was utilized to determine the level of agreement among the raters. CONCLUSION: With a paucity of published reliability testing studies, this research attempts to address identified research gaps and will thus...

  5. Inter-rater reliability of data elements from a prototype of the Paul Coverdell National Acute Stroke Registry

    Directory of Open Access Journals (Sweden)

    Wehner Susan

    2008-06-01

    Full Text Available Abstract Background The Paul Coverdell National Acute Stroke Registry (PCNASR is a U.S. based national registry designed to monitor and improve the quality of acute stroke care delivered by hospitals. The registry monitors care through specific performance measures, the accuracy of which depends in part on the reliability of the individual data elements used to construct them. This study describes the inter-rater reliability of data elements collected in Michigan's state-based prototype of the PCNASR. Methods Over a 6-month period, 15 hospitals participating in the Michigan PCNASR prototype submitted data on 2566 acute stroke admissions. Trained hospital staff prospectively identified acute stroke admissions, abstracted chart information, and submitted data to the registry. At each hospital 8 randomly selected cases were re-abstracted by an experienced research nurse. Inter-rater reliability was estimated by the kappa statistic for nominal variables, and intraclass correlation coefficient (ICC for ordinal and continuous variables. Factors that can negatively impact the kappa statistic (i.e., trait prevalence and rater bias were also evaluated. Results A total of 104 charts were available for re-abstraction. Excellent reliability (kappa or ICC > 0.75 was observed for many registry variables including age, gender, black race, hemorrhagic stroke, discharge medications, and modified Rankin Score. Agreement was at least moderate (i.e., 0.75 > kappa ≥; 0.40 for ischemic stroke, TIA, white race, non-ambulance arrival, hospital transfer and direct admit. However, several variables had poor reliability (kappa Conclusion The excellent reliability of many of the data elements supports the use of the PCNASR to monitor and improve care. However, the poor reliability for several variables, particularly time-related events in the emergency department, indicates the need for concerted efforts to improve the quality of data collection. Specific recommendations

  6. Ultrasound assessment for grading structural tendon changes in supraspinatus tendinopathy: an inter-rater reliability study

    DEFF Research Database (Denmark)

    Ingwersen, Kim Gordon; Hjarbæk, John; Eshøj, Henrik

    2016-01-01

    Aim To evaluate the inter-rater reliability of measuring structural changes in the tendon of patients, clinically diagnosed with supraspinatus tendinopathy (cases) and healthy participants (controls), on ultrasound (US) images captured by standardised procedures. Methods A total of 40 participant...

  7. US-guided diffuse optical tomography for breast lesions: the reliability of clinical experience

    Energy Technology Data Exchange (ETDEWEB)

    Kim, Min Jung; Kim, Ji Youn; Youn, Jung Hyun; Kim, Myung Hyun; Koo, Hye Ryoung; Kim, Soo Jin; Sohn, Yu-Mee; Moon, Hee Jung; Kim, Eun-Kyung [Yonsei University College of Medicine, Institute of Radiological Science, Seoul (Korea, Republic of); Yonsei University College of Medicine, Department of Radiology, Seoul (Korea, Republic of)

    2011-07-15

    To prospectively assess the reliability of US-guided diffuse optical tomography (US-DOT) using interobserver agreement for the diagnosis of breast lesions with individual real-time imaging and to assess the interobserver agreement of conventional sonography (US) combined with US-DOT for differentiation between benignity and malignancy breast lesions. An Institutional Review Board approved this study, and all subjects provided written informed consent. 122 breast lesions in 111 patients evaluated with US-guided core biopsy were included. Assessments with US and US-DOT for cases subjected to biopsy were obtained by two radiologists using individual real-time imaging prior to biopsy and were prospectively recorded by each performer. With DOT, the total haemoglobin concentration (THC) for each breast lesion was measured. Histopathological results from US-guided biopsies were used as a reference standard. To assess measurement interobserver agreement, the intraclass correlation coefficient (ICC) and the Bland-Altman plot were used for THC in US-DOT and the kappa values and ROC analysis were used to evaluate the diagnostic performances of the US BI-RADS final assessment in US and combined US and US-DOT. Of 122 US-guided core biopsied lesions, 83 (68.0%) were diagnosed as benign, and 39 (32.0%) as malignant. Excellent correlation was seen in the THC in US-DOT (ICC score 0.796; 95% confidence interval, 0.708-0.857). The interobserver agreement in BI-RADS final assessment with US and US-DOT (almost perfect; {kappa} = 0.8618) was improved compared with that of US (substantial agreement, {kappa} = 0.6574). However, the overall areas under the ROC curve did not show significant differences between US and combined US and US-DOT, 0.8894 and 0.8975, respectively (P = 0.981). The reliability of THC in US-DOT showed excellent correlation in overall real-time performance. Although the inter-observer agreement for BI-RADS final assessment of US was improved by using US-DOT, the

  8. Examination of anomalous self-experience in first-episode psychosis: interrater reliability

    DEFF Research Database (Denmark)

    Møller, Paul; Haug, Elisabeth; Raballo, Andrea

    2011-01-01

    -rater correlation above 0.80 (Spearman's rho, p values at an item level were very good in 9 items, good in 20 items, moderate in 11 items and fair in 4 items. Conclusion: The EASE provides a reliable and internally......) is a phenomenologically inspired checklist, specifically designed to support the comprehensive assessment of these characteristic subjective experiences. Aim: To assess the interrater reliability of the EASE. Sampling and Methods: Twenty-five first-episode psychosis (FEP) patients were interviewed with the EASE...

  9. Analyses of inter-rater reliability between professionals, medical students and trained school children as assessors of basic life support skills.

    Science.gov (United States)

    Beck, Stefanie; Ruhnke, Bjarne; Issleib, Malte; Daubmann, Anne; Harendza, Sigrid; Zöllner, Christian

    2016-10-07

    Training of lay-rescuers is essential to improve survival-rates after cardiac arrest. Multiple campaigns emphasise the importance of basic life support (BLS) training for school children. Trainings require a valid assessment to give feedback to school children and to compare the outcomes of different training formats. Considering these requirements, we developed an assessment of BLS skills using MiniAnne and tested the inter-rater reliability between professionals, medical students and trained school children as assessors. Fifteen professional assessors, 10 medical students and 111-trained school children (peers) assessed 1087 school children at the end of a CPR-training event using the new assessment format. Analyses of inter-rater reliability (intraclass correlation coefficient; ICC) were performed. Overall inter-rater reliability of the summative assessment was high (ICC = 0.84, 95 %-CI: 0.84 to 0.86, n = 889). The number of comparisons between peer-peer assessors (n = 303), peer-professional assessors (n = 339), and peer-student assessors (n = 191) was adequate to demonstrate high inter-rater reliability between peer- and professional-assessors (ICC: 0.76), peer- and student-assessors (ICC: 0.88) and peer- and other peer-assessors (ICC: 0.91). Systematic variation in rating of specific items was observed for three items between professional- and peer-assessors. Using this assessment and integrating peers and medical students as assessors gives the opportunity to assess hands-on skills of school children with high reliability.

  10. Method of Quantifying Size of Retinal Hemorrhages in Eyes with Branch Retinal Vein Occlusion Using 14-Square Grid: Interrater and Intrarater Reliability

    Directory of Open Access Journals (Sweden)

    Yuko Takashima

    2016-01-01

    Full Text Available Purpose. To describe a method of quantifying the size of the retinal hemorrhages in branch retinal vein occlusion (BRVO and to determine the interrater and intrarater reliabilities of these measurements. Methods. Thirty-five fundus photographs from 35 consecutive eyes with BRVO were studied. The fundus images were analyzed with Power-Point® software, and a grid of 14 squares was laid over the fundus image. Raters were asked to judge the percentage of each of the 14 squares that was covered by the hemorrhages, and the average of the 14 squares was taken to be the relative size of the retinal hemorrhage. Results. Interrater reliability between three raters was higher when a grid with 14 squares was used (intraclass correlation coefficient (ICC, 0.96 than that when a box with no grid was used (ICC, 0.78. Intrarater reliability, which was calculated by the retinal hemorrhage area measured on two different days, was also higher (ICC, 0.97 than that with no grid (ICC, 0.86. Interrater reliability for five fundus pictures with poor image quality was also good when a grid with 14 squares was used (ICC, 0.88. Conclusions. Although our method is subjective, excellent interrater and intrarater reliabilities indicate that this method can be adapted for clinical use.

  11. Intrarater and interrater reliability of pulse examination in traditional Indian Ayurvedic medicine.

    Science.gov (United States)

    Kurande, Vrinda; Waagepetersen, Rasmus; Toft, Egon; Prasad, Ramjee

    2013-09-01

    In Ayurveda, pulse examination ( nadipariksha ) is an important tool to assess the status of three doshas : vata , pitta , and kapha . Long historical use has been seen as a documentation of its efficacy; however, there is a lack of a quantitative measure of the reliability of the pulse examination method. The objective of this study was to test the intrarater and interrater reliability of pulse examination in Ayurveda. Fifteen registered Ayurvedic doctors with 3-15 years of experience examined the pulse of 20 healthy volunteers twice, for a total of 600 examinations. The examinations were performed blind and in a random order. Only the current status of dosha- specific methods of pulse examination were considered. Cohen's weighted κ statistic was used as a measure of intrarater and interrater reliability, and a hypothesis of homogeneous diagnosis (random rating) was tested. Following this, we tested whether proportions of ratings were equal between doctors. According to the Landis and Koch scale, the level of reliability ranged from poor to moderate. It was observed that the doctors more frequently diagnosed a combination of two doshas than a single dosha. The κ values were generally larger for experienced doctors ( p   =  0.04). Experience and proper training have important roles in pulse examination.

  12. Interrater reliability of Violence Risk Appraisal Guide scores provided in Canadian criminal proceedings.

    Science.gov (United States)

    Edens, John F; Penson, Brittany N; Ruchensky, Jared R; Cox, Jennifer; Smith, Shannon Toney

    2016-12-01

    Published research suggests that most violence risk assessment tools have relatively high levels of interrater reliability, but recent evidence of inconsistent scores among forensic examiners in adversarial settings raises concerns about the "field reliability" of such measures. This study specifically examined the reliability of Violence Risk Appraisal Guide (VRAG) scores in Canadian criminal cases identified in the legal database, LexisNexis. Over 250 reported cases were located that made mention of the VRAG, with 42 of these cases containing 2 or more scores that could be submitted to interrater reliability analyses. Overall, scores were skewed toward higher risk categories. The intraclass correlation (ICCA1) was .66, with pairs of forensic examiners placing defendants into the same VRAG risk "bin" in 68% of the cases. For categorical risk statements (i.e., low, moderate, high), examiners provided converging assessment results in most instances (86%). In terms of potential predictors of rater disagreement, there was no evidence for adversarial allegiance in our sample. Rater disagreement in the scoring of 1 VRAG item (Psychopathy Checklist-Revised; Hare, 2003), however, strongly predicted rater disagreement in the scoring of the VRAG (r = .58). (PsycINFO Database Record (c) 2016 APA, all rights reserved).

  13. Lesion detection and quantitation of positron emission mammography

    International Nuclear Information System (INIS)

    Qi, Jinyi; Huesman, Ronald H.

    2001-01-01

    A Positron Emission Mammography (PEM) scanner dedicated to breast imaging is being developed at our laboratory. We have developed a list mode likelihood reconstruction algorithm for this scanner. Here we theoretically study the lesion detection and quantitation. The lesion detectability is studied theoretically using computer observers. We found that for the zero-order quadratic prior, the region of interest observer can achieve the performance of the prewhitening observer with a properly selected smoothing parameter. We also study the lesion quantitation using the test statistic of the region of interest observer. The theoretical expressions for the bias, variance, and ensemble mean squared error of the quantitation are derived. Computer simulations show that the theoretical predictions are in good agreement with the Monte Carlo results for both lesion detection and quantitation

  14. Reliability of one-repetition maximum performance in people with chronic heart failure.

    Science.gov (United States)

    Ellis, Rachel; Holland, Anne E; Dodd, Karen; Shields, Nora

    2018-02-24

    Evaluate intra-rater and inter-rater reliability of the one-repetition maximum strength test in people with chronic heart failure. Intra-rater and inter-rater reliability study. A public tertiary hospital in northern metropolitan Melbourne. Twenty-four participants (nine female, mean age 71.8 ± 13.1 years) with mild to moderate heart failure of any aetiology. Lower limb strength was assessed by determining the maximum weight that could be lifted using a leg press. Intra-rater reliability was tested by one assessor on two separate occasions . Inter-rater reliability was tested by two assessors in random order. Intra-class correlation coefficients and 95% confidence intervals were calculated. Bland and Altman analyses were also conducted, including calculation of mean differences between measures ([Formula: see text]) and limits of agreement . Ten intra-rater and 21 inter-rater assessments were completed. Excellent intra-rater (intra-class correlation coefficient 2,1 0.96) and inter-rater (intra-class correlation coefficient 2,1 0.93) reliability was found. Intra-rater assessment showed less variability (mean difference 4.5 kg, limits of agreement -8.11 to 17.11 kg) than inter-rater agreement (mean difference -3.81 kg, limits of agreement -23.39 to 15.77 kg). One-repetition maximum determined using a leg press is a reliable measure in people with heart failure. Given its smaller limits of agreement, intra-rater testing is recommended. Implications for Rehabilitation Using a leg press to determine a one-repetition maximum we were able to demonstrate excellent inter-rater and intra-rater reliability using an intra-class correlation coefficient. The Bland and Altman levels of agreement were wide for inter-rater reliability and so we recommend using one assessor if measuring change in strength within an individual over time.

  15. Inter-rater reliability in the classification of supraspinatus tendon tears using 3D ultrasound – a question of experience?

    Directory of Open Access Journals (Sweden)

    Giorgio Tamborrini

    2016-09-01

    Full Text Available Background: Three-dimensional (3D ultrasound of the shoulder is characterized by a comparable accuracy to two-dimensional (2D ultrasound. No studies investigating 2D versus 3D inter-rater reliability in the detection of supraspinatus tendon tears taking into account the level of experience of the raters have been carried out so far. Objectives: The aim of this study was to determine the inter-rater reliability in the analysis of 3D ultrasound image sets of the supraspinatus tendon between sonographer with different levels of experience. Patients and methods: Non-interventional, prospective, observational pilot study of 2309 images of 127 adult patients suffering from unilateral shoulder pain. 3D ultrasound image sets were scored by three raters independently. The intra-and interrater reliabilities were calculated. Results: There was an excellent intra-rater reliability of rater A in the overall classification of supraspinatus tendon tears (2D vs 3D κ = 0.892, pairwise reliability 93.81%, 3D scoring round 1 vs 3D scoring round 2 κ = 0.875, pairwise reliability 92.857%. The inter-rater reliability was only moderate compared to rater B on 3D (κ = 0.497, pairwise reliability 70.95% and fair compared to rater C (κ = 0.238, pairwise reliability 42.38%. Conclusions: The reliability of 3D ultrasound of the supraspinatus tendon depends on the level of experience of the sonographer. Experience in 2D ultrasound does not seem to be sufficient for the analysis of 3D ultrasound imaging sets. Therefore, for a 3D ultrasound analysis new diagnostic criteria have to be established and taught even to experienced 2D sonographers to improve reproducibility.

  16. Inter-rater reliability of nursing home quality indicators in the U.S

    Directory of Open Access Journals (Sweden)

    Roy Jason

    2003-11-01

    Full Text Available Abstract Background In the US, Quality Indicators (QI's profiling and comparing the performance of hospitals, health plans, nursing homes and physicians are routinely published for consumer review. We report the results of the largest study of inter-rater reliability done on nursing home assessments which generate the data used to derive publicly reported nursing home quality indicators. Methods We sampled nursing homes in 6 states, selecting up to 30 residents per facility who were observed and assessed by research nurses on 100 clinical assessment elements contained in the Minimum Data Set (MDS and compared these with the most recent assessment in the record done by facility nurses. Kappa statistics were generated for all data items and derived for 22 QI's over the entire sample and for each facility. Finally, facilities with many QI's with poor Kappa levels were compared to those with many QI's with excellent Kappa levels on selected characteristics. Results A total of 462 facilities in 6 states were approached and 219 agreed to participate, yielding a response rate of 47.4%. A total of 5758 residents were included in the inter-rater reliability analyses, around 27.5 per facility. Patients resembled the traditional nursing home resident, only 43.9% were continent of urine and only 25.2% were rated as likely to be discharged within the next 30 days. Results of resident level comparative analyses reveal high inter-rater reliability levels (most items >.75. Using the research nurses as the "gold standard", we compared composite quality indicators based on their ratings with those based on facility nurses. All but two QI's have adequate Kappa levels and 4 QI's have average Kappa values in excess of .80. We found that 16% of participating facilities performed poorly (Kappa .75 on 12 or more QI's. No facility characteristics were related to reliability of the data on which Qis are based. Conclusion While a few QI's being used for public reporting

  17. The Effect of Instrument-Specific Rater Training on Interrater Reliability and Counseling Skills Performance Differentiation

    Science.gov (United States)

    Meacham, Paul Douglas, Jr.

    2013-01-01

    The purpose of this study was to explore the effect of instrument-specific rater training on interrater reliability (IRR) and counseling skills performance differentiation. Strong IRR is of primary concern to effective program evaluation (McCullough, Kuhn, Andrews, Valen, Hatch, & Osimo, 2003; Schanche, Nielsen, McCullough, Valen, &…

  18. Inter-rater reliability and stability of diagnoses of autism spectrum disorder in children identified through screening at a very young age.

    Science.gov (United States)

    van Daalen, Emma; Kemner, Chantal; Dietz, Claudine; Swinkels, Sophie H N; Buitelaar, Jan K; van Engeland, Herman

    2009-11-01

    To examine the inter-rater reliability and stability of autism spectrum disorder (ASD) diagnoses made at a very early age in children identified through a screening procedure around 14 months of age. In a prospective design, preschoolers were recruited from a screening study for ASD. The inter-rater reliability of the diagnosis of ASD was measured through an independent assessment of a randomly selected subsample of 38 patients by two other psychiatrists. The diagnoses at 23 months and 42 months of 131 patients, based on the clinical assessment and the diagnostic classifications of standardised instruments, were compared to evaluate stability of the diagnosis of ASD. Inter-rater reliability on a diagnosis of ASD versus non-ASD at 23 months was 87% with a weighted kappa of 0.74 (SE 0.11). The stability of the different diagnoses in the autism spectrum was 63% for autistic disorder, 54% for pervasive developmental disorder, not otherwise specified (PDD-NOS), and 91% for the whole category of ASD. Most diagnostic changes at 42 months were within the autism spectrum from autistic disorder to PDD-NOS and were mainly due to diminished symptom severity. Children who moved outside the ASD category at 42 months made significantly larger gains in cognitive and language skills than children with a stable ASD diagnosis. In conclusion, the inter-rater reliability and stability of the diagnoses of ASD established at 23 months in this population-based sample of very young children are good.

  19. Interrater Reliability of the Power Mobility Road Test in the Virtual Reality-Based Simulator-2.

    Science.gov (United States)

    Kamaraj, Deepan C; Dicianno, Brad E; Mahajan, Harshal P; Buhari, Alhaji M; Cooper, Rory A

    2016-07-01

    To assess interrater reliability of the Power Mobility Road Test (PMRT) when administered through the Virtual Reality-based SIMulator-version 2 (VRSIM-2). Within-subjects repeated-measures design. Participants interacted with VRSIM-2 through 2 display options (desktop monitor vs immersive virtual reality screens) using 2 control interfaces (roller system vs conventional movement-sensing joystick), providing 4 different driving scenarios (driving conditions 1-4). Participants performed 3 virtual driving sessions for each of the 2 display screens and 1 session through a real-world driving course (driving condition 5). The virtual PMRT was conducted in a simulated indoor office space, and an equivalent course was charted in an open space for the real-world assessment. After every change in driving condition, participants completed a self-reported workload assessment questionnaire, the Task Load Index, developed by the National Aeronautics and Space Administration. A convenience sample of electric-powered wheelchair (EPW) athletes (N=21) recruited at the 31st National Veterans Wheelchair Games. Not applicable. Total composite PMRT score. The PMRT had high interrater reliability (intraclass correlation coefficient [ICC]>.75) between the 2 raters in all 5 driving conditions. Post hoc analyses revealed that the reliability analyses had >80% power to detect high ICCs in driving conditions 1 and 4. The PMRT has high interrater reliability in conditions 1 and 4 and could be used to assess EPW driving performance virtually in VRSIM-2. However, further psychometric assessment is necessary to assess the feasibility of administering the PMRT using the different interfaces of VRSIM-2. Copyright © 2016 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.

  20. Development of standardized image interpretation for 68Ga-PSMA PET/CT to detect prostate cancer recurrent lesions

    Energy Technology Data Exchange (ETDEWEB)

    Fanti, Stefano; Ceci, Francesco; Castellucci, Paolo [University of Bologna, S. Orsola Hospital Bologna, Nuclear Medicine Unit, Bologna (Italy); Minozzi, Silvia [Lazio Regional Health Service, Department of Epidemiology, Rome (Italy); Morigi, Joshua James; Emmett, Louise [St. Vincent' s Public Hospital, Department of Diagnostic Imaging, Sydney (Australia); Giesel, Frederik; Haberkorn, Uwe [University Hospital Heidelberg, Department of Nuclear Medicine, Heidelberg (Germany); Uprimny, Christian; Virgolini, Irene [Medical University Innsbruck, Department of Nuclear Medicine, Innsbruck (Austria); Hofman, Michael S.; Hicks, Rodney J. [Peter MacCallum Cancer Centre, Centre for Molecular Imaging, Department of Cancer Imaging, Melbourne (Australia); Eiber, Matthias; Schwaiger, Markus [Technical University Munich, Department of Nuclear Medicine, Munich (Germany); Schwarzenbock, Sarah; Krause, Bernd J. [University Medical Centre, Department of Nuclear Medicine, Rostock (Germany); Bellisario, Cristina [University Hospital ' ' Citta della Salute e della Scienza di Torino' ' , Department of Cancer Screening, Centre for Epidemiology and Prevention in Oncology (CPO), Turin (Italy); Chauvie, Stephane; Bergesio, Fabrizio [Santa Croce e Carle Hospital, Medical Physics Division, Cuneo (Italy); Chiti, Arturo [Humanitas Clinical and Research Hospital, Nuclear Medicine, Humanitas Cancer Center, Rozzano, MI (Italy)

    2017-09-15

    After primary treatment, biochemical relapse (BCR) occurs in a substantial number of patients with prostate cancer (PCa). PET/CT imaging with prostate-specific membrane antigen based tracers (68Ga-PSMA) has shown promising results for BCR patients. However, a standardized image interpretation methodology has yet to be properly agreed. The aim of this study, which was promoted and funded by European Association of Nuclear Medicine (EANM), is to define standardized image interpretation criteria for 68Ga-PSMA PET/CT to detect recurrent PCa lesions in patients treated with primary curative intent therapy (radical prostatectomy or radiotherapy) who presented a biochemical recurrence. In the first phase inter-rater agreement between seven readers from seven international centers was calculated on the reading of 68Ga-PSMA PET/CT images of 49 patients with BCR. Each reader evaluated findings in five different sites of recurrence (local, loco-regional lymph nodes, distant lymph nodes, bone, and other). In the second phase the re-analysis was limited to cases with poor, slight, fair, or moderate agreement [Krippendorff's (K) alpha<0.61]. Finally, on the basis of the consensus readings, we sought to define a list of revised consensus criteria for 68Ga-PSMA PET/CT interpretation. Between-reader agreement for the presence of anomalous findings in any of the five sites was only moderate (K's alpha: 0.47). The agreement improved and became substantial when readers had to judge whether the anomalous findings were suggestive for a pathologic, uncertain, or non-pathologic image (K's alpha: 0.64). K's alpha calculations for each of the five sites of recurrence were also performed and evaluated. First Delphi round was thus conducted. A more detailed definition of the criteria was proposed by the project coordinator, which was then discussed and finally agreed by the seven readers. After the second Delphi round only four cases of disagreement still remained. These

  1. Inter-operator and inter-device agreement and reliability of the SEM Scanner.

    Science.gov (United States)

    Clendenin, Marta; Jaradeh, Kindah; Shamirian, Anasheh; Rhodes, Shannon L

    2015-02-01

    The SEM Scanner is a medical device designed for use by healthcare providers as part of pressure ulcer prevention programs. The objective of this study was to evaluate the inter-rater and inter-device agreement and reliability of the SEM Scanner. Thirty-one (31) volunteers free of pressure ulcers or broken skin at the sternum, sacrum, and heels were assessed with the SEM Scanner. Each of three operators utilized each of three devices to collect readings from four anatomical sites (sternum, sacrum, left and right heels) on each subject for a total of 108 readings per subject collected over approximately 30 min. For each combination of operator-device-anatomical site, three SEM readings were collected. Inter-operator and inter-device agreement and reliability were estimated. Over the course of this study, more than 3000 SEM Scanner readings were collected. Agreement between operators was good with mean differences ranging from -0.01 to 0.11. Inter-operator and inter-device reliability exceeded 0.80 at all anatomical sites assessed. The results of this study demonstrate the high reliability and good agreement of the SEM Scanner across different operators and different devices. Given the limitations of current methods to prevent and detect pressure ulcers, the SEM Scanner shows promise as an objective, reliable tool for assessing the presence or absence of pressure-induced tissue damage such as pressure ulcers. Copyright © 2015 Bruin Biometrics, LLC. Published by Elsevier Ltd.. All rights reserved.

  2. Automated detection of fundus photographic red lesions in diabetic retinopathy.

    Science.gov (United States)

    Larsen, Michael; Godt, Jannik; Larsen, Nicolai; Lund-Andersen, Henrik; Sjølie, Anne Katrin; Agardh, Elisabet; Kalm, Helle; Grunkin, Michael; Owens, David R

    2003-02-01

    To compare a fundus image-analysis algorithm for automated detection of hemorrhages and microaneurysms with visual detection of retinopathy in patients with diabetes. Four hundred fundus photographs (35-mm color transparencies) were obtained in 200 eyes of 100 patients with diabetes who were randomly selected from the Welsh Community Diabetic Retinopathy Study. A gold standard reference was defined by classifying each patient as having or not having diabetic retinopathy based on overall visual grading of the digitized transparencies. A single-lesion visual grading was made independently, comprising meticulous outlining of all single lesions in all photographs and used to develop the automated red lesion detection system. A comparison of visual and automated single-lesion detection in replicating the overall visual grading was then performed. Automated red lesion detection demonstrated a specificity of 71.4% and a resulting sensitivity of 96.7% in detecting diabetic retinopathy when applied at a tentative threshold setting for use in diabetic retinopathy screening. The accuracy of 79% could be raised to 85% by adjustment of a single user-supplied parameter determining the balance between the screening priorities, for which a considerable range of options was demonstrated by the receiver-operating characteristic (area under the curve 90.3%). The agreement of automated lesion detection with overall visual grading (0.659) was comparable to the mean agreement of six ophthalmologists (0.648). Detection of diabetic retinopathy by automated detection of single fundus lesions can be achieved with a performance comparable to that of experienced ophthalmologists. The results warrant further investigation of automated fundus image analysis as a tool for diabetic retinopathy screening.

  3. Inter-rater reliability of an observation-based ergonomics assessment checklist for office workers.

    Science.gov (United States)

    Pereira, Michelle Jessica; Straker, Leon Melville; Comans, Tracy Anne; Johnston, Venerina

    2016-12-01

    To establish the inter-rater reliability of an observation-based ergonomics assessment checklist for computer workers. A 37-item (38-item if a laptop was part of the workstation) comprehensive observational ergonomics assessment checklist comparable to government guidelines and up to date with empirical evidence was developed. Two trained practitioners assessed full-time office workers performing their usual computer-based work and evaluated the suitability of workstations used. Practitioners assessed each participant consecutively. The order of assessors was randomised, and the second assessor was blinded to the findings of the first. Unadjusted kappa coefficients between the raters were obtained for the overall checklist and subsections that were formed from question-items relevant to specific workstation equipment. Twenty-seven office workers were recruited. The inter-rater reliability between two trained practitioners achieved moderate to good reliability for all except one checklist component. This checklist has mostly moderate to good reliability between two trained practitioners. Practitioner Summary: This reliable ergonomics assessment checklist for computer workers was designed using accessible government guidelines and supplemented with up-to-date evidence. Employers in Queensland (Australia) can fulfil legislative requirements by using this reliable checklist to identify and subsequently address potential risk factors for work-related injury to provide a safe working environment.

  4. Test-re-test reliability and inter-rater reliability of a digital pelvic inclinometer in young, healthy males and females.

    Science.gov (United States)

    Beardsley, Chris; Egerton, Tim; Skinner, Brendon

    2016-01-01

    Objective. The purpose of this study was to investigate the reliability of a digital pelvic inclinometer (DPI) for measuring sagittal plane pelvic tilt in 18 young, healthy males and females. Method. The inter-rater reliability and test-re-test reliabilities of the DPI for measuring pelvic tilt in standing on both the right and left sides of the pelvis were measured by two raters carrying out two rating sessions of the same subjects, three weeks apart. Results. For measuring pelvic tilt, inter-rater reliability was designated as good on both sides (ICC = 0.81-0.88), test-re-test reliability within a single rating session was designated as good on both sides (ICC = 0.88-0.95), and test-re-test reliability between two rating sessions was designated as moderate on the left side (ICC = 0.65) and good on the right side (ICC = 0.85). Conclusion. Inter-rater reliability and test-re-test reliability within a single rating session of the DPI in measuring pelvic tilt were both good, while test-re-test reliability between rating sessions was moderate-to-good. Caution is required regarding the interpretation of the test-re-test reliability within a single rating session, as the raters were not blinded. Further research is required to establish validity.

  5. Translation, adaptation and inter-rater reliability of the administration manual for the Fugl-Meyer assessment.

    Science.gov (United States)

    Michaelsen, Stella M; Rocha, André S; Knabben, Rodrigo J; Rodrigues, Luciano P; Fernandes, Claudia G C

    2011-01-01

    Recently, the reliability of the Brazilian version of the Fugl-Meyer Assessment (FMA) was assessed through the scoring given according to observations made by a single evaluator who applied the test. When different raters apply the scale, the reliability may depend on the interpretation given to the assessment sheet. In such cases, a clear administration manual is essential for ensuring homogeneity of application. To translate and adapt the French Canadian version of the FMA administration manual into Brazilian Portuguese and to evaluate the inter-rater reliability when different evaluators apply the FMA on the basis of the information contained in the manual. Eighteen adults (59±10 years) with chronic hemiparesis (38±35 months after a stroke) took part in this study. Eight patients participated in the first part of the study and 10 in the second part. Based on analyzing the results from part 1, an adapted version was developed, in which information and photos were added to illustrate the positions of the patient and evaluator. The inter-rater reliability was assessed using the intraclass correlation coefficient (ICC). The reliability of the FMA based on the adapted version of the manual was excellent for the total motor scores for the upper limbs (ICC=0.98) and lower limbs (ICC=0.90), as well as for movement sense (ICC=0.98) and upper and lower-limb passive range of motion (ICC=0.84 and 0.90, respectively). The reliability was moderate for tactile sensitivity (0.75). The joint pain assessment presented low reliability. The results showed that, except for pain assessment, application of the FMA based on the adapted version of the application manual for Brazilian Portuguese presented adequate inter-rater reliability.

  6. Can Physicians Identify Inappropriate Nuclear Stress Tests? An Examination of Inter-rater Reliability for the 2009 Appropriate Use Criteria for Radionuclide Imaging

    Science.gov (United States)

    Ye, Siqin; Rabbani, LeRoy E.; Kelly, Christopher R.; Kelly, Maureen R.; Lewis, Matthew; Paz, Yehuda; Peck, Clara L.; Rao, Shaline; Bokhari, Sabahat; Weiner, Shepard D.; Einstein, Andrew J.

    2014-01-01

    Background We sought to determine inter-rater reliability of the 2009 Appropriate Use Criteria (AUC) for radionuclide imaging (RNI) and whether physicians at various levels of training can effectively identify nuclear stress tests with inappropriate indications. Methods and Results Four hundred patients were randomly selected from a consecutive cohort of patients undergoing nuclear stress testing at an academic medical center. Raters with different levels of training (including cardiology attending physicians, cardiology fellows, internal medicine hospitalists, and internal medicine interns) classified individual nuclear stress tests using the 2009 AUC. Consensus classification by two cardiologists was considered the operational gold standard, and sensitivity and specificity of individual raters for identifying inappropriate tests was calculated. Inter-rater reliability of the AUC was assessed using Cohen’s kappa statistics for pairs of different raters. The mean age of patients was 61.5 years; 214 (54%) were female. The cardiologists rated 256 (64%) of 400 NSTs as appropriate, 68 (18%) as uncertain, 55 (14%) as inappropriate; 21 (5%) tests were unable to be classified. Inter-rater reliability for non-cardiologist raters was modest (unweighted Cohen’s kappa, 0.51, 95% confidence interval, 0.45 to 0.55). Sensitivity of individual raters for identifying inappropriate tests ranged from 47% to 82%, while specificity ranged from 85% to 97%. Conclusions Inter-rater reliability for the 2009 AUC for RNI is modest, and there is considerable variation in the ability of raters at different levels of training to identify inappropriate tests. PMID:25563660

  7. Acute stroke: automatic perfusion lesion outlining using level sets.

    Science.gov (United States)

    Mouridsen, Kim; Nagenthiraja, Kartheeban; Jónsdóttir, Kristjana Ýr; Ribe, Lars R; Neumann, Anders B; Hjort, Niels; Østergaard, Leif

    2013-11-01

    To develop a user-independent algorithm for the delineation of hypoperfused tissue on perfusion-weighted images and evaluate its performance relative to a standard threshold method in simulated data, as well as in acute stroke patients. The study was approved by the local ethics committee, and patients gave written informed consent prior to their inclusion in the study. The algorithm identifies hypoperfused tissue in mean transit time maps by simultaneously minimizing the mean square error between individual and mean perfusion values inside and outside a smooth boundary. In 14 acute stroke patients, volumetric agreement between automated outlines and manual outlines determined in consensus among four neuroradiologists was assessed with Bland-Altman analysis, while spatial agreement was quantified by using lesion overlap relative to mean lesion volume (Dice coefficient). Performance improvement relative to a standard threshold approach was tested with the Wilcoxon signed rank test. The mean difference in lesion volume between automated outlines and manual outlines was -9.0 mL ± 44.5 (standard deviation). The lowest mean volume difference for the threshold approach was -25.8 mL ± 88.2. A significantly higher Dice coefficient was observed with the algorithm (0.71; interquartile range [IQR], 0.42-0.75) compared with the threshold approach (0.50; IQR, 0.27- 0.57; P , .001). The corresponding agreement among experts was 0.79 (IQR, 0.69-0.83). The perfusion lesions outlined by the automated algorithm agreed well with those defined manually in consensus by four experts and were superior to those obtained by using the standard threshold approach. This user-independent algorithm may improve the assessment of perfusion images as part of acute stroke treatment. http://radiology.rsna.org/lookup/suppl/doi:10.1148/radiol.13121622/-/DC1. RSNA, 2013

  8. Automatic segmentation and volumetry of multiple sclerosis brain lesions from MR images

    Directory of Open Access Journals (Sweden)

    Saurabh Jain

    2015-01-01

    Full Text Available The location and extent of white matter lesions on magnetic resonance imaging (MRI are important criteria for diagnosis, follow-up and prognosis of multiple sclerosis (MS. Clinical trials have shown that quantitative values, such as lesion volumes, are meaningful in MS prognosis. Manual lesion delineation for the segmentation of lesions is, however, time-consuming and suffers from observer variability. In this paper, we propose MSmetrix, an accurate and reliable automatic method for lesion segmentation based on MRI, independent of scanner or acquisition protocol and without requiring any training data. In MSmetrix, 3D T1-weighted and FLAIR MR images are used in a probabilistic model to detect white matter (WM lesions as an outlier to normal brain while segmenting the brain tissue into grey matter, WM and cerebrospinal fluid. The actual lesion segmentation is performed based on prior knowledge about the location (within WM and the appearance (hyperintense on FLAIR of lesions. The accuracy of MSmetrix is evaluated by comparing its output with expert reference segmentations of 20 MRI datasets of MS patients. Spatial overlap (Dice between the MSmetrix and the expert lesion segmentation is 0.67 ± 0.11. The intraclass correlation coefficient (ICC equals 0.8 indicating a good volumetric agreement between the MSmetrix and expert labelling. The reproducibility of MSmetrix' lesion volumes is evaluated based on 10 MS patients, scanned twice with a short interval on three different scanners. The agreement between the first and the second scan on each scanner is evaluated through the spatial overlap and absolute lesion volume difference between them. The spatial overlap was 0.69 ± 0.14 and absolute total lesion volume difference between the two scans was 0.54 ± 0.58 ml. Finally, the accuracy and reproducibility of MSmetrix compare favourably with other publicly available MS lesion segmentation algorithms, applied on the same data using default

  9. Education Research: Bias and poor interrater reliability in evaluating the neurology clinical skills examination

    Science.gov (United States)

    Schuh, L A.; London, Z; Neel, R; Brock, C; Kissela, B M.; Schultz, L; Gelb, D J.

    2009-01-01

    Objective: The American Board of Psychiatry and Neurology (ABPN) has recently replaced the traditional, centralized oral examination with the locally administered Neurology Clinical Skills Examination (NEX). The ABPN postulated the experience with the NEX would be similar to the Mini-Clinical Evaluation Exercise, a reliable and valid assessment tool. The reliability and validity of the NEX has not been established. Methods: NEX encounters were videotaped at 4 neurology programs. Local faculty and ABPN examiners graded the encounters using 2 different evaluation forms: an ABPN form and one with a contracted rating scale. Some NEX encounters were purposely failed by residents. Cohen’s kappa and intraclass correlation coefficients (ICC) were calculated for local vs ABPN examiners. Results: Ninety-eight videotaped NEX encounters of 32 residents were evaluated by 20 local faculty evaluators and 18 ABPN examiners. The interrater reliability for a determination of pass vs fail for each encounter was poor (kappa 0.32; 95% confidence interval [CI] = 0.11, 0.53). ICC between local faculty and ABPN examiners for each performance rating on the ABPN NEX form was poor to moderate (ICC range 0.14-0.44), and did not improve with the contracted rating form (ICC range 0.09-0.36). ABPN examiners were more likely than local examiners to fail residents. Conclusions: There is poor interrater reliability between local faculty and American Board of Psychiatry and Neurology examiners. A bias was detected for favorable assessment locally, which is concerning for the validity of the examination. Further study is needed to assess whether training can improve interrater reliability and offset bias. GLOSSARY ABIM = American Board of Internal Medicine; ABPN = American Board of Psychiatry and Neurology; CI = confidence interval; HFH = Henry Ford Hospital; ICC = intraclass correlation coefficients; IM = internal medicine; mini-CEX = Mini-Clinical Evaluation Exercise; NEX = Neurology Clinical

  10. Interrater reliability of the Melbourne Assessment of Unilateral Upper Limb Function for children with hemiplegic cerebral palsy.

    LENUS (Irish Health Repository)

    Spirtos, Michelle

    2012-02-01

    OBJECTIVE: We examined the interrater reliability of the Melbourne Assessment of Unilateral Upper Limb Function. METHOD: Three occupational therapists independently scored 34 videotaped assessments of children with hemiplegic cerebral palsy aged 6 yr, 1 mo, to 14 yr, 5 mo. Intraclass correlation coefficients (ICCs) at a 95% confidence interval were calculated for total scores, category scores, and item scores. RESULTS: The correlation between raters\\' total scores was high (ICC = .961). The highest correlation for test components between raters was found for fluency (ICC = .902), followed by range of movement (ICC = .866), and the lowest correlation was found for quality of movement (ICC = .683). The ICCs for individual test item scores varied and ranged from .368 to .899. CONCLUSION: This study demonstrated high interrater reliability for total scores, with scoring of some individual components and items requiring further consideration from both a clinical and a research perspective.

  11. The Automated Breast Volume Scanner (ABVS: initial experiences in lesion detection compared with conventional handheld B-mode ultrasound: a pilot study of 50 cases

    Directory of Open Access Journals (Sweden)

    Wojcinski S

    2011-10-01

    Full Text Available Sebastian Wojcinski1, Andre Farrokh1, Ursula Hille2, Jakub Wiskirchen3, Samuel Gyapong1, Amr A Soliman1,4, Friedrich Degenhardt1, Peter Hillemanns21Department of OB/GYN, Franziskus Hospital, Bielefeld, Germany; 2Department of OB/GYN, Hannover Medical School, Hannover, Germany; 3Department of Radiology, Franziskus Hospital, Bielefeld, Germany; 4Department of OB/GYN, Faculty of Medicine, University of Alexandria, Alexandria, EgyptAbstract: The idea of an automated whole breast ultrasound was developed three decades ago. We present our initial experiences with the latest technical advance in this technique, the automated breast volume scanner (ABVS ACUSON S2000TM. Volume data sets were collected from 50 patients and a database containing 23 women with no detectable lesions in conventional ultrasound (BI-RADS®-US 1, 13 women with clearly benign lesions (BI-RADS®-US 2, and 14 women with known breast cancer (BI-RADS®-US 5 was created. An independent examiner evaluated the ABVS data on a separate workstation without any prior knowledge of the patients’ histories. The diagnostic accuracy for the experimental ABVS was 66.0% (95% confidence interval [CI]: 52.9–79.1. The independent examiner detected all breast cancers in the volume data resulting in a calculated sensitivity of 100% in the described setting (95% CI: 73.2%–100%. After the ABVS examination, there were a high number of requests for second-look ultrasounds in 47% (95% CI: 30.9–63.5 of the healthy women (with either a clearly benign lesion or no breast lesions at all in conventional handheld ultrasound. Therefore, the specificity remained at 52.8% (95% CI: 35.7–69.2. When comparing the concordance of the ABVS with the gold standard (conventional handheld ultrasound, Cohen’s Kappa value as an estimation of the inter-rater reliability was κ = 0.37, indicating fair agreement. In conclusion, the ABVS must still be regarded as an experimental technique for breast ultrasound, which

  12. Low Cost Technology for Screening Early Cancerous Lesions of ...

    African Journals Online (AJOL)

    hanumantp

    [1] The detection rate for early cancerous lesions improved. 60% by unaided visual ... no agreement as how to screen and where to refer patients with oral cancer for ... External and internal examination of the oral cavity was done by the dentist ...

  13. INTER-RATER RELIABILITY FOR MOVEMENT PATTERN ANALYSIS (MPA: MEASURING PATTERNING OF BEHAVIORS VERSUS DISCRETE BEHAVIOR COUNTS AS INDICATORS OF DECISION-MAKING STYLE

    Directory of Open Access Journals (Sweden)

    Brenda L Connors

    2014-06-01

    Full Text Available The unique yield of collecting observational data on human movement has received increasing attention in a number of domains, including the study of decision-making style. As such, interest has grown in the nuances of core methodological issues, including the best ways of assessing inter-rater reliability. In this paper we focus on one key topic – the distinction between establishing reliability for the patterning of behaviors as opposed to the computation of raw counts – and suggest that reliability for each be compared empirically rather than determined a priori. We illustrate by assessing inter-rater reliability for key outcome measures derived from Movement Pattern Analysis (MPA, an observational methodology that records body movements as indicators of decision-making style with demonstrated predictive validity. While reliability ranged from moderate to good for raw counts of behaviors reflecting each of two Overall Factors generated within MPA (Assertion and Perspective, inter-rater reliability for patterning (proportional indicators of each factor was significantly higher and excellent (ICC = .89. Furthermore, patterning, as compared to raw counts, provided better prediction of observable decision-making process assessed in the laboratory. These analyses support the utility of using an empirical approach to inform the consideration of measuring discrete behavioral counts versus patterning of behaviors when determining inter-rater reliability of observable behavior. They also speak to the substantial reliability that may be achieved via application of theoretically grounded observational systems such as MPA that reveal thinking and action motivations via visible movement patterns.

  14. Test–re-test reliability and inter-rater reliability of a digital pelvic inclinometer in young, healthy males and females

    Directory of Open Access Journals (Sweden)

    Chris Beardsley

    2016-03-01

    Full Text Available Objective. The purpose of this study was to investigate the reliability of a digital pelvic inclinometer (DPI for measuring sagittal plane pelvic tilt in 18 young, healthy males and females. Method. The inter-rater reliability and test–re-test reliabilities of the DPI for measuring pelvic tilt in standing on both the right and left sides of the pelvis were measured by two raters carrying out two rating sessions of the same subjects, three weeks apart. Results. For measuring pelvic tilt, inter-rater reliability was designated as good on both sides (ICC = 0.81–0.88, test–re-test reliability within a single rating session was designated as good on both sides (ICC = 0.88–0.95, and test–re-test reliability between two rating sessions was designated as moderate on the left side (ICC = 0.65 and good on the right side (ICC = 0.85. Conclusion. Inter-rater reliability and test–re-test reliability within a single rating session of the DPI in measuring pelvic tilt were both good, while test–re-test reliability between rating sessions was moderate-to-good. Caution is required regarding the interpretation of the test–re-test reliability within a single rating session, as the raters were not blinded. Further research is required to establish validity.

  15. Vibration Response Imaging: evaluation of rater agreement in healthy subjects and subjects with pneumonia

    International Nuclear Information System (INIS)

    Bartziokas, Konstantinos; Daenas, Christos; Preau, Sebastien; Zygoulis, Paris; Triantaris, Apostolos; Kerenidi, Theodora; Makris, Demosthenes; Gourgoulianis, Konstantinos I; Daniil, Zoe

    2010-01-01

    We evaluated pulmonologists variability in the interpretation of Vibration response imaging (VRI) obtained from healthy subjects and patients hospitalized for community acquired pneumonia. The present is a prospective study conducted in a tertiary university hospital. Twenty healthy subjects and twenty three pneumonia cases were included in this study. Six pulmonologists blindly analyzed images of normal subjects and pneumonia cases and evaluated different aspects of VRI images related to the quality of data aquisition, synchronization of the progression of breath sound distribution and agreement between the maximal energy frame (MEF) of VRI (which is the maximal geographical area of lung vibrations produced at maximal inspiration) and chest radiography. For qualitative assessment of VRI images, the raters' evaluations were analyzed by degree of consistency and agreement. The average value for overall identical evaluations of twelve features of the VRI image evaluation, ranged from 87% to 95% per rater (94% to 97% in control cases and from 79% to 93% per rater in pneumonia cases). Inter-rater median (IQR) agreement was 91% (82-96). The level of agreement according to VRI feature evaluated was in most cases over 80%; intra-class correlation (ICC) obtained by using a model of subject/rater for the averaged features was overall 0.86 (0.92 in normal and 0.73 in pneumonia cases). Our findings suggest good agreement in the interpretation of VRI data between different raters. In this respect, VRI might be helpful as a radiation free diagnostic tool for the management of pneumonia

  16. Vibration Response Imaging: evaluation of rater agreement in healthy subjects and subjects with pneumonia

    Directory of Open Access Journals (Sweden)

    Makris Demosthenes

    2010-03-01

    Full Text Available Abstract Background We evaluated pulmonologists variability in the interpretation of Vibration response imaging (VRI obtained from healthy subjects and patients hospitalized for community acquired pneumonia. Methods The present is a prospective study conducted in a tertiary university hospital. Twenty healthy subjects and twenty three pneumonia cases were included in this study. Six pulmonologists blindly analyzed images of normal subjects and pneumonia cases and evaluated different aspects of VRI images related to the quality of data aquisition, synchronization of the progression of breath sound distribution and agreement between the maximal energy frame (MEF of VRI (which is the maximal geographical area of lung vibrations produced at maximal inspiration and chest radiography. For qualitative assessment of VRI images, the raters' evaluations were analyzed by degree of consistency and agreement. Results The average value for overall identical evaluations of twelve features of the VRI image evaluation, ranged from 87% to 95% per rater (94% to 97% in control cases and from 79% to 93% per rater in pneumonia cases. Inter-rater median (IQR agreement was 91% (82-96. The level of agreement according to VRI feature evaluated was in most cases over 80%; intra-class correlation (ICC obtained by using a model of subject/rater for the averaged features was overall 0.86 (0.92 in normal and 0.73 in pneumonia cases. Conclusions Our findings suggest good agreement in the interpretation of VRI data between different raters. In this respect, VRI might be helpful as a radiation free diagnostic tool for the management of pneumonia.

  17. A pediatric FOUR score coma scale: interrater reliability and predictive validity.

    Science.gov (United States)

    Czaikowski, Brianna L; Liang, Hong; Stewart, C Todd

    2014-04-01

    The Full Outline of UnResponsiveness (FOUR) Score is a coma scale that consists of four components (eye and motor response, brainstem reflexes, and respiration). It was originally validated among the adult population and recently in a pediatric population. To enhance clinical assessment of pediatric intensive care unit patients, including those intubated and/or sedated, at our children's hospital, we modified the FOUR Score Scale for this population. This modified scale would provide many of the same advantages as the original, such as interrater reliability, simplicity, and elimination of the verbal component that is not compatible with the Glasgow Coma Scale (GCS), creating a more valuable neurological assessment tool for the nursing community. Our goal was to potentially provide greater information than the formally used GCS when assessing critically ill, neurologically impaired patients, including those sedated and/or intubated. Experienced pediatric intensive care unit nurses were trained as "expert raters." Two different nurses assessed each subject using the Pediatric FOUR Score Scale (PFSS), GCS, and Richmond Agitation Sedation Scale at three different time points. Data were compared with the Pediatric Cerebral Performance Category (PCPC) assessed by another nurse. Our hypothesis was that the PFSS and PCPC should highly correlate and the GCS and PCPC should correlate lower. Study results show that the PFSS is excellent for interrater reliability for trained nurse-rater pairs and prediction of poor outcome and in-hospital mortality, under various situations, but there were no statistically significant differences between the PFSS and the GCS. However, the PFSS does have the potential to provide greater neurological assessment in the intubated and/or sedated patient based on the outcomes of our study.

  18. Clinical Functional Capacity Testing in Patients With Facioscapulohumeral Muscular Dystrophy: Construct Validity and Interrater Reliability of Antigravity Tests

    NARCIS (Netherlands)

    Rijken, N.H.M.; Engelen, B.G.M. van; Weerdesteyn, V.G.M.; Geurts, A.C.H.

    2015-01-01

    OBJECTIVE: To evaluate the construct validity and interrater reliability of 4 simple antigravity tests in a small group of patients with facioscapulohumeral muscular dystrophy (FSHD). DESIGN: Case-control study. SETTING: University medical center. PARTICIPANTS: Patients with various severity levels

  19. Usefulness of US-guided automated gun biopsy of nonpalpable breast lesions

    Energy Technology Data Exchange (ETDEWEB)

    Kwak, Min Sook; Kim, Hak Soo; Lee, Han Kyung; Koh, Sung Hye; O, Eun Young; Yoon, Myung Hwan; Yang, Dal Mo; Kim, Hyung Sik [Chungang Gil Hospital, Incheon (Korea, Republic of)

    1997-11-01

    To evaluate the clinical usefulness of ultrasonography(US)-guided automated gun biopsy of nonpalpable breast lesions. In 30 nonpalpable breast lesions over 0.6cm and detected on US, we performed US-guided biopsy using an 18-gauge automated biopsy gun. Two to four specimens were obtained from each lesion. We analyzed the site, size and depth of the lesions, and the length and histopathologic results of the specimens. In four lesions, surgical biopsy and gun biopsy results were compared. In 29 of 30 lesions(96.7%), specimens were adequate for histopathologic diagnosis, and this was as follows : one case of infiltrating ductal carcinoma, 13 of fibrocystic disease, 10 of fibrocystic disease versus fibroadenoma and one of fibrodenoma. There was also one reactive hyperplasia of LN, and one fatty one and two normal tissues, and in these four lesions, agreement between gun and surgical biopsy results was 100%. The only complication was minor bleeding, which was controlled by compression. US-guided automated gun biopsy is a clinically useful and safe procedure for evaluating nonpalpable breast lesions detected on US.

  20. Usefulness of US-guided automated gun biopsy of nonpalpable breast lesions

    International Nuclear Information System (INIS)

    Kwak, Min Sook; Kim, Hak Soo; Lee, Han Kyung; Koh, Sung Hye; O, Eun Young; Yoon, Myung Hwan; Yang, Dal Mo; Kim, Hyung Sik

    1997-01-01

    To evaluate the clinical usefulness of ultrasonography(US)-guided automated gun biopsy of nonpalpable breast lesions. In 30 nonpalpable breast lesions over 0.6cm and detected on US, we performed US-guided biopsy using an 18-gauge automated biopsy gun. Two to four specimens were obtained from each lesion. We analyzed the site, size and depth of the lesions, and the length and histopathologic results of the specimens. In four lesions, surgical biopsy and gun biopsy results were compared. In 29 of 30 lesions(96.7%), specimens were adequate for histopathologic diagnosis, and this was as follows : one case of infiltrating ductal carcinoma, 13 of fibrocystic disease, 10 of fibrocystic disease versus fibroadenoma and one of fibrodenoma. There was also one reactive hyperplasia of LN, and one fatty one and two normal tissues, and in these four lesions, agreement between gun and surgical biopsy results was 100%. The only complication was minor bleeding, which was controlled by compression. US-guided automated gun biopsy is a clinically useful and safe procedure for evaluating nonpalpable breast lesions detected on US

  1. Inter- and intraobserver variability of MR arthrography in the detection and classification of superior labral anterior posterior (SLAP) lesions: evaluation in 78 cases with arthroscopic correlation

    Energy Technology Data Exchange (ETDEWEB)

    Holzapfel, Konstantin; Waldt, Simone; Bruegel, Melanie; Rummeny, Ernst J.; Woertler, Klaus [Technische Universitaet Muenchen, Department of Radiology, Klinikum rechts der Isar, Munich (Germany); Paul, Jochen; Imhoff, Andreas B. [Technische Universitaet Muenchen, Department of Sports Orthopedics, Klinikum rechts der Isar, Munich (Germany); Heinrich, Petra [Technische Universitaet Muenchen, Institute of Medical Statistics and Epidemiology, Klinikum rechts der Isar, Munich (Germany)

    2010-03-15

    The purpose of this study was to determine inter- and intraobserver variability of MR arthrography of the shoulder in the detection and classification of superior labral anterior posterior (SLAP) lesions. MR arthrograms of 78 patients who underwent MR arthrography before arthroscopy were retrospectively analysed by three blinded readers for the presence and type of SLAP lesions. MR arthrograms were reviewed twice by each reader with a time interval of 4 months between the two readings. Inter- and intraobserver agreement for detection and classification of SLAP lesions were calculated using {kappa} coefficients. Arthroscopy confirmed 48 SLAP lesions: type I (n = 4), type II (n = 37), type III (n = 3), type IV (n = 4). Sensitivity and specificity for detecting SLAP lesions with MR arthrography for each reader were 88.6%/93.3%, 90.9%/80.0% and 86.4%/76.7%. MR arthrographic and arthroscopic grading were concurrent for 72.7%, 68.2% and 70.5% of SLAP lesions for readers 1-3, respectively. Interobserver agreement was excellent ({kappa} = 0.82) for detection and substantial ({kappa} = 0.63) for classification of SLAP lesions. For each reader intraobserver agreement was excellent for detection ({kappa} = 0.93, {kappa} = 0.97, {kappa} = 0.97) and classification ({kappa} = 0.94, {kappa} = 0.84, {kappa} = 0.93) of SLAP lesions. MR arthrography allows reliable and accurate detection of SLAP lesions. In addition, SLAP lesions can be diagnosed and classified with substantial to excellent inter- and intraobserver agreement. (orig.)

  2. Inter- and intraobserver variability of MR arthrography in the detection and classification of superior labral anterior posterior (SLAP) lesions: evaluation in 78 cases with arthroscopic correlation

    International Nuclear Information System (INIS)

    Holzapfel, Konstantin; Waldt, Simone; Bruegel, Melanie; Rummeny, Ernst J.; Woertler, Klaus; Paul, Jochen; Imhoff, Andreas B.; Heinrich, Petra

    2010-01-01

    The purpose of this study was to determine inter- and intraobserver variability of MR arthrography of the shoulder in the detection and classification of superior labral anterior posterior (SLAP) lesions. MR arthrograms of 78 patients who underwent MR arthrography before arthroscopy were retrospectively analysed by three blinded readers for the presence and type of SLAP lesions. MR arthrograms were reviewed twice by each reader with a time interval of 4 months between the two readings. Inter- and intraobserver agreement for detection and classification of SLAP lesions were calculated using κ coefficients. Arthroscopy confirmed 48 SLAP lesions: type I (n = 4), type II (n = 37), type III (n = 3), type IV (n = 4). Sensitivity and specificity for detecting SLAP lesions with MR arthrography for each reader were 88.6%/93.3%, 90.9%/80.0% and 86.4%/76.7%. MR arthrographic and arthroscopic grading were concurrent for 72.7%, 68.2% and 70.5% of SLAP lesions for readers 1-3, respectively. Interobserver agreement was excellent (κ = 0.82) for detection and substantial (κ = 0.63) for classification of SLAP lesions. For each reader intraobserver agreement was excellent for detection (κ = 0.93, κ = 0.97, κ = 0.97) and classification (κ = 0.94, κ = 0.84, κ = 0.93) of SLAP lesions. MR arthrography allows reliable and accurate detection of SLAP lesions. In addition, SLAP lesions can be diagnosed and classified with substantial to excellent inter- and intraobserver agreement. (orig.)

  3. Interobserver variation in the diagnosis of fibroepithelial lesions of the breast: a multicentre audit by digital pathology.

    Science.gov (United States)

    Dessauvagie, Benjamin F; Lee, Andrew H S; Meehan, Katie; Nijhawan, Anju; Tan, Puay Hoon; Thomas, Jeremy; Tie, Bibiana; Treanor, Darren; Umar, Seemeen; Hanby, Andrew M; Millican-Slater, Rebecca

    2018-02-13

    Fibroepithelial lesions (FELs) of the breast span a morphological continuum including lesions where distinction between cellular fibroadenoma (FA) and benign phyllodes tumour (PT) is difficult. The distinction is clinically important with FAs managed conservatively while equivocal lesions and PTs are managed with surgery. We sought to audit core biopsy diagnoses of equivocal FELs by digital pathology and to investigate whether digital point counting is useful in clarifying FEL diagnoses. Scanned slide images from cores and subsequent excisions of 69 equivocal FELs were examined in a multicentre audit by eight pathologists to determine the agreement and accuracy of core needle biopsy (CNB) diagnoses and by digital point counting of stromal cellularity and expansion to determine if classification could be improved. Interobserver variation was high on CNB with a unanimous diagnosis from all pathologists in only eight cases of FA, diagnoses of both FA and PT on the same CNB in 15 and a 'weak' mean kappa agreement between pathologists (k=0.36). 'Moderate' agreement was observed on CNBs among breast specialists (k=0.44) and on excision samples (k=0.49). Up to 23% of lesions confidently diagnosed as FA on CNB were PT on excision and up to 30% of lesions confidently diagnosed as PT on CNB were FA on excision. Digital point counting did not aid in the classification of FELs. Accurate and reproducible diagnosis of equivocal FELs is difficult, particularly on CNB, resulting in poor interobserver agreement and suboptimal accuracy. Given the diagnostic difficulty, and surgical implications, equivocal FELs should be reported in consultation with experienced breast pathologists as a small number of benign FAs can be selected out from equivocal lesions. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  4. [Quality assurance in coding expertise of hospital cases in the German DRG system. Evaluation of inter-rater reliability in MDK expertise].

    Science.gov (United States)

    Huber, H; Brambrink, M; Funk, R; Rieger, M

    2012-10-01

    The purpose of this study was to evaluate differences in the D-DRG results of a hospital case by 2 independently coding MKD raters. Calculation of the 2-inter-rater reliability was performed by examination of the coding of individual hospital cases. The reasons for the non-agreement of the expert evaluations and suggestions to improve the process are discussed. From the expert evaluation pool of the MDK-WL a random sample of 0.7% of the 57,375 expertises was taken. Distribution equality with the basic total was tested by the χ² test or, respectively, Fisher's exact test. For the total of 402 individual hospital cases, the G-DRG case sums of 2 experts of the MDK were determined independently and the results checked for each individual case for agreement or non-agreement. The corresponding confidence intervals with standard errors were analysed to test if certain major diagnosis categories (MDC) were statistically significantly more affected by differing expertise results than others. In 280 of the total 402 tested hospital cases, the 2 MDK raters independently reached the same G-DRG results; in 122 cases the G-DRG case sums determined by the 2 raters differed (agreement 70%; CI 65.2-74.1). Different DRG results between the 2 experts occurred regularly in the entire MDC spectrum. No MDC chapter in which significant differences between the 2 raters arose could be identified. The results of our study demonstrate an almost 70% agreement in the evaluation of hospital cost accounts by 2 independently operating MDK. This result leaves room for improvement. Optimisation potentials can be recognised on the basis of the results. Potential for improvement was established in combination with regular further training and the expansion of binding internal code recommendations as well as exchange of code-relevant information among experts in internal forums. The presented model is in principle suitable for cross-border examinations within the MDK system with the advantage that

  5. [Differentiation between moisture lesions and pressure ulcers using photographs in a critical area].

    Science.gov (United States)

    Valls-Matarín, Josefa; Del Cotillo-Fuente, Mercedes; Pujol-Vila, María; Ribal-Prior, Rosa; Sandalinas-Mulero, Inmaculada

    2016-01-01

    To identify difficulties for nurses in differentiating between moisture lesions and pressure ulcers, proper classification of pressure ulcers to assess the adequate classification of the Grupo Nacional para el Estudio y Asesoramiento de Úlceras por Presión y Heridas Crónicas (GNEAUPP) and the degree of agreement in the correct assessment by type and category of injury. Cross-sectional study in a critical area during 2014. All nurses who agreed to participate were included. They performed a questionnaire with 14 photographs validated by experts of moisture lesions or pressure ulcers in the sacral area and buttocks, with 6 possible answers: Pressure ulcer category I, II, III, IV, moisture lesions and unknown. Demographics and knowledge of the classification system of the pressure ulcers were collected according to GNEAUPP. It involved 98% of the population (n=56); 98.2% knew the classification system of the GNEAUPP; 35.2% of moisture lesions were considered as pressure ulcers, most of them as a category II (18.9%). The 14.8% of the pressure ulcers photographs were identified as moisture lesions and 16.1% were classified in another category. The agreement between nurses earned a global Kappa index of .38 (95% CI: .29-.57). There are difficulties differentiating between pressure ulcers and moisture lesions, especially within initial categories. Nurses have the perception they know the pressure ulcers classification, but they do not classify them correctly. The degree of concordance in the diagnosis of skin lesions was low. Copyright © 2016 Elsevier España, S.L.U. All rights reserved.

  6. Theoretical and experimental analysis of amplitude control ablation and bipolar ablation in creating linear lesion and discrete lesions for treating atrial fibrillation.

    Science.gov (United States)

    Yan, Shengjie; Wu, Xiaomei; Wang, Weiqi

    2017-09-01

    Radiofrequency (RF) energy is often used to create a linear lesion or discrete lesions for blocking the accessory conduction pathways for treating atrial fibrillation. By using finite element analysis, we study the ablation effect of amplitude control ablation mode (AcM) and bipolar ablation mode (BiM) in creating a linear lesion and discrete lesions in a 5-mm-thick atrial wall; particularly, the characteristic of lesion shape has been investigated in amplitude control ablation. Computer models of multipolar catheter were developed to study the lesion dimensions in atrial walls created through AcM, BiM and special electrodes activated ablation methods in AcM and BiM. To validate the theoretical results in this study, an in vitro experiment with porcine cardiac tissue was performed. At 40 V/20 V root mean squared (RMS) of the RF voltage for AcM, the continuous and transmural lesion was created by AcM-15s, AcM-5s and AcM-ad-20V ablation in 5-mm-thick atrial wall. At 20 V RMS for BiM, the continuous but not transmural lesion was created. AcM ablation yielded asymmetrical and discrete lesions shape, whereas the lesion shape turned to more symmetrical and continuous as the electrodes alternative activated period decreased from 15 s to 5 s. Two discrete lesions were created when using AcM, AcM-ad-40V, BiM-ad-20V and BiM-ad-40V. The experimental and computational thermal lesion shapes created in cardiac tissue were in agreement. Amplitude control ablation technology and bipolar ablation technology are feasible methods to create continuous lesion or discrete for pulmonary veins isolation.

  7. TH-E-BRF-08: Subpopulations of Similarly-Responding Lesions in Metastatic Prostate Cancer

    International Nuclear Information System (INIS)

    Lin, C; Harmon, S; Perk, T; Jeraj, R

    2014-01-01

    Purpose: In patients with multiple lesions, resistance to cancer treatments and subsequent disease recurrence may be due to heterogeneity of response across lesions. This study aims to identify subpopulations of similarly-responding metastatic prostate cancer lesions in bone using quantitative PET metrics. Methods: Seven metastatic prostate cancer patients treated with AR-directed therapy received pre-treatment and mid-treatment [F-18]NaF PET/CT scans. Images were registered using an articulated CT registration algorithm and transformations were applied to PET segmentations. Midtreatment response was calculated on PET-based texture features. Hierarchical agglomerative clustering was used to form groups of similarly-responding lesions, with the number of natural clusters (K) determined by the inconsistency coefficient. Lesion clustering was performed within each patient, and for the pooled population. The cophenetic coefficient (C) quantified how well the data was clustered. The Jaccard Index (JI) assessed similarity of cluster assignments from patient clustering and from population clustering. Results: 188 lesions in seven patients were identified for analysis (between 6 to 53 lesions per patient). Lesion response was defined as percent change relative to pre-treatment for 23 uncorrelated PET-based feature identifiers. . High response heterogeneity was found across all lesions (i.e. range ΔSUVmax =−95.98% to 775.00%). For intra-patient clustering, K ranged from 1–20. Population-based clustering resulted in 75 clusters, of 1-6 lesions each. Intra-patient clustering resulted in higher quality clusters than population clustering (mean C=0.95, range=0.89 to 1.00). For all patients, cluster assignments from population clustering showed good agreement to intra-patient clustering (mean JI=0.87, range=0.68 to 1.00). Conclusion: Subpopulations of similarly-responding lesions were identified in patients with multiple metastatic lesions. Good agreement was found between

  8. Manual muscle testing and hand-held dynamometry in people with inflammatory myopathy: An intra- and interrater reliability and validity study.

    Science.gov (United States)

    Baschung Pfister, Pierrette; de Bruin, Eling D; Sterkele, Iris; Maurer, Britta; de Bie, Rob A; Knols, Ruud H

    2018-01-01

    Manual muscle testing (MMT) and hand-held dynamometry (HHD) are commonly used in people with inflammatory myopathy (IM), but their clinimetric properties have not yet been sufficiently studied. To evaluate the reliability and validity of MMT and HHD, maximum isometric strength was measured in eight muscle groups across three measurement events. To evaluate reliability of HHD, intra-class correlation coefficients (ICC), the standard error of measurements (SEM) and smallest detectable changes (SDC) were calculated. To measure reliability of MMT linear Cohen`s Kappa was computed for single muscle groups and ICC for total score. Additionally, correlations between MMT8 and HHD were evaluated with Spearman Correlation Coefficients. Fifty people with myositis (56±14 years, 76% female) were included in the study. Intra-and interrater reliability of HHD yielded excellent ICCs (0.75-0.97) for all muscle groups, except for interrater reliability of ankle extension (0.61). The corresponding SEMs% ranged from 8 to 28% and the SDCs% from 23 to 65%. MMT8 total score revealed excellent intra-and interrater reliability (ICC>0.9). Intrarater reliability of single muscle groups was substantial for shoulder and hip abduction, elbow and neck flexion, and hip extension (0.64-0.69); moderate for wrist (0.53) and knee extension (0.49) and fair for ankle extension (0.35). Interrater reliability was moderate for neck flexion (0.54) and hip abduction (0.44); fair for shoulder abduction, elbow flexion, wrist and ankle extension (0.20-0.33); and slight for knee extension (0.08). Correlations between the two tests were low for wrist, knee, ankle, and hip extension; moderate for elbow flexion, neck flexion and hip abduction; and good for shoulder abduction. In conclusion, the MMT8 total score is a reliable assessment to consider general muscle weakness in people with myositis but not for single muscle groups. In contrast, our results confirm that HHD can be recommended to evaluate strength of

  9. Computing Inter-Rater Reliability for Observational Data: An Overview and Tutorial

    Directory of Open Access Journals (Sweden)

    Kevin A. Hallgren

    2012-02-01

    Full Text Available Many research designs require the assessment of inter-rater reliability (IRR to demonstrate consistency among observational ratings provided by multiple coders. However, many studies use incorrect statistical procedures, fail to fully report the information necessary to interpret their results, or do not address how IRR affects the power of their subsequent analyses for hypothesis testing. This paper provides an overview of methodological issues related to the assessment of IRR with a focus on study design, selection of appropriate statistics, and the computation, interpretation, and reporting of some commonly-used IRR statistics. Computational examples include SPSS and R syntax for computing Cohen’s kappa and intra-class correlations to assess IRR.

  10. The health preoccupation diagnostic interview: inter-rater reliability of a structured interview for diagnostic assessment of DSM-5 somatic symptom disorder and illness anxiety disorder.

    Science.gov (United States)

    Axelsson, Erland; Andersson, Erik; Ljótsson, Brjánn; Wallhed Finn, Daniel; Hedman, Erik

    2016-06-01

    Somatic symptom disorder (SSD) and illness anxiety disorder (IAD) are two new diagnoses introduced in the DSM-5. There is a need for reliable instruments to facilitate the assessment of these disorders. We therefore developed a structured diagnostic interview, the Health Preoccupation Diagnostic Interview (HPDI), which we hypothesized would reliably differentiate between SSD, IAD, and no diagnosis. Persons with clinically significant health anxiety (n = 52) and healthy controls (n = 52) were interviewed using the HPDI. Diagnoses were then compared with those made by an independent assessor, who listened to audio recordings of the interviews. Ratings generally indicated moderate to almost perfect inter-rater agreement, as illustrated by an overall Cohen's κ of .85. Disagreements primarily concerned (a) the severity of somatic symptoms, (b) the differential diagnosis of panic disorder, and (c) SSD specifiers. We conclude that the HPDI can be used to reliably diagnose DSM-5 SSD and IAD.

  11. HER-2, ER, PR status concordance in primary breast cancer and corresponding metastatic lesion in lymph node in Chinese women.

    Science.gov (United States)

    Li, Min Hua; Hou, Chuan Ling; Wang, Cheng; Sun, Ai Jing

    2016-04-01

    To compare the expression of estrogen receptor (ER), progesterone receptor (PR) and human epidermal growth factor receptor 2 (HER-2) in the primary site and the metastatic lesion of lymph nodes in invasive breast cancer for investigating whether the expression of these biomarkers in the primary site could act as a surrogate to the lymphatic metastatic lesion in the same patient. In lymphatic metastatic lesion and corresponding primary lesion of 107 cases of invasive breast cancer, ER and PR statuses were assessed by immunohistochemistry (IHC). HER-2 expression level was evaluated by IHC and/or fluorescence in situ hybridization (FISH). In the primary lesions, 43.9% were ER positive; 46.7% were PR positive; 34.6% were HER-2 positive. In corresponding lymphatic metastatic lesions, the HER-2 status was concordant in 90 patients; 9 patients were diagnosed positive in metastatic lesion while negative in primary lesion; 8 patients were negative in metastatic lesion while positive in primary site (agreement, 84.1%; κ=0.647). A change in ER status was observed in 24 cases: 17 cases positive in metastatic site while negative in primary site; 7 cases negative in metastatic site while positive in primary site (agreement, 77.6%; κ=0.534). PR status discordance between the primary lesion and the metastatic regional lymph nodes was reported in 19 cases (agreement, 82.2%; κ=0.640). This study revealed that there was only a moderate concordance of ER, PR and HER-2 status between primary tumors and metastatic lymph nodes. These results indicate that it was inappropriate to predict the status of ER, PR and HER-2 in metastatic lymph nodes based on the results of evaluation of that in primary lesions. Copyright © 2015 Elsevier GmbH. All rights reserved.

  12. Dosimetric analysis of radiation sources for use dermatological lesions

    International Nuclear Information System (INIS)

    Tada, Ariane

    2010-01-01

    Skin lesions undergoing therapy with radiation sources may have different patterns of malignancy. Malignant lesions or cancer most commonly found in radiotherapy services are carcinomas. Radiation therapy in skin lesions is performed with low penetration beams and orthovoltage X-rays, electron beams and radioactive sources ( 192 Ir, 198 Au, e 90 Sr) arranged on a surface mold or in metal applicator. This study aims to analyze the therapeutic radiation dose profile produced by radiation sources used in skin lesions radiotherapy procedures . Experimental measurements for the analysis of dosimetric radiation sources were compared with calculations obtained from a computer system based on the Monte Carlo Method. Computational results had a good agreement with the experimental measurements. Experimental measurements and computational results by the MCNP4C code were both physically consistent as expected. These experimental measurements compared with calculations using the MCNP-4C code have been used to validate the calculations obtained by MCNP code and to provide a reliable medical application for each clinical case. (author)

  13. Developing a digital photography-based method for dietary analysis in self-serve dining settings.

    Science.gov (United States)

    Christoph, Mary J; Loman, Brett R; Ellison, Brenna

    2017-07-01

    Current population-based methods for assessing dietary intake, including food frequency questionnaires, food diaries, and 24-h dietary recall, are limited in their ability to objectively measure food intake. Digital photography has been identified as a promising addition to these techniques but has rarely been assessed in self-serve settings. We utilized digital photography to examine university students' food choices and consumption in a self-serve dining hall setting. Research assistants took pre- and post-photos of students' plates during lunch and dinner to assess selection (presence), servings, and consumption of MyPlate food groups. Four coders rated the same set of approximately 180 meals for inter-rater reliability analyses; approximately 50 additional meals were coded twice by each coder to assess intra-rater agreement. Inter-rater agreement on the selection, servings, and consumption of food groups was high at 93.5%; intra-rater agreement was similarly high with an average of 95.6% agreement. Coders achieved the highest rates of agreement in assessing if a food group was present on the plate (95-99% inter-rater agreement, depending on food group) and estimating the servings of food selected (81-98% inter-rater agreement). Estimating consumption, particularly for items such as beans and cheese that were often in mixed dishes, was more challenging (77-94% inter-rater agreement). Results suggest that the digital photography method presented is feasible for large studies in real-world environments and can provide an objective measure of food selection, servings, and consumption with a high degree of agreement between coders; however, to make accurate claims about the state of dietary intake in all-you-can-eat, self-serve settings, researchers will need to account for the possibility of diners taking multiple trips through the serving line. Copyright © 2017 Elsevier Ltd. All rights reserved.

  14. Inter-rater Reliability for Metrics Scored in a Binary Fashion-Performance Assessment for an Arthroscopic Bankart Repair.

    Science.gov (United States)

    Gallagher, Anthony G; Ryu, Richard K N; Pedowitz, Robert A; Henn, Patrick; Angelo, Richard L

    2018-05-02

    To determine the inter-rater reliability (IRR) of a procedure-specific checklist scored in a binary fashion for the evaluation of surgical skill and whether it meets a minimum level of agreement (≥0.8 between 2 raters) required for high-stakes assessment. In a prospective randomized and blinded fashion, and after detailed assessment training, 10 Arthroscopy Association of North America Master/Associate Master faculty arthroscopic surgeons (in 5 pairs) with an average of 21 years of surgical experience assessed the video-recorded 3-anchor arthroscopic Bankart repair performance of 44 postgraduate year 4 or 5 residents from 21 Accreditation Council for Graduate Medical Education orthopaedic residency training programs from across the United States. No paired scores of resident surgeon performance evaluated by the 5 teams of faculty assessors dropped below the 0.8 IRR level (mean = 0.93; range 0.84-0.99; standard deviation = 0.035). A comparison between the 5 assessor groups with 1 factor analysis of variance showed that there was no significant difference between the groups (P = .205). Pearson's product-moment correlation coefficient revealed a strong and statistically significant negative correlation, that is, -0.856 (P fashion meet the need and can show a high (>80%) IRR. Copyright © 2018 Arthroscopy Association of North America. Published by Elsevier Inc. All rights reserved.

  15. Dosimetric analysis of radiation sources to use in dermatological lesions

    International Nuclear Information System (INIS)

    Tada, Ariane

    2010-01-01

    Skin lesions undergoing therapy with radiation sources may have different patterns of malignancy. Malignant lesions or cancer most commonly found in radiotherapy services are carcinomas. Radiation therapy in skin lesions is performed with low penetration beams and orthovoltage X-rays, electron beams and radioactive sources ( 192 Ir, 198 Au, e 90 Sr) arranged on a surface mold or in metal applicator. This study aims to analyze the therapeutic radiation dose profile produced by radiation sources used in skin lesions radiotherapy procedures. Experimental measurements for the analysis of dosimetric radiation sources were compared with calculations obtained from a computer system based on the Monte Carlo Method. Computational results had a good agreement with the experimental measurements. Experimental measurements and computational results by the MCNP4C code have been used to validate the calculations obtained by MCNP code and to provide a reliable medical application for each clinical case. (author)

  16. Transillumination and HDR Imaging for Proximal Caries Detection.

    Science.gov (United States)

    Lederer, A; Kunzelmann, K H; Hickel, R; Litzenburger, F

    2018-02-01

    The purpose was to develop an in vitro model for the validation of near-infrared transillumination (NIRT) for proximal caries detection, to enhance NIRT with high-dynamic-range imaging (HDRI), and to compare both methods, using micro-computed tomography (µCT) as a reference standard. Both proximal surfaces of 53 healthy or decayed permanent human teeth were examined using the Diagnocam (DC) (KaVo) and NIRT with HDRI (NIRT-HDRI). NIRT was combined with HDRI to improve the diagnostic performance by reducing under- and overexposed image areas. For NIRT-HDRI, an exposure series was captured and merged into a single HDR image. A classification was applied according to lesion depth. All surfaces were assessed twice by 2 trained examiners, and additionally with µCT for validation. The Kappa statistic was used to calculate inter-rater reliability and agreement between DC and NIRT-HDRI. Inter-rater reliability (weighted Kappa, wκ) showed very good agreement for the DC (0.90) and NIRT-HDRI (0.96). The overall agreement (wκ) was almost perfect (0.85). In the individual categories (0 to 4), the agreement (simple Kappa) ranged from almost perfect (category 4) to moderate (1 and 2) to substantial (categories 0 and 3). Sensitivity and specificity of sound surfaces, enamel, and dentin caries ranged from 0.57 to 0.99 and were similar for both methods in the different categories. NIRT-HDRI had a higher sensitivity for sound surfaces and enamel caries, as well as a higher specificity for dentin caries. Regarding the obtained images, HDRI allowed for the detection of caries within a greater range of luminance levels, resulting in a more detailed visualization of structures without under- or overexposure. However, HDRI this did not improve the diagnostics significantly. Distinguishing between a processed demineralized enamel and dentin lesions appears to be a problem specific to NIRT and cannot be balanced using HDRI.

  17. MRI lesion and epileptogenic focus in temporal lobe epilepsy

    International Nuclear Information System (INIS)

    Matsuda, Kazumi; Yagi, Kazuichi; Mihara, Tadahiro; Tottori, Takayasu; Watanabe, Yutaka; Seino, Masakazu

    1989-01-01

    The spatial relationship between a circumscribed lesion in the temporal lobe detected by MRI and an epileptogenic focus identified by ictal depth EEG along with a correlation of the MRI lesion with neuropathological findings were investigated in patients with medically intractable temporal lobe epilepsy but without any focal lesion on CT. Four parameters (an areal ratio of the temporal lobe against the hemisphere, area and calculated T1, T2 values of the hippocampus) were used to determine the abnormal MRI side. An agreement was reached in 67-72% of 18 patients between the abnormal values of the hippocampal area and of calculated T1, T2 and the side of the epileptogenic focus. In 14 of 17 patients, typical hippocampal sclerosis was demonstrated in resected tissue in accordance with the MRI lesions (atrophy and/or prolonged T2 of hippocampus). These results imply: 1)MRI abnormality thus defined may, if not all, indicate the side of the epileptogenic focus, and 2)also the presence of hippocampal sclerosis. It was emphasized that the MRI lesion would be a usable instrument to explore the causal relationship of hippocampal sclerosis to a generation of epileptogenic lesions as well as for presurgical evalution. (author)

  18. Evaluation of interobserver agreement in Albertoni's classification for mallet finger

    Directory of Open Access Journals (Sweden)

    Vinícius Alexandre de Souza Almeida

    Full Text Available ABSTRACT Objective: To measure the reliability of Albertoni's classification for mallet finger. Methods: Agreement study. Forty-three radiographs of patients with mallet finger were assessed by 19 responders (12 hand surgeons and seven residents. Injuries were classified by Albertoni's classification. For agreement comparison, lesions were grouped as: (A tendon avulsion; (B avulsion fracture; (C fracture of the dorsal lip; and (D physis injury-and subgroups (each group divided into two subgroups. Agreement was assessed by Fleiss's modification for kappa statistics. Results: Agreement was excellent for Group A (k = 0.95 (0.93-0.97 and remained good when separated into A1 and A2. Group B was moderate (k = 0.42 (0.39-0.44 and poor when separated into B1 and B2. In the Group C, agreement was good (k = 0.72 (0.70-0.74, but when separated into C1 and C2, it became moderate. Group D was always poor (k = 0.16 (0.14-0.19. The general agreement was moderate, with (k = 0.57 (0.56-0.58. Conclusion: Albertoni's classification evaluated for interobserver agreement is considered a reproducible classification by the method used in the research.

  19. Stereotoracic skin biopsy in non-palpable woman mammary glands lesions

    International Nuclear Information System (INIS)

    Leborgne, Francisco; Mezzera, Julieta

    1999-01-01

    Seventy stereotactic breast biopsies in non palpable lesions were performed in asintomatic patients, studied with mammography, physical examination and high-resolution ultrasonography. A 14-gauge cutting needle was used with excursion of 2 and 20 mm. in high likelihood probably benign lesions and low likelihood probably malignant lesions, not snographical evident and not less than 4 mm. in diameter. Long term follow-up or surgical correlation is available in only 46 biopsies, with 35 negative results and 11 positive results for cancer. Follow-up mammography was recommended in 34 of 35 negative results and one case showed mammographic lesion progression, a false negative result, with sensitivity 90.9% and NPV 97%. Needle wire localization and open surgery was recommended in 11 positive results, with total agreement between stereotactic core biopsy and open surgery, with specificity 100% and PPV 100%. The indication of stereotactic biopsy in non palpable lesions as an alternative to surgical biopsy in order to decrease the number of surgical biopsies with negative results and in consequence lowering costs of diagnosis of asintomatic malignant lesions, with error of 2.1% and 2.8% minor complications

  20. Inter-rater variability of visual interpretation and comparison with quantitative evaluation of 11C-PiB PET amyloid images of the Japanese Alzheimer's Disease Neuroimaging Initiative (J-ADNI) multicenter study

    International Nuclear Information System (INIS)

    Yamane, Tomohiko; Ishii, Kenji; Sakata, Muneyuki; Ikari, Yasuhiko; Nishio, Tomoyuki; Ishii, Kazunari; Kato, Takashi; Ito, Kengo; Senda, Michio

    2017-01-01

    The aim of this study was to assess the inter-rater variability of the visual interpretation of 11 C-PiB PET images regarding the positivity/negativity of amyloid deposition that were obtained in a multicenter clinical research project, Japanese Alzheimer's Disease Neuroimaging Initiative (J-ADNI). The results of visual interpretation were also compared with a semi-automatic quantitative analysis using mean cortical standardized uptake value ratio to the cerebellar cortex (mcSUVR). A total of 162 11 C-PiB PET scans, including 45 mild Alzheimer's disease, 60 mild cognitive impairment, and 57 normal cognitive control cases that had been acquired as J-ADNI baseline scans were analyzed. Based on visual interpretation by three independent raters followed by consensus read, each case was classified into positive, equivocal, and negative deposition (ternary criteria) and further dichotomized by merging the former two (binary criteria). Complete agreement of visual interpretation by the three raters was observed for 91.3% of the cases (Cohen κ = 0.88 on average) in ternary criteria and for 92.3% (κ = 0.89) in binary criteria. Cases that were interpreted as visually positive in the consensus read showed significantly higher mcSUVR than those visually negative (2.21 ± 0.37 vs. 1.27 ± 0.09, p < 0.001), and positive or negative decision by visual interpretation was dichotomized by a cut-off value of mcSUVR = 1.5. Significant positive/negative associations were observed between mcSUVR and the number of raters who evaluated as positive (ρ = 0.87, p < 0.0001) and negative (ρ = -0.85, p < 0.0001) interpretation. Cases of disagreement among raters showed generally low mcSUVR. Inter-rater agreement was almost perfect in 11 C-PiB PET scans. Positive or negative decision by visual interpretation was dichotomized by a cut-off value of mcSUVR = 1.5. As some cases of disagreement among raters tended to show low mcSUVR, referring to quantitative method may facilitate

  1. Diagnostic performance of direct traction MR arthrography of the hip: detection of chondral and labral lesions with arthroscopic comparison

    International Nuclear Information System (INIS)

    Schmaranzer, Florian; Klauser, Andrea; Henninger, Benjamin; Kogler, Michael; Schmaranzer, Ehrenfried; Forstner, Thomas; Reichkendler, Markus

    2015-01-01

    To assess diagnostic performance of traction MR arthrography of the hip in detection and grading of chondral and labral lesions with arthroscopic comparison. Seventy-five MR arthrograms obtained ± traction of 73 consecutive patients (mean age, 34.5 years; range, 14-54 years) who underwent arthroscopy were included. Traction technique included weight-adapted traction (15-23 kg), a supporting plate for the contralateral leg, and intra-articular injection of 18-27 ml (local anaesthetic and contrast agent). Patients reported on neuropraxia and on pain. Two blinded readers independently assessed femoroacetabular cartilage and labrum lesions which were correlated with arthroscopy. Interobserver agreement was calculated using κ values. Joint distraction ± traction was evaluated in consensus. No procedure had to be stopped. There were no cases of neuropraxia. Accuracy for detection of labral lesions was 92 %/93 %, 91 %/83 % for acetabular lesions, and 92 %/88 % for femoral cartilage lesions for reader 1/reader 2, respectively. Interobserver agreement was moderate (κ = 0.58) for grading of labrum lesions and substantial (κ = 0.7, κ = 0.68) for grading of acetabular and femoral cartilage lesions. Joint distraction was achieved in 72/75 and 14/75 hips with/without traction, respectively. Traction MR arthrography safely enabled accurate detection and grading of labral and chondral lesions. (orig.)

  2. Diagnostic performance of direct traction MR arthrography of the hip: detection of chondral and labral lesions with arthroscopic comparison

    Energy Technology Data Exchange (ETDEWEB)

    Schmaranzer, Florian; Klauser, Andrea; Henninger, Benjamin [Medical University Innsbruck, Department of Radiology, Innsbruck (Austria); Kogler, Michael; Schmaranzer, Ehrenfried [District Hospital St. Johann in Tyrol, Department of Radiology, St. Johann in Tyrol (Austria); Forstner, Thomas [Johannes Keppler University, Department for Applied Systems Research and Statistics, Linz (Austria); Reichkendler, Markus [District Hospital St. Johann in Tyrol, Department of Orthopedic Surgery, St. Johann in Tyrol (Austria)

    2015-06-01

    To assess diagnostic performance of traction MR arthrography of the hip in detection and grading of chondral and labral lesions with arthroscopic comparison. Seventy-five MR arthrograms obtained ± traction of 73 consecutive patients (mean age, 34.5 years; range, 14-54 years) who underwent arthroscopy were included. Traction technique included weight-adapted traction (15-23 kg), a supporting plate for the contralateral leg, and intra-articular injection of 18-27 ml (local anaesthetic and contrast agent). Patients reported on neuropraxia and on pain. Two blinded readers independently assessed femoroacetabular cartilage and labrum lesions which were correlated with arthroscopy. Interobserver agreement was calculated using κ values. Joint distraction ± traction was evaluated in consensus. No procedure had to be stopped. There were no cases of neuropraxia. Accuracy for detection of labral lesions was 92 %/93 %, 91 %/83 % for acetabular lesions, and 92 %/88 % for femoral cartilage lesions for reader 1/reader 2, respectively. Interobserver agreement was moderate (κ = 0.58) for grading of labrum lesions and substantial (κ = 0.7, κ = 0.68) for grading of acetabular and femoral cartilage lesions. Joint distraction was achieved in 72/75 and 14/75 hips with/without traction, respectively. Traction MR arthrography safely enabled accurate detection and grading of labral and chondral lesions. (orig.)

  3. Inter-observer agreement for the evaluation of bone involvement on Whole Body Low Dose Computed Tomography (WBLDCT) in Multiple Myeloma (MM)

    Energy Technology Data Exchange (ETDEWEB)

    Zacchino, M.; Minetti, V.; Dore, R.; Calliada, F. [University of Pavia, Fondazione IRCCS Policlinico San Matteo, Institute of Radiology, Pavia (Italy); Bonaffini, P.A.; Nasatti, A.; Sironi, S. [University of Milano Bicocca, San Gerardo Hospital, Department of Diagnostic Radiology, Monza (Italy); Corso, A. [University of Pavia, Fondazione IRCCS Policlinico San Matteo, Division of Hematology, Pavia (Italy); Tinelli, C. [University of Pavia, Fondazione IRCCS Policlinico San Matteo, Service of Biometry and Statistics, Pavia (Italy)

    2015-11-15

    We aimed to assess inter-observer agreement in bone involvement evaluation and define accuracy and reproducibility of MDCT images analysis in Multiple Myeloma (MM), by comparing two acquisition protocols at two different institutions. A total of 100 MM patients underwent whole body low-dose computed tomography (WB-LDCT), with two protocols: Group I (50 patients), 80 kV and 200-230 mAs; Group II, 120 kV-40 mAs. Four readers (two experts) retrospectively reviewed 22 anatomical districts, reporting the following for each patient: 1) osteolytic lesions; 2) cortical bone integrity; 3) fractures; 4) risk of vertebral collapse; 5) hyperattenuating bone lesions; and 6) extraosseous extension. Inter-observer agreement (by all readers, expert and young observers and comparison of the two protocols) was then statistically analyzed. According to Cohen's criteria, inter-observer agreement among the four readers and between experts and residents was good for the detection of bone lesions and extra-medullary extension, and for the evaluation of risk of collapse and cortical integrity. There was good agreement when comparing the two protocols. A greater variability was found for the evaluation of hyperattenuating lesions and the presence of fractures. WB-LDCT represents a reproducible and reliable technique that is helpful for defining bone disease in MM patients, with partial influence of readers' experience. (orig.)

  4. Inter-observer agreement for the evaluation of bone involvement on Whole Body Low Dose Computed Tomography (WBLDCT) in Multiple Myeloma (MM)

    International Nuclear Information System (INIS)

    Zacchino, M.; Minetti, V.; Dore, R.; Calliada, F.; Bonaffini, P.A.; Nasatti, A.; Sironi, S.; Corso, A.; Tinelli, C.

    2015-01-01

    We aimed to assess inter-observer agreement in bone involvement evaluation and define accuracy and reproducibility of MDCT images analysis in Multiple Myeloma (MM), by comparing two acquisition protocols at two different institutions. A total of 100 MM patients underwent whole body low-dose computed tomography (WB-LDCT), with two protocols: Group I (50 patients), 80 kV and 200-230 mAs; Group II, 120 kV-40 mAs. Four readers (two experts) retrospectively reviewed 22 anatomical districts, reporting the following for each patient: 1) osteolytic lesions; 2) cortical bone integrity; 3) fractures; 4) risk of vertebral collapse; 5) hyperattenuating bone lesions; and 6) extraosseous extension. Inter-observer agreement (by all readers, expert and young observers and comparison of the two protocols) was then statistically analyzed. According to Cohen's criteria, inter-observer agreement among the four readers and between experts and residents was good for the detection of bone lesions and extra-medullary extension, and for the evaluation of risk of collapse and cortical integrity. There was good agreement when comparing the two protocols. A greater variability was found for the evaluation of hyperattenuating lesions and the presence of fractures. WB-LDCT represents a reproducible and reliable technique that is helpful for defining bone disease in MM patients, with partial influence of readers' experience. (orig.)

  5. Chronic anterior cruciate ligament tears and associated meniscal and traumatic cartilage lesions: evaluation with morphological sequences at 3.0 T

    Energy Technology Data Exchange (ETDEWEB)

    Vlychou, Marianna; Fezoulidis, Ioannis V. [University Hospital of Larissa, Department of Radiology, Medical School of Thessaly, Larissa (Greece); Hantes, Michalis; Michalitsis, Sotirios; Malizos, Konstantinos [University Hospital of Larissa, Department of Orthopaedic Surgery, Medical School of Thessaly, Larissa (Greece); Tsezou, Aspasia [University Hospital of Larissa, Department of Molecular Genetics and Cytogenetics, Medical School of Thessaly, Larissa (Greece)

    2011-06-15

    To investigate the diagnostic efficacy of morphological sequences at 3.0 T MR imaging in detecting anterior cruciate ligament (ACL), meniscal pathology and traumatic cartilage legions in young patients with chronic deficient anterior cruciate ligament knees. This prospective study included 43 patients (39 male) between the age of 15 and 37 years (mean age 22.6 years) with a history of knee injury sustained at least 3 months prior to the decision to repair a torn ACL. All patients underwent a 3.0 T MR scan with the same standard protocol, including intermediate-weighted and three-dimensional spoiled gradient-recalled T1-weighted sequences with fat saturation and subsequently surgical reconstruction of the ACL, along with meniscal and cartilage repair, when necessary. All ACL tears were correctly interpreted by 3.0 T MR images. The sensitivity of the MR scans regarding tears of the medial meniscus was 93.7%, the specificity 92.6%, the positive predictive value 88.2% and the negative predictive value 95.8%. The sensitivity of the MR scans regarding tears of lateral meniscus was 85.7%, the specificity was 93.1%, the positive predictive value 85.7% and the negative predictive value 93.1%. With regard to the grading of the cartilage lesions, Cohen's kappa coefficient indicated moderate agreement for grade I and II cartilage lesions (0.5), substantial agreement for grade III and IV cartilage lesions (0.70 and 0.66) and substantial agreement for normal regions (0.75). Regarding location of the cartilage lesions, Cohen's kappa coefficient varied between almost perfect agreement in the lateral femoral condyle and no agreement in the trochlea. In the setting of chronic ACL deficiency, MR imaging at 3.0 T achieves satisfactory diagnostic performance regarding meniscal and ligamentous pathology. In the detection of cartilage lesions MRI is less successful. (orig.)

  6. Qualitative soil moisture assessment in semi-arid Africa - the role of experience and training on inter-rater reliability

    Science.gov (United States)

    Rinderer, M.; Komakech, H. C.; Müller, D.; Wiesenberg, G. L. B.; Seibert, J.

    2015-08-01

    Soil and water management is particularly relevant in semi-arid regions to enhance agricultural productivity. During periods of water scarcity, soil moisture differences are important indicators of the soil water deficit and are traditionally used for allocating water resources among farmers of a village community. Here we present a simple, inexpensive soil wetness classification scheme based on qualitative indicators which one can see or touch on the soil surface. It incorporates the local farmers' knowledge on the best soil moisture conditions for seeding and brick making in the semi-arid environment of the study site near Arusha, Tanzania. The scheme was tested twice in 2014 with farmers, students and experts (April: 40 persons, June: 25 persons) for inter-rater reliability, bias of individuals and functional relation between qualitative and quantitative soil moisture values. During the test in April farmers assigned the same wetness class in 46 % of all cases, while students and experts agreed on about 60 % of all cases. Students who had been trained in how to apply the method gained higher inter-rater reliability than their colleagues with only a basic introduction. When repeating the test in June, participants were given improved instructions, organized in small subgroups, which resulted in a higher inter-rater reliability among farmers. In 66 % of all classifications, farmers assigned the same wetness class and the spread of class assignments was smaller. This study demonstrates that a wetness classification scheme based on qualitative indicators is a robust tool and can be applied successfully regardless of experience in crop growing and education level when an in-depth introduction and training is provided. The use of a simple and clear layout of the assessment form is important for reliable wetness class assignments.

  7. Qualitative soil moisture assessment in semi-arid Africa: the role of experience and training on inter-rater reliability

    Science.gov (United States)

    Rinderer, M.; Komakech, H.; Müller, D.; Seibert, J.

    2015-03-01

    Soil and water management is particularly relevant in semi-arid regions to enhance agricultural productivity. During periods of water scarcity soil moisture differences are important indicators of the soil water deficit and are traditionally used for allocating water resources among farmers of a village community. Here we present a simple, inexpensive soil wetness classification scheme based on qualitative indicators which one can see or touch on the soil surface. It incorporates the local farmers' knowledge on the best soil moisture conditions for seeding and brick making in the semi-arid environment of the study site near Arusha, Tanzania. The scheme was tested twice in 2014 with farmers, students and experts (April: 40 persons, June: 25 persons) for inter-rater reliability, bias of individuals and functional relation between qualitative and quantitative soil moisture values. During the test in April farmers assigned the same wetness class in 46% of all cases while students and experts agreed in about 60% of all cases. Students who had been trained in how to apply the method gained higher inter-rater reliability than their colleagues with only a basic introduction. When repeating the test in June, participants were given improved instructions, organized in small sub-groups, which resulted in a higher inter-rater reliability among farmers. In 66% of all classifications farmers assigned the same wetness class and the spread of class assignments was smaller. This study demonstrates that a wetness classification scheme based on qualitative indicators is a robust tool and can be applied successfully regardless of experience in crop growing and education level when an in-depth introduction and training is provided. The use of a simple and clear layout of the assessment form is important for reliable wetness class assignments.

  8. Comparison of Digital Tomosynthesis and Chest Radiography for the Detection of Noncalcified Pulmonary and Hilar Lesions.

    Science.gov (United States)

    Galea, Angela; Adlan, Tarig; Gay, David; Roobottom, Carl; Dubbins, Paul; Riordan, Richard

    2015-09-01

    The aim of this study was to compare the sensitivity and specificity of chest digital tomosynthesis (DTS) with chest radiography (CXR) for the detection of noncalcified pulmonary nodules and hilar lesions using computed tomography (CT) as the reference standard. A total of 78 patients with suspected noncalcified pulmonary lesions on CXR were included in the study. Two radiologists, blinded to the history and CT, analyzed the CXR and the DTS images (separately), whereas a third radiologist analyzed the CXR and DTS images together. Noncalcified intrapulmonary nodules and hilar lesions were recorded for analysis. The interobserver agreement for CXR and DTS was assessed, and the time taken to report the images was recorded. A total of 202 lesions were recorded in 78 patients. There were 111 true lesions confirmed on CT in 53 patients; in 25 patients subsequent CT excluded a lesion. The overall sensitivity was 32% for CXR and 49% for DTS. This improved to 54% when the posteroanterior CXR and DTS were reviewed together (CXR-DTS). The overall specificities for CXR, DTS, and CXR-DTS were 49%, 96%, and 98%, respectively. There were 56 suspected hilar lesions with subgroup sensitivities of 76% for CXR, 65% for DTS, and 76% for CXR-DTS. The specificity for hilar lesions was 59%, 92%, and 97% for CXR, DTS, and CXR-DTS, respectively. DTS significantly improves the detectability of noncalcified nodules when compared with and when used in combination with CXR. The specificity and interobserver agreement of DTS in the diagnosis of suspected noncalcified pulmonary nodules and hilar lesions are significantly better than those of CXR and approaches those of CT.

  9. Interrater reliability of quantitative ultrasound using force feedback among examiners with varied levels of experience

    Directory of Open Access Journals (Sweden)

    Michael O. Harris-Love

    2016-06-01

    Full Text Available Background. Quantitative ultrasound measures are influenced by multiple external factors including examiner scanning force. Force feedback may foster the acquisition of reliable morphometry measures under a variety of scanning conditions. The purpose of this study was to determine the reliability of force-feedback image acquisition and morphometry over a range of examiner-generated forces using a muscle tissue-mimicking ultrasound phantom. Methods. Sixty material thickness measures were acquired from a muscle tissue mimicking phantom using B-mode ultrasound scanning by six examiners with varied experience levels (i.e., experienced, intermediate, and novice. Estimates of interrater reliability and measurement error with force feedback scanning were determined for the examiners. In addition, criterion-based reliability was determined using material deformation values across a range of examiner scanning forces (1–10 Newtons via automated and manually acquired image capture methods using force feedback. Results. All examiners demonstrated acceptable interrater reliability (intraclass correlation coefficient, ICC = .98, p .90, p < .001, independent of their level of experience. The measurement error among all examiners was 1.5%–2.9% across all applied stress conditions. Conclusion. Manual image capture with force feedback may aid the reliability of morphometry measures across a range of examiner scanning forces, and allow for consistent performance among examiners with differing levels of experience.

  10. Inter-rater reliability of kinesthetic measurements with the KINARM robotic exoskeleton.

    Science.gov (United States)

    Semrau, Jennifer A; Herter, Troy M; Scott, Stephen H; Dukelow, Sean P

    2017-05-22

    Kinesthesia (sense of limb movement) has been extremely difficult to measure objectively, especially in individuals who have survived a stroke. The development of valid and reliable measurements for proprioception is important to developing a better understanding of proprioceptive impairments after stroke and their impact on the ability to perform daily activities. We recently developed a robotic task to evaluate kinesthetic deficits after stroke and found that the majority (~60%) of stroke survivors exhibit significant deficits in kinesthesia within the first 10 days post-stroke. Here we aim to determine the inter-rater reliability of this robotic kinesthetic matching task. Twenty-five neurologically intact control subjects and 15 individuals with first-time stroke were evaluated on a robotic kinesthetic matching task (KIN). Subjects sat in a robotic exoskeleton with their arms supported against gravity. In the KIN task, the robot moved the subjects' stroke-affected arm at a preset speed, direction and distance. As soon as subjects felt the robot begin to move their affected arm, they matched the robot movement with the unaffected arm. Subjects were tested in two sessions on the KIN task: initial session and then a second session (within an average of 18.2 ± 13.8 h of the initial session for stroke subjects), which were supervised by different technicians. The task was performed both with and without the use of vision in both sessions. We evaluated intra-class correlations of spatial and temporal parameters derived from the KIN task to determine the reliability of the robotic task. We evaluated 8 spatial and temporal parameters that quantify kinesthetic behavior. We found that the parameters exhibited moderate to high intra-class correlations between the initial and retest conditions (Range, r-value = [0.53-0.97]). The robotic KIN task exhibited good inter-rater reliability. This validates the KIN task as a reliable, objective method for quantifying

  11. Inter-Rater and Test-Retest (Between-Sessions) Reliability of the 4-Skills Scan for Dutch Elementary School Children

    Science.gov (United States)

    van Kernebeek, Willem G.; de Schipper, Antoine W.; Savelsbergh, Geert J. P.; Toussaint, Huub M.

    2018-01-01

    In The Netherlands, the 4-Skills Scan is an instrument for physical education teachers to assess gross motor skills of elementary school children. Little is known about its reliability. Therefore, in this study the test-retest and inter-rater reliability was determined. Respectively, 624 and 557 Dutch 6- to 12-year-old children were analyzed for…

  12. Interrater and Test-Retest Reliability and Minimal Detectable Change of the Balance Evaluation Systems Test (BESTest) and Subsystems With Community-Dwelling Older Adults.

    Science.gov (United States)

    Wang-Hsu, Elizabeth; Smith, Susan S

    2017-01-10

    Falls are a common cause of injuries and hospital admissions in older adults. Balance limitation is a potentially modifiable factor contributing to falls. The Balance Evaluation Systems Test (BESTest), a clinical balance measure, categorizes balance into 6 underlying subsystems. Each of the subsystems is scored individually and summed to obtain a total score. The reliability of the BESTest and its individual subsystems has been reported in patients with various neurological disorders and cancer survivors. However, the reliability and minimal detectable change (MDC) of the BESTest with community-dwelling older adults have not been reported. The purposes of our study were to (1) determine the interrater and test-retest reliability of the BESTest total and subsystem scores; and (2) estimate the MDC of the BESTest and its individual subsystem scores with community-dwelling older adults. We used a prospective cohort methodological design. Community-dwelling older adults (N = 70; aged 70-94 years; mean = 85.0 [5.5] years) were recruited from a senior independent living community. Trained testers (N = 3) administered the BESTest. All participants were tested with the BESTest by the same tester initially and then retested 7 to 14 days later. With 32 of the participants, a second tester concurrently scored the retest for interrater reliability. Testers were blinded to each other's scores. Intraclass correlation coefficients [ICC(2,1)] were used to determine the interrater and test-retest reliability. Test-retest reliability was also analyzed using method error and the associated coefficients of variation (CVME). MDC was calculated using standard error of measurement. Interrater reliability (N = 32) of the BESTest total score was ICC(2, 1) = 0.97 (95% confidence interval [CI], 0.94-0.99). The ICCs for the individual subsystem scores ranged from 0.85 to 0.94. Test-retest reliability (N = 70) of the BESTest total score was ICC(2,1) = 0.93 (95% CI, 0.89-0.96). ICCs for the

  13. Comparison of Narrowband Imaging with Autofluorescence Imaging for Endoscopic Visualization of Superficial Squamous Cell Carcinoma Lesions of the Esophagus

    Directory of Open Access Journals (Sweden)

    Haruhisa Suzuki

    2012-01-01

    Full Text Available Aim. To compare narrowband imaging (NBI and autofluorescence imaging (AFI endoscopic visualization for identifying superficial esophageal squamous cell carcinoma (SCC. Methods. Twenty-four patients with superficial esophageal carcinomas diagnosed at previous hospitals were enrolled in this study. Lesions were initially detected using white-light endoscopy and then observed with both NBI and AFI. Endoscopic images documented each method, and three endoscopists experienced in esophageal imaging retrospectively reviewed respective images of histologically confirmed esophageal SCCs. Images were assessed for quality in identifying superficial SCCs and rated as excellent, fair, or poor by the three reviewers with interobserver agreement calculated using kappa (κ statistics. Results. Thirty-one lesions histologically confirmed as superficial esophageal SCCs were detected in 24 patients. NBI images of 27 lesions (87% were rated as excellent, three as fair, and one as poor compared to AFI images of 19 lesions (61% rated as excellent, 10 as fair and two as poor (P<0.05. Moderate interobserver agreement (κ=0.42, 95% CI 0.24–0.60 resulted in NBI while fair agreement (κ=0.35, 95% CI 0.18–0.51 was achieved using AFI. Conclusion. NBI may be more effective than AFI for visualization of esophageal SCC.

  14. Stereotaxic percutaneous core biopsy versus surgical biopsy of nonpalpable breast lesions using a standard mammographic table with an add-on device

    International Nuclear Information System (INIS)

    Levin, M.F.; Papoff, W.J.; Doan, L.; Eliasziw, M.

    2001-01-01

    To determine the accuracy of using a regular mammographic table with an add-on device for biopsy of nonpalpable breast lesions in women in a community hospital setting. During a 3-year period, 70 consenting women (39-80 years of age) with a nonpalpable mammographically suspicious lesion on routine screening mammography underwent 14-gauge automated percutaneous core biopsies, immediate needle localization and lumpectomy. The needle and surgical biopsy results were independently classified into 1 of 5 categories: cancer, fibroadenomal fibrocystic change, normal or other. The procedure was well tolerated, and all core specimens yielded adequate tissue for pathologic evaluation. There were 3 episodes of vasovagal reaction. There was complete agreement in histologic findings in 64 cases (91%), including 22 of 24 cancers (92%). The overall agreement for categorizing lesions was 91% (k 0.88), and there was 97% agreement (k = 0.94) for the classification of cancer versus benign lesion. The results are similar to those of studies performed with dedicated prone equipment. Stereotaxic core biopsies can be done safely and accurately in a community hospital setting with relatively inexpensive nondedicated mammographic equipment. (author)

  15. Stereotaxic percutaneous core biopsy versus surgical biopsy of nonpalpable breast lesions using a standard mammographic table with an add-on device

    Energy Technology Data Exchange (ETDEWEB)

    Levin, M.F.; Papoff, W.J.; Doan, L.; Eliasziw, M. [St. Thomas-Elgin General Hospital, St. Thomas, Ontario (Canada)

    2001-02-01

    To determine the accuracy of using a regular mammographic table with an add-on device for biopsy of nonpalpable breast lesions in women in a community hospital setting. During a 3-year period, 70 consenting women (39-80 years of age) with a nonpalpable mammographically suspicious lesion on routine screening mammography underwent 14-gauge automated percutaneous core biopsies, immediate needle localization and lumpectomy. The needle and surgical biopsy results were independently classified into 1 of 5 categories: cancer, fibroadenomal fibrocystic change, normal or other. The procedure was well tolerated, and all core specimens yielded adequate tissue for pathologic evaluation. There were 3 episodes of vasovagal reaction. There was complete agreement in histologic findings in 64 cases (91%), including 22 of 24 cancers (92%). The overall agreement for categorizing lesions was 91% (k 0.88), and there was 97% agreement (k = 0.94) for the classification of cancer versus benign lesion. The results are similar to those of studies performed with dedicated prone equipment. Stereotaxic core biopsies can be done safely and accurately in a community hospital setting with relatively inexpensive nondedicated mammographic equipment. (author)

  16. Changing pattern of oral cavity lesions and personal habits over a decade: Hospital based record analysis from Allahabad

    Directory of Open Access Journals (Sweden)

    Misra Vatsala

    2009-01-01

    Full Text Available Aim: To do a prospective clinicohistological study of premalignant and malignant lesions of the oral cavity, and compare it with a 10-year retrospective data, especially in terms of incidence, age distribution, personal habits, and site and type of lesion. Material and Methods: Sections from 776 lesions of the oral cavity, which included 647 lesions of a 10-year (1993 - 2003 retrospective study and 129 lesions of a one-year (2003 - 2004 prospective study, were observed clinically, and a histological correlation was carried out. Results: Premalignant lesions included 78 cases of leukoplakia, 68 cases of oral submucous fibrosis, and 76 cases of squamous papilloma. Their incidence has increased in the last decade from 0.15 to 0.53. These lesions commonly presented in the fourth decade of life, as white patches in leukoplakia and oral submucous fibrosis, and as a growth in squamous cell papilloma. Squamous cell carcinoma was the commonest lesion (57%. Its incidence has increased significantly in the last decade. The mean age of presentation was the sixth decade. A personal history of tobacco chewing was given by most of the patients in the retrospective group, while the use of pan masala was found to be maximum in the prospective group. The overall agreement between the clinical and histological diagnosis was 95.36% (740 / 776 and the kappa coefficient of agreement was 0.9256. Conclusion: Histology along with a detailed clinical workup was found to be a useful, reliable, and accurate diagnostic technique for lesions of the oral cavity. An increase in premalignant lesions in the prospective study, associated with increased pan masala intake is alarming and needs to be taken care of.

  17. Lesion progression in post-treatment persistent endodontic lesions.

    Science.gov (United States)

    Yu, Victoria Soo Hoon; Messer, Harold Henry; Shen, Liang; Yee, Robert; Hsu, Chin-ying Stephen

    2012-10-01

    Radiographic lesions related to root-filled teeth may persist for long periods after treatment and are considered to indicate failure of initial treatment. Persistent lesions are found in a proportion of cases, but information on lesion progression is lacking. This study examined the incidence of lesion improvement, remaining unchanged, and deterioration among persistent lesions in a group of patients recruited from a university-based clinic and identified potential predictors for lesion progression. Patients of a university clinic with persistent endodontic lesions at least 4 years since treatment and with original treatment radiographs available were recruited with informed consent. Data were obtained by interview and from dental records and clinical and radiographic examinations. Univariate and multivariate statistical analyses were carried out by using SPSS (version 19). One hundred fifty-one persistent lesions were identified in 114 patients. A majority of the lesions (107, 70.9%) received treatment between 4 and 5 years prior. Eighty-six lesions (57.0%) improved, 18 (11.9%) remained unchanged, and 47 (31.1%) deteriorated since treatment. Potential predictors for lesions that did not improve included recall lesion size, pain on biting at recall examination, history of a postobturation flare-up, and a non-ideal root-filling length (P < .05). Lesions that had persisted for a longer period appeared less likely to be improving (relative risk, 1.038; 95% confidence interval, 1.000-1.077). A specific time interval alone should not be used to conclude that a lesion will not resolve without intervention. This study identified several clinical factors that are associated with deteriorating persistent lesions, which should aid in identifying lesions that require further intervention. Copyright © 2012 American Association of Endodontists. Published by Elsevier Inc. All rights reserved.

  18. Radiation planning in small complex lesions and experimental verification

    International Nuclear Information System (INIS)

    Jess-Hempen, A.; Wowra, B.; Mack, A.; Kreiner, H.J.; Heck, B.

    2003-01-01

    The Gamma Knife is used as a sterotactic tool for the conformal treatment of very small, complex-shape cranial lesions. The combination of planning software and treatment equipment enables a highly-precise conformal dose distribution and positioning. The purpose of the present study was to experimentally verify the precision actually achievable in case of extremely irregular, small target volumes. For this purpose, a complete treatment procedure was performed using a standard head phantom complemented with a specially developed insert that simulates an L-shaped lesion. The spatial precision of the irradiation was recorded by means of high-resolution film dosimetry using GafChromic TM films. The analysis of the films showed for the film in the center plane an excellent conformity of the 75% isodose line used to circumscribe the lesion. A very good agreement between planning and measurement resulted also for isodose lines residing outside of the target volume. (orig.) [de

  19. Cartilage lesions in the ankle joint: comparison of MR arthrography and CT arthrography

    International Nuclear Information System (INIS)

    Schmid, M.R.; Pfirrmann, C.W.A.; Hodler, J.; Zanetti, M.; Vienne, P.

    2003-01-01

    To compare MR arthrography and CT arthrography for the evaluation of cartilage lesions in the ankle joint.Design and patients Thirty-six consecutive patients with clinically suspected cartilage lesions were prospectively included in the study. A 1:1 mixture of diluted gadoteridol (4 mmol/l) and iopamidol (300 mg iodine/ml) was injected. The articular cartilages of the talus, tibia, and fibula were analyzed separately by two musculoskeletal radiologists. A review panel consisting of two musculoskeletal radiologists and an orthopedic surgeon represented the standard of reference. For reader 1 accuracy of MR arthrography in the talus/tibia/fibula (88%/88%/94%) was slightly inferior to CT arthrography (90%/94%/92%). For reader 2, the accuracy was 76%/78%/83% for MR arthrography, and 92%/93%/92% for CT arthrography, respectively. Interobserver agreement for MR arthrography was 79%/74%/89% (kappa 0.47/0.34/0.27), while interobserver agreement for CT arthrography was 89%/90%/89% (kappa 0.69/0.54/0.54). CT arthrography appears to be more reliable than MR arthrography for the detection of cartilage lesions in the ankle joint. (orig.)

  20. Emergency Physicians Are Able to Detect Right Ventricular Dilation With Good Agreement Compared to Cardiology.

    Science.gov (United States)

    Rutz, Matt A; Clary, Julie M; Kline, Jeffrey A; Russell, Frances M

    2017-07-01

    Focused cardiac ultrasound (FOCUS) is a useful tool in evaluating patients presenting to the emergency department (ED) with acute dyspnea. Prior work has shown that right ventricular (RV) dilation is associated with repeat hospitalizations and shorter life expectancy. Traditionally, RV assessment has been evaluated by cardiologist-interpreted comprehensive echocardiography. The primary goal of this study was to determine the inter-rater reliability between emergency physicians (EPs) and a cardiologist for determining RV dilation on FOCUS performed on ED patients with acute dyspnea. This was a prospective, observational study at two urban academic EDs; patients were enrolled if they had acute dyspnea and a computed tomographic pulmonary angiogram without acute disease. All patients had an EP-performed FOCUS to assess for RV dilation. RV dilation was defined as an RV to left ventricular ratio greater than 1. FOCUS interpretations were compared to a blinded cardiologist FOCUS interpretation using agreement and kappa statistics. Of 84 FOCUS examinations performed on 83 patients, 17% had RV dilation. Agreement and kappa, for EP-performed FOCUS for RV dilation were 89% (95% confidence interval [CI] 80-95%) and 0.68 (95% CI 0.48-0.88), respectively. Emergency physician sonographers are able to detect RV dilation with good agreement when compared to cardiology. These results support the wider use of EP-performed FOCUS to evaluate for RV dilation in ED patients with dyspnea. © 2017 by the Society for Academic Emergency Medicine.

  1. "An Investigation Into The Interrater Reliability Of The Modified Ashworth Scale In The Assessment Of Muscle Spasticity In Hemiplegic Patients "

    Directory of Open Access Journals (Sweden)

    N. Nokhostin-Ansari

    2006-06-01

    Full Text Available Background and Aim: Spasticity is a velocity-dependent increase in tonic stretch reflexes (muscle tone with exaggerated tendon jerks, resulting from hyperexcitability of the stretch reflex. The measurement of spasticity is necessary to determine the effect of treatments. The Modified Ashworth Scale is the most widely used method for assessing muscle spasticity in clinical practice and research. The purpose of this study was to investigate the interrater reliability of Modified Ashworth Scale in hemiplegic patients. Materials and Methods: Thirty subjects (16 males, 14 females with a mean age of 59.40 (SD =14.013 recruited. Shoulder adductor , elbow flexor , wrist dorsiflexor , hip adductor , knee extensor and ankle plantarflexor on the hemiplegic side were tested by two physiotherapists. Results: In the upper limb, the interrater reliability for shoulder adductor and elbow flexor muscles was fair (0.372 and 0.369, respectively. The reliability for the wrist flexors was good (0.612. The difference in Kappa value for the proximal muscle (shoulder adductor; 0.372 and the distal muscle (wrist flexor; 0.612 was significant (²X=33.87, df=1, p0.05. The mean value for the upper limb (0.505 and the lower limb (0,.516 was not significantly different (²X=0.1407, df=1, p>0.05. Conclusion: The interrater reliability of Modified Ashworth Scale was not good . The limb, upper or lower, had no significant effect on the reliability. In the upper limb, the reliability for the proximal and distal muscle was significantly different. However. The difference in the lower limb was not significant.When using the scale, one should consider it's limitation.

  2. [Symptoms and lesion localization in visual agnosia].

    Science.gov (United States)

    Suzuki, Kyoko

    2004-11-01

    There are two cortical visual processing streams, the ventral and dorsal stream. The ventral visual stream plays the major role in constructing our perceptual representation of the visual world and the objects within it. Disturbance of visual processing at any stage of the ventral stream could result in impairment of visual recognition. Thus we need systematic investigations to diagnose visual agnosia and its type. Two types of category-selective visual agnosia, prosopagnosia and landmark agnosia, are different from others in that patients could recognize a face as a face and buildings as buildings, but could not identify an individual person or building. Neuronal bases of prosopagnosia and landmark agnosia are distinct. Importance of the right fusiform gyrus for face recognition was confirmed by both clinical and neuroimaging studies. Landmark agnosia is related to lesions in the right parahippocampal gyrus. Enlarged lesions including both the right fusiform and parahippocampal gyri can result in prosopagnosia and landmark agnosia at the same time. Category non-selective visual agnosia is related to bilateral occipito-temporal lesions, which is in agreement with the results of neuroimaging studies that revealed activation of the bilateral occipito-temporal during object recognition tasks.

  3. The Achievement of Therapeutic Objectives Scale: Interrater Reliability and Sensitivity to Change in Short-Term Dynamic Psychotherapy and Cognitive Therapy

    Science.gov (United States)

    Valen, Jakob; Ryum, Truls; Svartberg, Martin; Stiles, Tore C.; McCullough, Leigh

    2011-01-01

    This study examined interrater reliability and sensitivity to change of the Achievement of Therapeutic Objectives Scale (ATOS; McCullough, Larsen, et al., 2003) in short-term dynamic psychotherapy (STDP) and cognitive therapy (CT). The ATOS is a process scale originally developed to assess patients' achievements of treatment objectives in STDP,…

  4. Radiologists' Performance for Detecting Lesions and the Interobserver Variability of Automated Whole Breast Ultrasound

    International Nuclear Information System (INIS)

    Kim, Sung Hun; Kang, Bong Joo; Choi, Byung Gil; Choi, Jae Jung; Lee, Ji Hye; Song, Byung Joo; Choe, Byung Joo; Park, Sarah; Kim, Hyunbin

    2013-01-01

    To compare the detection performance of the automated whole breast ultrasound (AWUS) with that of the hand-held breast ultrasound (HHUS) and to evaluate the interobserver variability in the interpretation of the AWUS. AWUS was performed in 38 breast cancer patients. A total of 66 lesions were included: 38 breast cancers, 12 additional malignancies and 16 benign lesions. Three breast radiologists independently reviewed the AWUS data and analyzed the breast lesions according to the BI-RADS classification. The detection rate of malignancies was 98.0% for HHUS and 90.0%, 88.0% and 96.0% for the three readers of the AWUS. The sensitivity and the specificity were 98.0% and 62.5% in HHUS, 90.0% and 87.5% for reader 1, 88.0% and 81.3% for reader 2, and 96.0% and 93.8% for reader 3, in AWUS. There was no significant difference in the radiologists' detection performance, sensitivity and specificity (p > 0.05) between the two modalities. The interobserver agreement was fair to good for the ultrasonographic features, categorization, size, and the location of breast masses. AWUS is thought to be useful for detecting breast lesions. In comparison with HHUS, AWUS shows no significant difference in the detection rate, sensitivity and the specificity, with high degrees of interobserver agreement

  5. Radiologists' Performance for Detecting Lesions and the Interobserver Variability of Automated Whole Breast Ultrasound

    Energy Technology Data Exchange (ETDEWEB)

    Kim, Sung Hun; Kang, Bong Joo; Choi, Byung Gil; Choi, Jae Jung; Lee, Ji Hye [Department of Radiology, Seoul St. Mary' s Hospital, College of Medicine, The Catholic University of Korea, Seoul 137-701 (Korea, Republic of); Song, Byung Joo; Choe, Byung Joo [Department of General Surgery, Seoul St. Mary' s Hospital, College of Medicine, The Catholic University of Korea, Seoul 137-701 (Korea, Republic of); Park, Sarah [Department of Internal Medicine, Seoul St. Mary' s Hospital, College of Medicine, The Catholic University of Korea, Seoul 137-701 (Korea, Republic of); Kim, Hyunbin [CMC Clinical Research Coordinating Center, College of Medicine, The Catholic University of Korea, Seoul 137-701 (Korea, Republic of)

    2013-07-01

    To compare the detection performance of the automated whole breast ultrasound (AWUS) with that of the hand-held breast ultrasound (HHUS) and to evaluate the interobserver variability in the interpretation of the AWUS. AWUS was performed in 38 breast cancer patients. A total of 66 lesions were included: 38 breast cancers, 12 additional malignancies and 16 benign lesions. Three breast radiologists independently reviewed the AWUS data and analyzed the breast lesions according to the BI-RADS classification. The detection rate of malignancies was 98.0% for HHUS and 90.0%, 88.0% and 96.0% for the three readers of the AWUS. The sensitivity and the specificity were 98.0% and 62.5% in HHUS, 90.0% and 87.5% for reader 1, 88.0% and 81.3% for reader 2, and 96.0% and 93.8% for reader 3, in AWUS. There was no significant difference in the radiologists' detection performance, sensitivity and specificity (p > 0.05) between the two modalities. The interobserver agreement was fair to good for the ultrasonographic features, categorization, size, and the location of breast masses. AWUS is thought to be useful for detecting breast lesions. In comparison with HHUS, AWUS shows no significant difference in the detection rate, sensitivity and the specificity, with high degrees of interobserver agreement.

  6. Partnering to Establish and Study Simulation in International Nursing Education.

    Science.gov (United States)

    Garner, Shelby L; Killingsworth, Erin; Raj, Leena

    The purpose of this article was to describe an international partnership to establish and study simulation in India. A pilot study was performed to determine interrater reliability among faculty new to simulation when evaluating nursing student competency performance. Interrater reliability was below the ideal agreement level. Findings in this study underscore the need to obtain baseline interrater reliability data before integrating competency evaluation into a simulation program.

  7. Diagnostic Performance and Confidence of Contrast-Enhanced Ultrasound in the Differential Diagnosis of Cystic and Cysticlike Liver Lesions.

    Science.gov (United States)

    Corvino, Antonio; Catalano, Orlando; Corvino, Fabio; Sandomenico, Fabio; Petrillo, Antonella

    2017-09-01

    The aims of this study were to assess the diagnostic performance of contrast-enhanced ultrasound (CEUS) in the characterization of atypical cystic and cysticlike focal liver lesions in comparison with conventional US and to determine whether the use of CEUS can reduce the need for further diagnostic workup. In a 3-year period 48 patients with 50 atypical cystic and cysticlike lesions found at conventional US underwent CEUS. Diagnostic confirmation was obtained in cytohistopathologic examinations, with other imaging modalities, and in follow-up. Overall, there were 24 cystic lesions and 26 cysticlike solid lesions, specifically 32 benign and 18 malignant lesions. The conventional US and CEUS images and cine loops were reviewed by two blinded readers independently. Sensitivity, specificity, area under the ROC curve (A z ), and interobserver agreement were calculated. Diagnostic performance improved after review of CEUS examinations by both readers (conventional US A z = 0.781 vs 0.972; CEUS A z = 0.734 vs 0.957). Interreader agreement increased, although slightly (conventional US weighted κ = 0.894; CEUS weighted κ = 0.953). In terms of differential diagnosis, the occurrence of correctly characterized lesions increased after CEUS for both readers (reader 1, 62% to 98%; reader 2, 56% to 96%). The development of low-acoustic-power CEUS has made it possible to identify several imaging features of cystic and cysticlike focal liver lesions that, in association with history and clinical findings, may help to correctly characterize them. Our data indicate the usefulness of CEUS in the evaluation of patients with these lesions.

  8. Transcultural Adaptation of GRID Hamilton Rating Scale For Depression (GRID-HAMD) to Brazilian Portuguese and Evaluation of the Impact of Training Upon Inter-Rater Reliability.

    Science.gov (United States)

    Henrique-Araújo, Ricardo; Osório, Flávia L; Gonçalves Ribeiro, Mônica; Soares Monteiro, Ivandro; Williams, Janet B W; Kalali, Amir; Alexandre Crippa, José; Oliveira, Irismar Reis De

    2014-07-01

    GRID-HAMD is a semi-structured interview guide developed to overcome flaws in HAM-D, and has been incorporated into an increasing number of studies. Carry out the transcultural adaptation of GRID-HAMD into the Brazilian Portuguese language, evaluate the inter-rater reliability of this instrument and the training impact upon this measure, and verify the raters' opinions of said instrument. The transcultural adaptation was conducted by appropriate methodology. The measurement of inter-rater reliability was done by way of videos that were evaluated by 85 professionals before and after training for the use of this instrument. The intraclass correlation coefficient (ICC) remained between 0.76 and 0.90 for GRID-HAMD-21 and between 0.72 and 0.91 for GRID-HAMD-17. The training did not have an impact on the ICC, except for a few groups of participants with a lower level of experience. Most of the participants showed high acceptance of GRID-HAMD, when compared to other versions of HAM-D. The scale presented adequate inter-rater reliability even before training began. Training did not have an impact on this measure, except for a few groups with less experience. GRID-HAMD received favorable opinions from most of the participants.

  9. Actors' portrayals of depression to test interrater reliability in clinical trials.

    Science.gov (United States)

    Rosen, Jules; Mulsant, Benoit H; Bruce, Martha L; Mittal, Vikas; Fox, Debra

    2004-10-01

    This study determined if actors could portray depressed patients to establish the interrater reliability of raters using the Hamilton Depression Rating Scale (HDRS). Actors portrayed depressed patients using scripts derived from HDRS assessments obtained at three points during treatment. Four experienced raters blindly viewed videotapes of two patients and two actors. They guessed if each interviewee was a patient or an actor and rated the certainty of their guesses. For each interview, they also rated the realism of the portrayal and completed the HDRS. Experienced raters could not distinguish actors and patients better than chance and were equally certain of their right and wrong guesses. Actors and patients received high scores on the realism of their portrayals. The HDRS scores of the actor-patient pairs were correlated. Actors can effectively portray depressed patients. Future studies will determine if actors can accurately portray patients with anxiety and psychosis.

  10. Comparison between Radiographic (2-dimensional and 3-dimensional) and Histologic Findings of Periapical Lesions Treated with Apical Surgery.

    Science.gov (United States)

    Bornstein, Michael M; Bingisser, Andreas C; Reichart, Peter A; Sendi, Pedram; Bosshardt, Dieter D; von Arx, Thomas

    2015-06-01

    The aim of this study was to evaluate the concordance of 2- and 3-dimensional radiography and histopathology in the diagnosis of periapical lesions. Patients were consecutively enrolled in this study provided that preoperative periapical radiography (PR) and cone-beam computed tomographic imaging of the tooth to be treated with apical surgery were performed. The periapical lesional tissue was histologically analyzed by 2 blinded examiners. The final histologic diagnosis was compared with the radiographic assessments of 4 blinded observers. The initial study material included 62 teeth in the same number of patients. Four lesions had to be excluded during processing, resulting in a final number of 58 evaluated cases (31 women and 27 men, mean age = 55 years). The final histologic diagnosis of the periapical lesions included 55 granulomas (94.8%) and 3 cysts (5.2%). Histologic analysis of the tissue samples from the apical lesions exhibited an almost perfect agreement between the 2 experienced investigators with an overall agreement of 94.83% (kappa = 0.8011). Radiographic assessment overestimated cysts by 28.4% (cone-beam computed tomographic imaging) and 20.7% (periapical radiography), respectively. Comparing the correlation of the radiographic diagnosis of 4 observers with the final histologic diagnosis, 2-dimensional (kappa = 0.104) and 3-dimensional imaging (kappa = 0.111) provided only minimum agreement. To establish a final diagnosis of an apical radiolucency, the tissue specimen should be evaluated histologically and specified as a granuloma (with/without epithelium) or a cyst. Analysis of 2-dimensional and 3-dimensional radiographic images alike results only in a tentative diagnosis that should be confirmed with biopsy. Copyright © 2015 American Association of Endodontists. Published by Elsevier Inc. All rights reserved.

  11. Potential Use of American College of Radiology BI-RADS Mammography Atlas for Reporting and Assessing Lesions Detected on Dedicated Breast CT Imaging: Preliminary Study.

    Science.gov (United States)

    Jung, Hae Kyoung; Kuzmiak, Cherie M; Kim, Keum Won; Choi, Na Mi; Kim, Hye Jeong; Langman, Eun Lee; Yoon, Sora; Steen, Doreen; Zeng, Donglin; Gao, Fei

    2017-11-01

    Dedicated breast computed tomography (DBCT) is an emerging and promising modality for breast lesions. The objective of this study was to evaluate the potential use of applying the BI-RADS Mammography Atlas 5th Edition for reporting and assessing breast lesions on DBCT. Currently, no atlas exists for DBCT. Four radiologists trained in breast imaging were recruited in this institutional review board-approved, Health Insurance Portability and Accountability Act-compliant study. The enrolled radiologists, who were blinded to mammographic and histopathologic findings, individually reviewed 30 randomized DBCT cases that contained marked lesions. Thirty-four lesions were included in this study: 24 (70.6%) masses, 7 (20.6%) calcifications, and 3 (8.8%) architectural distortions. Eight (23.5%) lesions were malignant and 26 (76.5%) were benign. The reader was asked to specify according to the BI-RADS Mammography Atlas for each marked DBCT lesion: primary findings, features, breast density, and final assessment. We calculated readers' diagnostic performances for differentiating between benign and malignant lesions and interobserver variability for reporting and assessing lesions using a generalized estimating equation and the Fleiss kappa (κ) statistic. The estimated overall sensitivity of the readers was 0.969, and the specificity was 0.529. There were no significant differences in the sensitivity and the specificity between lesion types. For reporting the presence of a primary finding, the overall substantial agreement (κ = 0.70) was seen. In assigning the breast density and the final assessment, the overall agreement was moderate (κ = 0.53) and fair (κ = 0.30). The use of the BI-RADS Mammography Atlas 5th Edition for DBCT showed high performance and good agreement among readers. Copyright © 2017 The Association of University Radiologists. Published by Elsevier Inc. All rights reserved.

  12. A method for reducing misclassification in the extended Glasgow Outcome Score.

    Science.gov (United States)

    Lu, Juan; Marmarou, Anthony; Lapane, Kate; Turf, Elizabeth; Wilson, Lindsay

    2010-05-01

    The eight-point extended Glasgow Outcome Scale (GOSE) is commonly used as the primary outcome measure in traumatic brain injury (TBI) clinical trials. The outcome is conventionally collected through a structured interview with the patient alone or together with a caretaker. Despite the fact that using the structured interview questionnaires helps reach agreement in GOSE assessment between raters, significant variation remains among different raters. We introduce an alternate GOSE rating system as an aid in determining GOSE scores, with the objective of reducing inter-rater variation in the primary outcome assessment in TBI trials. Forty-five trauma centers were randomly assigned to three groups to assess GOSE scores on sample cases, using the alternative GOSE rating system coupled with central quality control (Group 1), the alternative system alone (Group 2), or conventional structured interviews (Group 3). The inter-rater variation between an expert and untrained raters was assessed for each group and reported through raw agreement and with weighted kappa (kappa) statistics. Groups 2 and 3 without central review yielded inter-rater agreements of 83% (weighted kappa = 0.81; 95% CI 0.69, 0.92) and 83% (weighted kappa = 0.76, 95% CI 0.63, 0.89), respectively, in GOS scores. In GOSE, the groups had an agreement of 76% (weighted kappa = 0.79; 95% CI 0.69, 0.89), and 63% (weighted kappa = 0.70; 95% CI 0.60, 0.81), respectively. The group using the alternative rating system coupled with central monitoring yielded the highest inter-rater agreement among the three groups in rating GOS (97%; weighted kappa = 0.95; 95% CI 0.89, 1.00), and GOSE (97%; weighted kappa = 0.97; 95% CI 0.91, 1.00). The alternate system is an improved GOSE rating method that reduces inter-rater variations and provides for the first time, source documentation and structured narratives that allow a thorough central review of information. The data suggest that a collective effort can be made to

  13. HPV and oral lesions: preventive possibilities, vaccines and early diagnosis of malignant lesions.

    Science.gov (United States)

    Testi, D; Nardone, M; Melone, P; Cardelli, P; Ottria, L; Arcuri, C

    2015-01-01

    The importance of HPV in world healthy is high, in fact high-risk HPV types contribute significantly to viral associated neoplasms. In this article we will analyze vary expression of HPV in oral cavity both benign and malignant, their prevalence and the importance in early diagnosis and prevention. The classical oral lesions associated with human papillomavirus are squamous cell papilloma, condyloma acuminatum, verruca vulgaris and focal epithelial hyperplasia. Overall, HPV types 2, 4, 6, 11, 13 and 32 have been associated with benign oral lesions while HPV types 16 and 18 have been associated with malignant lesions, especially in cancers of the tonsils and elsewhere in the oropharynx. Transmission of the virus can occur with direct contact, genital contact, anal and oral sex; latest studies suggest a salivary transmission and from mother to child during delivery. The number of lifetime sexual partners is an important risk factor for the development of HPV-positive head-neck cancer. Oral/oropharyngeal cancer etiologically associated with HPV having an increased survival and a better prognostic (85%-90% to five years). There is no cure for the virus. There are two commercially available prophylactic vaccines against HPV today: the bivalent (16 and 18) Cervarix® and the tetravalent (6, 11, 16 and 18) Gardasil® and new vaccine Gardasil 9 (6, 11, 16, 18, 31, 33, 45, 52, 58) was approved in the United States. To be effective, such vaccination should start before "sexual puberty". The vaccine could be an important preventive strategy, in fact the scientific community is in agreement on hypothesis that blocking the contagion it may also limit the distance complications as the oropharyngeal cancer.

  14. Characterization of the enhancing lesions on dynamic contrast enhanced magnetic resonance imaging in patients with interstitial mammoplasty

    International Nuclear Information System (INIS)

    Kim, Tae Yun; Kim, Sung Hun; Kang, Bong Joo; Kim, Hyeon Sook; Cha, Eun Suk; Kim, Ji Youn; Song, Byung Joo

    2013-01-01

    Purpose: The purpose of this study was to categorize the morphologic and kinetic features of enhancing lesions in breasts with interstitial mammoplasty using dynamic contrast-enhanced magnetic resonance imaging and to assess factors predictive of breast cancer. Materials and methods: We retrospectively reviewed the clinical and radiological data of 21 enhancing lesions in 19 patients with interstitial mammoplasty, who underwent breast magnetic resonance imaging and biopsy or an operation in our hospital from September 2008 to July 2012. These lesions were sorted by morphological and kinetic features and final assessment category according to the BI-RADS lexicon. Results: Nine cases were confirmed to be ductal carcinoma in situ (n = 2) and invasive ductal carcinoma (n = 7), and the remaining 12 cases were fibrocystic disease (n = 2), fibroadenoma (n = 2), fat necrosis (n = 1), foreign body granuloma (n = 3) and silicone mastitis (n = 1). Common features of malignancy included irregular shape (50.0%), spiculated margins (75.0%), heterogeneous enhancement (50.0%) and type III kinetic pattern (87.5%). The correlations of margins and kinetic curve pattern with benignity and malignancy approached statistical significance (p = 0.02, respectively). We found no correlation for shape (p = 0.33) or internal enhancement (p = 0.42) between lesion types. The malignancy rate of enhancing lesions was 42.8% (9/21). The sensitivity and specificity of dynamic contrast-enhanced magnetic resonance imaging were 100% and 16.67%, respectively. The positive predictive value, negative predictive value and accuracy of magnetic resonance imaging were 47.38%, 100% and 52.38%. Overall inter-observer agreement for the kinetic curve pattern was good (κ = 0.67). Moderate agreement was seen in describing the shape, margin, enhancement and assessing the final category (κ = 0.59, 0.46, 0.58 and 0.49, respectively). Conclusion: Dynamic contrast-enhanced magnetic resonance imaging had a high

  15. Characterization of the enhancing lesions on dynamic contrast enhanced magnetic resonance imaging in patients with interstitial mammoplasty

    Energy Technology Data Exchange (ETDEWEB)

    Kim, Tae Yun [Department of Radiology, Seoul St. Mary' s Hospital, The Catholic University of Korea (Korea, Republic of); Kim, Sung Hun, E-mail: rad-ksh@catholic.ac.kr [Department of Radiology, Seoul St. Mary' s Hospital, The Catholic University of Korea (Korea, Republic of); Kang, Bong Joo [Department of Radiology, Seoul St. Mary' s Hospital, The Catholic University of Korea (Korea, Republic of); Kim, Hyeon Sook [Department of Radiology, St. Paul Hospital, The Catholic University of Korea (Korea, Republic of); Cha, Eun Suk [Department of Radiology, Ewha Womans University, School of Medicine, Mokdong Hospital (Korea, Republic of); Kim, Ji Youn [Department of Radiology, Yeouido St. Mary' s Hospital, The Catholic University of Korea (Korea, Republic of); Song, Byung Joo [Department of Surgery, Seoul St. Mary' s Hospital, The Catholic University of Korea, Seoul (Korea, Republic of)

    2013-12-01

    Purpose: The purpose of this study was to categorize the morphologic and kinetic features of enhancing lesions in breasts with interstitial mammoplasty using dynamic contrast-enhanced magnetic resonance imaging and to assess factors predictive of breast cancer. Materials and methods: We retrospectively reviewed the clinical and radiological data of 21 enhancing lesions in 19 patients with interstitial mammoplasty, who underwent breast magnetic resonance imaging and biopsy or an operation in our hospital from September 2008 to July 2012. These lesions were sorted by morphological and kinetic features and final assessment category according to the BI-RADS lexicon. Results: Nine cases were confirmed to be ductal carcinoma in situ (n = 2) and invasive ductal carcinoma (n = 7), and the remaining 12 cases were fibrocystic disease (n = 2), fibroadenoma (n = 2), fat necrosis (n = 1), foreign body granuloma (n = 3) and silicone mastitis (n = 1). Common features of malignancy included irregular shape (50.0%), spiculated margins (75.0%), heterogeneous enhancement (50.0%) and type III kinetic pattern (87.5%). The correlations of margins and kinetic curve pattern with benignity and malignancy approached statistical significance (p = 0.02, respectively). We found no correlation for shape (p = 0.33) or internal enhancement (p = 0.42) between lesion types. The malignancy rate of enhancing lesions was 42.8% (9/21). The sensitivity and specificity of dynamic contrast-enhanced magnetic resonance imaging were 100% and 16.67%, respectively. The positive predictive value, negative predictive value and accuracy of magnetic resonance imaging were 47.38%, 100% and 52.38%. Overall inter-observer agreement for the kinetic curve pattern was good (κ = 0.67). Moderate agreement was seen in describing the shape, margin, enhancement and assessing the final category (κ = 0.59, 0.46, 0.58 and 0.49, respectively). Conclusion: Dynamic contrast-enhanced magnetic resonance imaging had a high

  16. Inter-rater variability of visual interpretation and comparison with quantitative evaluation of {sup 11}C-PiB PET amyloid images of the Japanese Alzheimer's Disease Neuroimaging Initiative (J-ADNI) multicenter study

    Energy Technology Data Exchange (ETDEWEB)

    Yamane, Tomohiko [Saitama Medical University Saitama International Center, Department of Nuclear Medicine, Hidaka (Japan); Institute of Biomedical Research and Innovation, Division of Molecular Imaging, Kobe (Japan); Tokyo Metropolitan Institute of Gerontology, Team for Neuroimaging Research, Tokyo (Japan); Ishii, Kenji; Sakata, Muneyuki [Tokyo Metropolitan Institute of Gerontology, Team for Neuroimaging Research, Tokyo (Japan); Ikari, Yasuhiko; Nishio, Tomoyuki [Institute of Biomedical Research and Innovation, Division of Molecular Imaging, Kobe (Japan); Research Association for Biotechnology, Tokyo (Japan); Ishii, Kazunari [Kinki University Hospital, Department of Radiology, Osaka, Sayama (Japan); Kato, Takashi; Ito, Kengo [National Center for Geriatrics and Gerontology, Department of Brain Science and Molecular Imaging, Obu (Japan); Senda, Michio [Institute of Biomedical Research and Innovation, Division of Molecular Imaging, Kobe (Japan); Collaboration: J-ADNI Study Group

    2017-05-15

    The aim of this study was to assess the inter-rater variability of the visual interpretation of {sup 11}C-PiB PET images regarding the positivity/negativity of amyloid deposition that were obtained in a multicenter clinical research project, Japanese Alzheimer's Disease Neuroimaging Initiative (J-ADNI). The results of visual interpretation were also compared with a semi-automatic quantitative analysis using mean cortical standardized uptake value ratio to the cerebellar cortex (mcSUVR). A total of 162 {sup 11}C-PiB PET scans, including 45 mild Alzheimer's disease, 60 mild cognitive impairment, and 57 normal cognitive control cases that had been acquired as J-ADNI baseline scans were analyzed. Based on visual interpretation by three independent raters followed by consensus read, each case was classified into positive, equivocal, and negative deposition (ternary criteria) and further dichotomized by merging the former two (binary criteria). Complete agreement of visual interpretation by the three raters was observed for 91.3% of the cases (Cohen κ = 0.88 on average) in ternary criteria and for 92.3% (κ = 0.89) in binary criteria. Cases that were interpreted as visually positive in the consensus read showed significantly higher mcSUVR than those visually negative (2.21 ± 0.37 vs. 1.27 ± 0.09, p < 0.001), and positive or negative decision by visual interpretation was dichotomized by a cut-off value of mcSUVR = 1.5. Significant positive/negative associations were observed between mcSUVR and the number of raters who evaluated as positive (ρ = 0.87, p < 0.0001) and negative (ρ = -0.85, p < 0.0001) interpretation. Cases of disagreement among raters showed generally low mcSUVR. Inter-rater agreement was almost perfect in {sup 11}C-PiB PET scans. Positive or negative decision by visual interpretation was dichotomized by a cut-off value of mcSUVR = 1.5. As some cases of disagreement among raters tended to show low mcSUVR, referring to quantitative method may

  17. Cervical pre-malignant lesions in HIV infected women attending Care and Treatment Centre in a tertiary hospital, Dar es Salaam, Tanzania.

    Science.gov (United States)

    Balandya, Belinda S; Pembe, Andrea B; Mwakyoma, Henry A

    2011-09-01

    The aims of this study was to determine proportion of HIV infected women with cervical pre-malignant lesions; and compare the use of Visual Inspection of the cervix after application of Acetic acid (VIA) and Papanicolau (Pap) smear in screening for cervical premalignant lesions in HIV positive women attending Care and Treatment Centre (CTC) at Muhimbili National Hospital (MNH), Dar es Salaam, Tanzania. A total of 316 women aged 18-70 years had a Pap smear taken for cytology, followed by spraying onto the cervix with 4% acetic acid and then inspecting it. Cytology was considered negative when there was no Cervical Intraepithelial Neoplasia (CIN) lesion reported from the Pap smear taken, and positive if CIN lesion 1, 2 or 3 was reported. Detection of a well-defined, opaque acetowhite lesion close to the squamocolumnar junction or close to the external cervical os constituted a positive VIA. Out of 316 women, 132 women had acetowhite lesions on VIA, making the proportion of abnormal cervical lesions to be 42.4%. One hundred and one out of 312 women (32.4%) had CIN lesions detected on Pap smear. The proportion of agreement between these two tests was 0.3. The proportion of agreement was moderate in women with advanced WHO HIV clinical stage of the disease and in women not on ART (Anti Retroviral Therapy). Women with CD-4 count less than 200 cells/mm3 had more abnormal cervical lesions. There is considerable proportion of HIV positive women with premalignant lesions of the cervix. Considering the proportion of HIV women with abnormal lesions and the difficulty in logistics of doing Pap smear in low resource settings, these results supports the recommendation to introduce screening of premalignant lesions of the cervix using VIA to all HIV infected women.

  18. Comparative measurement of collagen bundle orientation by Fourier analysis and semiquantitative evaluation: reliability and agreement in Masson's trichrome, Picrosirius red and confocal microscopy techniques.

    Science.gov (United States)

    Marcos-Garcés, V; Harvat, M; Molina Aguilar, P; Ferrández Izquierdo, A; Ruiz-Saurí, A

    2017-08-01

    Measurement of collagen bundle orientation in histopathological samples is a widely used and useful technique in many research and clinical scenarios. Fourier analysis is the preferred method for performing this measurement, but the most appropriate staining and microscopy technique remains unclear. Some authors advocate the use of Haematoxylin-Eosin (H&E) and confocal microscopy, but there are no studies comparing this technique with other classical collagen stainings. In our study, 46 human skin samples were collected, processed for histological analysis and stained with Masson's trichrome, Picrosirius red and H&E. Five microphotographs of the reticular dermis were taken with a 200× magnification with light microscopy, polarized microscopy and confocal microscopy, respectively. Two independent observers measured collagen bundle orientation with semiautomated Fourier analysis with the Image-Pro Plus 7.0 software and three independent observers performed a semiquantitative evaluation of the same parameter. The average orientation for each case was calculated with the values of the five pictures. We analyzed the interrater reliability, the consistency between Fourier analysis and average semiquantitative evaluation and the consistency between measurements in Masson's trichrome, Picrosirius red and H&E-confocal. Statistical analysis for reliability and agreement was performed with the SPSS 22.0 software and consisted of intraclass correlation coefficient (ICC), Bland-Altman plots and limits of agreement and coefficient of variation. Interrater reliability was almost perfect (ICC > 0.8) with all three histological and microscopy techniques and always superior in Fourier analysis than in average semiquantitative evaluation. Measurements were consistent between Fourier analysis by one observer and average semiquantitative evaluation by three observers, with an almost perfect agreement with Masson's trichrome and Picrosirius red techniques (ICC > 0.8) and a strong

  19. The Berg Balance Scale has high intra- and inter-rater reliability but absolute reliability varies across the scale: a systematic review.

    Science.gov (United States)

    Downs, Stephen; Marquez, Jodie; Chiarelli, Pauline

    2013-06-01

    What is the intra-rater and inter-rater relative reliability of the Berg Balance Scale? What is the absolute reliability of the Berg Balance Scale? Does the absolute reliability of the Berg Balance Scale vary across the scale? Systematic review with meta-analysis of reliability studies. Any clinical population that has undergone assessment with the Berg Balance Scale. Relative intra-rater reliability, relative inter-rater reliability, and absolute reliability. Eleven studies involving 668 participants were included in the review. The relative intrarater reliability of the Berg Balance Scale was high, with a pooled estimate of 0.98 (95% CI 0.97 to 0.99). Relative inter-rater reliability was also high, with a pooled estimate of 0.97 (95% CI 0.96 to 0.98). A ceiling effect of the Berg Balance Scale was evident for some participants. In the analysis of absolute reliability, all of the relevant studies had an average score of 20 or above on the 0 to 56 point Berg Balance Scale. The absolute reliability across this part of the scale, as measured by the minimal detectable change with 95% confidence, varied between 2.8 points and 6.6 points. The Berg Balance Scale has a higher absolute reliability when close to 56 points due to the ceiling effect. We identified no data that estimated the absolute reliability of the Berg Balance Scale among participants with a mean score below 20 out of 56. The Berg Balance Scale has acceptable reliability, although it might not detect modest, clinically important changes in balance in individual subjects. The review was only able to comment on the absolute reliability of the Berg Balance Scale among people with moderately poor to normal balance. Copyright © 2013 Australian Physiotherapy Association. Published by .. All rights reserved.

  20. A supervised framework for lesion segmentation and automated VLSM analyses in left hemispheric stroke

    Directory of Open Access Journals (Sweden)

    Dorian Pustina

    2015-05-01

    Full Text Available INTRODUCTION: Voxel-based lesion-symptom mapping (VLSM is conventionally performed using skill and knowledge of experts to manually delineate brain lesions. This process requires time, and is likely to have substantial inter-rater variability. Here, we propose a supervised machine learning framework for lesion segmentation capable of learning from a single modality and existing manual segmentations in order to delineate lesions in new patients. METHODS: Data from 60 patients with chronic stroke aphasia were utilized in the study (age: 59.7±11.5yrs, post-stroke interval: 5±2.9yrs, male/female ratio: 34/26. Using a single T1 image of each subject, additional features were created that provided complementary information, such as, difference from template, tissue segmentation, brain asymmetries, gradient magnitude, and deviances of these images from 80 age and gender matched controls. These features were fed into MRV-NRF (multi-resolution voxel-wise neighborhood random forest; Tustison et al., 2014 prediction algorithm implemented in ANTsR (Avants, 2015. The algorithm incorporates information from each voxel and its surrounding neighbors from all above features, in a hierarchy of random forest predictions from low to high resolution. The validity of the framework was tested with a 6-fold cross validation (i.e., train from 50 subjects, predict 10. The process was repeated ten times, producing ten segmentations for each subject, from which the average solution was binarized. Predicted lesions were compared to manually defined lesions, and VLSM models were built on 4 language measures: repetition and comprehension subscores from the WAB (Kertesz, 1982, WAB-AQ, and PNT naming accuracy (Roach, Schwartz, Martin, Grewal, & Brecher, 1996. RESULTS: Manual and predicted lesion size showed high correlation (r=0.96. Compared to manual lesions, the predicted lesions had a dice overlap of 0.72 (±0.14 STD, a case-wise maximum distance (Hausdorff of 21mm (±16

  1. Number of test trials needed for performance stability and interrater reliability of the one leg stand test in patients with a major non-traumatic lower limb amputation

    DEFF Research Database (Denmark)

    Kristensen, Morten Tange; Nielsen, Anni Østergaard; Madsen Topp, Ulla

    2014-01-01

    Balance is beneficial for daily functioning of patients with a lower limb amputation and sometimes assessed by the one-leg stand test (OLST). The aims of the study were to examine (1) the number of trials needed to achieve performance stability, (2) the interrater reliability of the OLST in patie......Balance is beneficial for daily functioning of patients with a lower limb amputation and sometimes assessed by the one-leg stand test (OLST). The aims of the study were to examine (1) the number of trials needed to achieve performance stability, (2) the interrater reliability of the OLST...... in patients with a major non-traumatic lower limb amputation, and (3) to provide a test procedure....

  2. Exploring precarious employment and women's health within the context of U.S. microenterprise using focus groups.

    Science.gov (United States)

    Salt, Rebekah; Lee, Jongwon

    2014-01-01

    Nursing has been a leader in exploring social determinants of health within the context of U.S. microenterprise and women's health. The purpose of this study was to explore precarious employment within the context of microenterprise and women's health using focus groups with clientele from New Mexico (NM). The specific aims were to identify (1) the health concerns of low-income women who utilized resources from Women's Economic Self-Sufficiency Team (WESST), and (2) the meaning of precarious employment in low-income women's lives. Fourteen women, ranging in age from 21-65 years, who were affiliated with regional WESST sites around NM participated in focus groups and completed a demographic questionnaire. Focus group data were analyzed using content analysis. The degree of interrater agreement was determined by calculating the Cohen's kappa, percentage agreement, and prevalence-adjusted and bias-adjusted kappa (PABAK). Two broad themes emerged from these data: (1) Working for Yourself and (2) Strategies. Although the women identified concerns about participation in microenterprise, flexibility, freedom, and feeling purposeful were motivators to pursue a small business. The kappa statistics on the five transcripts revealed poor inter-rater agreement, yet PABAK, which is a more sophisticated inter-rater reliability index, indicated that inter-rater agreement between the two raters was satisfactory. Despite the challenges associated with microenterprise in the US, women found value in working for themselves. © 2014 Wiley Periodicals, Inc.

  3. 4-Meter Gait Speed Test in Chronic Obstructive Pulmonary Disease: INTERRATER RELIABILITY USING A STOPWATCH.

    Science.gov (United States)

    Bisca, Gianna Waldrich; Fava, Lucas Rodrigues; Morita, Andrea Akemi; Machado, Felipe Vilaça Cavallari; Pitta, Fabio; Hernandes, Nidia Aparecida

    2017-12-14

    4-meter gait speed (4MGS) is increasingly used to assess functional performance in patients with chronic obstructive pulmonary disease. However, the current literature lacks information regarding some technical standards for this test. Therefore, the purpose of this study was to compare and to evaluate the interrater reliability between a stopwatch and video recording used as timing systems for the 4MGS in patients with chronic obstructive pulmonary disease, as well as to verify the interrater reliability between 2 observers measuring the 4MGS time using a manual stopwatch. Fifty-one patients performed the 4MGS using 4 different protocols (random order): walking at the usual and maximum speed in a 4-meter course and walking at the same 2 speeds on an 8-m course using a 2-m acceleration zone, a 4-meter timing area, and a 2-m deceleration zone. Gait speed was measured simultaneously using a stopwatch and a video recording. In a subanalysis (n = 24), 2 independent observers timed the 4MGS using a stopwatch. There was no significant difference in comparison between the 2 timing methods (P > .05 for all), and the reliability between video recording and stopwatch was excellent in all 4MGS studied protocols (intraclass correlation coefficient ≥ 0.91). Moreover, when comparing gait speed measured by 2 observers using a stopwatch, no significant difference was found among all proposed protocols (P > .05 for all), and there was also excellent reliability between the 2 independent observers (intraclass correlation coefficient ≥ 0.94). The stopwatch, a low-cost and feasible tool, is reliable as a timing device for the 4MGS in patients with chronic obstructive pulmonary disease.

  4. Association of human papilloma virus with atypical and malignant oral papillary lesions.

    Science.gov (United States)

    McCord, Christina; Xu, Jing; Xu, Wei; Qiu, Xin; Muhanna, Nidal; Irish, Jonathan; Leong, Iona; McComb, Richard John; Perez-Ordonez, Bayardo; Bradley, Grace

    2014-06-01

    This study aimed to examine atypical and malignant papillary oral lesions for low- and high-risk human papillomavirus (HPV) infection and to correlate HPV infection with clinical and pathologic features. Sections of 28 atypical papillary lesions (APLs) and 14 malignant papillary lesions (MPLs) were examined for HPV by in situ hybridization and for p16 and MIB-1 by immunohistochemistry; 24 conventional papillomas were studied for comparison. Low-risk HPV was found in 10 of 66 cases, including 9 APLs and 1 papilloma. All low-risk HPV-positive cases showed suprabasilar MIB-1 staining, and the agreement was statistically significant (P < .0001). Diffuse p16 staining combined with high-risk HPV was not seen in any of the cases. A subset of HPV(-) APLs progressed to carcinoma. Oral papillary lesions are a heterogeneous group. Low-risk HPV infection is associated with a subset of APLs with a benign clinical course. Potentially malignant APLs and MPLs are not associated with low- or high-risk HPV. Copyright © 2014 Elsevier Inc. All rights reserved.

  5. Low Interrater Reliability in Grading of Rectal Bleeding Using National Cancer Institute Common Toxicity Criteria and Radiation Therapy Oncology Group Toxicity Scales: A Survey of Radiation Oncologists

    International Nuclear Information System (INIS)

    Huynh-Le, Minh-Phuong; Zhang, Zhe; Tran, Phuoc T.; DeWeese, Theodore L.; Song, Daniel Y.

    2014-01-01

    Purpose: To measure concordance among genitourinary radiation oncologists in using the National Cancer Institute Common Toxicity Criteria (NCI CTC) and Radiation Therapy Oncology Group (RTOG) grading scales to grade rectal bleeding. Methods and Materials: From June 2013 to January 2014, a Web-based survey was sent to 250 American and Canadian academic radiation oncologists who treat prostate cancer. Participants were provided 4 case vignettes in which patients received radiation therapy and developed rectal bleeding and were asked for management plans and to rate the bleeding according to NCI CTC v.4 and RTOG late toxicity grading (scales provided). In 2 cases, participants were also asked whether they would send the patient for colonoscopy. A multilevel, random intercept modeling approach was used to assess sources of variation (case, respondent) in toxicity grading to calculate the intraclass correlation coefficient (ICC). Agreement on a dichotomous grading scale (low grades 1-2 vs high grades 3-4) was also assessed, using the κ statistic for multiple respondents. Results: Seventy-two radiation oncologists (28%) completed the survey. Forty-seven (65%) reported having either written or been principal investigator on a study using these scales. Agreement between respondents was moderate (ICC 0.52, 95% confidence interval [CI] 0.47-0.58) when using NCI CTC and fair using the RTOG scale (ICC 0.28, 95% CI 0.20-0.40). Respondents who chose an invasive management were more likely to select a higher toxicity grade (P<.0001). Using the dichotomous scale, we observed moderate agreement (κ = 0.42, 95% CI 0.40-0.44) with the NCI CTC scale, but only slight agreement with the RTOG scale (κ = 0.19, 95% CI 0.17-0.21). Conclusion: Low interrater reliability was observed among radiation oncologists grading rectal bleeding using 2 common scales. Clearer definitions of late rectal bleeding toxicity should be constructed to reduce this variability and avoid ambiguity in both

  6. Evaluating the reliability of an injury prevention screening tool: Test-retest study.

    Science.gov (United States)

    Gittelman, Michael A; Kincaid, Madeline; Denny, Sarah; Wervey Arnold, Melissa; FitzGerald, Michael; Carle, Adam C; Mara, Constance A

    2016-10-01

    A standardized injury prevention (IP) screening tool can identify family risks and allow pediatricians to address behaviors. To assess behavior changes on later screens, the tool must be reliable for an individual and ideally between household members. Little research has examined the reliability of safety screening tool questions. This study utilized test-retest reliability of parent responses on an existing IP questionnaire and also compared responses between household parents. Investigators recruited parents of children 0 to 1 year of age during admission to a tertiary care children's hospital. When both parents were present, one was chosen as the "primary" respondent. Primary respondents completed the 30-question IP screening tool after consent, and they were re-screened approximately 4 hours later to test individual reliability. The "second" parent, when present, only completed the tool once. All participants received a 10-dollar gift card. Cohen's Kappa was used to estimate test-retest reliability and inter-rater agreement. Standard test-retest criteria consider Kappa values: 0.0 to 0.40 poor to fair, 0.41 to 0.60 moderate, 0.61 to 0.80 substantial, and 0.81 to 1.00 as almost perfect reliability. One hundred five families participated, with five lost to follow-up. Thirty-two (30.5%) parent dyads completed the tool. Primary respondents were generally mothers (88%) and Caucasian (72%). Test-retest of the primary respondents showed their responses to be almost perfect; average 0.82 (SD = 0.13, range 0.49-1.00). Seventeen questions had almost perfect test-retest reliability and 11 had substantial reliability. However, inter-rater agreement between household members for 12 objective questions showed little agreement between responses; inter-rater agreement averaged 0.35 (SD = 0.34, range -0.19-1.00). One question had almost perfect inter-rater agreement and two had substantial inter-rater agreement. The IP screening tool used by a single individual had excellent

  7. Reliability of Lactation Assessment Tools Applied to Overweight and Obese Women.

    Science.gov (United States)

    Chapman, Donna J; Doughty, Katherine; Mullin, Elizabeth M; Pérez-Escamilla, Rafael

    2016-05-01

    The interrater reliability of lactation assessment tools has not been evaluated in overweight/obese women. This study aimed to compare the interrater reliability of 4 lactation assessment tools in this population. A convenience sample of 45 women (body mass index > 27.0) was videotaped while breastfeeding (twice daily on days 2, 4, and 7 postpartum). Three International Board Certified Lactation Consultants independently rated each videotaped session using 4 tools (Infant Breastfeeding Assessment Tool [IBFAT], modified LATCH [mLATCH], modified Via Christi [mVC], and Riordan's Tool [RT]). For each day and tool, we evaluated interrater reliability with 1-way repeated-measures analyses of variance, intraclass correlation coefficients (ICCs), and percentage absolute agreement between raters. Analyses of variance showed significant differences between raters' scores on day 2 (all scales) and day 7 (RT). Intraclass correlation coefficient values reflected good (mLATCH) to excellent reliability (IBFAT, mVC, and RT) on days 2 and 7. All day 4 ICCs reflected good reliability. The ICC for mLATCH was significantly lower than all others on day 2 and was significantly lower than IBFAT (day 7). Percentage absolute interrater agreement for scale components ranged from 31% (day 2: observable swallowing, RT) to 92% (day 7: IBFAT, fixing; and mVC, latch time). Swallowing scores on all scales had the lowest levels of interrater agreement (31%-64%). We demonstrated differences in the interrater reliability of 4 lactation assessment tools when applied to overweight/obese women, with the lowest values observed on day 4. Swallowing assessment was particularly unreliable. Researchers and clinicians using these scales should be aware of the differences in their psychometric behavior. © The Author(s) 2015.

  8. Reliability of the craniocervical posture assessment: visual and angular measurements using photographs and radiographs.

    Science.gov (United States)

    Gadotti, Inae C; Armijo-Olivo, Susan; Silveira, Anelise; Magee, David

    2013-01-01

    The purposes of this study were to determine the intrarater and interrater reliability of the craniocervical posture in a sagittal view using quantitative measurements on photographs and radiographs and to determine the agreement of the visual assessment of posture between raters. One photograph and 1 radiograph of the sagittal craniocervical posture were simultaneously taken from 39 healthy female subjects. Three angles were measured on the photographs and 10 angles on the radiographs of 22 subjects using Alcimage software (Alcimage; Uberlândia, MG, Brazil). Two repeated measurements were performed by 2 raters. The measurements were compared within and between raters to test the intrarater and interrater reliability, respectively. Intraclass correlation coefficient and SEM were used. κ Agreement was calculated for the visual assessment of 39 subjects using photographs and radiographs between 2 raters. Good to excellent intrarater and interrater intraclass correlation coefficient values were found on both photographs and radiographs. Interrater SEM was large and clinically significant for cervical lordosis photogrammetry and for 1 angle measuring cervical lordosis on radiographs. Interrater κ agreement for the visual assessment using photographs was poor (κ = 0.37). The raters were reliable to measure angles in photographs and radiographs to quantify craniocervical posture with exception of 2 angles measuring lordosis of the cervical spine when compared between raters. The visual assessment of posture between raters was not reliable. © 2013. Published by National University of Health Sciences All rights reserved.

  9. A nomenclature paradigm for benign midmembranous vocal fold lesions.

    Science.gov (United States)

    Rosen, Clark A; Gartner-Schmidt, Jackie; Hathaway, Bridget; Simpson, C Blake; Postma, Gregory N; Courey, Mark; Sataloff, Robert T

    2012-06-01

    There is a significant lack of uniform agreement regarding nomenclature for benign vocal fold lesions (BVFLs). This confusion results in difficulty for clinicians communicating with their patients and with each other. In addition, BVFL research and comparison of treatment methods are hampered by the lack of a detailed and uniform BVFL nomenclature. Clinical consensus conferences were held to develop an initial BVFL nomenclature paradigm. Perceptual video analysis was performed to validate the stroboscopy component of the paradigm. The culmination of the consensus conferences and the video-perceptual analysis was used to evaluate the BVFL nomenclature paradigm using a retrospective review of patients with BVFL. An initial BVFL nomenclature paradigm was proposed utilizing detailed definitions relating to vocal fold lesion morphology, stroboscopy, response to voice therapy and intraoperative findings. Video-perceptual analysis of stroboscopy demonstrated that the proposed binary stroboscopy system used in the BVFL nomenclature paradigm was valid and widely applicable. Retrospective review of 45 patients with BVFL followed to the conclusion of treatment demonstrated that slight modifications of the initial BVFL nomenclature paradigm were required. With the modified BVFL nomenclature paradigm, 96% of the patients fit into the predicted pattern and definitions of the BVFL nomenclature system. This study has validated a multidimensional BVFL nomenclature paradigm. This vocal fold nomenclature paradigm includes nine distinct vocal fold lesions: vocal fold nodules, vocal fold polyp, pseudocyst, vocal fold cyst (subepithelial or ligament), nonspecific vocal fold lesion, vocal fold fibrous mass (subepithelial or ligament), and reactive lesion. Copyright © 2011 The American Laryngological, Rhinological, and Otological Society, Inc.

  10. Agreements

    International Nuclear Information System (INIS)

    2001-01-01

    These columns summarize the different bilateral and multilateral agreements concluded recently between the different OECD countries and concerning the nuclear energy domain: Argentina - Australia: Agreement concerning Co-operation in the Peaceful Uses of Nuclear Energy (2001). Argentina - Brazil: Joint Declaration regarding the Creation of the Argentinean-Brazilian Agency for Nuclear Energy Applications (2001). Australia - Czech Republic / Australia - Hungary: Agreements on Co-operation in Peaceful Uses of Nuclear Energy and the Transfer of Nuclear Material (2001). Australia - Indonesia: Arrangement Concerning Co-operation on Nuclear Safeguards and Related Matters (2001). Austria - Switzerland: Agreement on the Early Exchange of Information in the Field of Nuclear Safety and Radiation. Brazil - United States: Extension of the Agreement concerning Research and Development in Nuclear Material Control, Accountancy, Verification, Physical Protection, and Advanced Containment and Surveillance Technologies for International Safeguards Applications (2001). Czech Republic - Republic of Korea: Agreement for Co-operation in the Peaceful Uses of Nuclear Energy (2001). European Union- Russian Federation: Agreements on Nuclear Safety and Controlled Nuclear Fusion (2001). France - United States: Agreement for Co-operation in Advanced Nuclear Reactor Science and Technology (2001). Japan - United Kingdom: Co-operation Agreement on Advanced Nuclear Fuel Cycle, Fast Breeder Reactor and Other Related Technologies (2001). Republic OF Korea - United States: Annex IV Joint Project on Cintichem Technology (2000). Morocco - United States: Protocol amending the Co-operation Agreement on the Peaceful Uses of Nuclear Energy (2001). Multilateral Agreements: Agreement for Information Exchange on Radiological Surveillance in Northern Europe (2001). Status of Conventions in the Field of Nuclear Energy. (author)

  11. MRI assessment of the alar ligaments in the late stage of whiplash injury - a study of structural abnormalities and observer agreement

    Energy Technology Data Exchange (ETDEWEB)

    Krakenes, J.; Moen, G.; Nordli, H. [Section of Neuroradiology, Department of Radiology, Haukeland University Hospital, University of Bergen Medical School (Norway); Kaale, B.R. [Firda Medical Centre, Sandane (Norway); Gilhus, N.E. [Department of Neurology, Haukeland University Hospital, University of Bergen Medical School (Norway); Rorvik, J. [Department of Radiology, Haukeland University Hospital, University of Bergen Medical School (Norway)

    2002-07-01

    Our aim was to characterise and classify structural changes in the alar ligaments in the late stage of whiplash injuries by use of a new MRI protocol, and to evaluate the reliability and the validity of this classification. We studied 92 whiplash-injured and 30 uninjured individuals who underwent proton density-weighted MRI of the craniovertebral junction in three orthogonal planes. Changes in the alar ligaments (grades 0-3) based on the ratio between the high signal area and the total cross-sectional area were rated twice at a 4-month interval, independently by three radiologists. Inter- and intraobserver statistics were calculated by ordinary and weighted kappa. Cases classified differently were reviewed to identify potential causes for disagreement. The alar ligaments were satisfactorily demonstrated in all cases (244 ligaments in 122 individuals). The lesions, 2-9 years after the injury, varied from small high-signal spots to high signal throughout the cross-sectional area. Signal was highest near the condylar insertion in 82 of 94 ligaments, indicating a lesion near that insertion, and near the dental insertion in eight, indicating a medial lesion. No grade 2 or 3 lesion was found in the control group. At least two observers assigned the same grade to 214 ligaments (87.7%) on the second occasion. In 30 ligaments (12.3%) this agreement was not obtained. Pair-wise interobserver agreement (weighted kappa) was fair to moderate (0.31-0.54) in the first grading, improving to moderate (0.49-0.57) in the second. Intraobserver agreement (weighted kappa) was moderate to good (0.43-0.70). Whiplash trauma can cause permanent damage to the alar ligaments, which can be shown by high-resolution proton density-weighted MRI. Reliability of classification of alar ligament lesions needs to be improved. (orig.)

  12. MRI assessment of the alar ligaments in the late stage of whiplash injury - a study of structural abnormalities and observer agreement

    International Nuclear Information System (INIS)

    Krakenes, J.; Moen, G.; Nordli, H.; Kaale, B.R.; Gilhus, N.E.; Rorvik, J.

    2002-01-01

    Our aim was to characterise and classify structural changes in the alar ligaments in the late stage of whiplash injuries by use of a new MRI protocol, and to evaluate the reliability and the validity of this classification. We studied 92 whiplash-injured and 30 uninjured individuals who underwent proton density-weighted MRI of the craniovertebral junction in three orthogonal planes. Changes in the alar ligaments (grades 0-3) based on the ratio between the high signal area and the total cross-sectional area were rated twice at a 4-month interval, independently by three radiologists. Inter- and intraobserver statistics were calculated by ordinary and weighted kappa. Cases classified differently were reviewed to identify potential causes for disagreement. The alar ligaments were satisfactorily demonstrated in all cases (244 ligaments in 122 individuals). The lesions, 2-9 years after the injury, varied from small high-signal spots to high signal throughout the cross-sectional area. Signal was highest near the condylar insertion in 82 of 94 ligaments, indicating a lesion near that insertion, and near the dental insertion in eight, indicating a medial lesion. No grade 2 or 3 lesion was found in the control group. At least two observers assigned the same grade to 214 ligaments (87.7%) on the second occasion. In 30 ligaments (12.3%) this agreement was not obtained. Pair-wise interobserver agreement (weighted kappa) was fair to moderate (0.31-0.54) in the first grading, improving to moderate (0.49-0.57) in the second. Intraobserver agreement (weighted kappa) was moderate to good (0.43-0.70). Whiplash trauma can cause permanent damage to the alar ligaments, which can be shown by high-resolution proton density-weighted MRI. Reliability of classification of alar ligament lesions needs to be improved. (orig.)

  13. Validity and inter-rater reliability of medio-lateral knee motion observed during a single-limb mini squat

    DEFF Research Database (Denmark)

    Ageberg, Eva; Bennell, Kim L; Hunt, Michael A

    2010-01-01

    Muscle function may influence the risk of knee injury and outcomes following injury. Clinical tests, such as a single-limb mini squat, resemble conditions of daily life and are easy to administer. Fewer squats per 30 seconds indicate poorer function. However, the quality of movement, such as the ......, such as the medio-lateral knee motion may also be important. The aim was to validate an observational clinical test of assessing the medio-lateral knee motion, using a three-dimensional (3-D) motion analysis system. In addition, the inter-rater reliability was evaluated....

  14. CT grading scales for splenic injury: why can't we agree?

    International Nuclear Information System (INIS)

    Clark, Rhys; Hird, Kathryn

    2011-01-01

    Full text: Computed tomography (CT) has been proven to be able to accurately diagnose splenic injury, Many have published CT splenic injury grading scales to quantify the extent of injury, However, these scales have failed at predicting clinical outcomes and therefore cannot be used to accu rately predict the need for intervention. We hypothesi sed that low interrater reliability is the reason why these scales have failed at predicting clinical outcomes. This is a retrospective study of patients who were admitted to the Royal Perth Hospital with blunt splenic injury as coded in the trauma registry, The abdominal CT images of these patients were reviewed by three consultant radiologists and were graded using the six different splenic injury grading scales. We assessed interrater reliability between each of the scales using generalised kappa and proportion of agreement calculations. Results: The images of 64 patients were reviewed. The interrater reliability yielded a generalised kappa score of 0.32-0,60 and proportion of agreement ranging from 34,4% to 65,5%, The six studied CT splenic injury grading scales did not have a high enough interrater reliability to be adequate for clinical use, The poor interrater reliability is likely to contribute to the failure of the scales at predicting clinical outcomes. Further research to improve the interrater reliability is recommended.

  15. Theoretical evaluation of the detectability of random lesions in bayesian emission reconstruction

    International Nuclear Information System (INIS)

    Qi, Jinyi

    2003-01-01

    Detecting cancerous lesion is an important task in positron emission tomography (PET). Bayesian methods based on the maximum a posteriori principle (also called penalized maximum likelihood methods) have been developed to deal with the low signal to noise ratio in the emission data. Similar to the filter cut-off frequency in the filtered backprojection method, the prior parameters in Bayesian reconstruction control the resolution and noise trade-off and hence affect detectability of lesions in reconstructed images. Bayesian reconstructions are difficult to analyze because the resolution and noise properties are nonlinear and object-dependent. Most research has been based on Monte Carlo simulations, which are very time consuming. Building on the recent progress on the theoretical analysis of image properties of statistical reconstructions and the development of numerical observers, here we develop a theoretical approach for fast computation of lesion detectability in Bayesian reconstruction. The results can be used to choose the optimum hyperparameter for the maximum lesion detectability. New in this work is the use of theoretical expressions that explicitly model the statistical variation of the lesion and background without assuming that the object variation is (locally) stationary. The theoretical results are validated using Monte Carlo simulations. The comparisons show good agreement between the theoretical predications and the Monte Carlo results

  16. Dental anatomy grading: comparison between conventional visual and a novel digital assessment technique.

    Science.gov (United States)

    Kwon, So Ran; Restrepo-Kennedy, Natalia; Dawson, Deborah V; Hernandez, Marcela; Denehy, Gerald; Blanchette, Derek; Gratton, David G; Aquilino, Steven A; Armstrong, Steven R

    2014-12-01

    Conventional grading of dental students' projects in preclinical courses has mainly relied on visual evaluation by experienced instructors. The purpose of this study was to compare conventional visual grading in a dental anatomy course at one U.S. dental school to a novel digital assessment technique. A total of sixty samples comprised of two sets of faculty wax-ups (n=30), student wax-ups (n=15), and dentoform teeth of tooth #14 (n=15) were used for this study. Two additional faculty members visually graded the samples according to a checklist and then repeated the grading after one week. The sample wax-up with the highest score based on the visual grading was selected as the master model for the digital grading, which was also performed twice with an interim period of one week. Descriptive statistics and signed rank tests for systematic bias were used for intra- and interrater comparisons. The intraclass correlation (ICC) was used as a measure of intra- and interrater reliability. None of the faculty members achieved the minimum acceptable intrarater agreement of 0.8. Interrater agreement was substantially less than intrarater agreement for the visual grading, whereas all measures of intrarater agreement were greater than 0.9 and considered excellent for the digital assessment technique. These results suggest that visual grading is limited by modest intrarater reliability and low interrater agreement. Digital grading is a promising evaluation method showing excellent intrarater reliability and correlation. Correlation for visual and digital grading was consistently modest, partly supporting the potential use of digital technology in dental anatomy grading.

  17. Inter-rater reliability of the South African Triage Scale: Assessing two different cadres of health care workers in a real time environment

    Directory of Open Access Journals (Sweden)

    Michèle Twomey

    2011-09-01

    Conclusion: The inter-rater reliability of SATS ratings is excellent within individual HCWs, but significantly lower between different HCWs. This confirms previous reliability studies of the SATS using vignettes and if validated by larger studies would support the feasibility of further implementation of the SATS in primary health care settings across the Western Cape.

  18. Breast imaging reporting and data system (BI-RADS) US lexicon and final assessment category for solid breast masses: the rates of inter-and intraobserver agreement

    International Nuclear Information System (INIS)

    Lee, Eun Hye; Cha, Joo Hee; Koh, Young Hwan; Youn, Byung Jae; Cho, Byung Jae; Moon, Woo Kyung

    2007-01-01

    To evaluate the rates of inter-and intraobserver agreement of the BI-RADS US lexicon. Two radiologists reviewed 60 sonograms of solid breast masses to evaluate interobserver agreement. After four weeks, the radiologists reinterpreted the series to evaluate the intraobserver agreement. The radiologists described shape, orientation, margin, lesion boundary, echo pattern, posterior acoustic features and microcalcifications. Final assessment categories and management plans were suggested for each case. The rates of inter-and intraobserver agreements were measured by the use of kappa statistics. Interobserver agreement ranged from the highest for orientation (κ = 0.65) and shape (κ = 0.61) to the lowest for posterior acoustic features (κ = 0.42). For the final assessment categories (κ = 0.46) and management (κ = 0.49), interobserver agreement were moderate. Intraobserver agreement ranged from the highest for microcalcifications in mass (κ = 0.90, 0.82) and orientation (κ 0.87, 0.83) and the lowest for echo patterns (κ = 0.62, 0.57) and posterior acoustic features (κ = 0.59, 0.65). In the final assessment category and management, intraobserver agreements were substantial or nearly complete (κ = 0.65-0.83). There were variable raged inter-and intraobserver agreements in the description of the BI-RADS US lexicon of solid breast masses. Among them, margin and lesion boundary showed lower agreements. A modification of the BI-RADS US lexicon with more detailed guidelines, followed by continuous education, are suggested

  19. Fully automatic detection of deep white matter T1 hypointense lesions in multiple sclerosis

    Science.gov (United States)

    Spies, Lothar; Tewes, Anja; Suppa, Per; Opfer, Roland; Buchert, Ralph; Winkler, Gerhard; Raji, Alaleh

    2013-12-01

    A novel method is presented for fully automatic detection of candidate white matter (WM) T1 hypointense lesions in three-dimensional high-resolution T1-weighted magnetic resonance (MR) images. By definition, T1 hypointense lesions have similar intensity as gray matter (GM) and thus appear darker than surrounding normal WM in T1-weighted images. The novel method uses a standard classification algorithm to partition T1-weighted images into GM, WM and cerebrospinal fluid (CSF). As a consequence, T1 hypointense lesions are assigned an increased GM probability by the standard classification algorithm. The GM component image of a patient is then tested voxel-by-voxel against GM component images of a normative database of healthy individuals. Clusters (≥0.1 ml) of significantly increased GM density within a predefined mask of deep WM are defined as lesions. The performance of the algorithm was assessed on voxel level by a simulation study. A maximum dice similarity coefficient of 60% was found for a typical T1 lesion pattern with contrasts ranging from WM to cortical GM, indicating substantial agreement between ground truth and automatic detection. Retrospective application to 10 patients with multiple sclerosis demonstrated that 93 out of 96 T1 hypointense lesions were detected. On average 3.6 false positive T1 hypointense lesions per patient were found. The novel method is promising to support the detection of hypointense lesions in T1-weighted images which warrants further evaluation in larger patient samples.

  20. A novel algorithm for a precise analysis of subchondral bone alterations

    Science.gov (United States)

    Gao, Liang; Orth, Patrick; Goebel, Lars K. H.; Cucchiarini, Magali; Madry, Henning

    2016-01-01

    Subchondral bone alterations are emerging as considerable clinical problems associated with articular cartilage repair. Their analysis exposes a pattern of variable changes, including intra-lesional osteophytes, residual microfracture holes, peri-hole bone resorption, and subchondral bone cysts. A precise distinction between them is becoming increasingly important. Here, we present a tailored algorithm based on continuous data to analyse subchondral bone changes using micro-CT images, allowing for a clear definition of each entity. We evaluated this algorithm using data sets originating from two large animal models of osteochondral repair. Intra-lesional osteophytes were detected in 3 of 10 defects in the minipig and in 4 of 5 defects in the sheep model. Peri-hole bone resorption was found in 22 of 30 microfracture holes in the minipig and in 17 of 30 microfracture holes in the sheep model. Subchondral bone cysts appeared in 1 microfracture hole in the minipig and in 5 microfracture holes in the sheep model (n = 30 holes each). Calculation of inter-rater agreement (90% agreement) and Cohen’s kappa (kappa = 0.874) revealed that the novel algorithm is highly reliable, reproducible, and valid. Comparison analysis with the best existing semi-quantitative evaluation method was also performed, supporting the enhanced precision of this algorithm. PMID:27596562

  1. Effects of Rating Training on Inter-Rater Consistency for Developing a Dental Hygiene Clinical Rater Qualification System

    Directory of Open Access Journals (Sweden)

    Jeong Ran Park

    2007-12-01

    Full Text Available We tried to develop itemized evaluation criteria and a clinical rater qualification system through rating training of inter-rater consistency for experienced clinical dental hygienists and dental hygiene clinical educators. A total of 15 clinical dental hygienists with 1-year careers participated as clinical examination candidates, while 5 dental hygienists with 3-year educations and clinical careers or longer participated as clinical raters. They all took the clinical examination as examinees. The results were compared, and the consistency of competence was measured. The comparison of clinical competence between candidates and clinical raters showed that the candidate group?占퐏 mean clinical competence ranged from 2.96 to 3.55 on a 5-point system in a total of 3 instruments (Probe, Explorer, Curet, while the clinical rater group?占퐏 mean clinical competence ranged from 4.05 to 4.29. There was a higher inter-rater consistency after education of raters in the following 4 items: Probe, Explorer, Curet, and insertion on distal surface. The mean score distribution of clinical raters ranged from 75% to 100%, which was more uniform in the competence to detect an artificial calculus than that of candidates (25% to 100%. According to the above results, there was a necessity in the operating clinical rater qualification system for comprehensive dental hygiene clinicians. Furthermore, in order to execute the clinical rater qualification system, it will be necessary to keep conducting a series of studies on educational content, time, frequency, and educator level.

  2. A Spanish validation of the Coma Recovery Scale-Revised (CRS-R).

    Science.gov (United States)

    Tamashiro, Mercedes; Rivas, Maria Elisa; Ron, Melania; Salierno, Fernando; Dalera, Marisol; Olmos, Lisandro

    2014-01-01

    Analysis of inter-rater reliability and concurrent validity. To determine measurement properties of a Spanish version of The Coma Recovery Scale-Revised (CRS-R). A sample of 35 in-patients with severe acquired brain injury. To test concurrent validity of the translated scale, the Glasgow Coma Scale (GSC) and Disability Rating Scale (DRS) were also administered. Two experts in the field were recruited to assess inter-rater agreement. Inter-rater reliability was good for total CRS-R scores (Cronbach α = 0.973, p = 0.001). Sub-scale analysis showed moderate-to-high inter-rater agreement. Total CRS-R scores correlated significantly (p < 0.05) with total GCS (r = 0.74) and DRS (r = 0.54) scores, indicating acceptable concurrent validity. The Spanish version of CRS-R can be administered reliably by trained and experienced examiners. CRS-R appears capable of differentiating patients in Emergence from Minimally Conscious State (EMCS) or in Minimally Conscious State (MCS) from those in a Vegetative State (VS).

  3. Performance of five research-domain automated WM lesion segmentation methods in a multi-center MS study

    DEFF Research Database (Denmark)

    de Sitter, Alexandra; Steenwijk, Martijn D; Ruet, Aurélie

    2017-01-01

    (Lesion-TOADS); and k-Nearest Neighbor with Tissue Type Priors (kNN-TTP). Main software parameters were optimized using a training set (N = 18), and formal testing was performed on the remaining patients (N = 52). To evaluate volumetric agreement with the reference segmentations, intraclass correlation......BACKGROUND AND PURPOSE: In vivoidentification of white matter lesions plays a key-role in evaluation of patients with multiple sclerosis (MS). Automated lesion segmentation methods have been developed to substitute manual outlining, but evidence of their performance in multi-center investigations......-one-center-out design to exclude the center of interest from the training phase to evaluate the performance of the method on 'unseen' center. RESULTS: Compared to the reference mean lesion volume (4.85 ± 7.29 mL), the methods displayed a mean difference of 1.60 ± 4.83 (Cascade), 2.31 ± 7.66 (LGA), 0.44 ± 4.68 (LPA), 1...

  4. Three-dimensional computed tomography measurement accuracy of varying Hill-Sachs lesion size.

    Science.gov (United States)

    Ho, Anthony; Kurdziel, Michael D; Koueiter, Denise M; Wiater, J Michael

    2018-02-01

    The glenoid track concept has been proposed to correlate shoulder stability with bone loss. Accurate assessment of Hill-Sachs lesion size preoperatively may affect surgical planning and postoperative outcomes; however, no measurement method has been universally accepted. This study aimed to assess the accuracy and reliability of measuring Hill-Sachs lesion sizes using 3-dimensional (3D) computed tomography (CT). Nine polyurethane humerus bone substitutes were used to create Hill-Sachs lesions of varying sizes with a combination of lesion depth (shallow, intermediate, and deep) and width (small, medium, and large). Specimens were scanned with a clinical CT scanner for size measurements and a micro-CT scanner for measurement of true lesion size. Six evaluators repeated measurements twice in a 2-week interval. Scans were measured by use of 3D CT reconstructions for length, width, and Hill-Sachs interval and with use of 2D CT for depth. The interclass correlation coefficient evaluated interobserver and intraobserver variability and percentage error, and Student t-tests assessed measurement accuracy. Interclass correlation coefficient reliability demonstrated strong agreement for all variables measured (0.856-0.975). Percentage error between measured length and measured depth and the true measurement significantly varied with respect to both lesion depth (P = .003 and P = .005, respectively) and lesion size (P = .049 and P = .004, respectively). The 3D CT imaging is effective and reproducible in determining lesion size. Determination of Hill-Sachs interval width is also reliable when it is applied to the glenoid track concept. Measured values on 3D and 2-dimensional imaging using a conventional CT scanner may slightly underestimate true measurements. Copyright © 2017 Journal of Shoulder and Elbow Surgery Board of Trustees. Published by Elsevier Inc. All rights reserved.

  5. Evaluation of agreement between transvaginal ultrasonography and magnetic resonance imaging of the pelvis in deep endometriosis with emphasis on intestinal involvement

    International Nuclear Information System (INIS)

    Cardoso, Maene Marcondes; Coutinho Junior, Antonio Carlos; Domingues, Marisa Nassar Aidar; Werner Junior, Heron

    2009-01-01

    Objective: To compare sonographic and magnetic resonance imaging findings in deep endometriosis with emphasis on intestinal involvement. Materials and methods: Eighteen women aged between 23 and 49 years with clinical suspicion and gynecological signs suggestive of deep endometriosis were submitted to ultrasonography and magnetic resonance imaging for correlation between findings. Results: Ultrasonography detected 40 lesions while magnetic resonance imaging detected 53 lesions in the pelvis. A comparative study has not shown any statistically significant intermethod difference in the detection of lesions (respectively p > 0.19 and p > 0.14). In the rectosigmoid junction, magnetic resonance imaging has detected one (5.6%) lesion, while ultrasonography has detected four lesions (22.2%). In the rectum, ultrasonography has detected eight lesions (44.4%), and magnetic resonance imaging has detected seven lesions (38.9%). Conclusion: The intermethod agreement has not been good for lesions in the rectosigmoid junction, considering that ultrasonography has detected a higher number of lesions in this region, but a lower number of lesions in the pelvis as compared with magnetic resonance imaging. The global comparative analysis has demonstrated no statistically significant intermethod difference in the detection of lesions. Low cost, good tolerability and high availability make ultrasonography a valuable diagnostic tool in cases of deep endometriosis. (author)

  6. Multiparametric MRI of the prostate: diagnostic performance and interreader agreement of two scoring systems.

    Science.gov (United States)

    Lin, Wei-Ching; Muglia, Valdair F; Silva, Gyl E B; Chodraui Filho, Salomão; Reis, Rodolfo B; Westphalen, Antonio C

    2016-06-01

    To compare the diagnostic accuracies and interreader agreements of the Prostate Imaging Reporting and Data System (PI-RADS) v. 2 and University of California San Francisco (UCSF) multiparametric prostate MRI scale for diagnosing clinically significant prostate cancer. This institutional review board-approved retrospective study included 49 males who had 1.5 T endorectal MRI and prostatectomy. Two radiologists scored suspicious lesions on MRI using PI-RADS v. 2 and the UCSF scale. Percent agreement, 2 × 2 tables and the area under the receiver operating characteristic curves (Az) were used to assess and compare the individual and overall scores of these scales. Interreader agreements were estimated with kappa statistics. Reader 1 (R1) detected 78 lesions, and Reader 2 (R2) detected 80 lesions. Both identified 52 of 65 significant cancers. The Az for PI-RADS v. 2 and UCSF scale for R1 were 0.68 and 0.69 [T2 weighted imaging (T2WI)], 0.75 and 0.68 [diffusion-weighted imaging (DWI)] and 0.64 and 0.72 (overall score), respectively, and were 0.72 and 0.75 (T2WI), 0.73 and 0.67 (DWI) and 0.66 and 0.75 (overall score) for R2. The dynamic contrast-enhanced percent agreements between scales were 100% (R1) and 95% (R2). PI-RADS v. 2 DWI of R1 performed better than UCSF DWI (Az = 0.75 vs Az = 0.68; p = 0.05); no other differences were found. The interreader agreements were higher for PI-RADS v. 2 (T2WI: 0.56 vs 0.42; DWI: 0.60 vs 0.46; overall: 0.61 vs 0.42). The UCSF approach to derive the overall PI-RADS v. 2 scores increased the Az for the identification of significant cancer (R1 to 0.76, p < 0.05; R2 to 0.71, p = 0.35). Although PI-RADS v. 2 DWI score may have a higher discriminatory performance than the UCSF scale counterpart to diagnose clinically significant cancer, the utilization of the UCSF scale weighing system for the integration of PI-RADS v. 2 individual parameter scores improved the accuracy its overall score. PI-RADS v. 2 is

  7. Impact of Lesion Length on Functional Significance in Intermediate Coronary Lesions

    Directory of Open Access Journals (Sweden)

    Morteza Safi

    2017-07-01

    Full Text Available Introduction: The present study aimed at assessing the role of lesion length in predicting Fractional Flow Reserve (FFR value for physiological evaluation of intermediate coronary lesions.Methods: In the current study, 68 patients with 83 coronary lesions were enrolled. All of the patients in this study underwent routine coronary angiography, according to appropriate indications. To evaluate physiologically significant intermediate coronary stenosis (defined between 40% and 70% on visual estimation, the Fractional Flow Reserve (FFR study was performed and the Quantitative Coronary Angiography (QCA data were also assessed for measurement of lesion length. The correlation between QCA data and FFR values was also examined.Results: Eighty-three lesions were evaluated from 68 patients. Stenosis was considered physiologically significant when FFR was lower than 0.75. The FFR was significant in twelve lesions (14.5%. There was a negative correlation between FFR value and lesion length (r = -0.294 and P = 0.013. Moreover, lesion length in physiologically significant FFR group (21.07  ± 6.9 was greater than that of the non-significant FFR group (15.23 ± 6.5 (P value < 0.05. Furthermore, the correlation between QCA data and FFR values was also investigated, yet, there was only a positive correlation between FFR and Minimum Luminal Diameter (MLD values (r = 0.248 and P value = 0.04. The Receiver Operating Characteristic (ROC curve analysis for predicting the significant FFR value demonstrated that a lesion length greater than 17.5 mm was the best cut-off point for prediction of the significant FFR value with acceptable sensitivity and specificity of 83.3% and 68.8%, respectively.Conclusions: There is a negative correlation between lesion length and FFR value in intermediate coronary lesions. In addition, a lesion length greater than 17.5 mm is the best cut- off point for prediction of significant FFR values.

  8. Test–re-test reliability and inter-rater reliability of a digital pelvic inclinometer in young, healthy males and females

    OpenAIRE

    Chris Beardsley; Tim Egerton; Brendon Skinner

    2016-01-01

    Objective. The purpose of this study was to investigate the reliability of a digital pelvic inclinometer (DPI) for measuring sagittal plane pelvic tilt in 18 young, healthy males and females.\\ud \\ud Method. The inter-rater reliability and test–re-test reliabilities of the DPI for measuring pelvic tilt in standing on both the right and left sides of the pelvis were measured by two raters carrying out two rating sessions of the same subjects, three weeks apart.\\ud \\ud Results. For measuring pel...

  9. Agreement Between an Automated Volume Breast Scanner and Handheld Ultrasound for Diagnostic Breast Examinations.

    Science.gov (United States)

    Barr, Richard G; DeVita, Robert; Destounis, Stamatia; Manzoni, Federica; De Silvestri, Annalisa; Tinelli, Carmine

    2017-10-01

    To compare the agreement and interobserver variability of diagnostic handheld ultrasound (US) and a single volume on an automated breast volume scanner (ABVS) and to determine whether there was a significant difference if the ABVS was used by a sonographer or mammographic technologist. Ninety patients scheduled for diagnostic US examinations were randomized to either handheld US or the ABVS first. The AVBS was randomized between a sonographer and a mammographic technologist performing the study. The studies were blinded, randomized, and read by 2 radiologists. The lesion with the highest Breast Imaging Reporting and Data System (BI-RADS) score was used in the analysis. Final diagnoses were made by core biopsy or follow-up for 2 years. Lesions included 9 malignant and 81 benign. The 90 patients had a mean age ± SD of 53.1 ± 16.3 years. The κ value for agreement between the ABVS and handheld US was 0.831 (95% confidence interval, 0.744-0.925), whereas the global agreement for a 7-point BI-RADS score was 0.488 (0.372-0.560). The agreement between the ABVS and handheld US was nearly the same when the ABVS was used by a mammographic technologist (κ = 0.858 [0.723-0.963]) or sonographer (κ = 0.803 [0.596-1.000]; P = .47). The areas under the receiver operating characteristic curves for characterization by the ABVS were 0.91 (0.84-0.96) for reader 1 and 0.91 (0.83-0.96) for reader 2; those for handheld US were 0.91 (0.84-0.96) for reader 1 and 0.83 (0.74-0.90) for reader 2, with no statistical difference. The agreement based on pathologic images was κ = 0.831 (0.718-0.944); for handheld US, κ = 0.795 (0.623-0.967); and for the AVBS, κ = 0.869 (0.725-1.000). Performing a single-view diagnostic ABVS examination has good agreement with a handheld diagnostic US workup. There is no difference if the ABVS is used by a sonographer or mammographic technologist. © 2017 by the American Institute of Ultrasound in Medicine.

  10. Assessment of pituitary micro-lesions using 3D sampling perfection with application-optimized contrasts using different flip-angle evolutions.

    Science.gov (United States)

    Wang, Jing; Wu, Yue; Yao, Zhenwei; Yang, Zhong

    2014-12-01

    The aim of this study was to explore the value of three-dimensional sampling perfection with application-optimized contrasts using different flip-angle evolutions (3D-SPACE) sequence in assessment of pituitary micro-lesions. Coronal 3D-SPACE as well as routine T1- and dynamic contrast-enhanced (DCE) T1-weighted images of the pituitary gland were acquired in 52 patients (48 women and four men; mean age, 32 years; age range, 17-50 years) with clinically suspected pituitary abnormality at 3.0 T, retrospectively. The interobserver agreement of assessment results was analyzed with K-statistics. Qualitative analyses were compared using Wilcoxon signed-rank test. There was good interobserver agreement of the independent evaluations for 3D-SPACE images (k = 0.892), fair for routine MR images (k = 0.649). At 3.0 T, 3D-SPACE provided significantly better images than routine MR images in terms of the boundary of pituitary gland, definition of pituitary lesions, and overall image quality. The evaluation of pituitary micro-lesions using combined routine and 3D-SPACE MR imaging was superior to that using only routine or 3D-SPACE imaging. The 3D-SPACE sequence can be used for appropriate and successful evaluation of the pituitary gland. We suggest 3D-SPACE sequence to be a powerful supplemental sequence in MR examinations with suspected pituitary micro-lesions.

  11. Human papillomavirus in oral lesions Virus papiloma humano en lesiones orales

    Directory of Open Access Journals (Sweden)

    Joaquín V. Gónzalez

    2007-08-01

    Full Text Available Growing evidence suggests a role for human papillomavirus (HPV in oral cancer; however its involvement is still controversial. This study evaluates the frequency of HPV DNA in a variety of oral lesions in patients from Argentina. A total of 77 oral tissue samples from 66 patients were selected (cases; the clinical-histopathological diagnoses corresponded to: 11 HPV- associated benign lesions, 8 non-HPV associated benign lesions, 33 premalignant lesions and 25 cancers. Sixty exfoliated cell samples from normal oral mucosa were used as controls. HPV detection and typing were performed by polymerase chain reaction (PCR using primers MY09, 11, combined with RFLP or alternatively PCR using primers GP5+, 6+ combined with dot blot hybridization. HPV was detected in 91.0% of HPV- associated benign lesions, 14.3% of non-HPV associated benign lesions, 51.5% of preneoplasias and 60.0% of cancers. No control sample tested HPV positive. In benign HPV- associated lesions, 30.0% of HPV positive samples harbored high-risk types, while in preneoplastic lesions the value rose to 59.9%. In cancer lesions, HPV detection in verrucous carcinoma was 88.9% and in squamous cell carcinoma 43.8%, with high-risk type rates of 75.5% and 85.6%, respectively. The high HPV frequency detected in preneoplastic and neoplastic lesions supports an HPV etiological role in at least a subset of oral cancers.Crecientes evidencias sugieren que el virus Papiloma humano (HPV tiene un rol en el cáncer oral; sin embargo su participación es todavía controvertida. Este estudio evalúa la frecuencia de ADN de HPV en una variedad de lesiones orales de pacientes de Argentina. Se seleccionaron 77 muestras de tejido oral de 66 pacientes (casos; el diagnóstico histo-patológico correspondió a: 11 lesiones benignas asociadas a HPV, 8 lesiones benignas no asociadas a HPV, 33 lesiones premalignas y 25 cánceres. Como controles se usaron 60 muestras de células exfoliadas de mucosa oral normal. La

  12. OPTION(5) versus OPTION(12) instruments to appreciate the extent to which healthcare providers involve patients in decision-making.

    Science.gov (United States)

    Stubenrouch, Fabienne E; Pieterse, Arwen H; Falkenberg, Rijan; Santema, T Katrien B; Stiggelbout, Anne M; van der Weijden, Trudy; Aarts, J Annemijn W M; Ubbink, Dirk T

    2016-06-01

    The 12-item "observing patient involvement" (OPTION(12))-instrument is commonly used to assess the extent to which healthcare providers involve patients in health-related decision-making. The five-item version (OPTION(5)) claims to be a more efficient measure. In this study we compared the Dutch versions of the OPTION-instruments in terms of inter-rater agreement and correlation in outpatient doctor-patient consultations in various settings, to learn if we can safely switch to the shorter OPTION(5)-instrument. Two raters coded 60 audiotaped vascular surgery and oncology patient consultations using OPTION(12) and OPTION(5). Unweighted Cohen's kappa was used to compute inter-rater agreement on item-level. The association between the total scores of the two OPTION-instruments was investigated using Pearson's correlation coefficient (r) and a Bland & Altman plot. After fine-tuning the OPTION-manuals, inter-rater agreement for OPTION(12) and OPTION(5) was good to excellent (kappa range 0.69-0.85 and 0.63-0.72, respectively). Mean total scores were 23.7 (OPTION(12); SD=7.8) and 39.3 (OPTION(5); SD=12.7). Correlation between the total scores was high (r=0.71; p=0.01). OPTION(5) scored systematically higher with a wider range than OPTION(12). Both OPTION-instruments had a good inter-rater agreement and correlated well. OPTION(5) seems to differentiate better between various levels of patient involvement. The OPTION(5)-instrument is recommended for clinical application. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  13. Intraosseous osteolytic lesions

    Energy Technology Data Exchange (ETDEWEB)

    Adler, C.P.; Wenz, W.

    1981-10-01

    Any pathological damage occurring in a bone will produce either an osteolytic or osteosclerotic lesion which can be seen in the macroscopic specimen as well as in the roentgenogram. Various bone lesions may lead to local destructions of the bone. An osteoma or osteoplastic osteosarcoma produces an osteosclerotic lesion showing a dense mass in the roentgenogram; a chondroblastoma or an osteoclastoma, on the other hand, induces an osteolytic focal lesion. This paper presents examples of different osteolytic lesions of the humerus. An osteolytic lesion seen in the roentgenogram may be either produced by an underlying non-ossifying fibroma of the bone, by fibrous dysplasia, osteomyelitis or Ewing's sarcoma. Differential diagnostic considerations based on the radiological picture include eosinophilic bone granuloma, juvenile or aneurysmal bone cyst, multiple myeloma or bone metastases. Serious differential diagnostic problems may be involved in case of osteolytic lesions occurring in the humerus. Cases of this type involving complications have been reported and include the presence of an teleangiectatic osteosarcoma as well as that of a hemangiosarcoma of the bone.

  14. Periodontal bone lesions

    International Nuclear Information System (INIS)

    Linden, L.W.J. van der.

    1985-01-01

    In the course of life the periodontum is subject to changes which may be physiological or pathological. Intraoral radiographs give insight into the hard structures of the dentomaxillar region and provides information on lesions in the bone of the periodontum in that they show radiopacities and radiolucencies caused by such lesions. In this thesis the relation is investigated between the true shape and dimensions of periodontal bone lesions and their radiographic images. A method is developed and tested of making standardized and reproducible radiographs suitable for longitudinal studies of periodontal lesions. Also the possibility is demonstrated of an objective and reproducible interpretation of radiographic characteristics of periodontal bone lesions. (Auth.)

  15. Reliability and reproducibility of subaxial cervical injury description system: a standardized nomenclature schema.

    Science.gov (United States)

    Bono, Christopher M; Schoenfeld, Andrew; Gupta, Giri; Harrop, James S; Anderson, Paul; Patel, Alpesh A; Dimar, John; Aarabi, Bizhan; Dailey, Andrew; Vaccaro, Alexander R; Gahr, Ralf; Shaffrey, Christopher; Anderson, David G; Rampersaud, Raj

    2011-08-01

    Radiographic measurement study. To develop a standardized cervical injury nomenclature system to facilitate description, communication, and classification among health care providers. The reliability and reproducibility of this system was then examined. Description of subaxial cervical injuries is critical for treatment decision making and comparing scientific reports of outcomes. Despite a number of available classification systems, surgeons, and researchers continue to use descriptive nomenclature, such as "burst" and "teardrop" fractures, to describe injuries. However, there is considerable inconsistency with use of such terms in the literature. Eleven distinct injury types and associated definitions were established for the subaxial cervical spine and subsequently refined by members of the Spine Trauma Study Group. A series of 18 cases of patients with a broad spectrum of subaxial cervical spine injuries was prepared and distributed to surgeon raters. Each rater was provided with the full nomenclature document and asked to select primary and secondary injury types for each case. After receipt of the raters' first round of classifications, the cases were resorted and returned to the raters for a second round of review. Interrater and intrarater reliabilities were calculated as percent agreement and Cohen kappa (κ) values. Intrarater reliability was assessed by comparing a given rater's diagnosis from the first and second rounds. Nineteen surgeons completed the first and second rounds of the study. Overall, the system demonstrated 56.4% interrater agreement and 72.8% intrarater agreement. Overall, interrater κ values demonstrated moderate agreement while intrarater κ values showed substantial agreement. Analyzed by injury types, only four (burst fractures, lateral mass fractures, flexion teardrop fractures, and anterior distraction injuries) demonstrated greater than 50% interrater agreement. This study demonstrated that, even in ideal circumstances, there is

  16. Prospective comparison of 3D FIESTA versus fat-suppressed 3D SPGR MRI in evaluating knee cartilage lesions

    International Nuclear Information System (INIS)

    Li, X.; Yu, C.; Wu, H.; Daniel, K.; Hu, D.; Xia, L.; Pan, C.; Xu, A.; Hu, J.; Wang, L.; Peng, W.; Li, F.

    2009-01-01

    Aim: To prospectively compare the accuracy of three-dimensional fast imaging employing steady-state acquisition (3D FIESTA) sequences with that of fat-suppressed three-dimensional spoiled gradient-recalled (3D SPGR) in the diagnosis of knee articular cartilage lesions, using arthroscopy as the reference standard. Materials and methods: Fifty-eight knees in 54 patients (age range 21-82 years; mean 36 years) were prospectively evaluated by using sagittal 3D FIESTA and sagittal fat-suppressed 3D SPGR sequences. Articular cartilage lesions were graded on MRI and during arthroscopy with a modified Noyes scoring system. Sensitivity, specificity, and accuracy were assessed. Interobserver agreement was determined with κ statistics. Results: The performance of 3D FIESTA sequences (sensitivity, specificity, and accuracy were 80, 94, and 92%, respectively, for reader 1 and 76, 94, and 90%, respectively, for reader 2) was similar to that of fat-suppressed 3D SPGR sequences (sensitivity, specificity, and accuracy were 82, 92, and 90%, respectively, for reader 1 and 82, 90, and 88%, respectively, for reader 2) in the detection of knee articular cartilage lesions. The interobserver agreement varied from fair to good to excellent (kappa values from 0.43-0.83). Conclusion: 3D FIESTA has good diagnostic performance, comparable with fat-suppressed 3D SPGR in evaluating knee cartilage lesions, and it can be incorporated into routine knee MRI protocols due to the short acquisition time.

  17. Prospective comparison of 3D FIESTA versus fat-suppressed 3D SPGR MRI in evaluating knee cartilage lesions

    Energy Technology Data Exchange (ETDEWEB)

    Li, X.; Yu, C. [Department of Radiology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei 430030 (China); Wu, H. [Department of Orthopaedics, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei 430030 (China)], E-mail: lilyboston2002@163.com; Daniel, K. [Department of Radiology, Brigham and Women' s Hospital, Harvard Medical School, Boston, MA 02115 (United States); Hu, D.; Xia, L.; Pan, C.; Xu, A.; Hu, J.; Wang, L.; Peng, W. [Department of Radiology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei 430030 (China); Li, F. [Department of Orthopaedics, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei 430030 (China)

    2009-10-15

    Aim: To prospectively compare the accuracy of three-dimensional fast imaging employing steady-state acquisition (3D FIESTA) sequences with that of fat-suppressed three-dimensional spoiled gradient-recalled (3D SPGR) in the diagnosis of knee articular cartilage lesions, using arthroscopy as the reference standard. Materials and methods: Fifty-eight knees in 54 patients (age range 21-82 years; mean 36 years) were prospectively evaluated by using sagittal 3D FIESTA and sagittal fat-suppressed 3D SPGR sequences. Articular cartilage lesions were graded on MRI and during arthroscopy with a modified Noyes scoring system. Sensitivity, specificity, and accuracy were assessed. Interobserver agreement was determined with {kappa} statistics. Results: The performance of 3D FIESTA sequences (sensitivity, specificity, and accuracy were 80, 94, and 92%, respectively, for reader 1 and 76, 94, and 90%, respectively, for reader 2) was similar to that of fat-suppressed 3D SPGR sequences (sensitivity, specificity, and accuracy were 82, 92, and 90%, respectively, for reader 1 and 82, 90, and 88%, respectively, for reader 2) in the detection of knee articular cartilage lesions. The interobserver agreement varied from fair to good to excellent (kappa values from 0.43-0.83). Conclusion: 3D FIESTA has good diagnostic performance, comparable with fat-suppressed 3D SPGR in evaluating knee cartilage lesions, and it can be incorporated into routine knee MRI protocols due to the short acquisition time.

  18. Training less-experienced faculty improves reliability of skills assessment in cardiac surgery.

    Science.gov (United States)

    Lou, Xiaoying; Lee, Richard; Feins, Richard H; Enter, Daniel; Hicks, George L; Verrier, Edward D; Fann, James I

    2014-12-01

    Previous work has demonstrated high inter-rater reliability in the objective assessment of simulated anastomoses among experienced educators. We evaluated the inter-rater reliability of less-experienced educators and the impact of focused training with a video-embedded coronary anastomosis assessment tool. Nine less-experienced cardiothoracic surgery faculty members from different institutions evaluated 2 videos of simulated coronary anastomoses (1 by a medical student and 1 by a resident) at the Thoracic Surgery Directors Association Boot Camp. They then underwent a 30-minute training session using an assessment tool with embedded videos to anchor rating scores for 10 components of coronary artery anastomosis. Afterward, they evaluated 2 videos of a different student and resident performing the task. Components were scored on a 1 to 5 Likert scale, yielding an average composite score. Inter-rater reliabilities of component and composite scores were assessed using intraclass correlation coefficients (ICCs) and overall pass/fail ratings with kappa. All components of the assessment tool exhibited improvement in reliability, with 4 (bite, needle holder use, needle angles, and hand mechanics) improving the most from poor (ICC range, 0.09-0.48) to strong (ICC range, 0.80-0.90) agreement. After training, inter-rater reliabilities for composite scores improved from moderate (ICC, 0.76) to strong (ICC, 0.90) agreement, and for overall pass/fail ratings, from poor (kappa = 0.20) to moderate (kappa = 0.78) agreement. Focused, video-based anchor training facilitates greater inter-rater reliability in the objective assessment of simulated coronary anastomoses. Among raters with less teaching experience, such training may be needed before objective evaluation of technical skills. Published by Elsevier Inc.

  19. Growing Region Segmentation Software (GRES) for quantitative magnetic resonance imaging of multiple sclerosis: intra- and inter-observer agreement variability: a comparison with manual contouring method

    International Nuclear Information System (INIS)

    Parodi, Roberto C.; Sardanelli, Francesco; Renzetti, Paolo; Rosso, Elisabetta; Losacco, Caterina; Ferrari, Alessandra; Levrero, Fabrizio; Pilot, Alberto; Inglese, Matilde; Mancardi, Giovanni L.

    2002-01-01

    Lesion area measurement in multiple sclerosis (MS) is one of the key points in evaluating the natural history and in monitoring the efficacy of treatments. This study was performed to check the intra- and inter-observer agreement variability of a locally developed Growing Region Segmentation Software (GRES), comparing them to those obtained using manual contouring (MC). From routine 1.5-T MRI study of clinically definite multiple sclerosis patients, 36 lesions seen on proton-density-weighted images (PDWI) and 36 enhancing lesion on Gd-DTPA-BMA-enhanced T1-weighted images (Gd-T1WI) were randomly chosen and were evaluated by three observers. The mean range of lesion size was 9.9-536.0 mm 2 on PDWI and 3.6-57.2 mm 2 on Gd-T1WI. The median intra- and inter-observer agreement were, respectively, 97.1 and 90.0% using GRES on PDWI, 81.0 and 70.0% using MC on PDWI, 88.8 and 80.0% using GRES on Gd-T1WI, and 85.8 and 70.0% using MC on Gd-T1WI. The intra- and inter-observer agreements were significantly greater for GRES compared with MC (P<0.0001 and P=0.0023, respectively) for PDWI, while no difference was found between GRES an MC for Gd-T1WI. The intra-observer variability for GRES was significantly lower on both PDWI (P=0.0001) and Gd-T1WI (P=0.0067), whereas for MC the same result was found only for PDWI (P=0.0147). These data indicate that GRES reduces both the intra- and the inter-observer variability in assessing the area of MS lesions on PDWI and may prove useful in multicentre studies. (orig.)

  20. Managing Carious Lesions

    DEFF Research Database (Denmark)

    Schwendicke, F; Frencken, J E; Bjørndal, L

    2016-01-01

    should be prioritized, while in shallow or moderately deep lesions, restoration longevity becomes more important. For teeth with shallow or moderately deep cavitated lesions, carious tissue removal is performed according toselective removal to firm dentine.In deep cavitated lesions in primary......The International Caries Consensus Collaboration undertook a consensus process and here presents clinical recommendations for carious tissue removal and managing cavitated carious lesions, including restoration, based on texture of demineralized dentine. Dentists should manage the disease dental...

  1. Exploring rater agreement: configurations of agreement and disagreement

    Directory of Open Access Journals (Sweden)

    ALEXANDER VON EYE

    2006-03-01

    Full Text Available At the level of manifest categorical variables, a large number of coefficients and models for the examination of rater agreement has been proposed and used for descriptive and explanatory purposes. This article focuses on exploring rater agreement. Configural Frequency Analysis (CFA is proposed as a method of exploration of cross-classifications of raters’ judgements. CFA allows researchers to (1 examine individual cells and sets of cells in agreement tables; (2 examine cells that indicate disagreement; and (3 explore agreement and disagreement among three or more raters. Four CFA base models are discussed. The first is the model of rater agreement that is also used for Cohen’s (1960  (kappa. This model proposes independence of raters’ judgements. Deviations from this model suggest agreement or disagreement beyond chance. The second CFA model is based on a log-linear null model. This model is also used for Brennan and Prediger’s (1981 n. It proposes a uniform distribution of ratings. The third model is that of Tanner and Young (1985. This model proposes equal weights for agreement cases and independence otherwise. The fourth model is the quasi-independence model. This model allows one to blank out agreement cells and thus to focus solely on patterns of disagreement. Examples use data from applicant selection.

  2. Diagnosing the severity of buccal caries lesions in orthodontic patients at de-bonding using digital photographs.

    Science.gov (United States)

    Almosa, Naif A; Lundgren, Ted; Bresin, Andrea; Birkhed, Dowen; Kjellberg, Heidrun

    2014-08-01

    The aims of this study were (1) to evaluate the severity of buccal caries lesions according to the International Caries Detection and Assessment System (ICDAS) criteria via scoring buccal caries lesions on digital photographs at the time of de-bonding and (2) to compare this method with clinical examination. In total, 89 patients treated with upper and lower fixed appliances were clinically examined immediately after de-bonding by the first author and buccal caries were scored according to the ICDAS-II. Close-up digital photographs were taken of 245 teeth with different buccal caries lesion scores according to the ICDAS-II. Thirteen postgraduate orthodontics students independently scored the buccal caries lesions in the digital photographs using the modified clinical criteria (ICDAS-II). Intra- and inter-examiner reliabilities were evaluated by calculating the weighted kappa. To evaluate the validity of diagnosing the severity of buccal caries lesions using digital photographs compared to clinical examination, Spearman's correlation coefficient was calculated. Intra-examiner reliability and the reliability between each examiner and the clinical examination showed moderate-to-excellent agreement, with kappa values of 0.52-0.83. Scoring buccal caries lesions via clinical examinations and scoring via photographs were well correlated according to the modified ICDAS-II criteria (Spearman's correlation coefficient, 0.76). Thus, scoring buccal caries lesions on digital photographs according to ICDAS-II criteria is a reliable and valid method for assessing the severity of buccal caries lesions.

  3. Human papillomavirus in oral lesions Virus papiloma humano en lesiones orales

    OpenAIRE

    Joaquín V. Gónzalez; Rafael A. Gutiérrez; Alicia Keszler; Maria Del Carmen Colacino; Lidia V. Alonio; Angélica R. Teyssie; Maria Alejandra Picconi

    2007-01-01

    Growing evidence suggests a role for human papillomavirus (HPV) in oral cancer; however its involvement is still controversial. This study evaluates the frequency of HPV DNA in a variety of oral lesions in patients from Argentina. A total of 77 oral tissue samples from 66 patients were selected (cases); the clinical-histopathological diagnoses corresponded to: 11 HPV- associated benign lesions, 8 non-HPV associated benign lesions, 33 premalignant lesions and 25 cancers. Sixty exfoliated cell ...

  4. Radiographic and histological evaluation of persistent periapical lesions associated with endodontic failures after apical microsurgery.

    Science.gov (United States)

    Çalışkan, M K; Kaval, M E; Tekin, U; Ünal, T

    2016-11-01

    To determine the histology of persistent periapical lesions associated with nonsurgical endodontic treatment failures and to compare radiographically the sizes of periapical lesions and the presence or absence of the radiopaque lamina with the histological findings. Ninety-three anterior teeth designated for apical microsurgery were included in the study. After taking standard radiographs of all cases using the parallel technique, the films were scanned and evaluated for the size of periapical radiolucent lesions and the presence or absence of radiopaque lamina by two calibrated observers. Biopsy specimens were obtained during apical microsurgery and examined under light microscopy by oral pathologists. Histological analysis established diagnoses of granuloma, cyst, abscess and scar tissue. Interobserver agreement was evaluated by the kappa test, and the relationship between histological diagnosis and lesion size was analysed by the Pearson's chi-square test. The 93 specimens consisted of 72% periradicular granulomas; 21.5% radicular cysts, including two keratocysts; 4.3% abscesses; and 2.2% scar tissue. Cystic prevalence increased as the size of the periapical lesion increased; however, there was no correlation between the presence of a radiopaque lamina and histological diagnosis of cyst. Neither radiographic size nor presence of an associated radiopaque line alone was sufficient to determine the type of lesion. Histological examination is required in order to reach to a definitive diagnosis. © 2015 International Endodontic Journal. Published by John Wiley & Sons Ltd.

  5. Bilateral agreements

    International Nuclear Information System (INIS)

    1998-01-01

    Ten bilateral agreements are presented. These are: 1) Co-operation agreement relating to the peaceful uses of nuclear energy between Argentina and EURATOM (1996); 2) Agreement on co-operation in the peaceful uses of nuclear energy between Argentina and Greece (1997); 3) Implementing arrangement for technical exchange and co-operation in the area of peaceful uses of nuclear energy between Argentina and the United States (1997); 4) Agreement concerning co-operation in nuclear science and technology between Australia and Indonesia (1997); 5) Implementation of the 1985 Agreement for co-operation concerning the peaceful uses of nuclear energy between the People's Republic of China and the United States (1998); 6) Protocol of co-operation between France and Lithuania (1997); 7) Agreement on co-operation in energy research, science and technology, and development between Germany and the United States (1998); 8) Agreement on early notification of a nuclear accident and exchange of information on nuclear facilities between Greece and Romania (1997); 9) Agreement on early notification of nuclear accidents and co-operation in the field of nuclear safety between Hungary and the Ukraine (1997); 10) Agreement in the field of radioactive waste management between Switzerland and the United States (1997). (K.A.)

  6. Can we see enough? A comparative study of film-screen vs digital radiographs in small lesions in rheumatoid arthritis

    International Nuclear Information System (INIS)

    Jagt, E.J. van der; Hofman, S.; Kraft, B.M.; Leeuwen, M.A. van

    2000-01-01

    The aim of this study was to evaluate the performance of digital radiography in the detection of early very small erosions and joint space narrowing in the hands and feet in rheumatoid arthritis. Fifty-three sets of film-screen and digital radiographs of the same hands and feet with very small and sometimes questionable lesions (possible erosions and cysts) were scored independently two times by four investigators. The percentage of lesions found in exactly the same position for each investigator was calculated. Intra-observer agreement between first and second reading in film-screen radiography was 64-76 % (mean 67 %), and in digital radiographs 60-71 % (mean 64 %). Agreement between film-screen and digital radiographs ranged from 54 to 64 % (mean 58 %) in the first reading and from 56 to 66 % (mean 62 %) in the second reading. Overall agreement between both techniques between first and second reading ranged between 62 and 73 % (mean 65 %). Digital radiography of the hands and feet can be used in patients suspected of rheumatoid arthritis and in follow-up of those patients, because small and early erosions can be seen equally well with the digital technique as compared with the conventional film-screen technique. (orig.)

  7. Evaluation of agreement among digital dermatitis scoring methods in the milking parlor, pen, and hoof trimming chute.

    Science.gov (United States)

    Cramer, G; Winders, T; Solano, L; Kleinschmit, D H

    2018-03-01

    Digital dermatitis (DD) is the most common infectious foot lesion affecting welfare and productivity of dairy cattle. The key to DD control programs is routine and frequent identification of DD lesions. The objective was to evaluate accuracy of detecting and scoring DD lesions in 3 milking parlor designs and in 3 alternative settings compared with scoring in the hoof trimming chute as reference. A total of 552 cows and 1,104 hind feet from 17 freestall farms were scored by 1 observer in the milking parlor and in 1 other setting: pen, headlocks, or management rail. After being scored in the milking parlor and at least 1 other setting, cows were examined in the hoof trimming chute, considered the gold standard. In every setting, all hind feet were inspected visually using a flashlight and without prior washing of feet. Agreement of the scoring settings was assessed using the 5 M-stage scoring system and a dichotomous absence (M0 or M0/M1) or presence (M1 to M4.1 or M2 to M4.1) system. At trimming chute inspection, 44% of feet had a DD lesion, with estimates of 11, 5, 2, 10, and 16% for M1, M2, M3, M4, and M4.1 lesions, respectively. Apparent DD foot-level prevalence at the milking parlor, pen, management rail, and headlocks was 28, 22, 16, and 22%, respectively. M-stages were less discernible in the pen, management rail, and headlocks (apparent prevalence of M1, M2, M3, and M4.1 was ≤1%) compared with the trimming chute and milking parlor. Agreement beyond chance between any scoring setting and trimming chute scoring ranged from 0.48 to 0.70 for the dichotomous scoring system (absence = M0/M1 vs. presence = M2 to M4.1). Diagnostic test performance varied greatly among DD scoring settings but, in general, it had low sensitivity (93%) for detecting any DD lesion. Agreement and test characteristics were not affected by the type of milking parlor. Although the milking parlor and headlocks were the most reliable settings in which to detect DD, none of the settings were

  8. Thermophotonic lock-in imaging of early demineralized and carious lesions in human teeth

    Science.gov (United States)

    Tabatabaei, Nima; Mandelis, Andreas; Amaechi, Bennett Tochukwu

    2011-07-01

    As an extension of frequency-domain photothermal radiometry, a novel dental-imaging modality, thermophotonic lock-in imaging (TPLI), is introduced. This methodology uses photothermal wave principles and is capable of detecting early carious lesions and cracks on occlusal and approximal surfaces as well as early caries induced by artificial demineralizing solutions. The increased light scattering and absorption within early carious lesions increases the thermal-wave amplitude and shifts the thermal-wave centroid, producing contrast between the carious lesion and the intact enamel in both amplitude and phase images. Samples with artificial and natural occlusal and approximal caries were examined in this study. Thermophotonic effective detection depth is controlled by the modulation frequency according to the well-known concept of thermal diffusion length. TPLI phase images are emissivity normalized and therefore insensitive to the presence of stains. Amplitude images, on the other hand, provide integrated information from deeper enamel regions. It is concluded that the results of our noninvasive, noncontacting imaging methodology exhibit higher sensitivity to very early demineralization than dental radiographs and are in agreement with the destructive transverse microradiography mineral density profiles.

  9. Benign fibroosseous lesions

    Directory of Open Access Journals (Sweden)

    Cansu Köseoğlu Seçgin

    2016-05-01

    Full Text Available Benign fibroosseous lesions represent a group of lesions that share the same basic evolutive mechanism and are characterized by replacement of normal bone with a fibrous connective tissue that gradually undergoes mineralization. These lesions are presented by a variety of diseases including developmental, reactive-dysplastic processes and neoplasms. Depending on the nature and amount of calcified tissue, they can be observed as radiolucent, mixed or radiopaque. Their radiographic features could be well-defined or indistinguishable from the surrounding bone tissue. They can be asymptomatic as in osseous dysplasias and can be detected incidentally on radiographs, or they can lead to expansion in the affected bone as in ossifying fibroma. All fibroosseous lesions seen in the jaws and face are variations of the same histological pattern. Therefore, detailed clinical and radiographic evaluation in differential diagnosis is important. In this review, fibroosseous benign lesions are classified as osseous dysplasia, fibrous dysplasia and fibroosseous tumors; and radiographic features and differential diagnosis of these lesions are reviewed taking into account this classification.

  10. Can CT and MR Shape and Textural Features Differentiate Benign Versus Malignant Pleural Lesions?

    Science.gov (United States)

    Pena, Elena; Ojiaku, MacArinze; Inacio, Joao R; Gupta, Ashish; Macdonald, D Blair; Shabana, Wael; Seely, Jean M; Rybicki, Frank J; Dennie, Carole; Thornhill, Rebecca E

    2017-10-01

    The study aimed to identify a radiomic approach based on CT and or magnetic resonance (MR) features (shape and texture) that may help differentiate benign versus malignant pleural lesions, and to assess if the radiomic model may improve confidence and accuracy of radiologists with different subspecialty backgrounds. Twenty-nine patients with pleural lesions studied on both contrast-enhanced CT and MR imaging were reviewed retrospectively. Three texture and three shape features were extracted. Combinations of features were used to generate logistic regression models using histopathology as outcome. Two thoracic and two abdominal radiologists evaluated their degree of confidence in malignancy. Diagnostic accuracy of radiologists was determined using contingency tables. Cohen's kappa coefficient was used to assess inter-reader agreement. Using optimal threshold criteria, sensitivity, specificity, and accuracy of each feature and combination of features were obtained and compared to the accuracy and confidence of radiologists. The CT model that best discriminated malignant from benign lesions revealed an AUC CT  = 0.92 ± 0.05 (P textural and shape analysis may help distinguish malignant from benign lesions. A radiomics-based approach may increase diagnostic confidence of abdominal radiologists on CT and MR and may potentially improve radiologists' accuracy in the assessment of pleural lesions characterized by MR. Copyright © 2017 The Association of University Radiologists. Published by Elsevier Inc. All rights reserved.

  11. Optical coherence tomography to evaluate variance in the extent of carious lesions in depth.

    Science.gov (United States)

    Park, Kyung-Jin; Schneider, Hartmut; Ziebolz, Dirk; Krause, Felix; Haak, Rainer

    2018-05-03

    Evaluation of variance in the extent of carious lesions in depth at smooth surfaces within the same ICDAS code group using optical coherence tomography (OCT) in vitro and in vivo. (1) Verification/validation of OCT to assess non-cavitated caries: 13 human molars with ICDAS code 2 at smooth surfaces were imaged using OCT and light microscopy. Regions of interest (ROI) were categorized according to the depth of carious lesions. Agreement between histology and OCT was determined by unweighted Cohen's Kappa and Wilcoxon test. (2) Assessment of 133 smooth surfaces using ICDAS and OCT in vitro, 49 surfaces in vivo. ROI were categorized according to the caries extent (ICDAS: codes 0-4, OCT: scoring based on lesion depth). A frequency distribution of the OCT scores for each ICDAS code was determined. (1) Histology and OCT agreed moderately (κ = 0.54, p ≤ 0.001) with no significant difference between both methods (p = 0.25). The lesions (76.9% (10 of 13)) _were equally scored. (2) In vitro, OCT revealed caries in 42% of ROI clinically assessed as sound. OCT detected dentin-caries in 40% of ROIs visually assessed as enamel-caries. In vivo, large differences between ICDAS and OCT were observed. Carious lesions of ICDAS codes 1 and 2 vary largely in their extent in depth.

  12. Evaluation of radiology as a tool to diagnose pulmonic lesions in calves, for example prior to experimental infection studies

    DEFF Research Database (Denmark)

    Tegtmeier, Conny; Arnbjerg, J.

    2000-01-01

    The aim of the study was to evaluate radiology as a technique to visualize pulmonary lesions in young calves, e.g. as a selection criterion for research animals in order to eliminate animals with lung lesions Drier to experimental studies of pneumonia. Five calves with acute clinical signs...... of pneumonia were included in a direct comparative study of radiological and post mortem findings. Also, a number of animals with no signs of pneumonia were included as controls. The study revealed good agreement between the radiological and post mortem findings. Thus, in conclusion, radiology should...

  13. Lesions of juxtacortical origin (surface lesions of bone)

    International Nuclear Information System (INIS)

    Kenan, S.; Abdelwahab, I.F.; Klein, M.J.; Hermann, G.; Lewis, M.M.

    1993-01-01

    A large variety of tumor and tumor-like conditions have been shown to originate from the surface of bone. Most surface lesions are associated with periosteal reaction. The periosteum is a multipotential membrane. Its cellular composition may give rise to a variety of both neoplasms and tumor-like conditions. To avoid misinterpretation, the orthopedist, radiologist, and pathologist should be familiar with the entire spectrum of surface lesions. A better understanding of the natural history and biological behavior at different lesional maturity stages and correlation of the history with the radiographic and pathological findings is essential to establish the correct diagnosis. A history of injury of blunt trauma is very important. A stress fracture may produce a periosteal reaction acd callus that can be difficult to distinguish from osteosarcoma. In this review article, the authors wish to describe and define each term by its anatomy and radiographic features while discussing the entire spectrum of surface lesions. All the illustrative cases in this review article have been proven histologically. (orig.)

  14. Quality assessment of observational studies in a drug-safety systematic review, comparison of two tools: the Newcastle–Ottawa Scale and the RTI item bank

    Directory of Open Access Journals (Sweden)

    Margulis AV

    2014-10-01

    Full Text Available Andrea V Margulis,1 Manel Pladevall,1 Nuria Riera-Guardia,1 Cristina Varas-Lorenzo,1 Lorna Hazell,2,3 Nancy D Berkman,4 Meera Viswanathan,4 Susana Perez-Gutthann,1 1RTI Health Solutions, Barcelona, Spain; 2Drug Safety Research Unit, Southampton, UK; 3Associate Department of the School of Pharmacy and Biomedical Sciences, University of Portsmouth, Portsmouth, UK; 4RTI International, Research Triangle Park, NC, USA Background: The study objective was to compare the Newcastle–Ottawa Scale (NOS and the RTI item bank (RTI-IB and estimate interrater agreement using the RTI-IB within a systematic review on the cardiovascular safety of glucose-lowering drugs. Methods: We tailored both tools and added four questions to the RTI-IB. Two reviewers assessed the quality of the 44 included studies with both tools, (independently for the RTI-IB and agreed on which responses conveyed low, unclear, or high risk of bias. For each question in the RTI-IB (n=31, the observed interrater agreement was calculated as the percentage of studies given the same bias assessment by both reviewers; chance-adjusted interrater agreement was estimated with the first-order agreement coefficient (AC1 statistic. Results: The NOS required less tailoring and was easier to use than the RTI-IB, but the RTI-IB produced a more thorough assessment. The RTI-IB includes most of the domains measured in the NOS. Median observed interrater agreement for the RTI-IB was 75% (25th percentile [p25] =61%; p75 =89%; median AC1 statistic was 0.64 (p25 =0.51; p75 =0.86. Conclusion: The RTI-IB facilitates a more complete quality assessment than the NOS but is more burdensome. The observed agreement and AC1 statistic in this study were higher than those reported by the RTI-IB's developers. Keywords: systematic review, meta-analysis, quality assessment, AC1

  15. An examination of the interrater reliability between practitioners and researchers on the static-99.

    Science.gov (United States)

    Quesada, Stephen P; Calkins, Cynthia; Jeglic, Elizabeth L

    2014-11-01

    Many studies have validated the psychometric properties of the Static-99, the most widely used measure of sexual offender recidivism risk. However much of this research relied on instrument coding completed by well-trained researchers. This study is the first to examine the interrater reliability (IRR) of the Static-99 between practitioners in the field and researchers. Using archival data from a sample of 1,973 formerly incarcerated sex offenders, field raters' scores on the Static-99 were compared with those of researchers. Overall, clinicians and researchers had excellent IRR on Static-99 total scores, with IRR coefficients ranging from "substantial" to "outstanding" for the individual 10 items of the scale. The most common causes of discrepancies were coding manual errors, followed by item subjectivity, inaccurate item scoring, and calculation errors. These results offer important data with regard to the frequency and perceived nature of scoring errors. © The Author(s) 2013.

  16. Incidental enchondromas at knee magnetic resonance imaging: intraobserver and interobserver agreement and prevalence of imaging findings

    Directory of Open Access Journals (Sweden)

    Sandra Akemi Nakamura

    2013-06-01

    Full Text Available Objective To evaluate intra- and interobserver agreement in the identification of incidental enchondromas at knee magnetic resonance imaging, and to assess the prevalence of imaging findings. Materials and Methods Retrospective study reviewing 326 knee magnetic resonance images acquired in the period between November 2009 and September 2010. The images were independently and blindly analyzed by two specialists in musculoskeletal radiology, with the objective of identifying incidental enchondromas, presence of foci with signal similar to bone marrow and foci of signal absence suggestive of calcifications within the enchondromas. Inter- and intraobserver agreements were analyzed. Results Eleven lesions compatible with enchondromas (3.3% were identified. The interobserver agreement for the presence of enchondroma was high. Prevalence of foci of bone marrow signal inside the enchondromas was of 54.55%, and foci suggestive of calcification corresponded to 36.36%. The intraobserver agreement for foci of bone marrow signal in enchondromas was perfect, and interobserver agreement was high. Conclusion The prevalence of incidental enchondromas in the current study was compatible with data in the literature. Excellent agreement was observed in the identification of enchondromas and in the assessment of imaging findings. A higher prevalence of fat signal foci was observed as compared with signal absence suggestive of calcifications.

  17. Three-dimensional volumetric MRI with isotropic resolution: improved speed of acquisition, spatial resolution and assessment of lesion conspicuity in patients with recurrent soft tissue sarcoma

    Energy Technology Data Exchange (ETDEWEB)

    Ahlawat, Shivani [The Johns Hopkins Medical Institutions, The Russell H. Morgan Department of Radiology and Radiological Science, Baltimore, MD (United States); Morris, Carol [The Johns Hopkins Medical Institutions, Department of Orthopedic Surgery, Baltimore, MD (United States); The Johns Hopkins Medical Institutions, Department of Oncology, Baltimore, MD (United States); Fayad, Laura M. [The Johns Hopkins Medical Institutions, The Russell H. Morgan Department of Radiology and Radiological Science, Baltimore, MD (United States); The Johns Hopkins Medical Institutions, Department of Orthopedic Surgery, Baltimore, MD (United States); The Johns Hopkins Medical Institutions, Department of Oncology, Baltimore, MD (United States)

    2016-05-15

    To assess the acquisition speed, lesion conspicuity, and inter-observer agreement associated with volumetric T{sub 1}-weighted MR sequences with isotropic resolution for detecting recurrent soft-tissue sarcoma (STS). Fifteen subjects with histologically proven recurrent STS underwent MRI, including axial and coronal T{sub 1}-weighted spin echo (T{sub 1}-WSE) (5-mm slice thickness) and coronal 3D volumetric T{sub 1}-weighted (fat-suppressed, volume-interpolated, breath-hold examination; repetition time/echo time, 3.7/1.4 ms; flip angle, 9.5 ; 1-mm slice thickness) sequences before and after intravenous contrast administration. Subtraction imaging and multiplanar reformations (MPRs) were performed. Acquisition times for T{sub 1}-WSE in two planes and 3D sequences were reported. Two radiologists reviewed images for quality (>50 % artifacts, 25-50 % artifacts, <25 % artifacts, and no substantial artifacts), lesion conspicuity, contrast-to-noise ratio (CNR{sub muscle}), recurrence size, and recurrence-to-joint distance. Descriptive and intraclass correlation (ICC) statistics are given. Mean acquisition times were significantly less for 3D imaging compared with 2-plane T{sub 1}-WSE (183.6 vs 342.6 s; P = 0.012). Image quality was rated as having no substantial artifacts in 13/15 and <25 % artifacts in 2/15. Lesion conspicuity was significantly improved for subtracted versus unsubtracted images (CNR{sub muscle}, 100 ± 138 vs 181 ± 199; P = 0.05). Mean recurrent lesion size was 2.5 cm (range, 0.7-5.7 cm), and measurements on 3D sequences offered excellent interobserver agreement (ICC, 0.98 for lesion size and 0.96 for recurrence-to-joint distance with MPR views). Three-dimensional volumetric sequences offer faster acquisition times, higher spatial resolution, and MPR capability compared with 2D T{sub 1}-WSE for postcontrast imaging. Subtraction imaging provides higher lesion conspicuity for detecting recurrent STS in skeletal muscle, with excellent interobserver

  18. Relative and Absolute Interrater Reliabilities of a Hand-Held Myotonometer to Quantify Mechanical Muscle Properties in Patients with Acute Stroke in an Inpatient Ward

    Directory of Open Access Journals (Sweden)

    Wai Leung Ambrose Lo

    2017-01-01

    Full Text Available Introduction. The reliability of using MyotonPRO to quantify muscles mechanical properties in a ward setting for the acute stroke population remains unknown. Aims. To investigate the within-session relative and absolute interrater reliability of MyotonPRO. Methods. Mechanical properties of biceps brachii, brachioradialis, rectus femoris, and tibialis anterior were recorded at bedside. Participants were within 1 month of the first occurrence of stroke. Relative reliability was assessed by intraclass correlation coefficient (ICC. Absolute reliability was assessed by standard error of measurement (SEM, SEM%, smallest real difference (SRD, SRD%, and the Bland-Altman 95% limits of agreement. Results. ICCs of all studied muscles ranged between 0.63 and 0.97. The SEM of all muscles ranged within 0.30–0.88 Hz for tone, 0.07–0.19 for decrement, 6.42–20.20 N/m for stiffness, and 0.04–0.07 for creep. The SRD of all muscles ranged within 0.70–2.05 Hz for tone, 0.16–0.45 for decrement, 14.98–47.15 N/m for stiffness, and 0.09–0.17 for creep. Conclusions. MyotonPRO demonstrated acceptable relative and absolute reliability in a ward setting for patients with acute stroke. However, results must be interpreted with caution, due to the varying level of consistency between different muscles, as well as between different parameters within a muscle.

  19. Interrater and test-retest reliability and validity of the Norwegian version of the BESTest and mini-BESTest in people with increased risk of falling.

    Science.gov (United States)

    Hamre, Charlotta; Botolfsen, Pernille; Tangen, Gro Gujord; Helbostad, Jorunn L

    2017-04-20

    The Balance Evaluation Systems Test (BESTest) was developed to assess underlying systems for balance control in order to be able to individually tailor rehabilitation interventions to people with balance disorders. A short form, the Mini-BESTest, was developed as a screening test. The study aimed to assess interrater and test-retest reliability of the Norwegian version of the BESTest and the Mini-BESTest in community-dwelling people with increased risk of falling and to assess concurrent validity with the Fall Efficacy Scale-International (FES-I), and it was an observational study with a cross-sectional design. Forty-two persons with increased risk of falling (elderly over 65 years of age, persons with a history of stroke or Multiple Sclerosis) were assessed twice by two raters. Relative reliability was analysed with Intraclass Correlation Coefficient (ICC), and absolute reliability with standard error of measurement (SEM) and smallest detectable change (SDC). Concurrent validity was assessed against the FES-I using Spearman's rho. The BESTest showed very good interrater reliability (ICC = 0.98, SEM = 1.79, SDC 95  = 5.0) and test-retest reliability (rater A/rater B = ICC = 0.89/0.89, SEM = 3.9/4.3, SDC 95  = 10.8/11.8). The Mini-BESTest also showed very good interrater reliability (ICC = 0.95, SEM = 1.19, SDC 95  = 3.3) and test-retest reliability (rater A/rater B = ICC = 0.85/0.84, SEM = 1.8/1.9, SDC 95  = 4.9/5.2). The correlations were moderate between the FES-I and both the BESTest and the Mini-BESTest (Spearman's rho -0.51 and-0.50, p test-retest reliability when assessed in a heterogeneous sample of people with increased risk of falling. The concurrent validity measured against the FES-I showed moderate correlation. The results are comparable with earlier studies and indicate that the Norwegian versions can be used in daily clinic and in research.

  20. Interobserver agreement for the spine instability neoplastic score varies according to the experience of the evaluator

    Directory of Open Access Journals (Sweden)

    William Gemio Jacobsen Teixeira

    2013-01-01

    Full Text Available OBJECTIVES: To evaluate the interobserver agreement for the Neoplastic Spine Instability Score (SINS among spine surgeons with or without experience in vertebral metastasis treatment and physicians in other specialties. METHODS: Case descriptions were produced based on the medical records of 40 patients with vertebral metastases. The descriptions were then published online. Physicians were invited to evaluate the descriptions by answering questions according to the Neoplastic Spine Instability Score (SINS. The agreement among physicians was calculated using the kappa coefficient. RESULTS: Seventeen physicians agreed to participate: three highly experienced spine surgeons, seven less-experienced spine surgeons, three surgeons of other specialties, and four general practitioners (n = 17. The agreement for the final SINS score among all participants was fair, and it varied according to the SINS component. The agreement was substantial for the spine location only. The agreement was higher among experienced surgeons. The agreement was nearly perfect for spinal location among the spine surgeons who were highly experienced in vertebral metastases. CONCLUSIONS: This study demonstrates that the experience of the evaluator has an impact on SINS scale classification. The interobserver agreement was only fair among physicians who were not spine surgeons and among spine surgeons who were not experienced in the treatment of vertebral metastases, which may limit the use of the SINS scale for the screening of unstable lesions by less-experienced evaluators.

  1. Whole-procedure clinical accuracy of Gamma Knife treatments of large lesions

    International Nuclear Information System (INIS)

    Ma Lijun; Chuang, Cynthia; Descovich, Martina; Petti, Paula; Smith, Vernon; Verhey, Lynn

    2008-01-01

    The mechanical accuracy of Gamma Knife radiosurgery based on single-isocenter measurement has been established to within 0.3 mm. However, the full delivery accuracy for Gamma Knife treatments of large lesions has only been estimated via the quadrature-sum analysis. In this study, the authors directly measured the whole-procedure accuracy for Gamma Knife treatments of large lesions to examine the validity of such estimation. The measurements were conducted on a head-phantom simulating the whole treatment procedure that included frame placement, computed tomography imaging, treatment planning, and treatment delivery. The results of the measurements were compared with the dose calculations from the treatment planning system. Average agreements of 0.1-1.6 mm for the isodose lines ranging from 25% to 90% of the maximum dose were found despite potentially large contributing uncertainties such as 3-mm imaging resolution, 2-mm dose grid size, 1-mm frame registration, multi-isocenter deliveries, etc. The results of our measurements were found to be significantly smaller (>50%) than the calculated value based on the quadrature-sum analysis. In conclusion, Gamma Knife treatments of large lesions can be delivered much more accurately than predicted from the quadrature-sum analysis of major sources of uncertainties from each step of the delivery chain.

  2. Localization of lesions in aphasia

    International Nuclear Information System (INIS)

    Hojo, Kei; Watanabe, Shunzo; Tasaki, Hiroichi; Sato, Tokijiro; Metoki, Hirobumi.

    1984-01-01

    Using a microcomputer, the locus and extent of the lesions, as demonstrated by computed tomography for 127 cases with various types of aphasia were superimposed onto standardized marices. The relationship between the foci of the lesions and the types of aphasia was investigated. Broca aphasics (n=39) : Since the accumulated site of the lesions highly involved the deep structures of the lower part of the precentral gyrus as well as the insula and lenticular nucleus, only 60% of the Broca aphasics had lesions on these areas. This finding has proved to have little localizing value. Wernicke aphasics (n=23) : The size of the lesion was significantly smaller than Broca's aphasia. At least 70% of the patients had the superior temporal lesions involving Wernicke's area and subcortical lesions of the superior and middle temporal gyri. Amnestic aphasics (n=18) : The size of the lesion was smaller than any other types. While there was some concentration of the lesions (maximum 40%) in the area of the subcortical region of the anterior temporal gyrus adjacent to Wernicke's area and the lenticular nucleus, the lesions were distributed throughout the left hemisphere. Amnestic aphasia was thought to be the least localizable. Conduction aphasics (n=11) : The lesions were relatively small in size. Many patients had posterior speech area lesions involving at least partially Wernicke's area. In particular, more than 80% of the conduction aphasics had lesions of the supramarginal gyrus and it's adjacent deep structures. Global aphasics (n=36) : In general, the size of the lesion was very large and 70% of the global aphasics had extensive lesions involving both Broca's and Wernicke's areas. However, there were observations showing that the lesions can be small and confined. (J.P.N.)

  3. Magnetic resonance imaging of lower abdominal and pelvic lesions

    International Nuclear Information System (INIS)

    Van Beers, Bernard

    1992-01-01

    To determine the value of oral particles (OMP) as a superpara-magnetic MR contrast agent for the gastrointestinal tract in lower abdominal and pelvic lesions, 30 patients underwent spin-echo imaging before and after ingestion of OMP at a dose of approximately 80 mg of iron in 800 ml water. The preparation was divided into 4 portions and taken by the patient over a 2-h period. Two readers independently reviewed the MR images. The contrast material was well tolerated and the distribution of the contrast material was good to excellent in the proximal and pelvic small bowel, but was not sufficient in the colon with the dose and timing used in the study. Postcontrast images showed a significantly better delineation of the lesions, the small bowel and the paraaortic region, but no significant improvement in delineation of the colon, the iliac vessels are, bladder or genital tract. Compared with precontrast images, confidence in defining or excluding disease on postcontrast images was better, equal or worse in 40, 60 and 0% respectively (P<0.001) with a substantial agreement between readers (K0.71). OMP produced susceptibility artefacts of significant intensity in only one case. These results indicate that OMP may be useful in the delineation of lower abdominal and pelvic lesions at MR imaging. Marking of the colon by a contrast agent might improve the results. (author). 22 refs.; 3 figs.; 1 tab

  4. Analysis of pulmonary coin lesions

    International Nuclear Information System (INIS)

    Kim, O; Kim, K. H.; Oh, K. K.; Park, C. Y.

    1979-01-01

    For A long time the solitary pulmonary nodule has remained a difficult problem to solve and has attracted a great deal of attension in recent years. Circumscribed coin lesions of the lung were generally peripheral in location with respect to the pulmonary hilus. Because of this, important clinical problem in management and diagnosis arise. Such a lesion is discovered through roentgenologic examination. So the roentgenologists is the first be in a position to offer advise. This presentation is an attempt to correlate a useful diagnosis with roentgenologic findings of pulmonary coin lesion which enables us to get differential diagnosis of benign and malignant lesion. Histologically proven 120 cases of the pulmonary coin lesion during the period of 8 years were reviewed through plain film, tomogram, bronchoscopy, variable laboratory findings, and clinical history. The results are as follows: 1. Male to female sex ratio was 3 : 1. In age distribution, most of the malignant pulmonary coin lesion appeared in 6th decade (39%) and 5th decade (27%). In benign lesion, the most cases were in 3 rd decade. 2. Pathological cell type are as follows: Primary bronchogenic cancer 43.3%, tuberculoma 25.8%, inflammatory lesion 17.5%, benign tumor 10%, and bronchial adenoma, harmartoma, A.V. malformation, mesothelioma, are 1 case respectively. As a result benign and malignant lesion showed equal distribution (49.1% : 50.3%). 3. In symptom analysis ; cough is the most common (43.5%) symptom in malignant lesion, next follows hemoptysis (20.9%) and chest pain (14.5%). In benign lesion, most of the patient (32.7%) did not complain any symptom. 4. In malignant lesion, the most common nodular size was 4 cm (32.3%), and in benign lesion 2 cm sized coin was most common (39.3%). 5. In general, margin of nodule was very sharp and well demarcated in benign lesion (83.3%), and in malignant lesion that was less demarcated and poorly defined. 6. Most case of calcification (82.7%) was seen in benign

  5. Uterine Vascular Lesions

    Science.gov (United States)

    Vijayakumar, Abhishek; Srinivas, Amruthashree; Chandrashekar, Babitha Moogali; Vijayakumar, Avinash

    2013-01-01

    Vascular lesions of the uterus are rare; most reported in the literature are arteriovenous malformations (AVMs). Uterine AVMs can be congenital or acquired. In recent years, there has been an increasing number of reports of acquired vascular lesions of the uterus following pregnancy, abortion, cesarean delivery, and curettage. It can be seen from these reports that there is confusion concerning the terminology of uterine vascular lesions. There is also a lack of diagnostic criteria and management guidelines, which has led to an increased number of unnecessary invasive procedures (eg, angiography, uterine artery embolization, hysterectomy for abnormal vaginal bleeding). This article familiarizes readers with various vascular lesions of the uterus and their management. PMID:24340126

  6. [Correlation of hybrid II capture cytologic exam in diagnosis of cervical lesions related to HPV].

    Science.gov (United States)

    Saturnino, Ana Conceição Ribeiro Dantas; Diniz, Veruska Cassandra; Sisenando, Herbert Ary Arzabe Anteza Costa Nóbrega; de Souza, Eroisa Maria Conceição Oliveira; de Paiva, Magnus Sérgio Martins; de Brito, Maria Helena Marques Fonseca; Ramos, Eleni Souto Nóbrega

    2005-01-01

    The aim of the present study was confront the results of the cytological examination with hybrid capture II in the diagnosis of induced cervical intraepithelial lesion-HPV, correlating the cytological findings with biomoleculares. The research was carried through in a group of 160 sexually active women who had espontaneamente looked its gynecologists for consultation of routine, having been submitted to the collection of cervicovaginal material for cytology and for examination of hybrid capture II in the Centro de Patologia Clínica and the Hospital e Maternidade Promater, in the city of the Natal-RN. The results had shown to relatively high numbers of positive cases for HPV using hybrid capture II (41.87%) and the cytology (23.75%). The agreement between the two studied methods relatively was raised (59.38%). It was evident also that the viruses with high oncogênico potential had presented found in the compatible cytology with Lesion of low risk (11.88%), followed of Lesion of high risk (NIC II and III); already the viruses with low oncogênico potential were more associates the Lesion of low risk (6.25%), followed of Lesion of high risk. The cytology, exactly with its limitations, is an important method in the detention of attributable patologias to the HPV, emphasizing that the molecular method comes to complement it and to consolidate the cytological findings.

  7. The impact of precise robotic lesion length measurement on stent length selection: ramifications for stent savings.

    Science.gov (United States)

    Campbell, Paul T; Kruse, Kevin R; Kroll, Christopher R; Patterson, Janet Y; Esposito, Michele J

    2015-09-01

    Coronary stent deployment outcomes can be negatively impacted by inaccurate lesion measurement and inappropriate stent length selection (SLS). We compared visual estimate of these parameters to those provided by the CorPath 200® Robotic PCI System. Sixty consecutive patients who underwent coronary stent placement utilizing the CorPath System were evaluated. The treating physician assessed orthogonal images and provided visual estimates of lesion length and SLS. The robotic system was then used for the same measures. SLS was considered to be accurate when visual estimate and robotic measures were in agreement. Visual estimate SLSs were considered to be "short" or "long" if they were below or above the robotic-selected stents, respectively. Only 35% (21/60) of visually estimated lesions resulted in accurate SLS, whereas 33% (20/60) and 32% (19/60) of the visually estimated SLSs were long and short, respectively. In 5 cases (8.3%), 1 less stent was placed based on the robotic lesion measurement being shorter than the visual estimate. Visual estimate assessment of lesion length and SLS is highly variable with 65% of the cases being inaccurately measured when compared to objective measures obtained from the robotic system. The 32% of the cases where lesions were visually estimated to be short represents cases that often require the use of extra stents after the full lesion is not covered by 1 stent [longitudinal geographic miss (LGM)]. Further, these data showed that the use of the robotic system prevented the use of extra stents in 8.3% of the cases. Measurement of lesions with robotic PCI may reduce measurement errors, need for extra stents, and LGM. Copyright © 2015 Elsevier Inc. All rights reserved.

  8. The relationship between post-traumatic ossicular injuries and conductive hearing loss: A 3D-CT study.

    Science.gov (United States)

    Maillot, Olivier; Attyé, Arnaud; Boutet, Claire; Boubagra, Kamel; Perolat, Romain; Zanolla, Marion; Grand, Sylvie; Schmerber, Sébastien; Krainik, Alexandre

    2017-09-01

    After a trauma, the conductive ossicular chain may be disrupted by ossicular luxation or fracture. Recent developments in 3D-CT allow a better understanding of ossicular injuries. In this retrospective study, we compared patients with post-traumatic conductive hearing loss (CHL) with those referred without CHL to evaluate the relationship between ossicular injuries and CHL. We also assessed the added value of 3D reconstructions on 2D-CT scan to detect ossicular lesions in patients surgically managed. The CT scans were performed using a 40-section spiral CT scanner in 49 patients with post-traumatic CHL (n=29) and without CHL (n=20). Three radiologists performed independent blind evaluations of 2D-CT and 3D reconstructions to detect ossicular chain injury. We used the t-test to explore differences regarding the number of subjects with ossicular injury in the two groups. We also estimated the diagnostic accuracy and the inter-rater agreement of the 3D-CT reconstructions associated to 2D-CT scan. We identified ossicular abnormality in 14 patients out of 29 and in one patient out of 20 in the CHL and non-CHL groups respectively. There was a significant difference regarding the number of subjects with ossicular lesions between the two groups (P≤0.01). The diagnostic sensitivity of 3D-CT reconstructions associated with 2D-CT ranged from 66% to 100% and the inter-reader agreement ranged from 0.85 to 1, depending of the type of lesion. The relationship between ossicular lesion and the presence of CHL tightly correlated. 3D-CT reconstructions of the temporal bone are useful to assess patients in a post-traumatic context. Copyright © 2017 Elsevier Masson SAS. All rights reserved.

  9. Therapeutic benefits of carbon dioxide (CO2) laser on single-site HPV lesions in the lower female genital tract

    Science.gov (United States)

    Urru, Giovanni; Moretti, Gianfranco

    1998-01-01

    Numerous studies have shown contradictory variable percentages of recurrent HPV lesions, after various therapies. The present study therefore evaluates the effectiveness of CO2 laser vaporization in the treatment of single-site HPV lesions of the lower female genital tract in order to confirm the conviction that physical therapy alone, in agreement with some findings reported in the literature, is capable of guaranteeing a high cure rate in selected patients. From January 1995 to June 1996, seventy- five female patients were treated with CO2 laser vaporization for single-site genital HPV lesions, some of which were associated with low-grade intra-epithelial neoplasia. The success rate after 12 months proved to be 97%. The pre-existing clinical symptoms disappeared in all the patients treated. No complication in the vaporization procedure was encountered.

  10. Cervical digital photography for screening of uterine cervix cancer and its precursor lesions in developing countries.

    Science.gov (United States)

    Hillmann, Elise de Castro; Dos Reis, Ricardo; Monego, Heleusa; Appel, Márcia; Hammes, Luciano Serpa; Rivoire, Waldemar Augusto; Capp, Edison

    2013-07-01

    This study aims to evaluate and to compare the performance of cervical digital photography (CDP) to the visual inspection with acetic acid (VIA) and visual inspection with Lugol's iodine (VILI) methods for screening the uterine cervix cancer and its precursor lesions in developing countries. A cross-sectional study was performed in Brazil. 176 women were evaluated by VIA, VILI, CDP with acetic acid and CDP with Lugol's iodine. Kappa statistic was used to estimate the interobserver and intermethod agreement. Sensitivity, specificity and diagnostic accuracy of the four methods (VIA, VILI, CDP with acetic acid, CDP with Lugol's iodine) was calculated. Interobserver agreement for CDP with acetic acid was K = 0.441 and for CDP with Lugol's iodine was K = 0.533; intermethod agreement of VIA and CDP with acetic acid, K = 0.559; and of VILI and CDP with Lugol's iodine, K = 0.507. Sensitivity and specificity of CDP with acetic acid were 84.00 and 95.83 %, and of CDP with Lugol's iodine were 88.00 and 97.26 %, respectively. The diagnostic accuracy of CDP with acetic acid and CDP with Lugol's iodine was 92.78 and 94.90 %, respectively. This was the first study to assess the CDP with Lugol's iodine performance, which had similar performance to the CDP with acetic acid. CDP is considered a promising method for screening the uterine cervix cancer and its precursor lesions in developing countries.

  11. Comparision between biphasic helical CT and dynamic gadolinium-enhanced MR in the detection and characterization of focal hepatic lesions in cirrhotic patients; Estudio comparativo entre TC helicoidal bifasica y RM dinamica con gadolinio en la deteccion y caracterizacion de lesiones focales hepticas en pacientes cirrticos

    Energy Technology Data Exchange (ETDEWEB)

    Puig, J; Martn, J; Donoso, L; Falco, J; Rue, M [Consorcio Hospitalario del Parc Taul. Sabadell. Barcelona (Spain)

    1998-12-31

    To assess the agreement between biphasic helical computerized tomography (BHCT) and dynamic gadolinium-enhanced magnetic resonance (MR) in the detection and characterization of focal hepatic lesions in a group of cirrhotic patients. A prospective study was done in 50 cirrhotic patients suspected of having hepatocarcinoma (HC) on the basis of ultrasonographic images evaluated by means of BHCT and dynamic gadolinium-enhanced MR using fast low-angle shot (FLASH) sequences (110-135/4/90 degree centigree). The images were assessed jointly by four radiologists. Between the two techniques, a total of 83 lesions were detected. MR disclosed more lesions (n=79) than BHCT (n=67) (p<0.005). Moreover, 25 of the lesions that were visible by both techniques were more clearly evident in MR images (p<0.01). MR correctly classified 6 of 7 benign lesions (85%) and 49 of 66 malignant ones (74%). BHCT correctly classifed 2 of 7 benign lesions (28%) and 32 of 66 malignant ones (48%). The sensitivities of MR and BHCT for the characterization of these lesions were 74% and 48%, respectively (p=0.0009), while the respective specificities were 86% and 29% (P<0.001). Dynamic gadolinium-enhanced MR with FLASH sequences is more efficient than BHCT in the detection and characterization of focal lesions in cirrhotic patients. (Author) 37 refs.

  12. Intra-Rater, Inter-Rater and Test-Retest Reliability of an Instrumented Timed Up and Go (iTUG Test in Patients with Parkinson's Disease.

    Directory of Open Access Journals (Sweden)

    Rob C van Lummel

    Full Text Available The "Timed Up and Go" (TUG is a widely used measure of physical functioning in older people and in neurological populations, including Parkinson's Disease. When using an inertial sensor measurement system (instrumented TUG [iTUG], the individual components of the iTUG and the trunk kinematics can be measured separately, which may provide relevant additional information.The aim of this study was to determine intra-rater, inter-rater and test-retest reliability of the iTUG in patients with Parkinson's Disease.Twenty eight PD patients, aged 50 years or older, were included. For the iTUG the DynaPort Hybrid (McRoberts, The Hague, The Netherlands was worn at the lower back. The device measured acceleration and angular velocity in three directions at a rate of 100 samples/s. Patients performed the iTUG five times on two consecutive days. Repeated measurements by the same rater on the same day were used to calculate intra-rater reliability. Repeated measurements by different raters on the same day were used to calculate intra-rater and inter-rater reliability. Repeated measurements by the same rater on different days were used to calculate test-retest reliability.Nineteen ICC values (15% were ≥ 0.9 which is considered as excellent reliability. Sixty four ICC values (49% were ≥ 0.70 and < 0.90 which is considered as good reliability. Thirty one ICC values (24% were ≥ 0.50 and < 0.70, indicating moderate reliability. Sixteen ICC values (12% were ≥ 0.30 and < 0.50 indicating poor reliability. Two ICT values (2% were < 0.30 indicating very poor reliability.In conclusion, in patients with Parkinson's disease the intra-rater, inter-rater, and test-retest reliability of the individual components of the instrumented TUG (iTUG was excellent to good for total duration and for turning durations, and good to low for the sub durations and for the kinematics of the SiSt and StSi. The results of this fully automated analysis of instrumented TUG movements

  13. Detection of retinal lesions in diabetic retinopathy: comparative evaluation of 7-field digital color photography versus red-free photography.

    Science.gov (United States)

    Venkatesh, Pradeep; Sharma, Reetika; Vashist, Nagender; Vohra, Rajpal; Garg, Satpal

    2015-10-01

    Red-free light allows better detection of vascular lesions as this wavelength is absorbed by hemoglobin; however, the current gold standard for the detection and grading of diabetic retinopathy remains 7-field color fundus photography. The goal of this study was to compare the ability of 7-field fundus photography using red-free light to detect retinopathy lesions with corresponding images captured using standard 7-field color photography. Non-stereoscopic standard 7-field 30° digital color fundus photography and 7-field 30° digital red-free fundus photography were performed in 200 eyes of 103 patients with various grades of diabetic retinopathy ranging from mild to moderate non-proliferative diabetic retinopathy to proliferative diabetic retinopathy. The color images (n = 1,400) were studied with corresponding red-free images (n = 1,400) by one retina consultant (PV) and two senior residents training in retina. The various retinal lesions [microaneurysms, hemorrhages, hard exudates, soft exudates, intra-retinal microvascular anomalies (IRMA), neovascularization of the retina elsewhere (NVE), and neovascularization of the disc (NVD)] detected by all three observers in each of the photographs were noted followed by determination of agreement scores using κ values (range 0-1). Kappa coefficient was categorized as poor (≤0), slight (0.01-0.20), fair (0.2 -0.40), moderate (0.41-0.60), substantial (0.61-0.80), and almost perfect (0.81-1). The number of lesions detected by red-free images alone was higher for all observers and all abnormalities except hard exudates. Detection of IRMA was especially higher for all observers with red-free images. Between image pairs, there was substantial agreement for detection of hard exudates (average κ = 0.62, range 0.60-0.65) and moderate agreement for detection of hemorrhages (average κ = 0.52, range 0.45-0.58), soft exudates (average κ = 0.51, range 0.42-0.61), NVE (average κ = 0.47, range 0.39-0.53), and NVD

  14. Inter- and intra-observer variability associated with the use of the Mirels' scoring system for metastatic bone lesions.

    LENUS (Irish Health Repository)

    Mac Niocaill, Ruairi F

    2011-01-01

    Metastatic bone disease is increasing in association with ever-improving medical management of osteophylic malignant conditions. The precise timing of surgical intervention for secondary lesions in long bones can be difficult to determine. This paper aims to evaluate a classic scoring system. All radiographs were examined twice by three orthopaedic oncologists and scored according to the Mirels\\' scoring system. The Kappa statistic was used for the purpose of statistical analysis. The results show agreement between observers (κ = 0.35-0.61) for overall scores at the two time intervals. Inter-observer agreement was also seen with subset analysis of size (κ = 0.27-0.60), site (κ = 0.77-1.0) and nature of the lesion (κ = 0.55-0.81). Similarly, low levels of intra-observer variability were noted for each of the three surgeons (κ= 0.34, 0.39, and 0.78, respectively). These results indicate a reliable, repeatable assessment of bony metastases. We continue to advocate its use in the management of patients with long bone metastases.

  15. QNOTE: an instrument for measuring the quality of EHR clinical notes.

    Science.gov (United States)

    Burke, Harry B; Hoang, Albert; Becher, Dorothy; Fontelo, Paul; Liu, Fang; Stephens, Mark; Pangaro, Louis N; Sessums, Laura L; O'Malley, Patrick; Baxi, Nancy S; Bunt, Christopher W; Capaldi, Vincent F; Chen, Julie M; Cooper, Barbara A; Djuric, David A; Hodge, Joshua A; Kane, Shawn; Magee, Charles; Makary, Zizette R; Mallory, Renee M; Miller, Thomas; Saperstein, Adam; Servey, Jessica; Gimbel, Ronald W

    2014-01-01

    The outpatient clinical note documents the clinician's information collection, problem assessment, and patient management, yet there is currently no validated instrument to measure the quality of the electronic clinical note. This study evaluated the validity of the QNOTE instrument, which assesses 12 elements in the clinical note, for measuring the quality of clinical notes. It also compared its performance with a global instrument that assesses the clinical note as a whole. Retrospective multicenter blinded study of the clinical notes of 100 outpatients with type 2 diabetes mellitus who had been seen in clinic on at least three occasions. The 300 notes were rated by eight general internal medicine and eight family medicine practicing physicians. The QNOTE instrument scored the quality of the note as the sum of a set of 12 note element scores, and its inter-rater agreement was measured by the intraclass correlation coefficient. The Global instrument scored the note in its entirety, and its inter-rater agreement was measured by the Fleiss κ. The overall QNOTE inter-rater agreement was 0.82 (CI 0.80 to 0.84), and its note quality score was 65 (CI 64 to 66). The Global inter-rater agreement was 0.24 (CI 0.19 to 0.29), and its note quality score was 52 (CI 49 to 55). The QNOTE quality scores were consistent, and the overall QNOTE score was significantly higher than the overall Global score (p=0.04). We found the QNOTE to be a valid instrument for evaluating the quality of electronic clinical notes, and its performance was superior to that of the Global instrument. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.

  16. Cone-beam computed tomography versus digital periapical radiography in the detection of artificially created periapical lesions: A pilot study of the diagnostic accuracy of endodontists using both techniques

    Energy Technology Data Exchange (ETDEWEB)

    Campello, Abdrea Fagundes; Goncalves, Lucio Souza; Marques, Fabio vidal [Faculty of Dentistry, Estacio de Sa University, Rio de Janeiro (Brazil); Guedes, Fabio Ribeiro [Faculty of Dentistry, Federal University of Rio de Janeiro, Rio de Janeiro (Brazil)

    2017-03-15

    The aim of this study was to compare the diagnostic accuracy of previously trained endodontists in the detection of artificially created periapical lesions using cone-beam computed tomography (CBCT) and digital periapical radiography (DPR). An ex vivo model using dry skulls was used, in which simulated apical lesions were created and then progressively enlarged using no.1/2, no.2, no.4, and no.6 round burs. A total of 11 teeth were included in the study, and 110 images were obtained with CBCT and with an intraoral digital periapical radiographic sensor (Instrumentarium dental, Tuusula, Finland) initially and after each bur was used. Specificity and sensitivity were calculated. All images were evaluated by 10 previously trained, certified endodontists. Agreement was calculated using the kappa coefficient. The accuracy of each method in detecting apical lesions was calculated using the chi-square test. The kappa coefficient between examiners showed low agreement (range, 0.17-0.64). No statistical difference was found between CBCT and DPR in teeth without apical lesions (P=.15). The accuracy for CBCT was significantly higher than for DPR in all corresponding simulated lesions (P<.001). The correct diagnostic rate for CBCT ranged between 56.9% and 73.6%. The greatest difference between CBCT and DPR was seen in the maxillary teeth (CBCT, 71.4%; DPR, 28.6%; P<.01) and multi-rooted teeth (CBCT, 83.3%; DPR, 33.3%; P<.01). CBCT allowed higher accuracy than DPR in detecting simulated lesions for all simulated lesions tested. Endodontists need to be properly trained in interpreting CBCT scans to achieve higher diagnostic accuracy.

  17. Evaluation of the female pelvic floor in pelvic organ prolapse using 3.0-Tesla diffusion tensor imaging and fibre tractography

    Energy Technology Data Exchange (ETDEWEB)

    Zijta, F.M. [University of Amsterdam, Department of Radiology, Academic Medical Centre, Amsterdam (Netherlands); Onze Lieve Vrouwe Gasthuis, Amsterdam and Department of Radiology, Amsterdam (Netherlands); Academic Medical Center, Department of Radiology, Amsterdam, AZ (Netherlands); Lakeman, M.M.E.; Roovers, J.P. [University of Amsterdam the Netherlands and Biomedical NMR, Amsterdam and Department of Gynaecology, Academic Medical Centre, Amsterdam (Netherlands); Froeling, M. [University of Amsterdam, Department of Radiology, Academic Medical Centre, Amsterdam (Netherlands); Eindhoven University of Technology, Department of Biomedical Engineering, Eindhoven (Netherlands); Paardt, M.P. van der; Borstlap, C.S.V.; Bipat, S.; Nederveen, A.J.; Stoker, J. [University of Amsterdam, Department of Radiology, Academic Medical Centre, Amsterdam (Netherlands); Montauban van Swijndregt, A.D. [Onze Lieve Vrouwe Gasthuis, Amsterdam and Department of Radiology, Amsterdam (Netherlands); Strijkers, G.J. [Eindhoven University of Technology, Department of Biomedical Engineering, Eindhoven (Netherlands)

    2012-12-15

    To prospectively explore the clinical application of diffusion tensor imaging (DTI) and fibre tractography in evaluating the pelvic floor. Ten patients with pelvic organ prolapse, ten with pelvic floor symptoms and ten asymptomatic women were included. A two-dimensional (2D) spin-echo (SE) echo-planar imaging (EPI) sequence of the pelvic floor was acquired. Offline fibre tractography and morphological analysis of pelvic magnetic resonance imaging (MRI) were performed. Inter-rater agreement for quality assessment of fibre tracking results was evaluated using weighted kappa ({kappa}). From agreed tracking results, eigen values ({lambda}1, {lambda}2, {lambda}3), mean diffusivity (MD) and fractional anisotropy (FA) were calculated. MD and FA values were compared using ANOVA. Inter-rater reliability of DTI parameters was interpreted using the intra-class correlation coefficient (ICC). Substantial inter-rater agreement was found ({kappa} = 0.71 [95% CI 0.63-0.78]). Four anatomical structures were reliably identified. Substantial inter-rater agreement was found for MD and FA (ICC 0.60-0.91). No significant differences between groups were observed for anal sphincter, perineal body and puboperineal muscle. A significant difference in FA was found for internal obturator muscle between the prolapse group and the asymptomatic group (0.27 {+-} 0.05 vs 0.22 {+-} 0.03; P = 0.015). DTI with fibre tractography permits identification of part of the clinically relevant pelvic structures. Overall, no significant differences in DTI parameters were found between groups. circle Diffusion tensor MRI offers new insights into female pelvic floor problems. (orig.)

  18. The Orientation of Gastric Biopsy Samples Improves the Inter-observer Agreement of the OLGA Staging System.

    Science.gov (United States)

    Cotruta, Bogdan; Gheorghe, Cristian; Iacob, Razvan; Dumbrava, Mona; Radu, Cristina; Bancila, Ion; Becheanu, Gabriel

    2017-12-01

    Evaluation of severity and extension of gastric atrophy and intestinal metaplasia is recommended to identify subjects with a high risk for gastric cancer. The inter-observer agreement for the assessment of gastric atrophy is reported to be low. The aim of the study was to evaluate the inter-observer agreement for the assessment of severity and extension of gastric atrophy using oriented and unoriented gastric biopsy samples. Furthermore, the quality of biopsy specimens in oriented and unoriented samples was analyzed. A total of 35 subjects with dyspeptic symptoms addressed for gastrointestinal endoscopy that agreed to enter the study were prospectively enrolled. The OLGA/OLGIM gastric biopsies protocol was used. From each subject two sets of biopsies were obtained (four from the antrum, two oriented and two unoriented, two from the gastric incisure, one oriented and one unoriented, four from the gastric body, two oriented and two unoriented). The orientation of the biopsy samples was completed using nitrocellulose filters (Endokit®, BioOptica, Milan, Italy). The samples were blindly examined by two experienced pathologists. Inter-observer agreement was evaluated using kappa statistic for inter-rater agreement. The quality of histopathology specimens taking into account the identification of lamina propria was analyzed in oriented vs. unoriented samples. The samples with detectable lamina propria mucosae were defined as good quality specimens. Categorical data was analyzed using chi-square test and a two-sided p value <0.05 was considered statistically significant. A total of 350 biopsy samples were analyzed (175 oriented / 175 unoriented). The kappa index values for oriented/unoriented OLGA 0/I/II/III and IV stages have been 0.62/0.13, 0.70/0.20, 0.61/0.06, 0.62/0.46, and 0.77/0.50, respectively. For OLGIM 0/I/II/III stages the kappa index values for oriented/unoriented samples were 0.83/0.83, 0.88/0.89, 0.70/0.88 and 0.83/1, respectively. No case of OLGIM IV

  19. Ghost cell lesions

    Directory of Open Access Journals (Sweden)

    E Rajesh

    2015-01-01

    Full Text Available Ghost cells have been a controversy for a long time. Ghost cell is a swollen/enlarged epithelial cell with eosnophilic cytoplasm, but without a nucleus. In routine H and E staining these cells give a shadowy appearance. Hence these cells are also called as shadow cells or translucent cells. The appearance of these cells varies from lesion to lesion involving odontogenic and nonodontogenic lesions. This article review about the origin, nature and significance of ghost cells in different neoplasms.

  20. Critical evaluation of the specificity of MRI and TVUS for differentiation of malignant from benign adnexal lesions

    International Nuclear Information System (INIS)

    Reuter, M.; Steffens, J.C.; Muhle, C.; Brinkmann, G.; Spielmann, R.P.; Heller, M.; Schueppler, U.; Weisner, D.; Luettges, J.; Maschek, A.

    1998-01-01

    The aim of our work was to study the specificity of MRI in comparison with transvaginal US for differentiation of malignant from benign adnexal lesions. A total of 67 patients with clinically suspicious adnexal lesions were evaluated by MRI. Transaxial and coronal images were acquired using T1-weighted sequences before and following IV contrast and T2-weighted sequences. In all patients transvaginal ultrasound examinations (TVUS) were performed. For both imaging modalities each lesion was classified separately as either benign or malignant according to previously published criteria. Pathologic findings were available in 65 cases. Both MRI and TVUS correctly classified the 12 malignant lesions (sensitivity 100 %). Specificity (MRI: 78.2 %, TVUS: 65.5 %) and accuracy (MRI: 82 %, TVUS: 71.6 %) were higher with MRI than with TVUS, but differences were statistically not significant (p = 0.18 and p = 0.20, chi-square test). There was agreement/disagreement between findings of MRI and US in 52/15 lesions. The macroscopic criteria for malignancy are unspecific and result in a limitation of the specificity of both MRI and TVUS. The MRI technique is a valuable adjunct to TVUS by enabling further clarification of adnexal tumors with equivocal complex or solid vaginal sonographic findings. (orig.)

  1. Interobserver variability of ultrasound elastography and the ultrasound BI-RADS lexicon of breast lesions.

    Science.gov (United States)

    Park, Chang Suk; Kim, Sung Hun; Jung, Na Young; Choi, Jae Jung; Kang, Bong Joo; Jung, Hyun Seouk

    2015-03-01

    Elastographpy is a newly developed noninvasive imaging technique that uses ultrasound (US) to evaluate tissue stiffness. The interpretation of the same elastographic images may be variable according to reviewers. Because breast lesions are usually reported according to American College of Radiology Breast Imaging and Data System (ACR BI-RADS) lexicons and final category, we tried to compare observer variability between lexicons and final categorization of US BI-RADS and the elasticity score of US elastography. From April 2009 to February 2010, 1356 breast lesions in 1330 patients underwent ultrasound-guided core biopsy. Among them, 63 breast lesions in 55 patients (mean age, 45.7 years; range, 21-79 years) underwent both conventional ultrasound and elastography and were included in this study. Two radiologists independently performed conventional ultrasound and elastography, and another three observers reviewed conventional ultrasound images and elastography videos. Observers independently recorded the elasticity score for a 5-point scoring system proposed by Itoh et al., BI-RADS lexicons and final category using ultrasound BI-RADS. The histopathologic results were obtained and used as the reference standard. Interobserver variability was evaluated. Of the 63 lesions, 42 (66.7 %) were benign, and 21 (33.3 %) were malignant. The highest value of concordance among all variables was achieved for the elasticity score (k = 0.59), followed by shape (k = 0.54), final category (k = 0.48), posterior acoustic features (k = 0.44), echogenecity and orientation (k = 0.43). The least concordances were margin (k = 0.26), lesion boundary (k = 0.29) and calcification (k = 0.3). Elasticity score showed a higher level of interobserver agreement for the diagnosis of breast lesions than BI-RADS lexicons and final category.

  2. Agreement between physicians and non-physician clinicians in starting antiretroviral therapy in rural Uganda

    Directory of Open Access Journals (Sweden)

    Vasan Ashwin

    2009-08-01

    Full Text Available Abstract Background The scarcity of physicians in sub-Saharan Africa – particularly in rural clinics staffed only by non-physician health workers – is constraining access to HIV treatment, as only they are legally allowed to start antiretroviral therapy in the HIV-positive patient. Here we present a pilot study from Uganda assessing agreement between non-physician clinicians (nurses and clinical officers and physicians in their decisions as to whether to start therapy. Methods We conducted the study at 12 government antiretroviral therapy sites in three regions of Uganda, all of which had staff trained in delivery of antiretroviral therapy using the WHO Integrated Management of Adult and Adolescent Illness guidelines for chronic HIV care. We collected seven key variables to measure patient assessment and the decision as to whether to start antiretroviral therapy, the primary variable of interest being the Final Antiretroviral Therapy Recommendation. Patients saw either a clinical officer or nurse first, and then were screened identically by a blinded physician during the same clinic visit. We measured inter-rater agreement between the decisions of the non-physician health workers and physicians in the antiretroviral therapy assessment variables using simple and weighted Kappa analysis. Results Two hundred fifty-four patients were seen by a nurse and physician, while 267 were seen by a clinical officer and physician. The majority (> 50% in each arm of the study were in World Health Organization Clinical Stages I and II and therefore not currently eligible for antiretroviral therapy according to national antiretroviral therapy guidelines. Nurses and clinical officers both showed moderate to almost perfect agreement with physicians in their Final Antiretroviral Therapy Recommendation (unweighted κ = 0.59 and κ = 0.91, respectively. Agreement was also substantial for nurses versus physicians for assigning World Health Organization Clinical

  3. Detection of infarct lesions from single MRI modality using inconsistency between voxel intensity and spatial location--a 3-D automatic approach.

    Science.gov (United States)

    Shen, Shan; Szameitat, André J; Sterr, Annette

    2008-07-01

    Detection of infarct lesions using traditional segmentation methods is always problematic due to intensity similarity between lesions and normal tissues, so that multispectral MRI modalities were often employed for this purpose. However, the high costs of MRI scan and the severity of patient conditions restrict the collection of multiple images. Therefore, in this paper, a new 3-D automatic lesion detection approach was proposed, which required only a single type of anatomical MRI scan. It was developed on a theory that, when lesions were present, the voxel-intensity-based segmentation and the spatial-location-based tissue distribution should be inconsistent in the regions of lesions. The degree of this inconsistency was calculated, which indicated the likelihood of tissue abnormality. Lesions were identified when the inconsistency exceeded a defined threshold. In this approach, the intensity-based segmentation was implemented by the conventional fuzzy c-mean (FCM) algorithm, while the spatial location of tissues was provided by prior tissue probability maps. The use of simulated MRI lesions allowed us to quantitatively evaluate the performance of the proposed method, as the size and location of lesions were prespecified. The results showed that our method effectively detected lesions with 40-80% signal reduction compared to normal tissues (similarity index > 0.7). The capability of the proposed method in practice was also demonstrated on real infarct lesions from 15 stroke patients, where the lesions detected were in broad agreement with true lesions. Furthermore, a comparison to a statistical segmentation approach presented in the literature suggested that our 3-D lesion detection approach was more reliable. Future work will focus on adapting the current method to multiple sclerosis lesion detection.

  4. Oral White Lesions: Presentation and Comparison of Oral Submucous Fibrosis with Other Lesions

    International Nuclear Information System (INIS)

    Maqsood, A.; Aman, N.

    2013-01-01

    Objective: To compare oral submucous fibrosis with other white oral lesions for presentation and associated factors. Study Design: Cross-sectional study. Place and Duration of Study: The Departments of Oral Medicine and Oral and Maxillofacial Surgery, Dr. Ishrat-ul-Ibad Institute of Oral Health Sciences (DIKIOHS), Karachi, from May 2008 to May 2009. Methodology: Patients presenting with oral white lesions were selected by consecutive non-purposive sampling and clinico-demographic data was collected. For patients with oral submucous fibrosis (OSF), additional information like duration of habits, maximal incisal opening (MIO), presence of any other associated lesion were noted. OSF was compared with other white lesions for any association between characteristic of subjects. Chi-square and independent t-tests for determining the statistical significance at p < 0.05. Results: OSF was present in 59.6% (n = 106) of the 178 patients; other white lesions were 40.4% (n = 72). The mean age of patients with OSF was 34 +- 12.7 years and 45.81 +- 16.2 years in patients with other white lesions, (p < 0.0001). Items containing areca nut were consumed more by patients with OSF, with a significant (p < 0.0001) compared to patients with other white lesions. Conclusion: OSF was the predominant white lesion in patients examined at DIKIOHS. Areca nut was found to be chewed more by patients with OSF and still longer by patients with SCC. (author)

  5. Establishing the distribution of satellite lesions in intermediate- and high-risk prostate cancer: implications for focused radiotherapy.

    Science.gov (United States)

    Hegde, J V; Margolis, D J; Wang, P-C; Reiter, R E; Huang, J; Steinberg, M L; Kamrava, M

    2017-06-01

    In focused radiotherapy for prostate cancer (PC), a full dose of radiation is delivered to the index lesion while reduced dose is delivered to the remaining prostate to reduce morbidity. As PC is commonly multifocal, we investigated whether baseline clinical characteristics or multiparametric magnetic resonance imaging (mpMRI) may be useful to predict the actual pathologic distribution of PC in men with intermediate- or high-risk PC, which may better inform how to deliver focused radiotherapy. A retrospective single-institutional study was performed on 71 consecutive men with clinically localized, intermediate- or high-risk PC who underwent mpMRI followed by radical prostatectomy (RP) from January 2012 to December 2012. Logistic regression analysis was performed to evaluate preoperative predictors for satellite lesions. Performance characteristics of mpMRI to detect satellite lesions and the extent of prostate disease (one hemi-gland vs both) were also evaluated. In all, 50.7% had satellite lesions on mpMRI. On RP specimen analysis, 66.2% had satellite lesions and 55.3% of these satellite lesions had pathologic Gleason score (pGS)⩾3+4. The sensitivity, specificity, positive predictive value, negative predictive value and accuracy for mpMRI detecting a satellite lesion being present in the RP specimen were 59.6%, 66.7%, 77.8%, 45.7% and 62.0%, respectively. The presence of MRI satellite lesions was the only preoperative predictor significantly associated with finding satellite lesions on final pathology (hazard ratio (HR), 2.95, P=0.040). There was agreement in 76.1% of the entire cohort for unilateral vs bilateral disease when incorporating both biopsy and mpMRI information and comparing with the RP specimen. In intermediate risk or greater PC, only the presence of mpMRI satellite lesions could predict for pathologic satellite lesions. While combining biopsy and mpMRI information may improve preoperative disease localization, the relatively high incidence of

  6. Detection of organ dysfunction by hypotension and/or hyperlactemia in septic patients

    DEFF Research Database (Denmark)

    Nissen, Janet Yde; Dynesen, Jens Jacob Østergaard; Pedersen, Marie Kristine Jessen

    physician documented a suspicion/confirmation of infection within the first 24h of admission. Diagnoses were confirmed by expert proof reading, and calculation of inter-rater agreement. Severe sepsis-defining organ dysfunction cut-offs were adapted from SSC-2012 (Gold Standard). “Simple tool” hypotension...... fulfilling the inclusion criteria. 494 patients (51%) were suspected to have infection within the first 24h of admission. Inter-rater agreement regarding suspected infection was 81% (pSimple tool” detected 73......BackgroundThe definitions of sepsis were updated February 2016[1] - organ dysfunctions remain the turning point between “simple infection” and sepsis (previously severe sepsis). Hypotension and hyperlactatemia define two of many organ dysfunctions presented in the most recent Surviving Sepsis...

  7. Imaging review of lipomatous musculoskeletal lesions

    Directory of Open Access Journals (Sweden)

    Burt Ashley M.

    2017-01-01

    Full Text Available Lipomatous lesions are common musculoskeletal lesions that can arise within the soft tissues, bone, neurovascular structures, and synovium. The majority of these lesions are benign, and many of the benign lesions can be diagnosed by radiologic evaluation. However, radiologic differences between benign and malignant lipomatous lesions may be subtle and pathologic correlation is often needed. The use of sonography, computed tomography (CT, and magnetic resonance imaging (MRI is useful not only in portraying fat within the lesion, but also for evaluating the presence and extent of soft tissue components. Lipomas make up most soft tissue lipomatous lesions, but careful evaluation must be performed to distinguish these lesions from a low-grade liposarcoma. In addition to the imaging appearance, the location of the lesion and the patient demographics can be utilized to help diagnose other soft tissue lipomatous lesions, such as elastofibroma dorsi, angiolipoma, lipoblastoma, and hibernoma. Osseous lipomatous lesions such as a parosteal lipoma and intraosseous lipoma occur less commonly as their soft tissue counterpart, but are also benign. Neurovascular and synovial lipomatous lesions are much rarer lesions but demonstrate more classic radiologic findings, particularly on MRI. A review of the clinical, radiologic, and pathologic characteristics of these lesions is presented.

  8. Quantitative performance evaluation of 124I PET/MRI lesion dosimetry in differentiated thyroid cancer

    Science.gov (United States)

    Wierts, R.; Jentzen, W.; Quick, H. H.; Wisselink, H. J.; Pooters, I. N. A.; Wildberger, J. E.; Herrmann, K.; Kemerink, G. J.; Backes, W. H.; Mottaghy, F. M.

    2018-01-01

    The aim was to investigate the quantitative performance of 124I PET/MRI for pre-therapy lesion dosimetry in differentiated thyroid cancer (DTC). Phantom measurements were performed on a PET/MRI system (Biograph mMR, Siemens Healthcare) using 124I and 18F. The PET calibration factor and the influence of radiofrequency coil attenuation were determined using a cylindrical phantom homogeneously filled with radioactivity. The calibration factor was 1.00  ±  0.02 for 18F and 0.88  ±  0.02 for 124I. Near the radiofrequency surface coil an underestimation of less than 5% in radioactivity concentration was observed. Soft-tissue sphere recovery coefficients were determined using the NEMA IEC body phantom. Recovery coefficients were systematically higher for 18F than for 124I. In addition, the six spheres of the phantom were segmented using a PET-based iterative segmentation algorithm. For all 124I measurements, the deviations in segmented lesion volume and mean radioactivity concentration relative to the actual values were smaller than 15% and 25%, respectively. The effect of MR-based attenuation correction (three- and four-segment µ-maps) on bone lesion quantification was assessed using radioactive spheres filled with a K2HPO4 solution mimicking bone lesions. The four-segment µ-map resulted in an underestimation of the imaged radioactivity concentration of up to 15%, whereas the three-segment µ-map resulted in an overestimation of up to 10%. For twenty lesions identified in six patients, a comparison of 124I PET/MRI to PET/CT was performed with respect to segmented lesion volume and radioactivity concentration. The interclass correlation coefficients showed excellent agreement in segmented lesion volume and radioactivity concentration (0.999 and 0.95, respectively). In conclusion, it is feasible that accurate quantitative 124I PET/MRI could be used to perform radioiodine pre-therapy lesion dosimetry in DTC.

  9. Inter-rater reliability of the Sødring Motor Evaluation of Stroke patients (SMES).

    Science.gov (United States)

    Halsaa, K E; Sødring, K M; Bjelland, E; Finsrud, K; Bautz-Holter, E

    1999-12-01

    The Sødring Motor Evaluation of Stroke patients is an instrument for physiotherapists to evaluate motor function and activities in stroke patients. The rating reflects quality as well as quantity of the patient's unassisted performance within three domains: leg, arm and gross function. The inter-rater reliability of the method was studied in a sample of 30 patients admitted to a stroke rehabilitation unit. Three therapists were involved in the study; two therapists assessed the same patient on two consecutive days in a balanced design. Cohen's weighted kappa and McNemar's test of symmetry were used as measures of item reliability, and the intraclass correlation coefficient was used to express the reliability of the sumscores. For 24 out of 32 items the weighted kappa statistic was excellent (0.75-0.98), while 7 items had a kappa statistic within the range 0.53-0.74 (fair to good). The reliability of one item was poor (0.13). The intraclass correlation coefficient for the three sumscores was 0.97, 0.91 and 0.97. We conclude that the Sødring Motor Evaluation of Stroke patients is a reliable measure of motor function in stroke patients undergoing rehabilitation.

  10. Indeterminate lesions on planar bone scintigraphy in lung cancer patients: SPECT, CT or SPECT-CT?

    International Nuclear Information System (INIS)

    Sharma, Punit; Kumar, Rakesh; Singh, Harmandeep; Bal, Chandrasekhar; Malhotra, Arun; Julka, Pramod Kumar; Thulkar, Sanjay

    2012-01-01

    The objective of the present study was to compare the role of single photon emission computed tomography (SPECT), computed tomography (CT) and SPECT-CT of selected volume in lung cancer patients with indeterminate lesions on planar bone scintigraphy (BS). The data of 50 lung cancer patients (53 ± 10.3 years; range 30-75; male/female 38/12) with 65 indeterminate lesions on planar BS (January 2010 to November 2010) were retrospectively evaluated. All of them underwent SPECT-CT of a selected volume. SPECT, CT and SPECT-CT images were independently evaluated by two experienced readers (experience in musculoskeletal imaging, including CT: 5 and 7 years) in separate sessions. A scoring scale of 1 to 5 was used, in which 1 is definitely metastatic, 2 is probably metastatic, 3 is indeterminate, 4 is probably benign and 5 is definitely benign. Sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV) were calculated for each modality, taking a score ≤2 as metastatic. With receiver operating characteristic (ROC) curve analysis, areas under the curve (AUC) were calculated for each modality and compared. Clinical and imaging follow-up and/or histopathology were taken as reference standard. For both readers SPECT was inferior to CT (P = 0.004, P = 0.022) and SPECT-CT (P = 0.003, P = 0.037). However, no significant difference was found between CT and SPECT-CT for reader 1 (P = 0.847) and reader 2 (P = 0.592). The findings were similar for lytic as well as sclerotic lesions. Moderate inter-observer agreement was seen for SPECT images (κ = 0.426), while almost perfect agreement was seen for CT (κ = 0.834) and SPECT-CT (κ = 0.971). CT alone and SPECT-CT are better than SPECT for accurate characterisation of indeterminate lesions on planar BS in lung cancer patients. CT alone is not inferior to SPECT-CT for this purpose and might be preferred because of shorter acquisition time and wider availability. (orig.)

  11. Hock lesions and free-stall design.

    Science.gov (United States)

    Weary, D M; Taszkun, I

    2000-04-01

    We compared the prevalence and severity of skin lesions on the hocks of lactating dairy cows in southern British Columbia, comparing 20 farms using three common bedding surfaces: sawdust, sand, and geotextile mattresses. Skin lesions were scored at five positions on the hock. For each position we noted if the lesion showed inflammatory attributes, and then assigned a severity score. Of the 1752 lactating cows scored, 1267 cows (73%) had at least one hock lesion. Of those cows with lesions, 87% had lesions on both legs, 76% had lesions on more than one location on the hock, and 78% had a lesion of at least moderate severity (i.e., evidence of skin breakage or an area of hair loss >10 cm2). Lesions were most prevalent on farms that used geotextile mattresses (91% of cows) and least common on farms that used sand (24% of cows). Moreover, lesions on cows from farms using mattresses were more numerous and more severe than those on cows from sand-bedded farms. The prevalence and severity of lesions on farms using sawdust was intermediate. Lesions also varied in relation to location on the hock. For farms using geotextile mattresses, lesions were more common and more severe on the lateral surfaces of both the tuber calcis and the tarsal joint. On farms using sawdust, lesions were common on the dorsal surface of the tuber calcis and the lateral surfaces of both the tuber calcis and the tarsal joint. Lesions were rare on all five positions for cows from sand-bedded farms. Among the 10 farms sampled using sawdust, we found a significant negative relationship between the length of the stall and severity of lesions. For cows with lesions, the number and severity of lesions increased with age.

  12. Diagnosis of Esophageal Motility Disorders: Esophageal Pressure Topography vs. Conventional Line Tracing.

    Science.gov (United States)

    Carlson, Dustin A; Ravi, Karthik; Kahrilas, Peter J; Gyawali, C Prakash; Bredenoord, Arjan J; Castell, Donald O; Spechler, Stuart J; Halland, Magnus; Kanuri, Navya; Katzka, David A; Leggett, Cadman L; Roman, Sabine; Saenz, Jose B; Sayuk, Gregory S; Wong, Alan C; Yadlapati, Rena; Ciolino, Jody D; Fox, Mark R; Pandolfino, John E

    2015-07-01

    Enhanced characterization of esophageal peristaltic and sphincter function provided by esophageal pressure topography (EPT) offers a potential diagnostic advantage over conventional line tracings (CLT). However, high-resolution manometry (HRM) and EPT require increased equipment costs over conventional systems and evidence demonstrating a significant diagnostic advantage of EPT over CLT is limited. Our aim was to investigate whether the inter-rater agreement and/or accuracy of esophageal motility diagnosis differed between EPT and CLT. Forty previously completed patient HRM studies were selected for analysis using a customized software program developed to perform blinded independent interpretation in either EPT or CLT (six pressure sensors) format. Six experienced gastroenterologists with a clinical focus in esophageal disease (attendings) and six gastroenterology trainees with minimal manometry experience (fellows) from three academic centers interpreted each of the 40 studies using both EPT and CLT formats. Rater diagnoses were assessed for inter-rater agreement and diagnostic accuracy, both for exact diagnosis and for correct identification of a major esophageal motility disorder. The total group agreement was moderate (κ=0.57; 95% CI: 0.56-0.59) for EPT and fair (κ=0.32; 0.30-0.33) for CLT. Inter-rater agreement between attendings was good (κ=0.68; 0.65-0.71) for EPT and moderate (κ=0.46; 0.43-0.50) for CLT. Inter-rater agreement between fellows was moderate (κ=0.48; 0.45-0.50) for EPT and poor to fair (κ=0.20; 0.17-0.24) for CLT. Among all raters, the odds of an incorrect exact esophageal motility diagnosis were 3.3 times higher with CLT assessment than with EPT (OR: 3.3; 95% CI: 2.4-4.5; PCLT than with EPT (OR: 3.4; 2.4-5.0; PCLT among our selected raters. On the basis of these findings, EPT may be the preferred assessment modality of esophageal motility.

  13. Agreement and Anti-Agreement in Berber: A Multiple-Feature Inheritance Account

    Directory of Open Access Journals (Sweden)

    Saleem Abdelhady

    2017-10-01

    Full Text Available This study proposes understanding Agreement and Anti-Agreement in Berber from Multi-Feature Inheritance (MFI account following Branigan’s (2016 recent approach. The study aims to bridge a misunderstanding in the literature where it has been assumed that Feature Inheritance (FI can only account for Agreement relations while Anti-agreement relations can only be established by a direct Agree between C as a probe and its goal. Under MFI, it can be argued that FI can account for all aspects of Agreement/Anti-agreement. The study, therefore, presents a unification of the three mechanisms proposed by Ouali (2008. The new approach is tested on data from Berber.

  14. 99mTc-3PRGD2 Scintimammography in Palpable and Nonpalpable Breast Lesions

    Directory of Open Access Journals (Sweden)

    Lin Liu

    2014-07-01

    Full Text Available The aim of this study was to explore the diagnostic performance of 99mTc-3(poly-(ethylene glycol,PEG4-RGD2 (99mTc-3PRGD2 scintimammography (SMM in patients with either palpable or nonpalpable breast lesions and compare SMM to mammography to assess the possible incremental value of SMM in breast cancer detection. We also investigated the αvβ3 expression in malignant and benign breast lesions. Ninety-four patients with 110 lesions were included in this study. Mammograms were evaluated according to the Breast Imaging Reporting and Data System (BI-RADS by a specialized imaging radiologist. Prone SMM was performed 1 hour after injection of 99mTc-3PRGD2. Scintigraphic images were interpreted independently by two experienced nuclear medicine physicians using a three-point system, and the kappa value was calculated to determine the interreader agreement. The McNemar test was used to compare SMM and mammography with respect to sensitivity, specificity, and accuracy. Diagnostic values for breast cancer detection were evaluated for each lesion. Immunohistochemistry was performed to evaluate integrin αvβ3 expression. Histopathology revealed 46 malignant lesions and 64 benign lesions. The overall sensitivity, specificity, accuracy, positive predictive value, and negative predictive value of SMM were 83%, 73%, 77%, 69%, and 85%, respectively. The kappa value between the two reviewers was 0.63. The diagnostic values of SMM were higher than those of mammography in evaluating overall breast lesions. A sensitivity of 91% was achieved when SMM and mammography results were combined with 60% of all false-negative mammography findings classified as true-positive results by SMM. Integrin αvβ3 expression was positively identified using SMM imaging. SMM is a promising tool to avoid unnecessary biopsies when used in addition to mammography and can be used to image αvβ3 expression in breast cancer with good image quality.

  15. Theoretical Analysis of Penalized Maximum-Likelihood Patlak Parametric Image Reconstruction in Dynamic PET for Lesion Detection.

    Science.gov (United States)

    Yang, Li; Wang, Guobao; Qi, Jinyi

    2016-04-01

    Detecting cancerous lesions is a major clinical application of emission tomography. In a previous work, we studied penalized maximum-likelihood (PML) image reconstruction for lesion detection in static PET. Here we extend our theoretical analysis of static PET reconstruction to dynamic PET. We study both the conventional indirect reconstruction and direct reconstruction for Patlak parametric image estimation. In indirect reconstruction, Patlak parametric images are generated by first reconstructing a sequence of dynamic PET images, and then performing Patlak analysis on the time activity curves (TACs) pixel-by-pixel. In direct reconstruction, Patlak parametric images are estimated directly from raw sinogram data by incorporating the Patlak model into the image reconstruction procedure. PML reconstruction is used in both the indirect and direct reconstruction methods. We use a channelized Hotelling observer (CHO) to assess lesion detectability in Patlak parametric images. Simplified expressions for evaluating the lesion detectability have been derived and applied to the selection of the regularization parameter value to maximize detection performance. The proposed method is validated using computer-based Monte Carlo simulations. Good agreements between the theoretical predictions and the Monte Carlo results are observed. Both theoretical predictions and Monte Carlo simulation results show the benefit of the indirect and direct methods under optimized regularization parameters in dynamic PET reconstruction for lesion detection, when compared with the conventional static PET reconstruction.

  16. An Assessment of Reliability and Validity of a Rubric for Grading APA-Style Introductions

    Science.gov (United States)

    Stellmack, Mark A.; Konheim-Kalkstein, Yasmine L.; Manor, Julia E.; Massey, Abigail R.; Schmitz, Julie Ann P.

    2009-01-01

    This article describes the empirical evaluation of the reliability and validity of a grading rubric for grading APA-style introductions of undergraduate students. Levels of interrater agreement and intrarater agreement were not extremely high but were similar to values reported in the literature for comparably structured rubrics. Rank-order…

  17. The Mayo-Portland Participation Index: A brief and psychometrically sound measure of brain injury outcome.

    Science.gov (United States)

    Malec, James F

    2004-12-01

    To evaluate the internal consistency, interrater agreement, concurrent validity, and floor and ceiling effects of the 8-item Participation Index (M2PI) of the Mayo-Portland Adaptability Inventory (MPAI). M2PI data derived from MPAIs completed independently by the people with acquired brain injury undergoing evaluation, their significant others, and rehabilitation staff were submitted to Rasch Facets analysis to determine the internal consistency of each independent rater group and of composite measures that combined rater groups. Correlations with the full-scale MPAI were examined to assess concurrent validity, as was interrater agreement. Outpatient rehabilitation in academic physical medicine and rehabilitation department. People with acquired brain injury (N=134) consecutively seen for evaluation, significant others, and evaluating staff. Not applicable. The MPAI and M2PI. The M2PI showed satisfactory internal consistency, concurrent validity, interrater agreement, and minimal floor and ceiling effects, although evidence of rater bias was also apparent. Composite indices showed more desirable psychometric properties than ratings by individual rater groups. The M2PI, particularly in composite indices and with attention to rater biases, provides an outcome measure with satisfactory psychometric qualities and the potential to represent the varying perspectives of people with acquired brain injury, significant others, and rehabilitation staff.

  18. Fast lesion mapping during HIFU treatment using harmonic motion imaging guided focused ultrasound (HMIgFUS) in vitro and in vivo

    Science.gov (United States)

    Han, Yang; Wang, Shutao; Payen, Thomas; Konofagou, Elisa

    2017-04-01

    The successful clinical application of high intensity focused ultrasound (HIFU) ablation depends on reliable monitoring of the lesion formation. Harmonic motion imaging guided focused ultrasound (HMIgFUS) is an ultrasound-based elasticity imaging technique, which monitors HIFU ablation based on the stiffness change of the tissue instead of the echo intensity change in conventional B-mode monitoring, rendering it potentially more sensitive to lesion development. Our group has shown that predicting the lesion location based on the radiation force-excited region is feasible during HMIgFUS. In this study, the feasibility of a fast lesion mapping method is explored to directly monitor the lesion map during HIFU. The harmonic motion imaging (HMI) lesion map was generated by subtracting the reference HMI image from the present HMI peak-to-peak displacement map, as streamed on the computer display. The dimensions of the HMIgFUS lesions were compared against gross pathology. Excellent agreement was found between the lesion depth (r 2  =  0.81, slope  =  0.90), width (r 2  =  0.85, slope  =  1.12) and area (r 2  =  0.58, slope  =  0.75). In vivo feasibility was assessed in a mouse with a pancreatic tumor. These findings demonstrate that HMIgFUS can successfully map thermal lesions and monitor lesion development in real time in vitro and in vivo. The HMIgFUS technique may therefore constitute a novel clinical tool for HIFU treatment monitoring.

  19. A simple classification system (the Tree flowchart) for breast MRI can reduce the number of unnecessary biopsies in MRI-only lesions

    Energy Technology Data Exchange (ETDEWEB)

    Woitek, Ramona; Spick, Claudio; Schernthaner, Melanie; Kapetas, Panagiotis; Bernathova, Maria; Furtner, Julia; Pinker, Katja; Helbich, Thomas H.; Baltzer, Pascal A.T. [Medical University of Vienna, Department of Biomedical Imaging and Image-Guided Therapy, Vienna (Austria); Rudas, Margaretha [Medical University of Vienna, Clinical Institute of Pathology, Vienna (Austria)

    2017-09-15

    To assess whether using the Tree flowchart obviates unnecessary magnetic resonance imaging (MRI)-guided biopsies in breast lesions only visible on MRI. This retrospective IRB-approved study evaluated consecutive suspicious (BI-RADS 4) breast lesions only visible on MRI that were referred to our institution for MRI-guided biopsy. All lesions were evaluated according to the Tree flowchart for breast MRI by experienced readers. The Tree flowchart is a decision rule that assigns levels of suspicion to specific combinations of diagnostic criteria. Receiver operating characteristic (ROC) curve analysis was used to evaluate diagnostic accuracy. To assess reproducibility by kappa statistics, a second reader rated a subset of 82 patients. There were 454 patients with 469 histopathologically verified lesions included (98 malignant, 371 benign lesions). The area under the curve (AUC) of the Tree flowchart was 0.873 (95% CI: 0.839-0.901). The inter-reader agreement was almost perfect (kappa: 0.944; 95% CI 0.889-0.998). ROC analysis revealed exclusively benign lesions if the Tree node was ≤2, potentially avoiding unnecessary biopsies in 103 cases (27.8%). Using the Tree flowchart in breast lesions only visible on MRI, more than 25% of biopsies could be avoided without missing any breast cancer. (orig.)

  20. Oropharynx lesion biopsy

    Science.gov (United States)

    ... as papilloma) Fungal infections (such as candida) Histoplasmosis Oral lichen planus Precancerous sore (leukoplakia) Viral infections (such as Herpes simplex) Risks Risks of the procedure may ... Throat lesion biopsy; Biopsy - mouth or throat; Mouth lesion biopsy; Oral cancer - biopsy ...

  1. Inter-algorithm lesion volumetry comparison of real and 3D simulated lung lesions in CT

    Science.gov (United States)

    Robins, Marthony; Solomon, Justin; Hoye, Jocelyn; Smith, Taylor; Ebner, Lukas; Samei, Ehsan

    2017-03-01

    The purpose of this study was to establish volumetric exchangeability between real and computational lung lesions in CT. We compared the overall relative volume estimation performance of segmentation tools when used to measure real lesions in actual patient CT images and computational lesions virtually inserted into the same patient images (i.e., hybrid datasets). Pathologically confirmed malignancies from 30 thoracic patient cases from Reference Image Database to Evaluate Therapy Response (RIDER) were modeled and used as the basis for the comparison. Lesions included isolated nodules as well as those attached to the pleura or other lung structures. Patient images were acquired using a 16 detector row or 64 detector row CT scanner (Lightspeed 16 or VCT; GE Healthcare). Scans were acquired using standard chest protocols during a single breath-hold. Virtual 3D lesion models based on real lesions were developed in Duke Lesion Tool (Duke University), and inserted using a validated image-domain insertion program. Nodule volumes were estimated using multiple commercial segmentation tools (iNtuition, TeraRecon, Inc., Syngo.via, Siemens Healthcare, and IntelliSpace, Philips Healthcare). Consensus based volume comparison showed consistent trends in volume measurement between real and virtual lesions across all software. The average percent bias (+/- standard error) shows -9.2+/-3.2% for real lesions versus -6.7+/-1.2% for virtual lesions with tool A, 3.9+/-2.5% and 5.0+/-0.9% for tool B, and 5.3+/-2.3% and 1.8+/-0.8% for tool C, respectively. Virtual lesion volumes were statistically similar to those of real lesions (.05 in most cases. Results suggest that hybrid datasets had similar inter-algorithm variability compared to real datasets.

  2. Interobserver agreement of semi-automated and manual measurements of functional MRI metrics of treatment response in hepatocellular carcinoma

    International Nuclear Information System (INIS)

    Bonekamp, David; Bonekamp, Susanne; Halappa, Vivek Gowdra; Geschwind, Jean-Francois H.; Eng, John; Corona-Villalobos, Celia Pamela; Pawlik, Timothy M.; Kamel, Ihab R.

    2014-01-01

    Purpose: To assess the interobserver agreement in 50 patients with hepatocellular carcinoma (HCC) before and 1 month after intra-arterial therapy (IAT) using two semi-automated methods and a manual approach for the following functional, volumetric and morphologic parameters: (1) apparent diffusion coefficient (ADC), (2) arterial phase enhancement (AE), (3) portal venous phase enhancement (VE), (4) tumor volume, and assessment according to (5) the Response Evaluation Criteria in Solid Tumors (RECIST), and (6) the European Association for the Study of the Liver (EASL). Materials and methods: This HIPAA-compliant retrospective study had institutional review board approval. The requirement for patient informed consent was waived. Tumor ADC, AE, VE, volume, RECIST, and EASL in 50 index lesions was measured by three observers. Interobserver reproducibility was evaluated using intraclass correlation coefficients (ICC). P < 0.05 was considered to indicate a significant difference. Results: Semi-automated volumetric measurements of functional parameters (ADC, AE, and VE) before and after IAT as well as change in tumor ADC, AE, or VE had better interobserver agreement (ICC = 0.830–0.974) compared with manual ROI-based axial measurements (ICC = 0.157–0.799). Semi-automated measurements of tumor volume and size in the axial plane before and after IAT had better interobserver agreement (ICC = 0.854–0.996) compared with manual size measurements (ICC = 0.543–0.596), and interobserver agreement for change in tumor RECIST size was also higher using semi-automated measurements (ICC = 0.655) compared with manual measurements (ICC = 0.169). EASL measurements of tumor enhancement in the axial plane before and after IAT ((ICC = 0.758–0.809), and changes in EASL after IAT (ICC = 0.653) had good interobserver agreement. Conclusion: Semi-automated measurements of functional changes assessed by ADC and VE based on whole-lesion segmentation demonstrated better reproducibility than

  3. MR-imaging of anterior tibiotalar impingement syndrome: Agreement, sensitivity and specificity of MR-imaging and indirect MR-arthrography

    Energy Technology Data Exchange (ETDEWEB)

    Haller, Joerg [Department of Radiology, Hanusch Hospital, Heinrich Collin-Strasse 30, A-1140 Vienna (Austria); Ludwig Boltzmann Institute for Osteology, Hanusch Hospital, Heinrich Collin-Strasse 30, A-1140 Vienna (Austria); Bernt, Reinhard [Department of Radiology, Hanusch Hospital, Heinrich Collin-Strasse 30, A-1140 Vienna (Austria)]. E-mail: reinhard.bernt@wgkk.sozvers.at; Seeger, Thomas [Department of Trauma Surgery, Hanusch Hospital, Heinrich Collin-Strasse 30, A-1140 Vienna (Austria); Weissenbaeck, Alexander [Department of Trauma Surgery, Hanusch Hospital, Heinrich Collin-Strasse 30, A-1140 Vienna (Austria); Tuechler, Heinrich [Ludwig Boltzmann Institute for Hematology, Hanusch Hospital, Heinrich Collin-Strasse 30, A-1140 Vienna (Austria); Resnick, Donald [Department of Radiology, VA Medical Center, UCSD, 3350 La Jolla Village Dr, San Diego, CA 92161 (United States)

    2006-06-15

    Objective: To clarify the role of MR-imaging in the diagnosis of anterior ankle impingement syndromes. Materials and methods: We prospectively examined 51 consecutive patients with chronic ankle pain by MR-imaging. Arthroscopy was performed in 29 patients who previously underwent non-enhanced MR-imaging; in 11 patients, indirect MR-arthrography additionally was performed. MR-examinations were correlated with clinical findings; MR and arthroscopy scores were statistically compared, agreement was measured. Results: Arthroscopy demonstrated granulation tissue in the lateral gutter (38%) and anterior recess (31%), lesions of the anterior tibiofibular (31%) and the anterior talofibular ligament (21%) as well as intraarticular bodies (10%). Stenosing tenosynovitis and a ganglionic cyst were revealed as extraarticular causes for chronic ankle pain by MR-examination (17%). Agreement of MR-imaging and arthroscopy was fair for the anterior talofibular ligament and the anterior joint cavity (kappa 0.40). Major discrepancy was found for non-enhanced MR scans (kappa 0.49) when compared with indirect MR-arthrography (kappa 0.03) in the anterior cavity. The sensitivity for lesions of the anterior talofibular and calcaneofibular ligament and the anterior cavity (0.91-0.87) detected by MR-imaging was superior in comparison to lesions of the anterior tibiofibular ligament and anteromedial cavity (0.50-0.24). Conclusion: MR-imaging provides additional information about the mechanics of chronic ankle impingement rather than an accurate diagnosis of this clinical entity. The method is helpful in differentiating extra- from intra-articular causes of ankle impingement. Indirect MR-arthrography has little or no additional value in patients with ankle impingement syndrome.

  4. MR-imaging of anterior tibiotalar impingement syndrome: Agreement, sensitivity and specificity of MR-imaging and indirect MR-arthrography

    International Nuclear Information System (INIS)

    Haller, Joerg; Bernt, Reinhard; Seeger, Thomas; Weissenbaeck, Alexander; Tuechler, Heinrich; Resnick, Donald

    2006-01-01

    Objective: To clarify the role of MR-imaging in the diagnosis of anterior ankle impingement syndromes. Materials and methods: We prospectively examined 51 consecutive patients with chronic ankle pain by MR-imaging. Arthroscopy was performed in 29 patients who previously underwent non-enhanced MR-imaging; in 11 patients, indirect MR-arthrography additionally was performed. MR-examinations were correlated with clinical findings; MR and arthroscopy scores were statistically compared, agreement was measured. Results: Arthroscopy demonstrated granulation tissue in the lateral gutter (38%) and anterior recess (31%), lesions of the anterior tibiofibular (31%) and the anterior talofibular ligament (21%) as well as intraarticular bodies (10%). Stenosing tenosynovitis and a ganglionic cyst were revealed as extraarticular causes for chronic ankle pain by MR-examination (17%). Agreement of MR-imaging and arthroscopy was fair for the anterior talofibular ligament and the anterior joint cavity (kappa 0.40). Major discrepancy was found for non-enhanced MR scans (kappa 0.49) when compared with indirect MR-arthrography (kappa 0.03) in the anterior cavity. The sensitivity for lesions of the anterior talofibular and calcaneofibular ligament and the anterior cavity (0.91-0.87) detected by MR-imaging was superior in comparison to lesions of the anterior tibiofibular ligament and anteromedial cavity (0.50-0.24). Conclusion: MR-imaging provides additional information about the mechanics of chronic ankle impingement rather than an accurate diagnosis of this clinical entity. The method is helpful in differentiating extra- from intra-articular causes of ankle impingement. Indirect MR-arthrography has little or no additional value in patients with ankle impingement syndrome

  5. Exploration of the (Interrater) Reliability and Latent Factor Structure of the Alcohol Use Disorders Identification Test (AUDIT) and the Drug Use Disorders Identification Test (DUDIT) in a Sample of Dutch Probationers.

    Science.gov (United States)

    Hildebrand, Martin; Noteborn, Mirthe G C

    2015-01-01

    The use of brief, reliable, valid, and practical measures of substance use is critical for conducting individual (risk and need) assessments in probation practice. In this exploratory study, the basic psychometric properties of the Alcohol Use Disorders Identification Test (AUDIT) and the Drug Use Disorders Identification Test (DUDIT) are evaluated. The instruments were administered as an oral interview instead of a self-report questionnaire. The sample comprised 383 offenders (339 men, 44 women). A subset of 56 offenders (49 men, 7 women) participated in the interrater reliability study. Data collection took place between September 2011 and November 2012. Overall, both instruments have acceptable levels of interrater reliability for total scores and acceptable to good interrater reliabilities for most of the individual items. Confirmatory factor analyses (CFA) indicated that the a priori one-, two- and three-factor solutions for the AUDIT did not fit the observed data very well. Principal axis factoring (PAF) supported a two-factor solution for the AUDIT that included a level of alcohol consumption/consequences factor (Factor 1) and a dependence factor (Factor 2), with both factors explaining substantial variance in AUDIT scores. For the DUDIT, CFA and PAF suggest that a one-factor solution is the preferred model (accounting for 62.61% of total variance). The Dutch language versions of the AUDIT and the DUDIT are reliable screening instruments for use with probationers and both instruments can be reliably administered by probation officers in probation practice. However, future research on concurrent and predictive validity is warranted.

  6. The MacArthur Competence Assessment Tool-Criminal Adjudication: Factor structure, interrater reliability, and association with clinician opinion of competence in a forensic inpatient sample.

    Science.gov (United States)

    Wood, Mary E; Anderson, Jaime L; Glassmire, David M

    2017-06-01

    Adjudicative competence is the most frequently referred evaluation in the forensic context, and it is because of this that periodic evaluation of competence assessment instruments is imperative. Among those instruments, the MacArthur Competence Assessment Tool-Criminal Adjudication (MacCAT-CA) has demonstrated adequate psychometric properties suggesting its utility in informing the forensic inquiry. The purpose of the current study was to further investigate the psychometric properties and ultimate utility of subscale scores using archival data from a sample of 103 male and female forensic patients who were hospitalized for competence restoration treatment. Results of the present study suggested adequate internal consistency and good model fit for the factor structure. Interrater reliability was evaluated by comparing the absolute agreement of scores derived from 2 independent research assistants for each of the subscales; 2 of the 3 subscales fell within the acceptable range given established interpretative benchmarks for forensic assessment. Of particular interest was that the Appreciation subscale, while heralding the lowest intraclass correlation coefficient, explained the largest proportion of variance in clinician opinion relative to the other 2 subscales. In other words, the most subjective subscale (as evidenced by the lowest intraclass correlation), explained the largest proportion of variance in ultimate opinion. The authors argue that, although these results are an important consideration in these assessments, they are neither surprising nor entirely problematic when considering the case-specific nature of the inquiries on the subscale, as well as the subjectivity of scoring criteria for each of the Appreciation items. (PsycINFO Database Record (c) 2017 APA, all rights reserved).

  7. The Reliability of a Novel Mobile 3-dimensional Wound Measurement Device.

    Science.gov (United States)

    Anghel, Ersilia L; Kumar, Anagha; Bigham, Thomas E; Maselli, Kathryn M; Steinberg, John S; Evans, Karen K; Kim, Paul J; Attinger, Christopher E

    2016-11-01

    Objective assessment of wound dimensions is essential for tracking progression and determining treatment effectiveness. A reliability study was designed to establish intrarater and interrater reliability of a novel mobile 3-dimensional wound measurement (3DWM) device. Forty-five wounds were assessed by 2 raters using a 3DWM device to obtain length, width, area, depth, and volume measurements. Wounds were also measured manually, using a disposable ruler and digital planimetry. The intraclass correlation coefficient (ICC) was used to establish intrarater and interrater reliability. High levels of intrarater and interrater agreement were observed for area, length, and width; ICC = 0.998, 0.977, 0.955 and 0.999, 0.997, 0.995, respectively. Moderate levels of intrarater (ICC = 0.888) and interrater (ICC = 0.696) agreement were observed for volume. Lastly, depth yielded an intrarater ICC of 0.360 and an interrater ICC of 0.649. Measures from the 3DWM device were highly correlated with those obtained from scaled photography for length, width, and area (ρ = 0.997, 0.988, 0.997, P device yielded correlations of ρ = 0.990, 0.987, 0.996 with P device was found to be highly reliable for measuring wound areas for a range of wound sizes and types as compared to manual measurement and digital planimetry. The depth and therefore volume measurement using the 3DWM device was found to have a lower ICC, but volume ICC alone was moderate. Overall, this device offers a mobile option for objective wound measurement in the clinical setting.

  8. Reliability and validity of a nutrition and physical activity environmental self-assessment for child care

    Directory of Open Access Journals (Sweden)

    Ammerman Alice S

    2007-07-01

    Full Text Available Abstract Background Few assessment instruments have examined the nutrition and physical activity environments in child care, and none are self-administered. Given the emerging focus on child care settings as a target for intervention, a valid and reliable measure of the nutrition and physical activity environment is needed. Methods To measure inter-rater reliability, 59 child care center directors and 109 staff completed the self-assessment concurrently, but independently. Three weeks later, a repeat self-assessment was completed by a sub-sample of 38 directors to assess test-retest reliability. To assess criterion validity, a researcher-administered environmental assessment was conducted at 69 centers and was compared to a self-assessment completed by the director. A weighted kappa test statistic and percent agreement were calculated to assess agreement for each question on the self-assessment. Results For inter-rater reliability, kappa statistics ranged from 0.20 to 1.00 across all questions. Test-retest reliability of the self-assessment yielded kappa statistics that ranged from 0.07 to 1.00. The inter-quartile kappa statistic ranges for inter-rater and test-retest reliability were 0.45 to 0.63 and 0.27 to 0.45, respectively. When percent agreement was calculated, questions ranged from 52.6% to 100% for inter-rater reliability and 34.3% to 100% for test-retest reliability. Kappa statistics for validity ranged from -0.01 to 0.79, with an inter-quartile range of 0.08 to 0.34. Percent agreement for validity ranged from 12.9% to 93.7%. Conclusion This study provides estimates of criterion validity, inter-rater reliability and test-retest reliability for an environmental nutrition and physical activity self-assessment instrument for child care. Results indicate that the self-assessment is a stable and reasonably accurate instrument for use with child care interventions. We therefore recommend the Nutrition and Physical Activity Self-Assessment for

  9. Assessment of simulated mandibular condyle bone lesions by cone beam computed tomography

    Energy Technology Data Exchange (ETDEWEB)

    Marques, Alexandre Perez; Perrella, Andreia; Arita, Emiko Saito; Pereira, Marlene Fenyo Soeiro de Matos; Cavalcanti, Marcelo de Gusmao Paraiso, E-mail: alexperez34@gmail.co [Universidade de Sao Paulo (USP), SP (Brazil). Faculdade de Odontologia. Dept. de Estomatologia

    2010-10-15

    There are many limitations to image acquisition, using conventional radiography, of the temporomandibular joint (TMJ) region. The Computed Tomography (CT) scan is a better option, due to its higher accuracy, for purposes of diagnosis, surgical planning and treatment of bone injuries. The aim of the present study was to analyze two protocols of cone beam computed tomography for the evaluation of simulated mandibular condyle bone lesions. Spherical lesions were simulated in 30 dry mandibular condyles, using dentist drills and drill bits sizes 1, 3 and 6. Each of the mandibular condyles was submitted to cone beam computed tomography (CBCT) using two protocols: axial, coronal and sagittal multiplanar reconstruction (MPR); and sagittal plus coronal slices throughout the longitudinal axis of the mandibular condyles. For these protocols, 2 observers analyzed the CBCT images independently, regarding the presence or not of injuries. Only one of the observers, however, performed on 2 different occasions. The results were compared to the gold standard, evaluating the percentage of agreement, degree of accuracy of CBCT protocols and observers' examination. The z test was used for the statistical analysis. The results showed there were no statistically significant differences between the 2 protocols. There was greater difficulty in the assessment of small-size simulated lesions (drill no.1). From the results of this study, it can be concluded that CBCT is an accurate tool for analyzing mandibular condyle bone lesions, with the MPR protocol showing slightly better results than the sagittal plus coronal slices throughout the longitudinal axis. (author)

  10. Lesion activity assessment

    DEFF Research Database (Denmark)

    Ekstrand, K R; Zero, D T; Martignon, S

    2009-01-01

    in response to cariogenic plaque as well as lesion arrest. Based on this understanding, different clinical scoring systems have been developed to assess the severity/depth and activity of lesions. A recent system has been devised by the International Caries Detection and Assessment System Committee...

  11. 12 CFR 533.5 - Related agreements considered a single agreement.

    Science.gov (United States)

    2010-01-01

    ... DISCLOSURE AND REPORTING OF CRA-RELATED AGREEMENTS § 533.5 Related agreements considered a single agreement... entered into within the same 12-month period; and (3) Are each in fulfillment of the CRA. (b... in fulfillment of the CRA, if the contracts were negotiated in a coordinated fashion and a NGEP is a...

  12. 12 CFR 35.5 - Related agreements considered a single agreement.

    Science.gov (United States)

    2010-01-01

    ... DISCLOSURE AND REPORTING OF CRA-RELATED AGREEMENTS § 35.5 Related agreements considered a single agreement... into within the same 12-month period; and (3) Are each in fulfillment of the CRA. (b) Substantively... the CRA, if the contracts were negotiated in a coordinated fashion and a NGEP is a party to each...

  13. Reliability of a novel, semi-quantitative scale for classification of structural brain magnetic resonance imaging in children with cerebral palsy.

    Science.gov (United States)

    Fiori, Simona; Cioni, Giovanni; Klingels, Katrjin; Ortibus, Els; Van Gestel, Leen; Rose, Stephen; Boyd, Roslyn N; Feys, Hilde; Guzzetta, Andrea

    2014-09-01

    To describe the development of a novel rating scale for classification of brain structural magnetic resonance imaging (MRI) in children with cerebral palsy (CP) and to assess its interrater and intrarater reliability. The scale consists of three sections. Section 1 contains descriptive information about the patient and MRI. Section 2 contains the graphical template of brain hemispheres onto which the lesion is transposed. Section 3 contains the scoring system for the quantitative analysis of the lesion characteristics, grouped into different global scores and subscores that assess separately side, regions, and depth. A larger interrater and intrarater reliability study was performed in 34 children with CP (22 males, 12 females; mean age at scan of 9 y 5 mo [SD 3 y 3 mo], range 4 y-16 y 11 mo; Gross Motor Function Classification System level I, [n=22], II [n=10], and level III [n=2]). Very high interrater and intrarater reliability of the total score was found with indices above 0.87. Reliability coefficients of the lobar and hemispheric subscores ranged between 0.53 and 0.95. Global scores for hemispheres, basal ganglia, brain stem, and corpus callosum showed reliability coefficients above 0.65. This study presents the first visual, semi-quantitative scale for classification of brain structural MRI in children with CP. The high degree of reliability of the scale supports its potential application for investigating the relationship between brain structure and function and examining treatment response according to brain lesion severity in children with CP. © 2014 Mac Keith Press.

  14. Video training and certification program improves reliability of postischemic neurologic deficit measurement in the rat.

    Science.gov (United States)

    Taninishi, Hideki; Pearlstein, Molly; Sheng, Huaxin; Izutsu, Miwa; Chaparro, Rafael E; Goldstein, Larry B; Warner, David S

    2016-12-01

    Scoring systems are used to measure behavioral deficits in stroke research. Video-assisted training is used to standardize stroke-related neurologic deficit scoring in humans. We hypothesized that a video-assisted training and certification program can improve inter-rater reliability in assessing neurologic function after middle cerebral artery occlusion in rats. Three expert raters scored neurologic deficits in post-middle cerebral artery occlusion rats using three published systems having different complexity levels (3, 18, or 48 points). The system having the highest point estimate for the correlation between neurologic score and infarct size was selected to create a video-assisted training and certification program. Eight trainee raters completed the video-assisted training and certification program. Inter-rater agreement ( Κ: score) and agreement with expert consensus scores were measured before and after video-assisted training and certification program completion. The 48-point system correlated best with infarct size. Video-assisted training and certification improved agreement with expert consensus scores (pretraining = 65 ± 10, posttraining = 87 ± 14, 112 possible scores, P  0.4 (pretraining = 4, posttraining = 9), and number of categories with an improvement in the Κ: score from pretraining to posttraining (n = 6). Video-assisted training and certification improved trainee inter-rater reliability and agreement with expert consensus behavioral scores in rats after middle cerebral artery occlusion. Video-assisted training and certification may be useful in multilaboratory preclinical studies. © The Author(s) 2015.

  15. MALIGNANCY IN LARGE COLORECTAL LESIONS

    Directory of Open Access Journals (Sweden)

    Carlos Eduardo Oliveira dos SANTOS

    2014-09-01

    Full Text Available Context The size of colorectal lesions, besides a risk factor for malignancy, is a predictor for deeper invasion Objectives To evaluate the malignancy of colorectal lesions ≥20 mm. Methods Between 2007 and 2011, 76 neoplasms ≥20 mm in 70 patients were analyzed Results The mean age of the patients was 67.4 years, and 41 were women. Mean lesion size was 24.7 mm ± 6.2 mm (range: 20 to 50 mm. Half of the neoplasms were polypoid and the other half were non-polypoid. Forty-two (55.3% lesions were located in the left colon, and 34 in the right colon. There was a high prevalence of III L (39.5% and IV (53.9% pit patterns. There were 72 adenomas and 4 adenocarcinomas. Malignancy was observed in 5.3% of the lesions. Thirty-three lesions presented advanced histology (adenomas with high-grade dysplasia or early adenocarcinoma, with no difference in morphology and site. Only one lesion (1.3% invaded the submucosa. Lesions larger than 30 mm had advanced histology (P = 0.001. The primary treatment was endoscopic resection, and invasive carcinoma was referred to surgery. Recurrence rate was 10.6%. Conclusions Large colorectal neoplasms showed a low rate of malignancy. Endoscopic treatment is an effective therapy for these lesions.

  16. CT diagnosis of sellar and juxtasellar lesions, 3. Non-tumorous lesions

    Energy Technology Data Exchange (ETDEWEB)

    Kobayashi, Tatsuya [Nagoya Univ. (Japan). Faculty of Medicine

    1982-08-01

    A study is made of the usefulness and limitations of the CT diagnosis of sellar and juxtasellar lesions other than tumors. This study is based on 112 verified cases at Nagoya University Hospital from October, 1976, to December, 1981. The lesions included in this study are classified into four groups: vascular, inflammatory, traumatic lesion, and congenital anomaly. Although cerebral angiography is the cardinal method for the diagnosis of vascular lesions, CT is useful for the evaluation of a giant aneurysm, the localization of bleeding, or infarction by a ruptured aneurysm. Radiation brain necrosis, a special form of vascular lesion, can also be diagnosed if the critical analysis is made after previous irradiation. CT findings are helpful for the local diagnosis of acute inflammatory lesions, such as basal meningitis or abscess, but specific diagnosis is made on the basis of clinical signs and CSF study. Abnormal CT findings are obtained from a chronic inflammatory process, such as arachnoiditis adhesiva, glanuloma, or mucocele. Differential diagnosis is necessary with brain tumors. The CT findings of an arachnoid cyst are often diagnostic. Metrizamide or air cisternography, either combined with CT or without it, is important for the diagnosis of basal meningoencephalocele and hypothalamic hamaroma. Pneumocephalus and an intracranial foreign body resulting from a head injury can be diagnosed by plain skull and CT. The diagnosis of CSF leakage or prolapse cerebri associated with a skull-base fracture has been most difficult, but even it is possible by a combination of polytomography and high-resolution CT with metrizamide cisternography.

  17. Clinical impact of confocal laser endomicroscopy in the management of gastrointestinal lesions with an uncertain diagnosis.

    Science.gov (United States)

    Robles-Medranda, Carlos; Vargas, Maria; Ospina, Jesenia; Puga-Tejada, Miguel; Valero, Manuel; Soria, Miguel; Bravo, Gladys; Robles-Jara, Carlos; Lukashok, Hannah Pitanga

    2017-08-16

    To evaluate the clinical impact of confocal laser endomicroscopy (CLE) in the diagnosis and management of patients with an uncertain diagnosis. A retrospective chart review was performed. Patients who underwent CLE between November 2013 and October 2015 and exhibited a poor correlation between endoscopic and histological findings were included. Baseline characteristics, indications, previous diagnostic studies, findings at the time of CLE, clinical management and histological results were analyzed. Interventions based on CLE findings were also analyzed. We compared the diagnostic accuracy of CLE and target biopsies of surgical specimens. A total of 144 patients were included. Of these, 51% (74/144) were female. The mean age was 51 years old. In all, 41/144 (28.4%) lesions were neoplastic (13 bile duct, 10 gastric, 8 esophageal, 6 colonic, 1 duodenal, 1 rectal, 1 ampulloma and 1 pancreatic). The sensitivity, specificity, positive predictive value, negative predictive value, and observed agreement when CLE was used to detect N-lesions were 85.37%, 87.38%, 72.92%, 93.75% and 86.81%, respectively. Cohen's Kappa was 69.20%, thus indicating good agreement. Changes in management were observed in 54% of the cases. CLE is a new diagnostic tool that has a significant clinical impact on the diagnosis and treatment of patients with uncertain diagnosis.

  18. Reliability of cortical lesion detection on double inversion recovery MRI applying the MAGNIMS-Criteria in multiple sclerosis patients within a 16-months period.

    Directory of Open Access Journals (Sweden)

    Tobias Djamsched Faizy

    Full Text Available In patients with multiple sclerosis (MS, Double Inversion Recovery (DIR magnetic resonance imaging (MRI can be used to identify cortical lesions (CL. We sought to evaluate the reliability of CL detection on DIR longitudinally at multiple subsequent time-points applying the MAGNIMs scoring criteria for CLs.26 MS patients received a 3T-MRI (Siemens, Skyra with DIR at 12 time-points (TP within a 16 months period. Scans were assessed in random order by two different raters. Both raters separately marked all CLs on each scan and total lesion numbers were obtained for each scan-TP and patient. After a retrospective re-evaluation, the number of consensus CLs (conL was defined as the total number of CLs, which both raters finally agreed on. CLs volumes, relative signal intensities and CLs localizations were determined. Both ratings (conL vs. non-consensus scoring were compared for further analysis.A total number of n = 334 CLs were identified by both raters in 26 MS patients with a first agreement of both raters on 160 out of 334 of the CLs found (κ = 0.48. After the retrospective re-evaluation, consensus agreement increased to 233 out of 334 CL (κ = 0.69. 93.8% of conL were visible in at least 2 consecutive TP. 74.7% of the conL were visible in all 12 consecutive TP. ConL had greater mean lesion volumes and higher mean signal intensities compared to lesions that were only detected by one of the raters (p<0.05. A higher number of CLs in the frontal, parietal, temporal and occipital lobe were identified by both raters than the number of those only identified by one of the raters (p<0.05.After a first assessment, slightly less than a half of the CL were considered as reliably detectable on longitudinal DIR images. A retrospective re-evaluation notably increased the consensus agreement. However, this finding is narrowed, considering the fact that retrospective evaluation steps might not be practicable in clinical routine. Lesions that were not reliably

  19. Diagnostic Value of CT Arthrography for Evaluation of Osteochondral Lesions at the Ankle

    Directory of Open Access Journals (Sweden)

    Jan S. Kirschke

    2016-01-01

    Full Text Available Background. To retrospectively determine the diagnostic value of computed tomography arthrography (CTA of the ankle in the evaluation of (osteochondral lesions in comparison to conventional magnetic resonance imaging (MRI and intraoperative findings. Methods. A total of N=79 patients had CTAs and MRI of the ankle; in 17/79 cases surgical reports with statements on cartilage integrity were available. Cartilage lesions and bony defects at talus and tibia were scored according to defect depth and size by two radiologists. Statistical analysis included sensitivity analyses and Cohen’s kappa calculations. Results. On CTA, 41/79 and 31/79 patients had full thickness cartilage defects at the talus and at the tibia, respectively. MRI was able to detect 54% of these defects. For the detection of full thickness cartilage lesions, interobserver agreement was substantial (0.72 ± 0.05 for CTA and moderate (0.55 ± 0.07 for MRI. In surgical reports, 88–92% and 46–62% of full thickness defects detected by CTA and MRI were described. CTA findings changed the further clinical management in 15.4% of cases. Conclusions. As compared to conventional MRI, CTA improves detection and visualization of cartilage defects at the ankle and is a relevant tool for treatment decisions in unclear cases.

  20. Comparision between biphasic helical CT and dynamic gadolinium-enhanced MR in the detection and characterization of focal hepatic lesions in cirrhotic patients

    International Nuclear Information System (INIS)

    Puig, J.; Martin, J.; Donoso, L.; Falco, J.; Rue, M.

    1997-01-01

    To assess the agreement between biphasic helical computerized tomography (BHCT) and dynamic gadolinium-enhanced magnetic resonance (MR) in the detection and characterization of focal hepatic lesions in a group of cirrhotic patients. A prospective study was done in 50 cirrhotic patients suspected of having hepatocarcinoma (HC) on the basis of ultrasonographic images evaluated by means of BHCT and dynamic gadolinium-enhanced MR using fast low-angle shot (FLASH) sequences (110-135/4/90 degree centigree). The images were assessed jointly by four radiologists. Between the two techniques, a total of 83 lesions were detected. MR disclosed more lesions (n=79) than BHCT (n=67) (p<0.005). Moreover, 25 of the lesions that were visible by both techniques were more clearly evident in MR images (p<0.01). MR correctly classified 6 of 7 benign lesions (85%) and 49 of 66 malignant ones (74%). BHCT correctly classifed 2 of 7 benign lesions (28%) and 32 of 66 malignant ones (48%). The sensitivities of MR and BHCT for the characterization of these lesions were 74% and 48%, respectively (p=0.0009), while the respective specificities were 86% and 29% (P<0.001). Dynamic gadolinium-enhanced MR with FLASH sequences is more efficient than BHCT in the detection and characterization of focal lesions in cirrhotic patients. (Author) 37 refs

  1. Reliability of visual and instrumental color matching.

    Science.gov (United States)

    Igiel, Christopher; Lehmann, Karl Martin; Ghinea, Razvan; Weyhrauch, Michael; Hangx, Ysbrand; Scheller, Herbert; Paravina, Rade D

    2017-09-01

    The aim of this investigation was to evaluate intra-rater and inter-rater reliability of visual and instrumental shade matching. Forty individuals with normal color perception participated in this study. The right maxillary central incisor of a teaching model was prepared and restored with 10 feldspathic all-ceramic crowns of different shades. A shade matching session consisted of the observer (rater) visually selecting the best match by using VITA classical A1-D4 (VC) and VITA Toothguide 3D Master (3D) shade guides and the VITA Easyshade Advance intraoral spectrophotometer (ES) to obtain both VC and 3D matches. Three shade matching sessions were held with 4 to 6 weeks between sessions. Intra-rater reliability was assessed based on the percentage of agreement for the three sessions for the same observer, whereas the inter-rater reliability was calculated as mean percentage of agreement between different observers. The Fleiss' Kappa statistical analysis was used to evaluate visual inter-rater reliability. The mean intra-rater reliability for the visual shade selection was 64(11) for VC and 48(10) for 3D. The corresponding ES values were 96(4) for both VC and 3D. The percentages of observers who matched the same shade with VC and 3D were 55(10) and 43(12), respectively, while corresponding ES values were 88(8) for VC and 92(4) for 3D. The results for visual shade matching exhibited a high to moderate level of inconsistency for both intra-rater and inter-rater comparisons. The VITA Easyshade Advance intraoral spectrophotometer exhibited significantly better reliability compared with visual shade selection. This study evaluates the ability of observers to consistently match the same shade visually and with a dental spectrophotometer in different sessions. The intra-rater and inter-rater reliability (agreement of repeated shade matching) of visual and instrumental tooth color matching strongly suggest the use of color matching instruments as a supplementary tool in

  2. Breast lesion characterization using whole-lesion histogram analysis with stretched-exponential diffusion model.

    Science.gov (United States)

    Liu, Chunling; Wang, Kun; Li, Xiaodan; Zhang, Jine; Ding, Jie; Spuhler, Karl; Duong, Timothy; Liang, Changhong; Huang, Chuan

    2018-06-01

    Diffusion-weighted imaging (DWI) has been studied in breast imaging and can provide more information about diffusion, perfusion and other physiological interests than standard pulse sequences. The stretched-exponential model has previously been shown to be more reliable than conventional DWI techniques, but different diagnostic sensitivities were found from study to study. This work investigated the characteristics of whole-lesion histogram parameters derived from the stretched-exponential diffusion model for benign and malignant breast lesions, compared them with conventional apparent diffusion coefficient (ADC), and further determined which histogram metrics can be best used to differentiate malignant from benign lesions. This was a prospective study. Seventy females were included in the study. Multi-b value DWI was performed on a 1.5T scanner. Histogram parameters of whole lesions for distributed diffusion coefficient (DDC), heterogeneity index (α), and ADC were calculated by two radiologists and compared among benign lesions, ductal carcinoma in situ (DCIS), and invasive carcinoma confirmed by pathology. Nonparametric tests were performed for comparisons among invasive carcinoma, DCIS, and benign lesions. Comparisons of receiver operating characteristic (ROC) curves were performed to show the ability to discriminate malignant from benign lesions. The majority of histogram parameters (mean/min/max, skewness/kurtosis, 10-90 th percentile values) from DDC, α, and ADC were significantly different among invasive carcinoma, DCIS, and benign lesions. DDC 10% (area under curve [AUC] = 0.931), ADC 10% (AUC = 0.893), and α mean (AUC = 0.787) were found to be the best metrics in differentiating benign from malignant tumors among all histogram parameters derived from ADC and α, respectively. The combination of DDC 10% and α mean , using logistic regression, yielded the highest sensitivity (90.2%) and specificity (95.5%). DDC 10% and α mean derived from

  3. A new MRI grading system for chondromalacia patellae.

    Science.gov (United States)

    Özgen, Ali; Taşdelen, Neslihan; Fırat, Zeynep

    2017-04-01

    Background Chondromalacia patellae is a very common disorder. Although magnetic resonance imaging (MRI) is widely used to investigate patellar cartilage lesions, there is no descriptive MRI-based grading system for chondromalacia patellae. Purpose To propose a new MRI grading system for chondromalacia patellae with corresponding high resolution images which might be useful in precisely reporting and comparing knee examinations in routine daily practice and used in predicting natural course and clinical outcome of the patellar cartilage lesions. Material and Methods High resolution fat-saturated proton density (FS PD) images in the axial plane with corresponding T2 mapping images were reviewed. A detailed MRI grading system covering the deficiencies of the existing gradings has been set and presented on these images. Two experienced observers blinded to clinical data examined 44 knee MR images and evaluated patellar cartilage changes according to the proposed grading system. Inter- and intra-rater validity testing using kappa statistics were calculated. Results A descriptive and detailed grading system with corresponding FS PD and T2 mapping images has been presented. Inter-rater agreement was 0.80 (95% confidence interval [CI], 0.71-0.89). Intra-rater agreements were 0.83 (95% CI, 0.74-0.91) for observer A and 0.79 (95% CI, 0.70-0.88) for observer B (k-values). Conclusion We present a new MRI grading system for chondromalacia patellae with corresponding images and good inter- and intra-rater agreement which might be useful in reporting and comparing knee MRI examinations in daily practice and may also have the potential for using more precisely predicting prognosis and clinical outcome of the patients.

  4. Concordance between fine-needle aspiration and core biopsies for osseous lesions by lesion imaging appearance and CT attenuation.

    Science.gov (United States)

    Li, John; Weissberg, Zoe; Bevilacqua, Thomas A; Yu, Gordon; Weber, Kristy; Sebro, Ronnie

    2018-04-01

    To compare the concordance between fine-needle aspiration and core biopsies for osseous lesions by lesion imaging appearance and CT attenuation. Retrospective review of 215 FNAs of osseous lesions performed in conjunction with core biopsy at our institution over a 6-year period (2011-2016). FNAs were interpreted independently of core biopsies. We assessed if FNA in conjunction with core biopsy increased diagnostic accuracy compared to core biopsy alone. We also calculated the concordance between FNA and core biopsy by lesion appearance, lesion CT attenuation, lesion histology, lesion location and FNA needle gauge size. Core biopsy alone provided the diagnosis in 207/215 cases (96.3%), however, the FNA provided the diagnosis in the remaining 8/215 cases (3.7%) where the core biopsy was non-diagnostic. There were 154 (71.6%) lytic lesions, 21 (9.8%) blastic lesions, 25 (11.6%) mixed lytic and blastic lesions and 15 (7.0%) lesions that were neither lytic nor blastic. The concordance between FNA and core biopsy for lytic osseous lesions (136/154 cases, 88.3%) was statistically significantly higher than that for blastic osseous lesions (13/21 cases, 61.9%) [P = 4.2 × 10 -3 ; 95% CI (0.02, 0.50)]. The concordance between FNA and core biopsy was higher for low-attenuation- (110/126) than high-attenuation (58/77) lesions (P = 0.028). The concordance between FNA and core biopsy was also higher for metastases (102/119 cases, 85.7%) than non-metastases (78/96, 81.3%) [P = 0.487; 95% CI (- 0.15, 0.065)]. There was no difference in the rate of concordance between FNA and core biopsy by lesion location or FNA needle gauge size (P > 0.05). FNA with core biopsy increases diagnostic rate compared to core biopsy alone or FNA alone. The concordance between FNA and core biopsy is higher for lytic lesions than for blastic lesions; and higher for low-attenuation lesions than for high-attenuation lesions.

  5. Epidermal hydrogen peroxide is not increased in lesional and non-lesional skin of vitiligo.

    Science.gov (United States)

    Zailaie, Mohammad Z

    2017-01-01

    It is widely believed that the loss of the epidermal melanocytes in vitiligo is basically due to excessive oxidative stress. Previous research work described abnormal elevation of the absolute concentration of the epidermal hydrogen peroxide (H 2 O 2 ) in lesional and non-lesional skin of vitiligo. Based on this finding, our primary research objective was to use this feature as a screening marker in individuals at a great risk of developing vitiligo. Ninety-six patients of non-segmental vitiligo (NSV) of varying durations, skin phototypes, and treatment modalities (psoralen UVA-, narrow band UVB-treated) were recruited for this study. Raman spectroscopic measurements, using an external probehead, of the lesional and non-lesional skin were obtained, and the resulting spectra were analyzed using the Opus software package of the MultiRam spectrometer and the intensity of the peak at 875 cm -1 that represents the absolute concentration of H 2 O 2 was calculated. Contrary to previous reports, in patients of skin phototype IV, the absolute concentrations of H 2 O 2 in non-lesional and lesional NSV of all groups were non-significantly decreased compared to normal control. In patients of NSV of skin phototype V, the decrease in the absolute concentrations of H 2 O 2 was not significant in the untreated group, and a slight non-significant increase in the NBUVB-treated group was noted. However, in the PUVA-treated group, the non-lesional skin demonstrated significant increase in the absolute concentration of H 2 O 2 , whereas the lesional skin showed only a slight non-significant increase compared to normal control. In NSV patients of skin phototype VI who were previously treated with PUVA, the non-lesional skin showed a slight non-significant increase in the absolute concentration of H 2 O 2 ; however, the lesional skin showed a marked significant decrease compared to normal control and the non-lesional skin. Thereof, one can conclude that the epidermal H 2 O 2 is not

  6. The interrater reliability of rating non-exercise activity of inpatients with eating disorders using a visual analogue scale.

    Science.gov (United States)

    Mazloum, A; Johnston, M; Lundrigan, M; Birmingham, C L

    2008-12-01

    Non-exercise activity thermogenesis (NEAT) is the energy expended by body movement, other than sleeping, eating or sports-like activities. The obese have been reported to have a lower NEAT (walking, standing, and fidgeting) than controls. We hypothesize that an elevated NEAT could explain why some patients with anorexia nervosa are resistant to weight gain. To evaluate the interrater reliability of a rating of non-exercise activity of inpatients with eating disorders (ED) using a visual analogue scale (VAS). Health care providers were asked to rate the non-exercise activity of inpatients by marking a VAS. Eight patients were individually rated by 10 clinicians. Results were analyzed using the intraclass correlation coefficient (ICC) and Cohen's multi-rater kappa statistic (kappa). The ICC(3,k) was 0.257 (pexercise activity and physiological measurements should be used.

  7. 46 CFR 298.38 - Partnership agreements and limited liability company agreements.

    Science.gov (United States)

    2010-10-01

    ... 46 Shipping 8 2010-10-01 2010-10-01 false Partnership agreements and limited liability company... liability company agreements. Partnership and limited liability company agreements must be in form and...) Duration of the entity; (b) Adequate partnership or limited liability company funding requirements and...

  8. Classification of brain compartments and head injury lesions by neural networks applied to MRI

    International Nuclear Information System (INIS)

    Kischell, E.R.; Kehtarnavaz, N.; Hillman, G.R.; Levin, H.; Lilly, M.; Kent, T.A.

    1995-01-01

    An automatic, neural network-based approach was applied to segment normal brain compartments and lesions on MR images. Two supervised networks, backpropagation (BPN) and counterpropagation, and two unsupervised networks, Kohonen learning vector quantizer and analog adaptive resonance theory, were trained on registered T2-weighted and proton density images. The classes of interest were background, gray matter, white matter, cerebrospinal fluid, macrocystic encephalomalacia, gliosis, and 'unknown'. A comprehensive feature vector was chosen to discriminate these classes. The BPN combined with feature conditioning, multiple discriminant analysis followed by Hotelling transform, produced the most accurate and consistent classification results. Classifications of normal brain compartments were generally in agreement with expert interpretation of the images. Macrocystic encephalomalacia and gliosis were recognized and, except around the periphery, classified in agreement with the clinician's report used to train the neural network. (orig.)

  9. Classification of brain compartments and head injury lesions by neural networks applied to MRI

    Energy Technology Data Exchange (ETDEWEB)

    Kischell, E R [Dept. of Electrical Engineering, Texas A and M Univ., College Station, TX (United States); Kehtarnavaz, N [Dept. of Electrical Engineering, Texas A and M Univ., College Station, TX (United States); Hillman, G R [Dept. of Pharmacology, Univ. of Texas Medical Branch, Galveston, TX (United States); Levin, H [Dept. of Neurosurgery, Univ. of Texas Medical Branch, Galveston, TX (United States); Lilly, M [Dept. of Neurosurgery, Univ. of Texas Medical Branch, Galveston, TX (United States); Kent, T A [Dept. of Neurology and Psychiatry, Univ. of Texas Medical Branch, Galveston, TX (United States)

    1995-10-01

    An automatic, neural network-based approach was applied to segment normal brain compartments and lesions on MR images. Two supervised networks, backpropagation (BPN) and counterpropagation, and two unsupervised networks, Kohonen learning vector quantizer and analog adaptive resonance theory, were trained on registered T2-weighted and proton density images. The classes of interest were background, gray matter, white matter, cerebrospinal fluid, macrocystic encephalomalacia, gliosis, and `unknown`. A comprehensive feature vector was chosen to discriminate these classes. The BPN combined with feature conditioning, multiple discriminant analysis followed by Hotelling transform, produced the most accurate and consistent classification results. Classifications of normal brain compartments were generally in agreement with expert interpretation of the images. Macrocystic encephalomalacia and gliosis were recognized and, except around the periphery, classified in agreement with the clinician`s report used to train the neural network. (orig.)

  10. Improved efficiency in clinical workflow of reporting measured oncology lesions via PACS-integrated lesion tracking tool.

    Science.gov (United States)

    Sevenster, Merlijn; Travis, Adam R; Ganesh, Rajiv K; Liu, Peng; Kose, Ursula; Peters, Joost; Chang, Paul J

    2015-03-01

    OBJECTIVE. Imaging provides evidence for the response to oncology treatment by the serial measurement of reference lesions. Unfortunately, the identification, comparison, measurement, and documentation of several reference lesions can be an inefficient process. We tested the hypothesis that optimized workflow orchestration and tight integration of a lesion tracking tool into the PACS and speech recognition system can result in improvements in oncologic lesion measurement efficiency. SUBJECTS AND METHODS. A lesion management tool tightly integrated into the PACS workflow was developed. We evaluated the effect of the use of the tool on measurement reporting time by means of a prospective time-motion study on 86 body CT examinations with 241 measureable oncologic lesions with four radiologists. RESULTS. Aggregated measurement reporting time per lesion was 11.64 seconds in standard workflow, 16.67 seconds if readers had to register measurements de novo, and 6.36 seconds for each subsequent follow-up study. Differences were statistically significant (p workflow-integrated lesion management tool, especially for patients with multiple follow-up examinations, reversing the onetime efficiency penalty at baseline registration.

  11. Approximal sealings on lesions in neighbouring teeth requiring operative treatment: an in vitro study.

    Science.gov (United States)

    Cartagena, Alvaro; Bakhshandeh, Azam; Ekstrand, Kim Rud

    2018-02-07

    With this in vitro study we aimed to assess the possibility of precise application of sealant on accessible artificial white spot lesions (WSL) on approximal surfaces next to a tooth surface under operative treatment. A secondary aim was to evaluate whether the use of magnifying glasses improved the application precision. Fifty-six extracted premolars were selected, approximal WSL lesions were created with 15% HCl gel and standardized photographs were taken. The premolars were mounted in plaster-models in contact with a neighbouring molar with Class II/I-II restoration (Sample 1) or approximal, cavitated dentin lesion (Sample 2). The restorations or the lesion were removed, and Clinpro Sealant was placed over the WSL. Magnifying glasses were used when sealing half the study material. The sealed premolar was removed from the plaster-model and photographed. Adobe Photoshop was used to measure the size of WSL and sealed area. The degree of match between the areas was determined in Photoshop. Interclass agreement for WSL, sealed, and matched areas were found as excellent (κ = 0.98-0.99). The sealant covered 48-100% of the WSL-area (median = 93%) in Sample 1 and 68-100% of the WSL-area (median = 95%) in Sample 2. No statistical differences were observed concerning uncovered proportions of the WSL-area between groups with and without using magnifying glasses (p values ≥ .19). However, overextended sealed areas were more pronounced when magnification was used (p = .01). The precision did not differ between the samples (p = .31). It was possible to seal accessible approximal lesions with high precision. Use of magnifying glasses did not improve the precision.

  12. Technology Partnership Agreements | NREL

    Science.gov (United States)

    Partnership Agreements Technology Partnership Agreements Looking for Funding? We do not fund any projects under a technology partnership agreement. The partner provides the necessary resources and, in using technology partnership agreements. See a summary of our Fiscal Year 2017 technology partnership

  13. A disappearing neonatal skin lesion.

    LENUS (Irish Health Repository)

    Hawkes, Colin Patrick

    2012-01-31

    A preterm baby girl was noted at birth to have a firm, raised, non-tender skin lesion located over her right hip. She developed three similar smaller lesions on her ear, buttock and right knee. All lesions had resolved by 2 months of age.

  14. Diffuse cavitary lung lesions

    Energy Technology Data Exchange (ETDEWEB)

    Grunzke, Mindy; Garrington, Timothy [University of Colorado Denver, Department of Pediatrics, Aurora, CO (United States); The Children' s Hospital, Rick Wilson Center for Cancer and Blood Disorders, Aurora, CO (United States); Hayes, Kari [The Children' s Hospital, Pediatric Radiology, Aurora, CO (United States); Bourland, Wendy [Children' s Hospital at St. Francis, Warren Clinic, Inc., Tulsa, OK (United States)

    2010-02-15

    An 11-year-old girl presented with a 2-month history of progressively worsening cough, daily fevers, and weight loss. A chest radiograph revealed multiple cystic cavitary lung lesions. An extensive infectious work-up was negative. Chest CT verified multiple cavitary lung lesions bilaterally, and [F-18]2-fluoro-2-deoxy-D-glucose ({sup 18}F-FDG) positron emission tomography with CT (PET/CT) showed increased uptake in the lung lesions as well as regional lymph nodes. Subsequent biopsy of an involved lymph node confirmed classical Hodgkin lymphoma, nodular sclerosis type. This case represents an unusual presentation for a child with Hodgkin lymphoma and demonstrates a role for {sup 18}F-FDG PET/CT in evaluating a child with cavitary lung lesions. (orig.)

  15. Diffuse cavitary lung lesions

    International Nuclear Information System (INIS)

    Grunzke, Mindy; Garrington, Timothy; Hayes, Kari; Bourland, Wendy

    2010-01-01

    An 11-year-old girl presented with a 2-month history of progressively worsening cough, daily fevers, and weight loss. A chest radiograph revealed multiple cystic cavitary lung lesions. An extensive infectious work-up was negative. Chest CT verified multiple cavitary lung lesions bilaterally, and [F-18]2-fluoro-2-deoxy-D-glucose ( 18 F-FDG) positron emission tomography with CT (PET/CT) showed increased uptake in the lung lesions as well as regional lymph nodes. Subsequent biopsy of an involved lymph node confirmed classical Hodgkin lymphoma, nodular sclerosis type. This case represents an unusual presentation for a child with Hodgkin lymphoma and demonstrates a role for 18 F-FDG PET/CT in evaluating a child with cavitary lung lesions. (orig.)

  16. Energy deposition and the formation of biologically significant lesions by accelerated ions

    International Nuclear Information System (INIS)

    Kiefer, J.

    1985-01-01

    The assumption that the number of biologically significant lesions depends only on the amount of of energy absorbed in a critical cellular site is not able to explain the increase of RBE with LET and leads to large discrepancies between predicted and measured inactivation cross sections in the LET range between 20 and 200 keV.μm -1 . It has, therefore, to be concluded that not only the amount of energy absorbed but also the spatial pattern of this deposition plays a decisive role. In the model presented it is postulated that two or more energy deposition events in nanometre sites are required for the formation of biologically significant lesions. This cooperative action has to take place in very short times so that only interactions within a single particle track contribute. The mathematical treatment will be outlined and qualitatively shown that the model is able to predict RBE-LET relationships. The calculations use a track structure model based on classical collision mechanics. It is compared with existing experimental results showing good agreement at least for higher particle energies. (author)

  17. Interobserver agreement in the histologic diagnosis of colorectal polyps. the experience of the multicenter adenoma colorectal study (SMAC).

    Science.gov (United States)

    Costantini, Massimo; Sciallero, Stefania; Giannini, Augusto; Gatteschi, Beatrice; Rinaldi, Paolo; Lanzanova, Giuseppe; Bonelli, Luigina; Casetti, Tino; Bertinelli, Elisabetta; Giuliani, Orietta; Castiglione, Guido; Mantellini, Paola; Naldoni, Carlo; Bruzzi, Paolo

    2003-03-01

    Current clinical practice guidelines for patients with colorectal polyps are mainly based on the histologic characteristics of their lesions. However, interobserver variability in the assessment of specific polyp characteristics was evaluated in very few studies. The purpose of this study was to evaluate the interobserver agreement of four pathologists in the diagnosis of histologic type of colorectal polyps and in the degree of dysplasia and of infiltrating carcinoma in adenomas. A stratified random sample of 100 polyps was obtained from the 4,889 polyps resected within the Multicentre Adenoma Colorectal Study (SMAC), and the slides were blindly reviewed by the four pathologists. Agreement was analyzed using kappa statistics. A median kappa of 0.89 (range 0.79-1.0) was estimated for the interobserver agreement for the diagnosis of hyperplastic polyp vs. adenoma. The agreement in the diagnosis of tubular, tubulovillous, and villous type, was given by median kappa values of 0.50, 0.15, and 0.36, respectively. The median kappa for the diagnosis of infiltrating carcinoma was 0.78 (range 0.73-0.84). Agreement on diagnosis of adenoma histologic subtypes, degrees of dysplasia, or infiltrating carcinoma in adenoma was moderate. A simpler classifications might help to better identify patients at different risk of colorectal cancer.

  18. Correlation between model observer and human observer performance in CT imaging when lesion location is uncertain

    Energy Technology Data Exchange (ETDEWEB)

    Leng, Shuai; Yu, Lifeng; Zhang, Yi; McCollough, Cynthia H. [Department of Radiology, Mayo Clinic, 200 First Street Southwest, Rochester, Minnesota 55905 (United States); Carter, Rickey [Department of Biostatistics, Mayo Clinic, 200 First Street Southwest, Rochester, Minnesota 55905 (United States); Toledano, Alicia Y. [Biostatistics Consulting, LLC, 10606 Wheatley Street, Kensington, Maryland 20895 (United States)

    2013-08-15

    Purpose: The purpose of this study was to investigate the correlation between model observer and human observer performance in CT imaging for the task of lesion detection and localization when the lesion location is uncertain.Methods: Two cylindrical rods (3-mm and 5-mm diameters) were placed in a 35 × 26 cm torso-shaped water phantom to simulate lesions with −15 HU contrast at 120 kV. The phantom was scanned 100 times on a 128-slice CT scanner at each of four dose levels (CTDIvol = 5.7, 11.4, 17.1, and 22.8 mGy). Regions of interest (ROIs) around each lesion were extracted to generate images with signal-present, with each ROI containing 128 × 128 pixels. Corresponding ROIs of signal-absent images were generated from images without lesion mimicking rods. The location of the lesion (rod) in each ROI was randomly distributed by moving the ROIs around each lesion. Human observer studies were performed by having three trained observers identify the presence or absence of lesions, indicating the lesion location in each image and scoring confidence for the detection task on a 6-point scale. The same image data were analyzed using a channelized Hotelling model observer (CHO) with Gabor channels. Internal noise was added to the decision variables for the model observer study. Area under the curve (AUC) of ROC and localization ROC (LROC) curves were calculated using a nonparametric approach. The Spearman's rank order correlation between the average performance of the human observers and the model observer performance was calculated for the AUC of both ROC and LROC curves for both the 3- and 5-mm diameter lesions.Results: In both ROC and LROC analyses, AUC values for the model observer agreed well with the average values across the three human observers. The Spearman's rank order correlation values for both ROC and LROC analyses for both the 3- and 5-mm diameter lesions were all 1.0, indicating perfect rank ordering agreement of the figures of merit (AUC

  19. Imaging inflammatory acne: lesion detection and tracking

    Science.gov (United States)

    Cula, Gabriela O.; Bargo, Paulo R.; Kollias, Nikiforos

    2010-02-01

    It is known that effectiveness of acne treatment increases when the lesions are detected earlier, before they could progress into mature wound-like lesions, which lead to scarring and discoloration. However, little is known about the evolution of acne from early signs until after the lesion heals. In this work we computationally characterize the evolution of inflammatory acne lesions, based on analyzing cross-polarized images that document acne-prone facial skin over time. Taking skin images over time, and being able to follow skin features in these images present serious challenges, due to change in the appearance of skin, difficulty in repositioning the subject, involuntary movement such as breathing. A computational technique for automatic detection of lesions by separating the background normal skin from the acne lesions, based on fitting Gaussian distributions to the intensity histograms, is presented. In order to track and quantify the evolution of lesions, in terms of the degree of progress or regress, we designed a study to capture facial skin images from an acne-prone young individual, followed over the course of 3 different time points. Based on the behavior of the lesions between two consecutive time points, the automatically detected lesions are classified in four categories: new lesions, resolved lesions (i.e. lesions that disappear completely), lesions that are progressing, and lesions that are regressing (i.e. lesions in the process of healing). The classification our methods achieve correlates well with visual inspection of a trained human grader.

  20. Solitary sternal lesions in breast cancer. Lesiones esternales unicas en cancer de mama

    Energy Technology Data Exchange (ETDEWEB)

    Morales, R; Cano, R; Mendoza, G [Instituto Peruano de Energia Nuclear, Lima (Peru); Guzman, C; Cotrina, M; Aguilar, C [Instituto Nacional de Enfermedades Neoplasicas, Lima (Peru)

    1993-12-01

    In a retrospective review of bone scans performed in 1740 patients with breast cancer from January 1988 to April 1993, twenty had a solitary sternal lesion. Etiology was found correlating this finding with pathology, x-rays and/or final outcome. Nineteen lesions were due to metastases and one to infection. This experience suggests that solitary sternal lesions in breast cancer patients are uncommon and are most frequently (95%) associated with malignant etiology. (Authors). 10 refs., 2 figs.

  1. 48 CFR 225.403 - World Trade Organization Government Procurement Agreement and Free Trade Agreements.

    Science.gov (United States)

    2010-10-01

    ... Government Procurement Agreement and Free Trade Agreements. 225.403 Section 225.403 Federal Acquisition... FOREIGN ACQUISITION Trade Agreements 225.403 World Trade Organization Government Procurement Agreement and... Government Procurement Agreement, acquire only U.S.-made, qualifying country, or designated country end...

  2. A study of the reliability of the Nociception Coma Scale.

    Science.gov (United States)

    Riganello, F; Cortese, M D; Arcuri, F; Candelieri, A; Guglielmino, F; Dolce, G; Sannita, W G; Schnakers, C

    2015-04-01

    In this study, we investigated the reliability of the Nociception Coma Scale which has recently been developed to assess nociception in non-communicative, severely brain-injured patients. Prospective cross-sequential study. Semi-intensive care unit and long-term brain injury care. Forty-four patients diagnosed as being in a vegetative state (n=26) or in a minimally conscious state (n=18). Patients were assessed by two experts (rater A and rater B) on two consecutive weeks to measure inter-rater agreement and test-retest reliability. Total scores and subscores of the Nociception Coma Scale. We performed a total of 176 assessments. The inter-rater agreement was moderate for the total scores (k = 0.57) and fair to substantial for the subscores (0.33 ≤ k ≤ 0.62) on week 2. The test-retest reliability was substantial for the total scores (k = 0.66) and moderate to almost perfect for the subscores (0.53 ≤ k ≤ 0.96) for rater A. The inter-rater agreement was weaker on week 1, whereas the test-retest reliability was lower for the least experienced rater (rater B). This study provides further evidence of the psychometric qualities of the Nociception Coma Scale. Future studies should assess the impact of practical experience and background on administration and scoring of the scale. © The Author(s) 2014.

  3. Ecologically relevant outcome measure for post-inpatient rehabilitation.

    Science.gov (United States)

    Marquez de la Plata, Carlos; Qualls, Devin; Plenger, Patrick; Malec, James F; Hayden, Mary Ellen

    2017-01-01

    Transfer of skills learned within the clinic environment to patients' home or community is important in post-inpatient brain injury rehabilitation (PBIR). Outcome measures used in PBIR assess level of independence during functional tasks; however, available functional instruments do not quantitate the environment in which the behaviors occur. To examine the reliability and validity of an instrument used to assess patients' functional abilities while quantifying the amount of structure and distractions in the environment. 2501 patients who sustained a traumatic brain injury (TBI) or cerebrovascular accident (CVA) and participated in a multidisciplinary PBIR program between 2006 and 2014 were identified retrospectively for this study. The PERPOS and MPAI-4 were used to assess functional abilities at admission and at discharge. Construct validity was assessed using a bivariate Spearman rho analysis A subsample of 56 consecutive admissions during 2014 were examined to determine inter-rater reliability. Intra-class correlation coefficient (ICC) and Kappa coefficients assessed inter-rater agreement of the total PERPOS and PERPOS subscales respectively. The PERPOS and MPAI-4 demonstrated a strong negative association among both TBI and CVA patients. Kappa scores for the three PERPOS scales each demonstrated good to excellent inter-rater agreement. The ICC for overall PERPOS scores fell in the good agreement range. The PERPOS can be used reliably in PBIR to quantify patients' functional abilities within the context of environmental demands.

  4. T2-Weighted Dixon Turbo Spin Echo for Accelerated Simultaneous Grading of Whole-Body Skeletal Muscle Fat Infiltration and Edema in Patients With Neuromuscular Diseases.

    Science.gov (United States)

    Schlaeger, Sarah; Klupp, Elisabeth; Weidlich, Dominik; Cervantes, Barbara; Foreman, Sarah C; Deschauer, Marcus; Schoser, Benedikt; Katemann, Christoph; Kooijman, Hendrik; Rummeny, Ernst J; Zimmer, Claus; Kirschke, Jan S; Karampinos, Dimitrios C

    2018-04-02

    The assessment of fatty infiltration and edema in the musculature of patients with neuromuscular diseases (NMDs) typically requires the separate performance of T1-weighted and fat-suppressed T2-weighted sequences. T2-weighted Dixon turbo spin echo (TSE) enables the generation of T2-weighted fat- and water-separated images, which can be used to assess both pathologies simultaneously. The present study examines the diagnostic performance of T2-weighted Dixon TSE compared with the standard sequences in 10 patients with NMDs and 10 healthy subjects. Whole-body magnetic resonance imaging was performed including T1-weighted Dixon fast field echo, T2-weighted short-tau inversion recovery, and T2-weighted Dixon TSE. Fatty infiltration and intramuscular edema were rated by 2 radiologists using visual semiquantitative rating scales. To assess intermethod and interrater agreement, weighted Cohen's κ coefficients were calculated. The ratings of fatty infiltration showed high intermethod and high interrater agreement (T1-weighted Dixon fast field echo vs T2-weighted Dixon TSE fat image). The evaluation of edematous changes showed high intermethod and good interrater agreement (T2-weighted short-tau inversion recovery vs T2-weighted Dixon TSE water image). T2-weighted Dixon TSE imaging is an alternative for accelerated simultaneous grading of whole-body skeletal muscle fat infiltration and edema in patients with NMDs.

  5. Rater Reliability of the Hardy Classification for Pituitary Adenomas in the Magnetic Resonance Imaging Era.

    Science.gov (United States)

    Mooney, Michael A; Hardesty, Douglas A; Sheehy, John P; Bird, C Roger; Chapple, Kristina; White, William L; Little, Andrew S

    2017-10-01

    Objectives  The Hardy classification is used to classify pituitary tumors for clinical and research purposes. The scale was developed using lateral skull radiographs and encephalograms, and its reliability has not been evaluated in the magnetic resonance imaging (MRI) era. Design  Fifty preoperative MRI scans of biopsy-proven pituitary adenomas using the sellar invasion and suprasellar extension components of the Hardy scale were reviewed. Setting  This study was a cohort study set at a single institution. Participants  There were six independent raters. Main Outcome Measures  The main outcome measures of this study were interrater reliability, intrarater reliability, and percent agreement. Results  Overall interrater reliability of both Hardy subscales on MRI was strong. However, reliability of the intermediate scores was weak, and percent agreement among raters was poor (12-16%) using the full scales. Dichotomizing the scale into clinically useful groups maintained strong interrater reliability for the sellar invasion scale and increased the percent agreement for both scales. Conclusion  This study raises important questions about the reliability of the original Hardy classification. Editing the measure to a clinically relevant dichotomous scale simplifies the rating process and may be useful for preoperative tumor characterization in the MRI era. Future research studies should use the dichotomized Hardy scale (sellar invasion Grades 0-III versus Grade IV, suprasellar extension Types 0-C versus Type D).

  6. Dynamic multislice helical CT of maxillomandibular lesions. Distinction of ameloblastomas from other cystic lesions

    Energy Technology Data Exchange (ETDEWEB)

    Tozaki, Mitsuhiro; Hayashi, Katsuhiko; Fukuda, Kunihiko [Jikei Univ., Tokyo (Japan). School of Medicine

    2001-10-01

    The purpose of this study was to evaluate the clinical usefulness of dynamic multislice helical CT in differentiating ameloblastoma from other cystic lesions in cases of maxillomandibular cystic lesions. The study included 32 patients with maxillomandibular cystic lesions (ameloblastoma [n=6], myxofibroma [n=1], odontogenic keratocyst [n=3], dentigerous cyst [n=11], radicular cyst [n=11], and paradental cyst [n=2]). Dynamic study was performed before and 30 sec, and 90 sec after intravenous contrast medium administration. CT density values and percentage of density increase were calculated at 30 and 90 sec. In five cases of ameloblastoma, a rapidly enhancing area was detected within the cystic lesions at 30 sec, while no apparent rapid enhancement was seen in the other cystic lesions. Three cysts showed gradual enhancement in the marginal area at 90 sec. Comparing ameloblastoma and other kinds of cysts, we found significant differences in the percentage of density increase at 30 sec (p<0.01) and 90 sec (p<0.05). Dynamic multislice helical CT is useful in the diagnosis of cystic lesions of the maxillomandibular region, especially in the detection of neovascularities in ameloblastoma. (author)

  7. Dynamic multislice helical CT of maxillomandibular lesions. Distinction of ameloblastomas from other cystic lesions

    International Nuclear Information System (INIS)

    Tozaki, Mitsuhiro; Hayashi, Katsuhiko; Fukuda, Kunihiko

    2001-01-01

    The purpose of this study was to evaluate the clinical usefulness of dynamic multislice helical CT in differentiating ameloblastoma from other cystic lesions in cases of maxillomandibular cystic lesions. The study included 32 patients with maxillomandibular cystic lesions (ameloblastoma [n=6], myxofibroma [n=1], odontogenic keratocyst [n=3], dentigerous cyst [n=11], radicular cyst [n=11], and paradental cyst [n=2]). Dynamic study was performed before and 30 sec, and 90 sec after intravenous contrast medium administration. CT density values and percentage of density increase were calculated at 30 and 90 sec. In five cases of ameloblastoma, a rapidly enhancing area was detected within the cystic lesions at 30 sec, while no apparent rapid enhancement was seen in the other cystic lesions. Three cysts showed gradual enhancement in the marginal area at 90 sec. Comparing ameloblastoma and other kinds of cysts, we found significant differences in the percentage of density increase at 30 sec (p<0.01) and 90 sec (p<0.05). Dynamic multislice helical CT is useful in the diagnosis of cystic lesions of the maxillomandibular region, especially in the detection of neovascularities in ameloblastoma. (author)

  8. FFA STUDY OF MACULAR LESIONS

    Directory of Open Access Journals (Sweden)

    K. Vinayagamurthy

    2017-08-01

    Full Text Available BACKGROUND Macula is an important portion of retina that occupies the posterior pole of retina. Any disease that affects macula results in significant loss of central vision, form vision and colour vision to an extent. Macular lesions can be hereditary as well as acquired. Macular lesions occur in both younger and older individuals. Anatomically, a macular lesions can vary from a simple lesion like an RPF defect to a vision-threatening lesions like choroidal neovascular membrane. Many screening tests that are sensitive and specific are available to assess the functioning of macula called as ‘macular function test’. But, the greater understanding of the retinal vascular led to the usage of fluorescein angiogram in the detection and screening of macular, retinovascular and optic disc lesions. Through fundus fluorescein angiogram is a thirty-year-old procedure; it is still in vogue in almost all parts of the world. It has its own merits. The aim of the study is to study the role of fluorescein angiography in the evaluation of macular lesions. MATERIALS AND METHODS A hospital-based prospective randomised study was done, which included 50 patients. Detailed patient history was taken and thorough ocular and systemic examination was done. All patients were examined by ophthalmoscopy (direct and indirect and slit-lamp examination with 90D followed by fluorescein angiography. Ophthalmoscopic and fluorescein angiography findings were analysed and categorised. Patients were advised proper ocular and systemic treatment and follow up. RESULTS 50 cases with macular lesions were analysed and categorised into conditions like ARMD, CSR, macular oedema, CME, degenerations and dystrophies and miscellaneous conditions. FFA altered the diagnosis in 8% cases and categorised the cases in all cases. 16% patients developed adverse reactions like allergy, vomiting and nausea. On statistical analysis, FFA proved to be cheap and superior diagnostic tool in confirming

  9. Subcategorization of Suspicious Breast Lesions (BI-RADS Category 4) According to MRI Criteria: Role of Dynamic Contrast-Enhanced and Diffusion-Weighted Imaging.

    Science.gov (United States)

    Maltez de Almeida, João Ricardo; Gomes, André Boechat; Barros, Thomas Pitangueira; Fahel, Paulo Eduardo; de Seixas Rocha, Mário

    2015-07-01

    The purposes of this study were to investigate whether dynamic contrast-enhanced MRI is adequate for subcategorization of suspicious lesions (BI-RADS category 4) and to evaluate whether use of DWI improves diagnostic performance. The study group was composed of 103 suspicious lesions found in 83 subjects. Patient ages and lesion sizes were compiled, and two radiologists reanalyzed the images; subcategorized the findings as BI-RADS 4A, 4B, or 4C; and calculated apparent diffusion coefficient (ADC) values. The stratified variables were tested by univariate analysis and inserted in two multivariate predictive models, which were used to generate ROC curves and compare AUCs. Positive predictive values (PPVs) for each subcategory and ADC level were calculated, and interobserver agreement was tested. Forty-four (42.7%) suspicious findings proved malignant. Except for age (p = 0.08), all stratified predictor variables were significant in univariate analyses (p BI-RADS 4 subcategory (4A, 0.15; 4B, 0.37; 4C, 0.84). ADC values of 1.10 × 10(-3) mm(2)/s or less had the second highest PPV (0.77). Interobserver agreement was substantial at a kappa value of 0.80 (95% CI, 0.70-0.90; p BI-RADS category 4) can be satisfactorily performed with DCE-MRI and slightly improved when DWI is introduced.

  10. The Americleft Speech Project: A Training and Reliability Study.

    Science.gov (United States)

    Chapman, Kathy L; Baylis, Adriane; Trost-Cardamone, Judith; Cordero, Kelly Nett; Dixon, Angela; Dobbelsteyn, Cindy; Thurmes, Anna; Wilson, Kristina; Harding-Bell, Anne; Sweeney, Triona; Stoddard, Gregory; Sell, Debbie

    2016-01-01

    To describe the results of two reliability studies and to assess the effect of training on interrater reliability scores. The first study (1) examined interrater and intrarater reliability scores (weighted and unweighted kappas) and (2) compared interrater reliability scores before and after training on the use of the Cleft Audit Protocol for Speech-Augmented (CAPS-A) with British English-speaking children. The second study examined interrater and intrarater reliability on a modified version of the CAPS-A (CAPS-A Americleft Modification) with American and Canadian English-speaking children. Finally, comparisons were made between the interrater and intrarater reliability scores obtained for Study 1 and Study 2. The participants were speech-language pathologists from the Americleft Speech Project. In Study 1, interrater reliability scores improved for 6 of the 13 parameters following training on the CAPS-A protocol. Comparison of the reliability results for the two studies indicated lower scores for Study 2 compared with Study 1. However, this appeared to be an artifact of the kappa statistic that occurred due to insufficient variability in the reliability samples for Study 2. When percent agreement scores were also calculated, the ratings appeared similar across Study 1 and Study 2. The findings of this study suggested that improvements in interrater reliability could be obtained following a program of systematic training. However, improvements were not uniform across all parameters. Acceptable levels of reliability were achieved for those parameters most important for evaluation of velopharyngeal function.

  11. Cartilage lesions of the glenohumeral joint: diagnostic effectiveness of multidetector spiral CT arthrography and comparison with arthroscopy

    Energy Technology Data Exchange (ETDEWEB)

    Lecouvet, Frederic E.; Dorzee, Benjamin; Berg, Bruno C. vande; Malghem, Jacques [Cliniques Universitaires St Luc, Universite Catholique de Louvain, Department of Radiology, Brussels (Belgium); Dubuc, Jean E. [Cliniques Universitaires St Luc, Universite Catholique de Louvain, Department of Orthopaedic Surgery, Brussels (Belgium); Jamart, Jacques [Mont Godinne University Hospital, Center of Biostatistics, Yvoir (Belgium)

    2007-07-15

    This study assessed the diagnostic effectiveness of multidetector spiral CT arthrography (MDCTa) in detecting hyaline cartilage abnormalities of the shoulder joint, with correlation to arthroscopy. Shoulder MDCTa images prospectively obtained in 22 consecutive patients (mean age, 50 years; age range, 23-74 years; 12 female, 10 male) were evaluated for glenohumeral cartilage lesions. Two musculoskeletal radiologists independently analysed the cartilage surfaces of the humeral head and of the glenoid fossa in nine anatomical surface areas. Observations of MDCTa were compared to arthroscopic findings. The sensitivity and specificity of MDCTa for grade 2 (substance loss <50%) or higher and grade 3 (substance loss {>=}50%) or higher cartilage lesions, the Spearman correlation coefficient between arthrographic and arthroscopic grading, and K statistics for assessing Intra and Interobserver reproducibility were determined. At MDCTa, sensitivities and specificities ranged between 80% and 94% for the detection of grade 2 or higher cartilage lesions, and between 88% and 98% for the detection of grade 3 or higher cartilage lesions. Spearman correlation coefficients between MDCTa and arthroscopic grading of articular surfaces ranged between 0.532 and 0.651. Interobserver agreement was moderate for grading all articular surfaces ({kappa} = 0.457), but substantial to almost perfect for detecting lesions with substance loss ({kappa}, 0.618-0.876). In conclusion, MDCTa is accurate for the study of cartilage surface in the entire shoulder joint. This technique may beneficially impact patient's management by means of selecting the proper treatment approach. (orig.)

  12. Quantitative angiography methods for bifurcation lesions

    DEFF Research Database (Denmark)

    Collet, Carlos; Onuma, Yoshinobu; Cavalcante, Rafael

    2017-01-01

    Bifurcation lesions represent one of the most challenging lesion subsets in interventional cardiology. The European Bifurcation Club (EBC) is an academic consortium whose goal has been to assess and recommend the appropriate strategies to manage bifurcation lesions. The quantitative coronary...... angiography (QCA) methods for the evaluation of bifurcation lesions have been subject to extensive research. Single-vessel QCA has been shown to be inaccurate for the assessment of bifurcation lesion dimensions. For this reason, dedicated bifurcation software has been developed and validated. These software...

  13. Retrospective analysis of nonendodontic periapical lesions misdiagnosed as endodontic apical periodontitis lesions in a population of Taiwanese patients.

    Science.gov (United States)

    Huang, Hsun-Yu; Chen, Yuk-Kwan; Ko, Edward Cheng-Chuan; Chuang, Fu-Hsiung; Chen, Ping-Ho; Chen, Ching-Yi; Wang, Wen-Chen

    2017-07-01

    We aimed to evaluate nonendodontic periapical lesions clinically misdiagnosed as endodontic periapical pathoses in a population of Taiwanese patients. Cases (2000-2014) of histopathological diagnoses of nonendodontic periapical lesions were retrieved from all cases with a clinical diagnosis of radicular cyst, apical granuloma, or apical periodontitis in the institution. These cases were regarded as misdiagnosed nonendodontic periapical lesions, of which the types and frequencies, in addition to the demographic data, were determined. Four thousand and four specimens were clinically diagnosed as endodontically associated pathoses, of which 118 cases (2.95%) received a histopathological diagnosis of a nonendodontic pathologic entity, the most frequent lesion being keratocystic odontogenic tumor (KCOT, n = 38, 32.20%), followed by fibro-osseous lesion (n = 18, 15.25%), and dentigerous cyst (n = 13, 11.02%). Nine malignant lesions in the periapical area [squamous cell carcinoma (n = 7, 5.93%), adenoid cystic carcinoma (n = 1, 0.85%), and Langerhans cell histiocytosis (n = 1, 0.85%)] were also noted. A wide variety of histopathological diagnoses, including benign odontogenic and non-odontogenic cystic and tumorous lesions and infectious diseases, as well as malignant lesions, was noted in these 118 cases of nonendodontic periapical lesions. Squamous cell carcinoma was the most predominant malignancy of nonendodontic periapical lesions misdiagnosed as apical periodontitis lesions from imaging examination overlooking the clinical findings. The current data form a useful basis for clinicopathological investigation and educational teaching regarding nonendodontic periapical lesions misdiagnosed as endodontic apical periodontitis lesions.

  14. Post-radiotherapeutic heart lesions

    International Nuclear Information System (INIS)

    Testart, F.M.

    1979-05-01

    Heart structures have traditionally been considered radioresistant. In fact all tissues subjected to radiotherapy can develop lesions. Possible damage includes: - pericardiac fibrosis, the commonest and best individualized, associated with a constriction this leads to a stoppage pattern usually occurring late, around the 18th month. Its frequency depends directly on the total radiation dose; - fibrous myocarditis by direct damage to the heart muscle; - stenosis type lesions of the large coronary trunks; - in exceptional cases lesions of the aorta: hyperplastic degenerescence of the intima and adventitia or of the aortic sigmoid valvules and the mitral valves. Three observations are reported, concerning a coronary, a pericardiac and a coronary, myocardiac and pericardiac lesion. Following this account the irradiation techniques and main experimental data are reviewed and the prophylactic and therapeutic consequences to be derived from our observations and those of the literature are examined [fr

  15. Convergence Analysis of Micro-Lesions (CAML: An approach to mapping of diffuse lesions from carotid revascularization

    Directory of Open Access Journals (Sweden)

    Allyson C. Rosen

    Full Text Available Carotid revascularization (endarterectomy, stenting prevents stroke; however, procedure-related embolization is common and results in small brain lesions easily identified by diffusion weighted magnetic resonance imaging (DWI. A crucial barrier to understanding the clinical significance of these lesions has been the lack of a statistical approach to identify vulnerable brain areas. The problem is that the lesions are small, numerous, and non-overlapping. Here we address this problem with a new method, the Convergence Analysis of Micro-Lesions (CAML technique, an extension of the Anatomic Likelihood Analysis (ALE. The method combines manual lesion tracing, constraints based on known lesion patterns, and convergence analysis to represent regions vulnerable to lesions as probabilistic brain atlases. Two studies were conducted over the course of 12 years in an active, vascular surgery clinic. An analysis in an initial group of 126 patients at 1.5 T MRI was cross-validated in a second group of 80 patients at 3T MRI. In CAML, lesions were manually defined and center points identified. Brains were aligned according to side of surgery since this factor powerfully determines lesion distribution. A convergence based analysis, was performed on each of these groups. Results indicated the most consistent region of vulnerability was in motor and premotor cortex regions. Smaller regions common to both groups included the dorsolateral prefrontal cortex and medial parietal regions. Vulnerability of motor cortex is consistent with previous work showing changes in hand dexterity associated with these procedures. The consistency of CAML also demonstrates the feasibility of this new approach to characterize small, diffuse, non-overlapping lesions in patients with multifocal pathologies. Keywords: Embolization, DWI, ALE

  16. Inter-rater reliability of postnatal ultrasound interpretation in infants with congenital hydronephrosis.

    Science.gov (United States)

    Vemulakonda, V M; Wilcox, D T; Torok, M R; Hou, A; Campbell, J B; Kempe, A

    2015-09-01

    The most common measurements of hydronephrosis are the anterior-posterior (AP) diameter and the Society for Fetal Urology (SFU) grading systems. To date, the inter-rater reliability (IRR) of these measures has not been compared in the postnatal period. The objectives of this study were to compare the IRR of the AP diameter and the SFU grading system in infants and to determine whether ultrasound findings other than pelvicalyceal dilation are associated with higher SFU grades. Initial postnatal ultrasounds of infants seen from February 1, 2011, to January 31, 2012, with a primary diagnosis of congenital hydronephrosis were included for review. Ultrasound images were de-identified and reviewed by four pediatric urologists. IRR was calculated using the intraclass correlation (ICC) measure. A paired t test was used to compare ICCs. Associations between SFU grade and other ultrasound findings were tested using Chi-square or Fisher's exact tests. A total of 112 kidneys in 56 patients were reviewed. IRR of the SFU grading system was high (right kidney ICC = 0.83, left kidney ICC = 0.85); however, IRR of AP diameter measurement was higher (right kidney ICC = 00.97, left kidney ICC = 0.98; p hydronephrosis on bivariable and multivariable analysis. The SFU grading system is associated with excellent IRR, although the AP diameter appears to have higher IRR. Physicians may consider ultrasound findings that are not explicitly included in the SFU system when assigning hydronephrosis grade, which may lead to variability in use of this classification system.

  17. Shareholders' agreements in Denmark

    DEFF Research Database (Denmark)

    Werlauff, Erik

    2010-01-01

    ’ agreements”) cannot bind the company within the sense of company law under the new state of law, and voting rights agreements, agreements on right of pre-emption etc. will therefore only apply at the level of contract law between the parties to the agreement. This article for European Business Law Review......The article warns that with effect from 1 March 2010, the new Danish Companies Act (on public and private limited companies) has seriously weakened the effect of shareholders’ agreements which have been entered into on Danish companies. These agreements (in the act’s new terminology: “owners...... analyses the consequences of this. Rights and duties in the owners’ agreements must now be reiterated to the greatest possible extent in the company’s articles of association so that the precepts become binding on the company and its management. Whether the parties to the owners’ agreement can be required...

  18. Localization of lesions in aphasia, (2)

    International Nuclear Information System (INIS)

    Hojo, Kei; Watanabe, Shunzo; Tasaki, Hiroichi; Sato, Tokijiro; Metoki, Hirofumi.

    1985-01-01

    The relationship between the focus and the extent of the lesions and the various symptoms was investigated. 1. Broca aphasics: More than 80% of the group with obvious anarthric components had lesions of the third frontal gyrus involving Broca's area and the lower part of the precental gyrus as well as opercular and insular regions. The size of the lesions of this group was significantly larger than that of the group without marked anarthric components, and the latter was proved to have little localizing value. 2. Wernicke aphasics: The group with poor reading comprehension had cortical and/or subcortical lesions, involving posterior parts of both superior and middle temporal gyri as well as the supramarginal gyrus. On the other hand, lesions of the group with poor auditory comprehension were more anteriorly located and localized in the deep structures. Lesions of the group with poor Token test scores were large and scattered more anteriorly and/or posteriorly compared with those of the group with good Token test scores. 3. Amnestic aphaiscs: The group with poor naming scores had somewhat larger lesions than the group with good naming scores, and the lesions were scattered about the left hemisphere. The finding has proved that both groups had little localizing value. 4. Conduction aphasics: Lesions of the non-fluent type were significantly larger than those of the fluent type and distributed more anteriorly. However, highly involved lesions were located in the supramarginal gyrus and posterior parts of superior and/or middle temporal gyri. 5. Global aphasics: Lesions of the group with good articulation and prosody were observed to distribute more posteriorly in comparison with those of the other global aphasics. (J.P.N.)

  19. Intra- and inter-rater reliabilities of measurement of ultrasound imaging for muscle thickness and pennation angle of tibialis anterior muscle in stroke patients.

    Science.gov (United States)

    Cho, Ki Hun; Lee, Hwang Jae; Lee, Wan Hee

    2017-07-01

    Dysfunction of skeletal muscle has been commonly reported in stroke patients. The purpose of this study was to investigate the intra- and inter-rater reliabilities of measurement of ultrasound imaging (USI) for pennation angle (PA) and muscle thickness (MT) of tibialis anterior muscle in stroke patients. Thirty-four stroke patients (19 men) participated in this study. USI was used for measurement of PA and MT of the tibialis anterior muscles at rest and during maximum voluntary contraction (MVC). Two examiners acquired images from all participants during two separate testing sessions, seven days apart. Intra-class correlation coefficients (ICCs), confidence interval (CI), standard error of measurement, minimal detectable change, and Bland-Altman plots were used for estimation of reliability. In the intra-rater reliability between measures, for all variables (PA and MT of the paretic and non-paretic sides of tibialis anterior muscles at rest and during MVC), the ICCs ranged between 0.639 and 0.998 and the CI was within an acceptable range of 0.388-0.999. In inter-rater reliability between examiners for the two tests, for all variables, the ICCs ranged between 0.690 and 0.995 and the CI was within an acceptable range of 0.463-0.997. In addition, significant difference was observed between the paretic and non-paretic sides of the tibialis anterior muscle architecture (p stroke patients. In addition, objective and quantitative measurements of tibialis anterior muscle using USI may provide appropriate management for the walking recovery of stroke patients.

  20. Tracking the Evolution of Cerebral Gadolinium-Enhancing Lesions to Persistent T1 Black Holes in Multiple Sclerosis: Validation of a Semiautomated Pipeline.

    Science.gov (United States)

    Andermatt, Simon; Papadopoulou, Athina; Radue, Ernst-Wilhelm; Sprenger, Till; Cattin, Philippe

    2017-09-01

    Some gadolinium-enhancing multiple sclerosis (MS) lesions remain T1-hypointense over months ("persistent black holes, BHs") and represent areas of pronounced tissue loss. A reduced conversion of enhancing lesions to persistent BHs could suggest a favorable effect of a medication on tissue repair. However, the individual tracking of enhancing lesions can be very time-consuming in large clinical trials. We created a semiautomated workflow for tracking the evolution of individual MS lesions, to calculate the proportion of enhancing lesions becoming persistent BHs at follow-up. Our workflow automatically coregisters, compares, and detects overlaps between lesion masks at different time points. We tested the algorithm in a data set of Magnetic Resonance images (1.5 and 3T; spin-echo T1-sequences) from a phase 3 clinical trial (n = 1,272), in which all enhancing lesions and all BHs had been previously segmented at baseline and year 2. The algorithm analyzed the segmentation masks in a longitudinal fashion to determine which enhancing lesions at baseline turned into BHs at year 2. Images of 50 patients (192 enhancing lesions) were also reviewed by an experienced MRI rater, blinded to the algorithm results. In this MRI data set, there were no cases that could not be processed by the algorithm. At year 2, 417 lesions were classified as persistent BHs (417/1,613 = 25.9%). The agreement between the rater and the algorithm was > 98%. Due to the semiautomated procedure, this algorithm can be of great value in the analysis of large clinical trials, when a rater-based analysis would be time-consuming. Copyright © 2017 by the American Society of Neuroimaging.

  1. Reliability of the detailed assessment of speed of handwriting on Flemish children.

    Science.gov (United States)

    Simons, Johan; Probst, Michel

    2014-01-01

    This study evaluates the reliability of the Detailed Assessment of Speed of Handwriting (DASH) in a Dutch-speaking sample of children. The sample included 650 boys and 513 girls (age range = 9-16 years). Handwriting speed measurements were obtained using the DASH. Interrater agreement, test-retest reliability, and internal consistency were calculated; gender and age effects were analyzed. Interrater agreement shows excellent reliability with intraclass correlation coefficients of at least 0.94. Test-retest correlations ranged from r = 0.65 to r = 0.81. The internal consistency measures, calculated with Cronbach's alpha, were between 0.88 and 0.94. Both gender and age have a significant effect on handwriting speed, with F (7.1144) = 17.43 (P handwriting speed of Dutch-speaking children. There is a tendency of girls to write faster than boys.

  2. Nora's lesion, a distinct radiological entity?

    International Nuclear Information System (INIS)

    Dhondt, E.; Oudenhoven, L.; Khan, S.; Kroon, H.M.; Hogendoorn, P.C.; Nieborg, A.; Bloem, J.L.; Schepper, A. de

    2006-01-01

    To describe the radiological findings of ''Bizarre parosteal osteochondromatous proliferation''(BPOP) - otherwise known as Nora's lesion, to describe the natural evolution of BPOP and to assess radiologically if BPOP is indeed part of a spectrum of reactive lesions including florid reactive periostitis and turret exostosis. Four experienced musculoskeletal radiologists studied plain radiographs and other imaging documents of histologically-proven Nora's lesions, looking for soft-tissue changes, periosteal reaction/calcification and calcified/ossified pseudotumours, and compared those findings with findings on pathology reviewed by a peer group of pathologists. Twenty-four Nora's lesions originating from a series of 200 consecutive, histologically-verified bone (pseudo)tumours of the hand, seen by the ''Netherlands Committee on Bone Tumours'' for review and second opinion. Nora's lesions have a recognised presentation on radiographs without specific MR characteristics. Natural evolution could be assessed retrospectively in four cases. Recurrent lesions were seen in seven cases and are difficult to differentiate from primary Lesions. (orig.)

  3. Voxel-based lesion-symptom mapping of stroke lesions underlying somatosensory deficits

    Directory of Open Access Journals (Sweden)

    Sarah Meyer

    2016-01-01

    Full Text Available The aim of this study was to investigate the relationship between stroke lesion location and the resulting somatosensory deficit. We studied exteroceptive and proprioceptive somatosensory symptoms and stroke lesions in 38 patients with first-ever acute stroke. The Erasmus modified Nottingham Sensory Assessment was used to clinically evaluate somatosensory functioning in the arm and hand within the first week after stroke onset. Additionally, more objective measures such as the perceptual threshold of touch and somatosensory evoked potentials were recorded. Non-parametric voxel-based lesion-symptom mapping was performed to investigate lesion contribution to different somatosensory deficits in the upper limb. Additionally, structural connectivity of brain areas that demonstrated the strongest association with somatosensory symptoms was determined, using probabilistic fiber tracking based on diffusion tensor imaging data from a healthy age-matched sample. Voxels with a significant association to somatosensory deficits were clustered in two core brain regions: the central parietal white matter, also referred to as the sensory component of the superior thalamic radiation, and the parietal operculum close to the insular cortex, representing the secondary somatosensory cortex. Our objective recordings confirmed findings from clinical assessments. Probabilistic tracking connected the first region to thalamus, internal capsule, brain stem, postcentral gyrus, cerebellum, and frontal pathways, while the second region demonstrated structural connections to thalamus, insular and primary somatosensory cortex. This study reveals that stroke lesions in the sensory fibers of the superior thalamocortical radiation and the parietal operculum are significantly associated with multiple exteroceptive and proprioceptive deficits in the arm and hand.

  4. Voxel-based lesion-symptom mapping of stroke lesions underlying somatosensory deficits.

    Science.gov (United States)

    Meyer, Sarah; Kessner, Simon S; Cheng, Bastian; Bönstrup, Marlene; Schulz, Robert; Hummel, Friedhelm C; De Bruyn, Nele; Peeters, Andre; Van Pesch, Vincent; Duprez, Thierry; Sunaert, Stefan; Schrooten, Maarten; Feys, Hilde; Gerloff, Christian; Thomalla, Götz; Thijs, Vincent; Verheyden, Geert

    2016-01-01

    The aim of this study was to investigate the relationship between stroke lesion location and the resulting somatosensory deficit. We studied exteroceptive and proprioceptive somatosensory symptoms and stroke lesions in 38 patients with first-ever acute stroke. The Erasmus modified Nottingham Sensory Assessment was used to clinically evaluate somatosensory functioning in the arm and hand within the first week after stroke onset. Additionally, more objective measures such as the perceptual threshold of touch and somatosensory evoked potentials were recorded. Non-parametric voxel-based lesion-symptom mapping was performed to investigate lesion contribution to different somatosensory deficits in the upper limb. Additionally, structural connectivity of brain areas that demonstrated the strongest association with somatosensory symptoms was determined, using probabilistic fiber tracking based on diffusion tensor imaging data from a healthy age-matched sample. Voxels with a significant association to somatosensory deficits were clustered in two core brain regions: the central parietal white matter, also referred to as the sensory component of the superior thalamic radiation, and the parietal operculum close to the insular cortex, representing the secondary somatosensory cortex. Our objective recordings confirmed findings from clinical assessments. Probabilistic tracking connected the first region to thalamus, internal capsule, brain stem, postcentral gyrus, cerebellum, and frontal pathways, while the second region demonstrated structural connections to thalamus, insular and primary somatosensory cortex. This study reveals that stroke lesions in the sensory fibers of the superior thalamocortical radiation and the parietal operculum are significantly associated with multiple exteroceptive and proprioceptive deficits in the arm and hand.

  5. Transcriptome Analysis Reveals Markers of Aberrantly Activated Innate Immunity in Vitiligo Lesional and Non-Lesional Skin

    Science.gov (United States)

    Huang, Yuanshen; Wang, Yang; Yu, Jie; Gao, Min; Levings, Megan; Wei, Shencai; Zhang, Shengquan; Xu, Aie; Su, Mingwan; Dutz, Jan; Zhang, Xuejun; Zhou, Youwen

    2012-01-01

    Background Vitiligo is characterized by the death of melanocytes in the skin. This is associated with the presence of T cell infiltrates in the lesional borders. However, at present, there is no detailed and systematic characterization on whether additional cellular or molecular changes are present inside vitiligo lesions. Further, it is unknown if the normal appearing non-lesional skin of vitiligo patients is in fact normal. The purpose of this study is to systematically characterize the molecular and cellular characteristics of the lesional and non-lesional skin of vitiligo patients. Methods and Materials Paired lesional and non-lesional skin biopsies from twenty-three vitiligo patients and normal skin biopsies from sixteen healthy volunteers were obtained with informed consent. The following aspects were analyzed: (1) transcriptome changes present in vitiligo skin using DNA microarrays and qRT-PCR; (2) abnormal cellular infiltrates in vitiligo skin explant cultures using flow cytometry; and (3) distribution of the abnormal cellular infiltrates in vitiligo skin using immunofluorescence microscopy. Results Compared with normal skin, vitiligo lesional skin contained 17 genes (mostly melanocyte-specific genes) whose expression was decreased or absent. In contrast, the relative expression of 13 genes was up-regulated. The up-regulated genes point to aberrant activity of the innate immune system, especially natural killer cells in vitiligo. Strikingly, the markers of heightened innate immune responses were also found to be up-regulated in the non-lesional skin of vitiligo patients. Conclusions and Clinical Implications As the first systematic transcriptome characterization of the skin in vitiligo patients, this study revealed previously unknown molecular markers that strongly suggest aberrant innate immune activation in the microenvironment of vitiligo skin. Since these changes involve both lesional and non-lesional skin, our results suggest that therapies targeting

  6. Silent ischemic brain lesions after transcatheter aortic valve replacement : lesion distribution and predictors

    NARCIS (Netherlands)

    Samim, Mariam; Hendrikse, Jeroen; van der Worp, H. Bart; Agostoni, Pierfrancesco; Nijhoff, Freek; Doevendans, Pieter A.; Stella, Pieter R.

    Silent ischemic brain lesions and ischemic stroke are known complications of transcatheter aortic valve replacement (TAVR). We aimed to investigate the occurrence and distribution of TAVR-related silent ischemic brain lesions using diffusion-weighted magnetic resonance imaging (DWI). Consecutive

  7. Sonographic Findings of Morel-Lavalle'e Lesions

    International Nuclear Information System (INIS)

    Oh, Yu Jin; Yang, Ik; Lee, Yul; Woo, Ji Young; Hong, Hye Suk; Jung, Ah Young; Jeh, Su Kyung

    2011-01-01

    We reviewed the sonographic features of Morel-Lavalle'e lesions by correlating the US image findings with a lesion's age. We obtained the sonography reports of 20 Morel-Lavalle'e lesions of the hip and extremities from 18 patients with a history of trauma. The US images were reviewed to characterize the echogenicity, shape, homogeneity, margins, location and size of the lesions. The results were correlated with the age of the lesions and the clinical histories. All the Morel-Lavalle'e lesions were hypoechoic or anechoic fluid collections located between the subcutaneous fat and the underlying fascia. Regarding the shape of the fluid collections, the lobular shaped lesions were all less than 21 days for the lesion's age, and the flat fluid collections were all greater than 1 month of age. Regarding the homogeneity, the heterogeneous fluid collections were all less than 25 days of age, and the homogeneous fluid collections were all greater than 1 month of age. A Morel-Lavalle'e lesion is seen as a posttraumatic fluid collection in the potential space between the subcutaneous fat and the underlying fascia on an ultrasound examination. Acute Morel-Lavalle'e lesions tended to be heterogeneous and lobular, and they became more homogeneous and flat in shape as the lesions evolved. Awareness of these imaging findings will help us to properly diagnose Morel- Lavalle'e lesions

  8. Solitary lucent epiphyseal lesions in children

    Energy Technology Data Exchange (ETDEWEB)

    Gardner, D.J.; Azouz, E.M.

    1988-10-01

    We evaluated retrospectively the varying radiographic appearances of 15 solitary lucent epiphyseal lesions occurring in children. Imaging modalities used included plain films, conventional tomography, nuclear scintigraphy, and computed tomography. 40% of the lesions (6) were due to osteomyelitis. The remaining lesions included tuberculosis (1), foreign body granuloma (1), chondroblastoma (2), chondromyoxid fibroma (1), enchondroma (1), osteoid osteoma (2), and eosinophilic granuloma (1). Although the radiographic appearances of such lesions may be particularly characteristic, pathologic correlation is frequently necessary. The high incidence of osteomyelitis in our cases emphasizes its importance as a cause for a lucent epiphyseal lesion.

  9. Correlation of contrast agent kinetics between iodinated contrast-enhanced spectral tomosynthesis and gadolinium-enhanced MRI of breast lesions

    International Nuclear Information System (INIS)

    Froeling, Vera; Diekmann, Felix; Renz, Diane M.; Fallenberg, Eva M.; Steffen, Ingo G.; Diekmann, Susanne; Schmitzberger, Florian F.; Lawaczeck, Ruediger

    2013-01-01

    Assessment of contrast agent kinetics in contrast-enhanced MRI (CE-MRI) with gadolinium-containing contrast agents offers the opportunity to predict breast lesion malignancy. The goal of our study was to determine if similar patterns exist for spectral contrast-enhanced digital breast tomosynthesis (CE-DBT) using an iodinated contrast agent. The protocol of our prospective study was approved by the relevant institutional review board and the German Federal Office for Radiation Protection. All patients provided written informed consent. We included 21 women with a mean age of 62.4 years. All underwent ultrasound-guided biopsy of a suspect breast lesion, spectral CE-DBT and CE-MRI. For every breast lesion, contrast agent kinetics was assessed by signal intensity-time curves for spectral CE-DBT and CE-MRI. Statistical comparison used Cohen's kappa and Spearman's rho test. Spearman's rho of 0.49 showed significant (P = 0.036) correlation regarding the contrast agent kinetics in signal intensity-time curves for spectral CE-DBT and CE-MRI. Cohen's kappa indicated moderate agreement (kappa = 0.438). There is a statistically significant correlation between contrast agent kinetics in the signal intensity-time curves for spectral CE-DBT and CE-MRI. Observing intralesional contrast agent kinetics in spectral CE-DBT may aid evaluation of malignant breast lesions. (orig.)

  10. Histomorphological spetrum of breast lesions.

    Science.gov (United States)

    Parajuli, S; Koirala, U; Khatri, R; Acharya, L; Suwal, A

    2011-04-01

    Cancer of the breast is the second most common cause of cancer in women. Mass in the breast, whether benign or malignant is a cause of anxiety to the patients and the family members. All breast lumps are considered to be carcinomas until proved otherwise and are the causes of concern both for the patient and surgeon. This is a retrospective study conducted in Kathmandu Model Hospital for a total duration of three years from August 2007 to August 2010. 114 sample of breast tissue sent for histopathology were studied. Peak incidence of benign lesion was in between 21-30 years and malignant lesions in between 31-50 years. No breast lesions were seen in the first decade of life. Cancer of the breast was seen in 12.28% of cases. Fibroadenoma and fibrocystic disease were the commonest benign lesion and infiltrating ductal carcinoma was the commonest malignant lesion. Specimens from 10 male breasts were received. Gynaecomastia was the most common lesion encountered in males. Infiltrating ductal carcinoma was seen in a 70 year old male. Breast cancer is one of the commonest causes of breast lump particularly in women and is growing public health problem in Nepal.

  11. Petrous apex lesions in the pediatric population

    Energy Technology Data Exchange (ETDEWEB)

    Radhakrishnan, Rupa [University of Cincinnati College of Medicine, Department of Radiology, Cincinnati, OH (United States); Cincinnati Children' s Hospital Medical Center, Department of Radiology, Cincinnati, OH (United States); Son, Hwa Jung [University of Cincinnati College of Medicine, Department of Otolaryngology-Head and Neck Surgery, Cincinnati, OH (United States); Koch, Bernadette L. [Cincinnati Children' s Hospital Medical Center, Department of Radiology, Cincinnati, OH (United States)

    2014-03-15

    A variety of abnormal imaging findings of the petrous apex are encountered in children. Many petrous apex lesions are identified incidentally while images of the brain or head and neck are being obtained for indications unrelated to the temporal bone. Differential considerations of petrous apex lesions in children include ''leave me alone'' lesions, infectious or inflammatory lesions, fibro-osseous lesions, neoplasms and neoplasm-like lesions, as well as a few rare miscellaneous conditions. Some lesions are similar to those encountered in adults, and some are unique to children. Langerhans cell histiocytosis (LCH) and primary and metastatic pediatric malignancies such as neuroblastoma, rhabomyosarcoma and Ewing sarcoma are more likely to be encountered in children. Lesions such as petrous apex cholesterol granuloma, cholesteatoma and chondrosarcoma are more common in adults and are rarely a diagnostic consideration in children. We present a comprehensive pictorial review of CT and MRI appearances of pediatric petrous apex lesions. (orig.)

  12. Noninvasive differential diagnosis of dental periapical lesions in cone-beam CT scans

    Energy Technology Data Exchange (ETDEWEB)

    Okada, Kazunori, E-mail: kazokada@sfsu.edu [Department of Computer Science, San Francisco State University, San Francisco, California 94132 (United States); Rysavy, Steven [Biomedical and Health Informatics Program, University of Washington, Seattle, Washington 98195 (United States); Flores, Arturo [Computer Science and Engineering, University of California, San Diego, California 92093 (United States); Linguraru, Marius George [Sheikh Zayed Institute for Pediatric Surgical Innovation, Children’s National Medical Center, Washington, DC 20010 and Departments of Radiology and Pediatrics, George Washington University, Washington, DC 20037 (United States)

    2015-04-15

    Purpose: This paper proposes a novel application of computer-aided diagnosis (CAD) to an everyday clinical dental challenge: the noninvasive differential diagnosis of periapical lesions between periapical cysts and granulomas. A histological biopsy is the most reliable method currently available for this differential diagnosis; however, this invasive procedure prevents the lesions from healing noninvasively despite a report that they may heal without surgical treatment. A CAD using cone-beam computed tomography (CBCT) offers an alternative noninvasive diagnostic tool which helps to avoid potentially unnecessary surgery and to investigate the unknown healing process and rate for the lesions. Methods: The proposed semiautomatic solution combines graph-based random walks segmentation with machine learning-based boosted classifiers and offers a robust clinical tool with minimal user interaction. As part of this CAD framework, the authors provide two novel technical contributions: (1) probabilistic extension of the random walks segmentation with likelihood ratio test and (2) LDA-AdaBoost: a new integration of weighted linear discriminant analysis to AdaBoost. Results: A dataset of 28 CBCT scans is used to validate the approach and compare it with other popular segmentation and classification methods. The results show the effectiveness of the proposed method with 94.1% correct classification rate and an improvement of the performance by comparison with the Simon’s state-of-the-art method by 17.6%. The authors also compare classification performances with two independent ground-truth sets from the histopathology and CBCT diagnoses provided by endodontic experts. Conclusions: Experimental results of the authors show that the proposed CAD system behaves in clearer agreement with the CBCT ground-truth than with histopathology, supporting the Simon’s conjecture that CBCT diagnosis can be as accurate as histopathology for differentiating the periapical lesions.

  13. Noninvasive differential diagnosis of dental periapical lesions in cone-beam CT scans

    International Nuclear Information System (INIS)

    Okada, Kazunori; Rysavy, Steven; Flores, Arturo; Linguraru, Marius George

    2015-01-01

    Purpose: This paper proposes a novel application of computer-aided diagnosis (CAD) to an everyday clinical dental challenge: the noninvasive differential diagnosis of periapical lesions between periapical cysts and granulomas. A histological biopsy is the most reliable method currently available for this differential diagnosis; however, this invasive procedure prevents the lesions from healing noninvasively despite a report that they may heal without surgical treatment. A CAD using cone-beam computed tomography (CBCT) offers an alternative noninvasive diagnostic tool which helps to avoid potentially unnecessary surgery and to investigate the unknown healing process and rate for the lesions. Methods: The proposed semiautomatic solution combines graph-based random walks segmentation with machine learning-based boosted classifiers and offers a robust clinical tool with minimal user interaction. As part of this CAD framework, the authors provide two novel technical contributions: (1) probabilistic extension of the random walks segmentation with likelihood ratio test and (2) LDA-AdaBoost: a new integration of weighted linear discriminant analysis to AdaBoost. Results: A dataset of 28 CBCT scans is used to validate the approach and compare it with other popular segmentation and classification methods. The results show the effectiveness of the proposed method with 94.1% correct classification rate and an improvement of the performance by comparison with the Simon’s state-of-the-art method by 17.6%. The authors also compare classification performances with two independent ground-truth sets from the histopathology and CBCT diagnoses provided by endodontic experts. Conclusions: Experimental results of the authors show that the proposed CAD system behaves in clearer agreement with the CBCT ground-truth than with histopathology, supporting the Simon’s conjecture that CBCT diagnosis can be as accurate as histopathology for differentiating the periapical lesions

  14. Noninvasive differential diagnosis of dental periapical lesions in cone-beam CT scans.

    Science.gov (United States)

    Okada, Kazunori; Rysavy, Steven; Flores, Arturo; Linguraru, Marius George

    2015-04-01

    This paper proposes a novel application of computer-aided diagnosis (CAD) to an everyday clinical dental challenge: the noninvasive differential diagnosis of periapical lesions between periapical cysts and granulomas. A histological biopsy is the most reliable method currently available for this differential diagnosis; however, this invasive procedure prevents the lesions from healing noninvasively despite a report that they may heal without surgical treatment. A CAD using cone-beam computed tomography (CBCT) offers an alternative noninvasive diagnostic tool which helps to avoid potentially unnecessary surgery and to investigate the unknown healing process and rate for the lesions. The proposed semiautomatic solution combines graph-based random walks segmentation with machine learning-based boosted classifiers and offers a robust clinical tool with minimal user interaction. As part of this CAD framework, the authors provide two novel technical contributions: (1) probabilistic extension of the random walks segmentation with likelihood ratio test and (2) LDA-AdaBoost: a new integration of weighted linear discriminant analysis to AdaBoost. A dataset of 28 CBCT scans is used to validate the approach and compare it with other popular segmentation and classification methods. The results show the effectiveness of the proposed method with 94.1% correct classification rate and an improvement of the performance by comparison with the Simon's state-of-the-art method by 17.6%. The authors also compare classification performances with two independent ground-truth sets from the histopathology and CBCT diagnoses provided by endodontic experts. Experimental results of the authors show that the proposed CAD system behaves in clearer agreement with the CBCT ground-truth than with histopathology, supporting the Simon's conjecture that CBCT diagnosis can be as accurate as histopathology for differentiating the periapical lesions.

  15. First International Consensus Conference on lesions of uncertain malignant potential in the breast (B3 lesions).

    Science.gov (United States)

    Rageth, Christoph J; O'Flynn, Elizabeth Am; Comstock, Christopher; Kurtz, Claudia; Kubik, Rahel; Madjar, Helmut; Lepori, Domenico; Kampmann, Gert; Mundinger, Alexander; Baege, Astrid; Decker, Thomas; Hosch, Stefanie; Tausch, Christoph; Delaloye, Jean-François; Morris, Elisabeth; Varga, Zsuzsanna

    2016-09-01

    The purpose of this study is to obtain a consensus for the therapy of B3 lesions. The first International Consensus Conference on lesions of uncertain malignant potential in the breast (B3 lesions) including atypical ductal hyperplasia (ADH), flat epithelial atypia (FEA), classical lobular neoplasia (LN), papillary lesions (PL), benign phyllodes tumors (PT), and radial scars (RS) took place in January 2016 in Zurich, Switzerland organized by the International Breast Ultrasound School and the Swiss Minimally Invasive Breast Biopsy group-a subgroup of the Swiss Society of Senology. Consensus recommendations for the management and follow-up surveillance of these B3 lesions were developed and areas of research priorities were identified. The consensus recommendation for FEA, LN, PL, and RS diagnosed on core needle biopsy or vacuum-assisted biopsy (VAB) is to therapeutically excise the lesion seen on imaging by VAB and no longer by open surgery, with follow-up surveillance imaging for 5 years. The consensus recommendation for ADH and PT is, with some exceptions, therapeutic first-line open surgical excision. Minimally invasive management of selected B3 lesions with therapeutic VAB is acceptable as an alternative to first-line surgical excision.

  16. Morel-Lavallee lesion.

    Science.gov (United States)

    Li, Hui; Zhang, Fangjie; Lei, Guanghua

    2014-01-01

    To review current knowledge of the Morel-Lavallee lesion (MLL) to help clinicians become familiar with this entity. Familiarization may decrease missed diagnoses and misdiagnoses. It could also help steer the clinician to the proper treatment choice. A search was performed via PubMed and EMBASE from 1966 to July 2013 using the following keywords: Morel-Lavallee lesion, closed degloving injury, concealed degloving injury, Morel-Lavallee effusion, Morel-Lavallee hematoma, posttraumatic pseudocyst, posttraumatic soft tissue cyst. Chinese and English language literatures relevant to the subject were collected. Their references were also reviewed. Morel-Lavallee lesion is a relatively rare condition involving a closed degloving injury. It is characterized by a filled cystic cavity created by separation of the subcutaneous tissue from the underlying fascia. Apart from the classic location over the region of the greater trochanter, MLLs have been described in other parts of the body. The natural history of MLL has not yet been established. The lesion may decrease in volume, remain stable, enlarge progressively or show a recurrent pattern. Diagnosis of MLL was often missed or delayed. Ultrasonography, computed tomography, and magnetic resonance imaging have great value in the diagnosis of MLL. Treatment of MLL has included compression, local aspiration, open debridement, and sclerodesis. No standard treatment has been established. A diagnosis of MLL should be suspected when a soft, fluctuant area of skin or chronic recurrent fluid collection is found in a region exposed to a previous shear injury. Clinicians and radiologists should be aware of both the acute and chronic appearances to make the correct diagnosis. Treatment decisions should base on association with fractures, the condition of the lesion, symptom and desire of the patient.

  17. Correlation of the SLAP lesion with lesions of the medial sheath of the biceps tendon and intra-articular subscapularis tendon

    Directory of Open Access Journals (Sweden)

    Bennett William

    2009-01-01

    Full Text Available Background: Superior labral anterior to posterior (SLAP lesions have been well described in the literature and are thought to be secondary to traction injuries to the biceps anchor and/or falls on the outstretched arm. The pulley has recently been described as a structure that aids in the prevention of biceps instability. The intra-articular subscapularis insertion (IASS has been noted to contribute to the robust nature of the medial sheath. The purpose of the study was to determine a potential correlation of SLAP lesions and pulley lesions with/without IASS lesions, (hereafter referred to as medial sheath as forces that can disrupt the biceps anchor and may also disrupt structures of the medial sheath or vice-versa. Materials and Methods: Three hundred and sixteen consecutive shoulder arthroscopies performed by one surgeon were reviewed retrospectively. Operative reports and arthroscopic pictures were carefully reviewed with particular attention paid to the labral and pulley pathology. Selection bias was noted as the author had never operated primarily for a Type 1 SLAP lesion. Following, however, and as such, the exclusion criteria, was a Type 1 SLAP. Results: There were a total of 30 SLAP lesions and a total of 126 medial sheath lesions. There were 13 patients who had both SLAP and medial sheath lesions. There were 17 patients who had a SLAP lesion without a medial sheath lesion. There were 96 medial sheath lesions without a SLAP. A comparison of rates between patients who had a medial sheath lesion with a SLAP and those who had a medial sheath lesion without a SLAP, for the 316 patients, and when tested with a Fisher exact test revealed that there was no statistical significance, P = 0.673. The prevalence of SLAP lesions in this population of 316 patients was 9.4%, Buford 1%, medial sheath lesions 39%, and SLAP and medial sheath lesions 4%. Interestingly, there were three Buford complexes, all associated with a SLAP and one Buford complex

  18. Bilateral agreements

    International Nuclear Information System (INIS)

    Anon.

    2004-01-01

    The bilateral agreements concern Brazil with United States relative to the co operation in nuclear energy, Germany with Russian Federation relative to the elimination and disposal of nuclear weapons; The multilateral agreements concerns the signature of the Protocols to amend the Paris and Brussels Conventions, the multilateral nuclear environmental programme in the Russian Federation, the status of Conventions in the field of nuclear energy. (N.C.)

  19. The performance of multimodal hyperspectral spectroscopy in the detection of precancerous cervical lesions

    Science.gov (United States)

    Trahmono; Lusiana, N.; Indarti, J.

    2017-08-01

    The aim of this study was to compare the performance of multimodal hyperspectral spectroscopy (MHS), which combines fluorescence and reflectance spectroscopy, with that of conventional laboratory-based screening tests, such as the Papanicolaou (Pap) smear test and human papilloma virus (HPV) DNA test, for detecting precancerous lesions of the cervix. The study utilized a cross-sectional design, and the kappa test was used in the analytical assessment. MHS scans were obtained from a sample of 70 consecutive patients, followed by sample collection for Pap and HPV DNA analysis and colposcopy referral, if indicated. Of the 70 patients evaluated, the results of cervical spectroscopy were normal in 38 (54.3%) patients, and they were abnormal in 32 (45.7%) patients. Based on the cytology results, 45 (64.3%) samples were normal, and 25 (35.7%) samples were abnormal. According to the results of the HPV DNA test, 47 (67.14%) samples were normal, and 17 (24.28%) samples were abnormal. Based on the results of the kappa test, the agreement between MHS and cytology was 0.793 (p < 0.001). The agreement between MHS and the HPV DNA test was 0.195 (p = 0.086), and the agreement between MHS and colposcopy was 0.479 (p < 0.001).

  20. Validating the Danish adaptation of the World Health Organization's International Classification for Patient Safety classification of patient safety incident types

    DEFF Research Database (Denmark)

    Mikkelsen, Kim Lyngby; Thommesen, Jacob; Andersen, Henning Boje

    2013-01-01

    Objectives Validation of a Danish patient safety incident classification adapted from the World Health Organizaton's International Classification for Patient Safety (ICPS-WHO). Design Thirty-three hospital safety management experts classified 58 safety incident cases selected to represent all types.......513 (range: 0.193–0.804). Kappa and ICC showed high correlation (r = 0.99). An inverse correlation was found between the prevalence of type and inter-rater reliability. Results are discussed according to four factors known to determine the inter-rater agreement: skill and motivation of raters; clarity...

  1. 48 CFR 25.403 - World Trade Organization Government Procurement Agreement and Free Trade Agreements.

    Science.gov (United States)

    2010-10-01

    ... Government Procurement Agreement and Free Trade Agreements. 25.403 Section 25.403 Federal Acquisition... 25.403 World Trade Organization Government Procurement Agreement and Free Trade Agreements. (a... in 25.402(a)(1). The WTO GPA and FTAs specify procurement procedures designed to ensure fairness (see...

  2. The value of digital tomosynthesis of the chest as a problem-solving tool for suspected pulmonary nodules and hilar lesions detected on chest radiography.

    Science.gov (United States)

    Galea, Angela; Dubbins, Paul; Riordan, Richard; Adlan, Tarig; Roobottom, Carl; Gay, David

    2015-05-01

    To assess the capability of digital tomosynthesis (DTS) of the chest compared to a postero-anterior (PA) and lateral chest radiograph (CXR) in the diagnosis of suspected but unconfirmed pulmonary nodules and hilar lesions detected on a CXR. Computed tomography (CT) was used as the reference standard. 78 patients with suspected non-calcified pulmonary nodules or hilar lesions on their CXR were included in the study. Two radiologists, blinded to the history and CT, prospectively analysed the CXR (PA and lateral) and the DTS images using a picture archiving and communication workstation and were asked to designate one of two outcomes: true intrapulmonary lesion or false intrapulmonary lesion. A CT of the chest performed within 4 weeks of the CXR was used as the reference standard. Inter-observer agreement and time to report the modalities were calculated for CXR and DTS. There were 34 true lesions confirmed on CT, 12 were hilar lesions and 22 were peripheral nodules. Of the 44 false lesions, 37 lesions were artefactual or due to composite shadow and 7 lesions were real but extrapulmonary simulating non-calcified intrapulmonary lesions. The PA and lateral CXR correctly classified 39/78 (50%) of the lesions, this improved to 75/78 (96%) with DTS. The sensitivity and specificity was 0.65 and 0.39 for CXR and 0.91 and 1 for DTS. Based on the DTS images, readers correctly classified all the false lesions but missed 3/34 true lesions. Two of the missed lesions were hilar in location and one was a peripheral nodule. All three missed lesions were incorrectly classified on DTS as composite shadow. DTS improves diagnostic confidence when compared to a repeat PA and lateral CXR in the diagnosis of both suspected hilar lesions and pulmonary nodules detected on CXR. DTS is able to exclude most peripheral pulmonary nodules but caution and further studies are needed to assess its ability to exclude hilar lesions. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  3. Evaluation of the SLAP lesion using a low-field (0.2T) magnetic resonance system

    International Nuclear Information System (INIS)

    Cho, Yong Soo; Back, Chang Hee; Lee, Kyung Rae; Shin, Yun Hack

    2007-01-01

    To evaluate the diagnostic capabilities of the low-field (0.2T) magnetic resonance (MR) system in the detection of the superior labrum anterior to posterior (SLAP) lesion. One hundred fifty patients underwent magnetic resonance imaging of the shoulder over a 7-month period. Forty-six patients underwent arthroscopic surgery, and the surgical results were correlated with the findings of the MR imaging. Arthroscopic procedures were performed within a mean of 8 days after MR imaging. MR imaging of the shoulder was conducted as follows: shoulder coil; T1-weighted spin echo, coronal-oblique images; T2-weighted gradient echo, coronal-oblique and axial images; and T2-weighted spin echo, coronal oblique and sagittal-oblique images. Prospectively, one radiologist interpreted the MR images. The results of surgery were as follow: SLAP II in 26 shoulders, SLAP III in 1 shoulder, SLAP IV in 1 shoulder, normal labrum in 6 shoulder. For SLAP lesions with a higher grade than type 2, the sensitivity, specificity, positive predictive value, negative predictive value, and accuracy of the low-field MRI were 85.7%, 55.5%. 75%, 71%, and 74%, respectively. There was relatively good agreement for the comparison of the MR results obtained using a low-field MR system with the surgical findings for identifying SLAP lesions

  4. Nuclear cooperation agreements

    International Nuclear Information System (INIS)

    Nuclear cooperation agreements are reviewed in tabular form, especially agreements with developing countries. The reporting countries are the USA, the Federal Republic of Germany, Canada, Australia, Japan, and France. A separate EURATOM list is annexed

  5. Characterisation of focal liver lesions with unenhanced and contrast enhanced low MI real time ultrasound: On-site unblinded versus off-site blinded reading

    International Nuclear Information System (INIS)

    Hohmann, Joachim; Skrok, Jan; Basilico, Raffaella; Jennett, Manfred; Müller, Anja; Wolf, Karl-Jürgen; Albrecht, Thomas

    2012-01-01

    Objective: To compare on-site and blinded off-site reading of baseline ultrasound (US) and contrast enhanced ultrasound (CEUS) for classification and characterisation of focal liver lesions. Materials and methods: 99 patients (57 women and 42 men, age range 18–89 years, mean age: 59 years) with 53 malignant and 46 benign liver lesions were studied with unenhanced US followed by contrast enhanced US after injection of 2.4 ml SonoVue ® (Bracco, Milano, Italy). Image interpretation was performed on-site with clinical information available by consensus of two readers and off-site by two independent blinded readers at two different centers. Comparison of pre and post contrast scans and of the different readers was performed. Reference examinations were histology, intraoperative US, MRI or CT. Results: Sensitivity for malignancy improved from 81/89/66% (on-site/off-site reader 1/2) before to 100/96/96% post contrast administration (p < 0.05, except for reader 1). Specificity improved from 48/48/54% on baseline US to 89/80/76% on CEUS (p < 0.05). Accuracy for specific lesion diagnosis was 62/59/50% pre and 90/77/72% post contrast (p < 0.05). Classification and characterisation post contrast were mildly inferior for off-site reading. Agreement between on-site and off-site readers of unenhanced scans was fair (κ = 0.29–0.39) while it was good for CEUS (κ = 0.63–0.79). Conclusions: CEUS improves classification and characterisation of focal liver lesions and interobserver agreement compared to conventional US. Classification and characterisation post contrast were mildly but statistically significantly better for on-site than for off-site reading.

  6. Stroke Lesions in a Large Upper Limb Rehabilitation Trial Cohort Rarely Match Lesions in Common Preclinical Models

    Science.gov (United States)

    Edwardson, Matthew A.; Wang, Ximing; Liu, Brent; Ding, Li; Lane, Christianne J.; Park, Caron; Nelsen, Monica A.; Jones, Theresa A; Wolf, Steven L; Winstein, Carolee J; Dromerick, Alexander W.

    2017-01-01

    Background Stroke patients with mild-moderate upper extremity (UE) motor impairments and minimal sensory and cognitive deficits provide a useful model to study recovery and improve rehabilitation. Laboratory-based investigators use lesioning techniques for similar goals. Objective Determine whether stroke lesions in an UE rehabilitation trial cohort match lesions from the preclinical stroke recovery models used to drive translational research. Methods Clinical neuroimages from 297 participants enrolled in the Interdisciplinary Comprehensive Arm Rehabilitation Evaluation (ICARE) study were reviewed. Images were characterized based on lesion type (ischemic or hemorrhagic), volume, vascular territory, depth (cortical gray matter, cortical white matter, subcortical), old strokes, and leukoaraiosis. Lesions were compared with those of preclinical stroke models commonly used to study upper limb recovery. Results Among the ischemic stroke participants, median infarct volume was 1.8 mL, with most lesions confined to subcortical structures (61%) including the anterior choroidal artery territory (30%) and the pons (23%). Of ICARE participants, stroke patients, but they represent a clinically and scientifically important subgroup. Compared to lesions in general stroke populations and widely-studied animal models of recovery, ICARE participants had smaller, more subcortically-based strokes. Improved preclinical-clinical translational efforts may require better alignment of lesions between preclinical and human stroke recovery models. PMID:28337932

  7. Radiologic appearance of primary jaw lesions in children

    Energy Technology Data Exchange (ETDEWEB)

    Gupta, Malini; Kaste, Sue C. [Department of Diagnostic Imaging, St. Jude Children' s Research Hospital, Memphis, TN (United States); Hopkins, Kenneth P. [Department of Surgery, Division of Dentistry, St. Jude Children' s Research Hospital, Memphis, TN (United States)

    2002-03-01

    Background: The jaw (an unusual site of primary tumors in children and adolescents) has lesions often found incidentally by dentists on routine panoramic radiographs or during examination of a child who has swelling or tooth pain. Objective: This pictorial seeks to familiarize pediatric radiologists with the radiographic appearance of a variety of primary jaw lesions. Materials and methods: We retrospectively searched institutional records for cases of primary jaw lesions in children and adolescents. Jaw lesions were characterized as: I, well-circumscribed radiolucent lesions; II, lesions with mixed or variable appearance; III, poorly circumscribed radiolucent lesions; and IV, radiopaque lesions. Results: Although most oral and maxillofacial lesions in children are benign, a broad spectrum of tumors was identified; lesions may occur in patients with unrelated prior malignancy. Conclusion: Because radiologic studies may identify jaw lesions and direct further care, familiarity with the appearance of these entities is prudent. (orig.)

  8. Diagnostic agreement between a primary care physician and a teledermatologist for common dermatological conditions in North India

    Directory of Open Access Journals (Sweden)

    Binod Kumar Patro

    2015-01-01

    Full Text Available Background: Primary care physicians (PCPs encounter a large number of patients with dermatological diseases. However, delivering appropriate management is a challenge considering the inadequate dermatology training offered during the undergraduate medical curriculum. Teledermatology is the clinical evaluation of skin lesions by dermatologists and allows patients to be diagnosed and treated from a distant site. It is seen as a potential solution to the shortage of specialists and providing equitable service in remote areas. Aim: The study was aimed at estimating the diagnostic agreement of common dermatological conditions between a PCP and a teledermatologist. Materials and Methods: Consecutive patients with dermatological ailments who attended a primary health care clinic were recruited into the study, examined by the PCP and offered a diagnosis. The clinical images and patients′ history were collected and transferred to a dermatologist at a tertiary center who also made a diagnosis. Agreement between diagnosis made by the PCPs and the teledermatologist was measured using kappa (κ statistics. Results: Overall agreement between the diagnoses made by a PCP and the dermatologist was found to be 56%. Poor κ agreement (<0.4 was seen in the diagnosis of psoriasis and eczema. Conclusion: Teledermatology can supplement specialist dermatology service in remote areas. There was poor agreement in the diagnosis of psoriasis, classifying various types of eczematous conditions and fungal infections. Scarce manpower in dermatology at the primary health care level compounded by the burden of skin ailments necessitates training of PCPs in common dermatological conditions.

  9. Lesion localization in aphasia without hemiparesis

    International Nuclear Information System (INIS)

    Komatsu, Midori; Senoh, Yoko; Okamoto, Koichi; Morimatsu, Mitsunori; Hirai, Shunsaku

    1983-01-01

    The distribution of the lesions responsible for aphasia unassociated with right-sided hemiparesis was evaluated by cranial computed tomography (CT) among stroke patients. In the Broca aphasia group were observed atypical aphasic symptoms, and the lesions were far more localized than in ordinary Broca one. In the Wernicke aphasia group showed relatively large lesions in the left superior temporal gyrus, sometimes extending to supramarginal and angular gyri, which caused such additional symptoms as apraxia without motor paresis in some cases. In the Transcortical motor aphasia group showed the occlusion of the left internal carotid artery, though without obvious abnormality at CT. In another patient a circumscribed low density lesion was disclosed in the area anterior and superior to so-called Broca's area. In the Transcortical sensory aphasia group, the lesion involved the borderzone supplied by the left middle and posterior cerebral arteries. In the Amnestic aphasia group showed a lesion in the left parietal lobe, while in another no remarkable change was demonstrated. In the Global aphasia group, one had multiple isolated lesions in both anterior and posterior speech areas. Another showed a large lesion involving the whole territory of the left middle cerebral artery. In the remaining one a high density area was observed in the left superior temporal, supramarginal and angular gyri, not extending to the frontal lobe beyond with sylvian fissure. Therefore, in interpreting CTs of such aphasic patients we must take account of not only the extent of the lesion but also the severity of destruction. (J.P.N.)

  10. Lesion localization in aphasia without hemiparesis

    Energy Technology Data Exchange (ETDEWEB)

    Komatsu, Midori; Senoh, Yoko; Okamoto, Koichi; Morimatsu, Mitsunori; Hirai, Shunsaku (Gunma Univ., Maebashi (Japan). School of Medicine)

    1983-06-01

    The distribution of the lesions responsible for aphasia unassociated with right-sided hemiparesis was evaluated by cranial computed tomography (CT) among stroke patients. In the Broca aphasia group were observed atypical aphasic symptoms, and the lesions were far more localized than in ordinary Broca one. In the Wernicke aphasia group showed relatively large lesions in the left superior temporal gyrus, sometimes extending to supramarginal and angular gyri, which caused such additional symptoms as apraxia without motor paresis in some cases. In the Transcortical motor aphasia group showed the occlusion of the left internal carotid artery, though without obvious abnormality at CT. In another patient a circumscribed low density lesion was disclosed in the area anterior and superior to so-called Broca's area. In the Transcortical sensory aphasia group, the lesion involved the borderzone supplied by the left middle and posterior cerebral arteries. In the Amnestic aphasia group showed a lesion in the left parietal lobe, while in another no remarkable change was demonstrated. In the Global aphasia group, one had multiple isolated lesions in both anterior and posterior speech areas. Another showed a large lesion involving the whole territory of the left middle cerebral artery. In the remaining one a high density area was observed in the left superior temporal, supramarginal and angular gyri, not extending to the frontal lobe beyond with sylvian fissure. Therefore, in interpreting CTs of such aphasic patients we must take account of not only the extent of the lesion but also the severity of destruction.

  11. International Fisheries Agreements

    DEFF Research Database (Denmark)

    Pintassilgo, Pedro; Kronbak, Lone Grønbæk; Lindroos, Marko

    2015-01-01

    This paper surveys the application of game theory to the economic analysis of international fisheries agreements. The relevance of this study comes not only from the existence of a vast literature on the topic but especially from the specific features of these agreements. The emphasis of the survey...... is on coalition games, an approach that has become prominent in the fisheries economics literature over the last decade. It is shown that coalition games were first applied to international fisheries agreements in the late 1990s addressing cooperative issues under the framework of characteristic function games...... and stability of international fisheries agreements. A key message that emerges from this literature strand is that self-enforcing cooperative management of internationally shared fish stocks is generally difficult to achieve. Hence, the international legal framework and regulations play a decisive role...

  12. Juxtacortical Lesions and Cortical Thinning in Multiple Sclerosis.

    Science.gov (United States)

    Pareto, D; Sastre-Garriga, J; Auger, C; Vives-Gilabert, Y; Delgado, J; Tintoré, M; Montalban, X; Rovira, A

    2015-12-01

    The role of juxtacortical lesions in brain volume loss in multiple sclerosis has not been fully clarified. The aim of this study was to explore the role of juxtacortical lesions on cortical atrophy and to investigate whether the presence of juxtacortical lesions is related to local cortical thinning in the early stages of MS. A total of 131 patients with clinically isolated syndrome or with relapsing-remitting MS were scanned on a 3T system. Patients with clinically isolated syndrome were classified into 3 groups based on the presence and topography of brain lesions: no lesions (n = 24), only non-juxtacortical lesions (n = 33), and juxtacortical lesions and non-juxtacortical lesions (n = 34). Patients with relapsing-remitting MS were classified into 2 groups: only non-juxtacortical lesions (n = 10) and with non-juxtacortical lesions and juxtacortical lesions (n = 30). A juxtacortical lesion probability map was generated, and cortical thickness was measured by using FreeSurfer. Juxtacortical lesion volume in relapsing-remitting MS was double that of patients with clinically isolated syndrome. The insula showed the highest density of juxtacortical lesions, followed by the temporal, parietal, frontal, and occipital lobes. Patients with relapsing-remitting MS with juxtacortical lesions showed significantly thinner cortices overall and in the parietal and temporal lobes compared with those with clinically isolated syndrome with normal brain MR imaging. The volume of subcortical structures (thalamus, pallidum, putamen, and accumbens) was significantly decreased in relapsing-remitting MS with juxtacortical lesions compared with clinically isolated syndrome with normal brain MR imaging. The spatial distribution of juxtacortical lesions was not found to overlap with areas of cortical thinning. Cortical thinning and subcortical gray matter volume loss in patients with a clinically isolated syndrome or relapsing-remitting MS was related to the presence of juxtacortical

  13. DW-MRI of liver lesions: Can a single ADC-value represent the entire lesion?

    International Nuclear Information System (INIS)

    Schmid-Tannwald, C.; Dahi, F.; Jiang, Y.; Ivancevic, M.K.; Rist, C.; Sethi, I.; Oommen, J.; Oto, A.

    2014-01-01

    Aim: To evaluate whether focal liver lesions (FLLs) exhibit a homogeneous appearance on apparent diffusion coefficient (ADC) maps and whether there is inter-section variation in the calculated ADC values of FLLs (inter-section range). Materials and methods: Eighty-eight patients with 128 FLLs (70 benign, 58 malignant) who underwent abdominal magnetic resonance imaging (MRI) including diffusion-weighted (DW)-MRI were included. Two observers evaluated variation of signal intensity of each FLL within each ADC map image (intra-section) and among different ADC map images through the lesion (inter-section). ADC values of each FLL and neighbouring liver parenchyma were measured on all sections. The inter-section range of FLLs was compared with the neighbouring liver parenchyma. Results: Intra-section inhomogeneity was noted in 39.8% (97/244 sections) and 38.9% (95/244) of benign lesions, and 61% (114/187 sections) and 61.5% (115/187) of malignant lesions, by observer 1 and observer 2, respectively. Inter-section inhomogeneity was noted in 25.7% (18/70) and 27.1% (19/70) of benign lesions, and 51.7% (30/58) and 50% (29/58) of malignant lesions, by observer 1 and observer 2, respectively. The inter-section range for both benign (0.28 × 10 −3  mm²/s) and malignant (0.25 × 10 −3  mm²/s) FLLs were significantly greater than that of liver parenchyma surrounding benign (0.16 × 10 −3  mm²/s, p < 0.001) and malignant (0.14 × 10 −3  mm²/s, p = 0.01) FLLs. Conclusion: Due to intra-/inter-section variations in ADC values of benign and malignant FLLs, a single ADC value may not reliably represent the entire lesion

  14. Evaluation of the Melanocytic Pathology Assessment Tool and Hierarchy for Diagnosis (MPATH-Dx) classification scheme for diagnosis of cutaneous melanocytic neoplasms: Results from the International Melanoma Pathology Study Group.

    Science.gov (United States)

    Lott, Jason P; Elmore, Joann G; Zhao, Ge A; Knezevich, Stevan R; Frederick, Paul D; Reisch, Lisa M; Chu, Emily Y; Cook, Martin G; Duncan, Lyn M; Elenitsas, Rosalie; Gerami, Pedram; Landman, Gilles; Lowe, Lori; Messina, Jane L; Mihm, Martin C; van den Oord, Joost J; Rabkin, Michael S; Schmidt, Birgitta; Shea, Christopher R; Yun, Sook Jung; Xu, George X; Piepkorn, Michael W; Elder, David E; Barnhill, Raymond L

    2016-08-01

    Pathologists use diverse terminology when interpreting melanocytic neoplasms, potentially compromising quality of care. We sought to evaluate the Melanocytic Pathology Assessment Tool and Hierarchy for Diagnosis (MPATH-Dx) scheme, a 5-category classification system for melanocytic lesions. Participants (n = 16) of the 2013 International Melanoma Pathology Study Group Workshop provided independent case-level diagnoses and treatment suggestions for 48 melanocytic lesions. Individual diagnoses (including, when necessary, least and most severe diagnoses) were mapped to corresponding MPATH-Dx classes. Interrater agreement and correlation between MPATH-Dx categorization and treatment suggestions were evaluated. Most participants were board-certified dermatopathologists (n = 15), age 50 years or older (n = 12), male (n = 9), based in the United States (n = 11), and primary academic faculty (n = 14). Overall, participants generated 634 case-level diagnoses with treatment suggestions. Mean weighted kappa coefficients for diagnostic agreement after MPATH-Dx mapping (assuming least and most severe diagnoses, when necessary) were 0.70 (95% confidence interval 0.68-0.71) and 0.72 (95% confidence interval 0.71-0.73), respectively, whereas correlation between MPATH-Dx categorization and treatment suggestions was 0.91. This was a small sample size of experienced pathologists in a testing situation. Varying diagnostic nomenclature can be classified into a concise hierarchy using the MPATH-Dx scheme. Further research is needed to determine whether this classification system can facilitate diagnostic concordance in general pathology practice and improve patient care. Copyright © 2016 American Academy of Dermatology, Inc. Published by Elsevier Inc. All rights reserved.

  15. Tratamiento no quirúrgico de lesiones periapicales Non-surgical treatment of periapical lesions

    Directory of Open Access Journals (Sweden)

    Pedro Pablo Ferro Benítez

    2005-08-01

    Full Text Available El tratamiento de dientes con lesiones periapicales crónicas ha sido estudiado por diferentes autores en el campo endodóntico. Basados en los resultados obtenidos en diferentes trabajos experimentales, nos propusimos realizar la presente investigación, tomando 30 dientes humanos que al momento de realizar el tratamiento pulpo-radicular mostraban presencia de lesiones periapicales con diámetros variables, por encima de los 5 mm . Se tuvieron en cuenta 2 grupos: en uno de ellos se les realizaron a los conductos radiculares rellenos temporales de pasta de hidróxido de calcio y agua destilada; en el otro grupo a estos conductos se les realizaron rellenos temporales con pasta de hidróxido de calcio, agua destilada y paramonoclorofenol alcanforado. Se evaluó la disminución de las lesiones periapicales en ambos grupos a los 3 y 6 meses de iniciado el tratamiento, sin que se observaran diferencias estadísticamente significativas. Igual ocurrió en la evaluación de la reparación ósea periapical a los 9 meses de iniciado el tratamiento. Con ambas técnicas los resultados fueron satisfactorios, lo que demuestra una participación activa del hidróxido de calcio en la reparación de las lesiones periapicales, independientemente de su asociación con el paramonoclorofenol alcanforado.The treatment of teeth with chronic periapical lesions has been studied by different authors in the endodontics field. Based on the results attained in different experimental works, we proposed ourselves to make the present research taking 30 human teeth that at the time of applying the pulpar radicular treatment presented periapical lesions with variable diameters over 5 mm. Two groups were taken into account: in one of them, the radicular ducts were temporarily filled up with calcium hydroxide paste and distilled water, whereas in the other one these ducts were filled up with calcium hydroxide paste, distilled water and camphorated paramonochlorophenol. The

  16. Atypical idiopathic inflammatory demyelinating lesions

    DEFF Research Database (Denmark)

    Wallner-Blazek, Mirja; Rovira, Alex; Fillipp, Massimo

    2013-01-01

    Atypical lesions of a presumably idiopathic inflammatory demyelinating origin present quite variably and may pose diagnostic problems. The subsequent clinical course is also uncertain. We, therefore, wanted to clarify if atypical idiopathic inflammatory demyelinating lesions (AIIDLs) can be class......Atypical lesions of a presumably idiopathic inflammatory demyelinating origin present quite variably and may pose diagnostic problems. The subsequent clinical course is also uncertain. We, therefore, wanted to clarify if atypical idiopathic inflammatory demyelinating lesions (AIIDLs) can...... be classified according to previously suggested radiologic characteristics and how this classification relates to prognosis. Searching the databases of eight tertiary referral centres we identified 90 adult patients (61 women, 29 men; mean age 34 years) with ≥1 AIIDL. We collected their demographic, clinical...

  17. Measuring Cutaneous Lesions: Trends in Clinical Practice.

    Science.gov (United States)

    Zhang, Shali; Blalock, Travis W

    2018-03-01

    Knowing the size of a cutaneous lesion can be important for tracking its progression over time, selecting the proper treatment modality, surgical planning, determining prognosis, and accurate billing. However, providers vary in their consistency, accuracy, and methods of measuring cutaneous lesions. To investigate the clinical practices of US dermatologists and dermatologic surgeons regarding how they determine the size of cutaneous lesions. A survey was electronically distributed to members of the American Society for Dermatologic Surgery. Four hundred twenty-six dermatologists completed the online survey. When a lesion is suspected to be malignant, 85% of respondents obtained exact measurements most, if not all, of the time; however, only 8% did for benign lesions. Most providers determined lesion sizes themselves rather than delegating to staff. When performing visual estimation, approximately three-quarters believed that they were accurate to within 1 to 2 mm. The top reasons for obtaining exact measurements were for tracking atypical pigmented lesions, determining treatment pathways, and accurate billing. The majority of respondents believed that lesion size affected management decisions; however, the need for exact measurement remains controversial, particularly for benign lesions. Future studies may investigate whether taking exact versus estimated measurements has an effect on outcomes.

  18. Male breast lesions

    International Nuclear Information System (INIS)

    Matushita, J.P.K.; Andrade, L.G. de; Carregal, E.; Marimatsu, R.I.; Matushita, J.S.

    1989-01-01

    Roentgenographic examination of the male breast is an important aspect of the continued, intensive investigation of the radiologic morphology of the normal and diseased breast conducted in 17 cases examined at the Instituto Nacional do Cancer - RJ. It is purpose of this report to present the Roentgen appearance of various lesions of the male breast as they have been found in our practice and also to stress some of the difficulties in the differential diagnosis of these lesions. (author) [pt

  19. Effects of consensus training on the reliability of auditory perceptual ratings of voice quality.

    Science.gov (United States)

    Iwarsson, Jenny; Reinholt Petersen, Niels

    2012-05-01

    This study investigates the effect of consensus training of listeners on intrarater and interrater reliability and agreement of perceptual voice analysis. The use of such training, including a reference voice sample, could be assumed to make the internal standards held in memory common and more robust, which is of great importance to reduce the variability of auditory perceptual ratings. A prospective design with testing before and after training. Thirteen students of audiologopedics served as listening subjects. The ratings were made using a multidimensional protocol with four-point equal-appearing interval scales. The stimuli consisted of text reading by authentic dysphonic patients. The consensus training for each perceptual voice parameter included (1) definition, (2) underlying physiology, (3) presentation of carefully selected sound examples representing the parameter in three different grades followed by group discussions of perceived characteristics, and (4) practical exercises including imitation to make use of the listeners' proprioception. Intrarater reliability and agreement showed a marked improvement for intermittent aphonia but not for vocal fry. Interrater reliability was high for most parameters before training with a slight increase after training. Interrater agreement showed marked increases for most voice quality parameters as a result of the training. The results support the recommendation of specific consensus training, including use of a reference voice sample material, to calibrate, equalize, and stabilize the internal standards held in memory by the listeners. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.

  20. Thermophysical lesions caused by HZE particles

    International Nuclear Information System (INIS)

    Tobias, C.A.; Malachowski, M.; Nelson, A.; Philpott, D.E.

    1980-01-01

    This paper deals with a type of damage caused by heavy particles that may occur in subcellular structures. These lesions are called thermophysical radiation injury and are similar to damage produced in solids by HZE particles. This chapter summarizes some of the experimental evidence for the presence of these lesions in certain mammalian tissues including the retina, brain, cornea, lens of mice and seeds of corn. Of all tissues examined, only the cornea exhibited a type of lesion which would fulfill the criteria of thermophysical lesions